The Spark Universal Migrator is a data migration Spark application that fetches data from an Oracle database and stores it in a Hive database. It utilizes JDBC to connect to Oracle, processes and converts the data into a Hive-compatible schema, and then transfers it to Hive. The migration process employs iterators to generate queries for data retrieval and utilizes a BlockingQueue in conjunction with a ThreadPoolExecutor for parallel processing of data. Additionally, the application incorporates an Oracle ...
| License | Apache 2.0 |
|---|---|
| Tags | github |
| HomePage | https://github.com/queukat/spark_oracle_hive_streaming 🔍 Inspect URL |
| Ranking | #580496 in MvnRepository (See Top Artifacts) |