Hi Team,
Sharing the progress on ECS onboarding for DG:
- The ingestion job for the database application has been refactored by removing all HDFS dependencies.
- The refactored code is successfully running on ECS using spark-submit (in local mode).
- The job is able to connect to the source database, read data, and create audit entries.
- We are also able to read data for the CIRAS application on ECS. Currently, 10 records are being read and printed to the console.
- At present, data is being successfully read from MySQL Server and Oracle.
- The next steps include testing data ingestion with different data sizes and validating additional data sources one by one.