Design and build Informatica PowerCenter ingestion pipelines for all identified source tables to the Azure Data Lake Bronze zone, supporting both full initial loads and incremental CDC/batch loads.
Implement schema drift detection, metadata capture, parameterization for reusability across environments and table lists, and idempotency/retry logic.
Develop EDS Data Quality checks at ingress (completeness, validity, referential integrity) with alerting dashboards; produce reconciliation reports vs. ETHIX source counts and totals.
Build Silver-zone transformation logic on Cloudera CDP using Spark: CASA variable calculations including window functions, aggregations, date logic, currency normalisation, product hierarchy mapping, customer de‑dup, and survivorship rules.
Ready to Apply?
Join thousands of Americans building their careers