Position Overview
Experience Requirements: Must have QA/Testing Experience. Data Testing Experience specifically in Big Data, Hadoop, or Cloud Data Warehouse environments. Databricks Experience of testing pipelines within a Databricks environment. Automation Focus: Proven track record of moving from manual SQL checks to automated Python-based testing frameworks. Mandatory: Databricks Certified Data Engineer Associate (at minimum). Preferred: ISTQB Foundation or Advanced Level (Test Automation Engineer). Core Technical Skills: Data Validation & Frameworks Great Expectations / Pandera: Proficiency in using Python-based libraries to define data 'contracts' and automated validation suites. DLT Expectations: Deep understanding of Delta Live Tables (DLT) expectations (Fail, Drop, Quarantining bad records). Advanced SQL: Expert-level SQL for complex data reconciliation, identifying duplicates, and null-value analysis across billions of records. Python for QA (PySpark): Pytest-Spark: Experience using pytest to ...