Job Type: Permanent,C2H
Work Mode: Hybrid (3 Days from office)
Role - Data QA (Automation, Azure, SQL, ETL)
Experience - 4-6 Years
Location- Remote / Hybrid
(Bangalore/Hyderabad/Mumbai/Pune)
(Hybrid - Candidate has to work 2/3 days from office in a week)
The Data base QA Tester will play a critical role in ensuring the accuracy, integrity, and reliability of data products and data marts built by the engineering team within Microsoft fabric. This individual will be responsible for validating new features, identifying defects, and ensuring system stability through rigorous testing practices including regression, functional, integration, data quality validation and data profiling testing.
The ideal candidate brings a strong understanding of QA methodologies and a working knowledge of python, pyspark, and SQL. This position requires close collaboration with data engineers, business analysts, and product stakeholders.
Key Responsibilities:
Test Planning and Execution
• Review technical specifications and schematic model designs to develop comprehensive test plans and test cases.
• Execute functional, regression, and integration testing for new data products, transformation logic, and platform enhancements.
• Validate Pyspark and SQL-based transformations
• Toolchain experience – dbt-core, soda-core, greatexpectations with pyspark and ANSI SQL, DAX expressions
Data Validation
• Write Python notebook, SQL queries to validate data transformations, joins, filters, aggregations, and calculations across staging, transformation, and reporting layers.
• Compare results between source systems and final data products for accuracy and completeness.
• Ensure data reconciliation and integrity across pipelines.
Issue Tracking and Communication
• Log, document, and track bugs or data anomalies.
• Collaborate with data engineers and developers to troubleshoot and verify resolution of issues.
• Proactively raise risks and blockers during test cycles to ensure timely delivery.
Regression & Automation Support
• Maintain regression test cases as data products evolve in git repository
• Identify repetitive test cases suitable for automation and assist in building out test scripts as needed in python
• Ensure previous features and deliverables continue to work after new releases.
Documentation & Collaboration
• Document test plans, cases, and results clearly for audit and traceability.
• Participate in Agile/Scrum ceremonies and provide QA status updates.
• Collaborate with cross-functional teams to ensure product quality at each development stage.
Required Skills and Experience
• Strong Pyspark and SQL skills, with experience in atleast one tool (dbt-core, soda-core, greatexpectations) the ability to write complex queries for data validation and reconciliation.
• Understanding of relational semantic model design, ETL processes, and data modeling concepts.
• Experience testing data products or platforms built on Microsoft fabric and/or Databricks.
• Familiarity with testing concepts such as QA, UAT, regression, EDA, and post-production deployment testing.
• Ability to work with datasets of varying complexity, volume, and sensitivity.
• High attention to detail and ability to detect inconsistencies or inaccuracies in data.
• Strong written and verbal communication skills for documenting and explaining findings.
Preferred Qualifications
• Prior experience in data platform testing, preferably in GRC domain or regulated industries
• Familiarity with Microsoft fabric or databricks