Job Description
We are seeking a Senior Data Engineer with deep expertise in building and managing data pipelines on AWS. The ideal candidate will be responsible for designing, developing, and optimizing data workflows using AWS Glue and other related technologies. You will also play a critical role in data modeling and managing our Redshift data warehouse to support analytics and business intelligence initiatives.
Requirements
- 7+ years of experience in a Data Engineering role.
- Extensive experience with AWS data services, particularly AWS Glue (Glue Studio, Glue Data Catalog, Glue Jobs).
- Demonstrated expertise in building complex ETL/ELT processes.
- Proven experience with Amazon Redshift and a solid understanding of data warehouse concepts.
- In-depth knowledge of data modeling techniques, including Star schema and canonical models.
- Proficiency in SQL and scripting languages like Python or Scala.
- Experience with other AWS services like S3, Lambda, and IAM.
- At least 2–3 relevant demonstrable project experiences involving large-scale data pipelines and data warehousing are required.
- Prior experience working in a SaaS environment is highly desirable.
Responsibilities
- Architect and build scalable and resilient ETL pipelines using AWS Glue, including Glue crawlers, jobs, and workflows.
- Perform data modeling for a data warehouse using Star schema and canonical models.
- Manage and optimize data within Amazon Redshift to ensure high performance and cost efficiency.
- Collaborate with data analysts and business teams to understand their data requirements and translate them into technical solutions.
- Monitor data pipelines, troubleshoot issues, and ensure data quality and integrity.