About the Role
We are hiring an experienced AWS Data Engineer who excels in building scalable ETL pipelines using AWS Glue, Python, Athena, and S3. The role involves end‑to‑end data engineering, optimization, and cloud-based pipeline development.
🔧 Key Responsibilities
- Develop and maintain ETL/ELT pipelines using AWS Glue (PySpark/Python)
- Build scalable data ingestion and transformation workflows
- Write and optimize SQL queries in Athena (Presto)
- Manage Glue Data Catalog and schema evolution
- Organize data on Amazon S3 including partitioning and data formats
- Work with Lambda, Step Functions, CloudWatch for automation and orchestration
- Troubleshoot pipeline issues and ensure high data quality
- Work with business teams to understand and deliver data requirements
🛠 Required Technical Skills
- Strong experience with AWS Glue Jobs, Glue Crawlers, Workflows
- Hands-on with Athena, S3, IAM, Python (PySpark)
- Strong SQL skills
- Knowledge of Parquet, ORC, JSON formats
- Understanding data warehousing concepts
- Git/version control knowledge