Amazon Data Engineer Intern
Data Engineer Intern | Amazon (ADCI HYD 13 SEZ) | Full-time Internship | Final year Bachelor’s/Master’s in CS, IT, or related field | SQL, Python, ETL, RDBMS, NoSQL, Data Warehousing, AWS (S3, Redshift, EMR, Lambda, Glue), Big Data, Hadoop, Spark, Hive, Impala, Data Lake, Data Pipelines, Communication, Agile
Worked in a fast-paced, customer-centric environment, building scalable data pipelines, supporting analytical platforms, and enabling data-driven decisions. Gained hands-on experience with core AWS data services and real-time ETL workflows while collaborating across software, analytics, and product teams.
Key Responsibilities:
- Designed and supported scalable analytical data platforms for business insights.
- Implemented ETL pipelines using AWS tools (S3, Redshift, EMR, Glue, Lambda).
- Designed and managed internal schemas, SQL/NoSQL databases, and reporting metrics.
- Extracted and transformed data from various sources using big data technologies.
- Participated in architecture discussions and performance optimization tasks.
- Collaborated with cross-functional teams to improve customer self-service analytics.
Tools/Technologies Used: SQL, Python, Hadoop, Spark, Hive, Impala, AWS (S3, EMR, Redshift, Glue, Lambda), RDBMS, NoSQL, Data Lake
Notable Achievements:
- Contributed to data pipeline development that improved reporting latency by 25%.
- Successfully integrated real-time data sources into Amazon’s analytics ecosystem.
- Built reusable scripts and automation for pipeline validation and monitoring.