Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 1K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

RATE NOW!
- ABECA 2025
  
  RATE NOW!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Elements

Compare

4.2

based on 17 Reviews

37 Elements Jobs

AWS Data Engineer - Spark/Python (10-12 yrs)

ElementSkill

4.2

based on 17 Reviews

10-12 years

Elements

posted 18d ago

Job Role Insights

Flexible timing

Key skills for the job

Data Analytics Data Engineering Python AWS SQL Informatica

+ 5 more

Job Description

Position Overview

We are seeking a skilled AWS Data Engineer with strong expertise in designing and optimizing scalable data pipelines and processing systems. The ideal candidate will have in-depth knowledge of Spark, PySpark, AWS Cloud services, and hands-on experience in data integration, transformation, and warehousing. The role involves collaborating with cross-functional teams to develop robust solutions for large-scale data challenges, enabling data-driven decision-making processes.

Key Responsibilities :

Data Pipeline Development :

- Design, develop, and maintain scalable data processing pipelines using Spark and PySpark.

- Optimize Spark jobs to enhance performance and efficiency in large-scale data environments.

Data Transformation :

- Write and manage complex SQL queries to manipulate, clean, and transform datasets.

- Develop and deploy data workflows to meet business needs.

AWS Cloud Services :

- Work extensively with AWS Cloud services such as Redshift, AWS Glue, and Databricks to manage and process large datasets.

- Utilize SQL Server for additional database operations.

Programming & Modularization :

- Use Python for data processing tasks, ensuring modular and reusable code packaging.

- Adhere to best practices for scalable and maintainable Python development.

Data Integration & Real-Time Streaming :

- Implement real-time data streaming and integration using tools like NiFi, Kafka, and EventHub (optional but desirable).

Data Warehousing :

- Work hands-on with Snowflake for managing data warehousing and analytics needs.

Collaborative Development :

- Partner with data scientists, analysts, and engineering teams to fulfill data requirements and support advanced analytics initiatives.

Informatica :

- Apply basic knowledge of Informatica for data integration tasks and workflow automation.

Required Skills and Qualifications :

- Spark Expertise : Advanced experience with Spark and PySpark, including optimization techniques.

- SQL Knowledge : Strong skills in writing and optimizing complex SQL queries for data transformation.

- Cloud Proficiency : Experience with AWS services (Redshift, Glue) and Databricks.

- Python Programming : Proficient in Python for scripting and data processing tasks.

- Real-Time Data Tools : Familiarity with tools like NiFi, Kafka, and EventHub is highly desirable.

- Snowflake Expertise : Hands-on experience with Snowflake for data warehousing and advanced analytics.

- Informatica : Basic understanding of Informatica tools and their application in data projects.

Preferred Skills (Optional) :

- Real-time data streaming experience using Kafka, NiFi, or similar tools.

- Familiarity with EventHub for managing event-driven data workflows.

- Experience in CI/CD pipelines and version control with Git.

Soft Skills :

- Excellent communication and collaboration abilities to work with diverse teams.

- Proactive and detail-oriented with the ability to take ownership of complex data challenges.

- Strong analytical and problem-solving skills.

Functional Areas: Other

Read full job description