Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Prama

Compare

3.8

based on 61 Reviews

7 Prama Jobs

Prama.ai - Python/PySpark Developer - Data Engineering (5-8 yrs)

Prama.ai

3.8

based on 61 Reviews

5-8 years

Prama

posted 8d ago

Job Role Insights

Fixed timing

Key skills for the job

Data Engineering Python AWS SQL Cloud Services ETL Testing

+ 4 more

Job Description

About the Role :

We are seeking a highly skilled and motivated Python/PySpark Data Engineer to join our growing data engineering team.

In this role, you will play a crucial part in building and maintaining robust and efficient data pipelines that power our data-driven decision making.

You will work closely with data engineers, analysts, and other stakeholders to design, develop, and deploy high-performance data solutions on cloud platforms, primarily AWS.

Responsibilities :

Data Pipeline Development & Maintenance :

- Design, develop, and maintain data pipelines using PySpark on cloud platforms like AWS EMR, AWS Glue, and Databricks.

- Extract, transform, and load (ETL) large datasets from various sources (e.g, databases, APIs, cloud storage) into data warehouses and data lakes.

- Optimize data pipelines for performance, scalability, and cost-effectiveness using techniques like data partitioning, caching, and indexing.

- Implement data quality checks and validation procedures to ensure data accuracy and integrity.

- Troubleshoot and resolve data pipeline issues promptly and effectively.

Python & PySpark Proficiency :

- Write clean, efficient, and well-documented Python code for data processing, transformation, and analysis.

- Leverage advanced PySpark features like DataFrames, SQL, and Spark SQL for data manipulation and aggregation.

- Experience with Spark streaming and real-time data processing is a plus.

Cloud Technologies :

- Hands-on experience with AWS services such as S3, Redshift, Glue, EMR, and IAM.

- Familiarity with cloud-native data platforms and tools is a plus (e.g, AWS Glue Data Catalog, AWS Athena).

Data Warehousing & ETL/ELT :

- Strong understanding of data warehousing concepts, including dimensional modeling, data marts, and data lakes.

- Experience with ETL/ELT processes and tools (e.g, Airflow, Prefect).

Collaboration & Communication :

- Collaborate effectively with data engineers, data analysts, data scientists, and business stakeholders to understand data requirements and translate them into technical solutions.

- Clearly communicate technical concepts and project progress to both technical and non-technical audiences.

Continuous Learning :

- Stay up-to-date with the latest advancements in data engineering technologies, best practices, and industry trends.

Qualifications :

- Bachelor's degree in Computer Science, Computer Engineering, or a related field.

- 3+ years of professional experience in Python development.

- 2+ years of hands-on experience with PySpark and the Spark ecosystem.

- Strong understanding of data structures, algorithms, and object-oriented programming principles.

- Proficiency in SQL and experience with relational databases (e.g, PostgreSQL, MySQL, Oracle).

- Experience with data warehousing concepts, ETL/ELT processes, and data modeling techniques.

- Excellent analytical and problem-solving skills with the ability to identify and resolve complex data issues.

- Strong communication and interpersonal skills with the ability to work effectively in a collaborative team environment.

- Experience with Agile development methodologies is a plus.

Bonus Points :

- Experience with containerization technologies like Docker and Kubernetes.

- Knowledge of machine learning and data science concepts.

- Experience with data visualization tools (e.g, Tableau, Power BI)