Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

EPAM Systems

Compare

3.7

based on 1.5k Reviews

Filter interviews by

EPAM Systems Senior Data Engineer Interview Questions, Process, and Tips

Updated 13 Jan 2025

Top EPAM Systems Senior Data Engineer Interview Questions and Answers

Q1. how to migrate 1000s of tables using spark(databricks) notebooks

View answer (1)
Q2. What is the process for finding the missing number from a list?

View answer (1)
Q3. Dataflow vs Dataproc, layering processing and curated environments in gcp , Data cleaning

View answer (1)

View all 10 questions

EPAM Systems Senior Data Engineer Interview Experiences

11 interviews found

Senior Data Engineer Interview Questions & Answers

ABHISHEK JHA

posted on 11 Dec 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Selected

I applied via LinkedIn and was interviewed in Nov 2024. There were 4 interview rounds.

Round 1 - Technical

(6 Questions)

Q1. Current day to day tasks

Ans.

My current day-to-day tasks involve designing and implementing data pipelines, optimizing data storage and retrieval, and collaborating with cross-functional teams.

Designing and implementing data pipelines to extract, transform, and load data from various sources
Optimizing data storage and retrieval processes for efficiency and scalability
Collaborating with cross-functional teams to understand data requirements and del...

Answered by AI

Add your answer

Q2. Question on data modelling(star vs snowflake)

Add your answer

Q3. Questions on data warehousing Dimensions, Facts, CDC, SCD types

Add your answer

Q4. Questions on snowflake product and variety on objects there use case and optimizations, partition pruning.

Add your answer

Q5. SQL queries: as per snowflake documentation 1. Read json file and extract data 2. sql query with subqueries to update records 3. sql query on window functions

Add your answer

Q6. Easy Pyspark coding question.

Add your answer

Round 2 - Technical

(7 Questions)

Q1. Project end to end detailed explanation and follow up questions.

Add your answer

Q2. Questions on data warehousing concepts like facts & dimensions in detail

Add your answer

Q3. Question on semi structured file formats

Add your answer

Q4. Questions on snowflake product: 1. tasks 2. streams 3. time travel 4. meta data caching 5. views vs materialized views 6. how would you spin up an warehouse cluster

Add your answer

Q5. Questions on spark 1. lazy evaluation 2. handling skew 3. data partitioning 4. pandas vs spark

Add your answer

Q6. Questions on aws 1. lambda functions, synchronous vs asynchronous calls 2. dynamic frames, transformations 3. glue crawler 4. s3 storage classes

Add your answer

Q7. SQL question using cross join, python question on string

Add your answer

Round 3 - Technical

(6 Questions)

Q1. End to end project architecture.

Ans.

The end-to-end project architecture involves designing and implementing the entire data pipeline from data ingestion to data visualization.

Data ingestion: Collecting data from various sources such as databases, APIs, and files.
Data processing: Cleaning, transforming, and aggregating the data using tools like Apache Spark or Hadoop.
Data storage: Storing the processed data in data warehouses or data lakes like Amazon S3 ...

Answered by AI

Add your answer

Q2. Questions on data modelling: 1. facts and types of facts 2. dimensions and its types 3. normalization vs denormalization

Add your answer

Q3. Questions on automating spark notebooks and optimization on spark job.

Add your answer

Q4. How to migrate 1000s of tables using spark(databricks) notebooks

Ans.

Use Spark (Databricks) notebooks to migrate 1000s of tables efficiently.

Utilize Spark's parallel processing capabilities to handle large volumes of data
Leverage Databricks notebooks for interactive data exploration and transformation
Automate the migration process using scripts or workflows
Optimize performance by tuning Spark configurations and cluster settings

Answered by AI

Add your answer

Q5. Questions on delta format

Add your answer

Q6. Sql question on joins and numbers of records for each type of join.

Add your answer

Round 4 - HR

(3 Questions)

Q1. Reason for change

Ans.

Seeking new challenges and opportunities for growth in a more dynamic environment.

Desire for new challenges and learning opportunities
Seeking a more dynamic work environment
Looking to expand skill set and experience
Interested in working on cutting-edge technologies
Seeking better career advancement prospects

Answered by AI

Add your answer

Q2. Expectations from EPAM

Ans.

I expect EPAM to provide challenging projects, opportunities for growth, a collaborative work environment, and support for continuous learning.

Challenging projects that allow me to utilize my skills and knowledge
Opportunities for professional growth and advancement within the company
A collaborative work environment where teamwork is valued
Support for continuous learning through training programs and resources

Answered by AI

Add your answer

Q3. Willing to relocate? when can you join

Ans.

Yes, I am willing to relocate for the right opportunity. I can join the company within 4 weeks.

Willing to relocate for the right opportunity
Can join within 4 weeks
Open to discussing relocation assistance if needed

Answered by AI

Add your answer

Interview Preparation Tips

Topics to prepare for EPAM Systems Senior Data Engineer interview:

work experience
SQL
Python
Spark
snowflake

Interview preparation tips for other job seekers - They have a very structured process and technical interviews last 1-1.5 hours, so you will have enough opportunities to showcase your skills. Prepare for the JD thoroughly.

Skills evaluated in this interview

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 17 Dec 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Naukri.com and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Coding Test

Big data related multiple questions around 10 Multiple choice
One sql question to find the salary details medium difficulty
One python related question

Round 2 - Technical

(2 Questions)

Q1. All basic to complex questions related to spark and its architecture

Add your answer

Q2. Big data and python questions and how to build new data source from sql server to data warehouse. Need to give detail tech stack and complete process information

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Focus on enhancing your understanding of theoretical concepts and reviewing all relevant topics.

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 13 Jan 2025

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. What is the process for finding the missing number from a list?

Ans.

To find the missing number from a list, calculate the sum of all numbers in the list and subtract it from the expected sum of the list.

Calculate the sum of all numbers in the list using a loop or a built-in function.
Calculate the expected sum of the list using the formula n*(n+1)/2, where n is the length of the list.
Subtract the sum of the list from the expected sum to find the missing number.

Answered by AI

Add your answer

Q2. What are some methods for optimizing Spark performance?

Ans.

Optimizing Spark performance involves tuning configurations, partitioning data, caching, and using efficient transformations.

Tune Spark configurations for memory allocation, parallelism, and resource management.
Partition data properly to distribute work evenly across nodes and minimize shuffling.
Cache intermediate results in memory to avoid recomputation.
Use efficient transformations like map, filter, and reduceByKey i...

Answered by AI

Add your answer

What people are saying about EPAM Systems

View All

a senior software engineer

2w (edited)

Need help to decide

I have the following offers and confused to choose one. Exp: 6.8 Java backend dev cognizant - 25 lpa (24 fixed) Epam - 28 fixed Citi - 30 fixed If i ask epam with citi offer they will increase their offer, but my question is that worth negotiating with them? Please give you suggestions. My Last working day is 31 March 2025.

Got a question about EPAM Systems?

Ask anonymously on communities.

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 24 Jan 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via Approached by Company and was interviewed in Dec 2023. There were 4 interview rounds.

Round 1 - Technical

(7 Questions)

Q1. Lineage graph,rdd,python decorators,python dict problem,sql self join probelm

Add your answer

Q2. Types of transformations,no of jobs,tasks,actions

Ans.

The question is asking about types of transformations, number of jobs, tasks, and actions in the context of a Senior Data Engineer role.

Types of transformations: Extract, Transform, Load (ETL), MapReduce, Spark transformations, SQL transformations
Number of jobs: Depends on the complexity and scale of the data engineering projects
Number of tasks: Varies based on the number of data sources, data transformations, and data...

Answered by AI

Add your answer

Q3. Fact,dimesnion,star and snowflake schema

Add your answer

Q4. Architecture of spark,airflow,bigquery,

Ans.

Spark is a distributed processing engine, Airflow is a workflow management system, and BigQuery is a fully managed data warehouse.

Spark is designed for big data processing and provides in-memory computation capabilities.
Airflow is used for orchestrating and scheduling data pipelines.
BigQuery is a serverless data warehouse that allows for fast and scalable analytics.
Spark can be integrated with Airflow to schedule and m...

Answered by AI

Add your answer

Q5. Optimisation in spark,sql,bigquery,airflow

Ans.

Optimization techniques in Spark, SQL, BigQuery, and Airflow.

Use partitioning and bucketing in Spark to optimize data processing.
Optimize SQL queries by using indexes, query rewriting, and query optimization techniques.
In BigQuery, use partitioning and clustering to improve query performance.
Leverage Airflow's task parallelism and resource allocation to optimize workflow execution.

Answered by AI

Add your answer

Q6. Delete duplicates from table in spark and sql

Ans.

To delete duplicates from a table in Spark and SQL, you can use the DISTINCT keyword or the dropDuplicates() function.

In SQL, you can use the DISTINCT keyword in a SELECT statement to retrieve unique rows from a table.
In Spark, you can use the dropDuplicates() function on a DataFrame to remove duplicate rows.
Both methods compare all columns by default, but you can specify specific columns to consider for duplicates.
You...

Answered by AI

Add your answer

Q7. Coaelesce,repartition,shuffling,CDC,acid properties,CDC types

Add your answer

Round 2 - Technical

(1 Question)

Q1. Dataflow vs Dataproc, layering processing and curated environments in gcp , Data cleaning

Ans.

Dataflow and Dataproc are both processing services in GCP, but with different approaches and use cases.

Dataflow is a fully managed service for executing batch and streaming data processing pipelines.
Dataproc is a managed Spark and Hadoop service for running big data processing and analytics workloads.
Dataflow provides a serverless and auto-scaling environment, while Dataproc offers more control and flexibility.
Dataflow...

Answered by AI

Add your answer

Round 3 - Technical

(1 Question)

Q1. Pyspark field delimited questions with group by transformations, Sql window function question Discussion on arch of spark,bq,airflow. How to define dependency between tasks and two diff dags. star and sno...

Add your answer

Round 4 - Group Discussion

This was final round with Client.
They ask questions based on my work in pyspark area.
Questions are like:
What kind of transformations you used.
Broadcast join internals.
Spark internal joins
Spark catalyst optimizer.What are the joins happens in catalyst optimizer.
Window function question: 3rd highest salary of an emp
Discussion on airflow arch and how to deploy a airflow dag in gcp.
Discussion on BQ on what kind of work i have done till now.

Interview Preparation Tips

Interview preparation tips for other job seekers - Thorough with whatever you are writing in Resume

Skills evaluated in this interview

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 31 Mar 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via LinkedIn and was interviewed in Mar 2024. There were 3 interview rounds.

Round 1 - Technical

(1 Question)

Q1. Round 1 was with indian guy who asked basic data engineering questions on spark and azure and delta lake. also one sql, one python and one pyspark questions which was intermediate level to solve.

Add your answer

Round 2 - Technical

(1 Question)

Q1. 2nd round was with ukrain guy, who asked in python questions such as accumulators, generatorsm decorators, GIL, why python is preferred over other languages. One one sql and pyspark question. And then ques...

Add your answer

Round 3 - Maneger round

(1 Question)

Q1. Basic questions about existing project and one sql query to solve. what all tools and technologies you have worked on.

Add your answer

Get interview-ready with Top EPAM Systems Interview Questions

Senior Data Engineer Interview Questions & Answers

Sourav Barik

posted on 23 Dec 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. What are decorators in python

Add your answer

Q2. Stored procedure

Add your answer

Senior Data Engineer Jobs at EPAM Systems

View all

Senior Data Engineer - Data Integration

Chennai

5-8 Yrs

Not Disclosed

Senior Data Engineer

Pune

5-8 Yrs

Not Disclosed

Senior Data Engineer

Chennai

5-8 Yrs

Not Disclosed

Senior Data Engineer

Gurgaon / Gurugram

5-8 Yrs

₹ 25-30 LPA

Senior Data Engineer

Gurgaon / Gurugram

5-8 Yrs

₹ 25-30 LPA

Senior Data Engineer

Chennai

5-8 Yrs

Not Disclosed

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 29 Oct 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

I applied via Naukri.com and was interviewed in Sep 2024. There was 1 interview round.

Round 1 - Coding Test

Python questions were asked in this round

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 5 Apr 2023

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via Naukri.com and was interviewed in Mar 2023. There were 5 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - Coding Test

2 coding question in Python where 1 of them is basic level and other one is based on binary tree. 3 question multiple choice question consists of 10 question which will be based on basic of spark and big data concepts.

Round 3 - Technical

(2 Questions)

Q1. Question will move around Azure, ADF, ADB and Pyspark. Basic check on all above topics and other will be the use cases on above topics and how you deals with the use case. Efficiently use the appropriate t...

Add your answer

Q2. Be ready for some pseudo code in Pyspark to join 2 df and replace all the null values with another df exact value.

Add your answer

Round 4 - Techno Mangerial Round

(1 Question)

Q1. This round will be more concentrated over your current project architect and optimisation you have done. Few questions on sone use case to check your ability of problem solving.

Add your answer

Round 5 - HR

(1 Question)

Q1. Salary negotiations and basic question

Add your answer

Interview Preparation Tips

Topics to prepare for EPAM Systems Senior Data Engineer interview:

Python basic
Spark
Azure data factory
azure databricks
SQL
keyVault

Interview preparation tips for other job seekers - Coding round is only the barrier. If you cleared it and you have strong command over your data engineer concepts. You will definitely clear the interview in EPAM.

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 14 Oct 2023

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Don’t add your photo or details such as gender, age, and address in your resume. These details do not add any value.

View all tips

Round 2 - One-on-one

(1 Question)

Q1. Project discussion, Spark optimisations

Add your answer

Round 3 - Coding Test

Python
Sql coding tests

Senior Data Engineer Interview Questions & Answers

dileep

posted on 12 Mar 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Selected

I applied via Approached by Company and was interviewed before Mar 2023. There were 2 interview rounds.

Round 1 - One-on-one

(1 Question)

Q1. Data warehousing concepts,pyspark,sql

Add your answer

Round 2 - One-on-one

(1 Question)

Q1. Managerial round

Add your answer

EPAM Systems Interview FAQs

How many rounds are there in EPAM Systems Senior Data Engineer interview?

EPAM Systems interview process usually has 2-3 rounds. The most common rounds in the EPAM Systems interview process are Technical, Coding Test and HR.

How to prepare for EPAM Systems Senior Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at EPAM Systems. The most common topics and skills that interviewers at EPAM Systems expect are Python, SQL, AWS, GCP and Cloud Computing.

What are the top questions asked in EPAM Systems Senior Data Engineer interview?

Some of the top questions asked at the EPAM Systems Senior Data Engineer interview -

how to migrate 1000s of tables using spark(databricks) notebo...read more
What is the process for finding the missing number from a li...read more
Dataflow vs Dataproc, layering processing and curated environments in gcp , Dat...read more

How long is the EPAM Systems Senior Data Engineer interview process?

The duration of EPAM Systems Senior Data Engineer interview process can vary, but typically it takes about less than 2 weeks to complete.

Tell us how to improve this page.

EPAM Systems Interviews By Designations

Interview Questions for Popular Designations

EPAM Systems Senior Data Engineer Interview Process

based on 12 interviews

3 Interview rounds

Technical Round - 1
Technical Round - 2
Technical Round - 3

Accenture Senior Data Engineer Interview Questions

3.8

• 14 Interviews

TCS Senior Data Engineer Interview Questions

3.7

• 11 Interviews

CGI Group Senior Data Engineer Interview Questions

4.0

• 2 Interviews

Fujitsu Senior Data Engineer Interview Questions

3.8

• 2 Interviews

NTT Data Senior Data Engineer Interview Questions

3.8

• 1 Interview

Optum Global Solutions Senior Data Engineer Interview Questions

4.0

• 1 Interview

Atos Senior Data Engineer Interview Questions

3.8

• 1 Interview

HERE Technologies Senior Data Engineer Interview Questions

3.8

• 1 Interview

Koch Business Solutions Senior Data Engineer Interview Questions

3.4

• 1 Interview

View all

Vasavi College of Engineering, Hyderabad Placement Questions

1 Interview

Vidya Jyothi Institute of Technology, Hyderabad Placement Questions

1 Interview

University Business School (UBS), Chandigarh Placement Questions

1 Interview

KIIT University, Bhuvaneshwar Placement Questions

1 Interview

Koneru Lakshmaiah College of Engineering (KLCE), Arni Placement Questions

1 Interview

Krishna Institute of Engineering and Technology, Ghaziabad Placement Questions

1 Interview

Technocrats Institute of Technology, Bhopal Placement Questions

1 Interview

View all

EPAM Systems Senior Data Engineer Salary

based on 75 salaries

₹12 L/yr - ₹40 L/yr

43% more than the average Senior Data Engineer Salary in India

View more details

Senior Data Engineer Jobs at EPAM Systems

Senior Data Engineer - Data Integration

Chennai

5-8 Yrs

Not Disclosed

Senior Data Engineer

Pune

5-8 Yrs

Not Disclosed

Senior Data Engineer

Chennai

5-8 Yrs

Not Disclosed

Explore more jobs

EPAM Systems Salaries in India

Senior Software Engineer 3.1k salaries	₹15 L/yr - ₹42 L/yr
Software Engineer 1.9k salaries	₹4.9 L/yr - ₹25 L/yr
Lead Software Engineer 963 salaries	₹16.5 L/yr - ₹53 L/yr
Senior Systems Engineer 322 salaries	₹12 L/yr - ₹36 L/yr
Software Developer 270 salaries	₹8.5 L/yr - ₹30 L/yr