Upload Button Icon Add office photos

KPMG India

Compare button icon Compare button icon Compare

Filter interviews by

KPMG India Data Engineer Interview Questions and Answers for Experienced

Updated 5 Nov 2024

6 Interview questions

A Data Engineer was asked
Q. Write PySpark code to change a column name and divide one column by another.
Ans. 

Pyspark code to change column name and divide one column by another column.

  • Use 'withColumnRenamed' method to change column name

  • Use 'withColumn' method to divide one column by another column

  • Example: df = df.withColumnRenamed('old_col_name', 'new_col_name').withColumn('new_col_name', df['col1']/df['col2'])

A Data Engineer was asked
Q. What are the optimization techniques applied in PySpark code?
Ans. 

Optimization techniques in PySpark code include partitioning, caching, and using broadcast variables.

  • Partitioning data based on key columns to optimize join operations

  • Caching frequently accessed data in memory to avoid recomputation

  • Using broadcast variables to efficiently share small data across nodes

  • Using appropriate data types and avoiding unnecessary type conversions

  • Avoiding shuffling of data by using appropria...

Data Engineer Interview Questions Asked at Other Companies for Experienced

asked in Sigmoid
Q1. Next Greater Element Problem Statement You are given an array arr ... read more
asked in LTIMindtree
Q2. If you are given cards numbered 1-1000 and 4 boxes, where card 1 ... read more
asked in Sigmoid
Q3. Problem: Search In Rotated Sorted Array Given a sorted array that ... read more
asked in Sigmoid
Q4. K-th Element of Two Sorted Arrays You are provided with two sorte ... read more
asked in Sigmoid
Q5. Zigzag Binary Tree Traversal Problem Statement Determine the zigz ... read more
A Data Engineer was asked
Q. Write PySpark code to read a CSV file and display the top 10 records.
Ans. 

Pyspark code to read csv file and show top 10 records.

  • Import the necessary libraries

  • Create a SparkSession

  • Read the CSV file using the SparkSession

  • Display the top 10 records using the show() method

A Data Engineer was asked
Q. Given a dictionary, how do you find the greatest number for the same key in Python?
Ans. 

Find the greatest number for same key in a Python dictionary.

  • Use max() function with key parameter to find the maximum value for each key in the dictionary.

  • Iterate through the dictionary and apply max() function on each key.

  • If the dictionary is nested, use recursion to iterate through all the keys.

What people are saying about KPMG India

View All
sankalp344
Verified Icon
3d
currently not working
Job
Do top companies like Deloitte,KPMG,Tcs,HCL,Ey,etc hire someone from correspondence graduation ?
Got a question about KPMG India?
Ask anonymously on communities.
A Data Engineer was asked
Q. How do you handle changing schema from source. What are the common issues faced in hadoop and how did you resolve it?
Ans. 

Handling changing schema from source in Hadoop

  • Use schema evolution techniques like Avro or Parquet to handle schema changes

  • Implement a flexible ETL pipeline that can handle schema changes

  • Use tools like Apache NiFi to dynamically adjust schema during ingestion

  • Common issues include data loss, data corruption, and performance degradation

  • Resolve issues by implementing proper testing, monitoring, and backup strategies

A Data Engineer was asked 9mo ago
Q. Spark optimization techniques
Ans. 

Spark optimization techniques involve partitioning, caching, and tuning resources for efficient data processing.

  • Partitioning data to distribute workload evenly

  • Caching frequently accessed data to avoid recomputation

  • Tuning resources like memory allocation and parallelism

  • Using broadcast variables for small lookup tables

KPMG India Data Engineer Interview Experiences for Experienced

3 interviews found

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Sep 2024. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. Spark optimization techniques
  • Ans. 

    Spark optimization techniques involve partitioning, caching, and tuning resources for efficient data processing.

    • Partitioning data to distribute workload evenly

    • Caching frequently accessed data to avoid recomputation

    • Tuning resources like memory allocation and parallelism

    • Using broadcast variables for small lookup tables

  • Answered by AI
  • Q2. Data warehousing questions
Round 2 - Technical 

(2 Questions)

  • Q1. Project experience
  • Ans. 

    Developed a data pipeline to ingest, process, and analyze real-time streaming data from IoT devices.

    • Designed and implemented data ingestion process using Apache Kafka

    • Utilized Apache Spark for real-time data processing and analysis

    • Developed data models and algorithms to extract insights from the data

    • Worked with stakeholders to understand requirements and deliver actionable insights

  • Answered by AI
  • Q2. Challenges faced
  • Ans. 

    Some challenges faced include data quality issues, scalability issues, and keeping up with evolving technologies.

    • Data quality issues such as missing values, inconsistencies, and errors in data sources.

    • Scalability issues when dealing with large volumes of data and ensuring efficient processing.

    • Keeping up with evolving technologies and tools in the field of data engineering.

    • Collaborating with cross-functional teams and s...

  • Answered by AI

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 18 Oct 2022

I applied via Approached by Company and was interviewed in Sep 2022. There were 5 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Coding Test 

It was a MCQ test to intepret codes and its outcomes.

Round 3 - Technical 

(4 Questions)

  • Q1. Given a dictionary, find out the greatest number for same key in Python.
  • Ans. 

    Find the greatest number for same key in a Python dictionary.

    • Use max() function with key parameter to find the maximum value for each key in the dictionary.

    • Iterate through the dictionary and apply max() function on each key.

    • If the dictionary is nested, use recursion to iterate through all the keys.

  • Answered by AI
  • Q2. Write Pyspark code to read csv file and show top 10 records.
  • Ans. 

    Pyspark code to read csv file and show top 10 records.

    • Import the necessary libraries

    • Create a SparkSession

    • Read the CSV file using the SparkSession

    • Display the top 10 records using the show() method

  • Answered by AI
  • Q3. Write pyspark code to change column name, divide one column by another column.
  • Ans. 

    Pyspark code to change column name and divide one column by another column.

    • Use 'withColumnRenamed' method to change column name

    • Use 'withColumn' method to divide one column by another column

    • Example: df = df.withColumnRenamed('old_col_name', 'new_col_name').withColumn('new_col_name', df['col1']/df['col2'])

  • Answered by AI
  • Q4. What are the optimization techniques applied in pyspark code?
  • Ans. 

    Optimization techniques in PySpark code include partitioning, caching, and using broadcast variables.

    • Partitioning data based on key columns to optimize join operations

    • Caching frequently accessed data in memory to avoid recomputation

    • Using broadcast variables to efficiently share small data across nodes

    • Using appropriate data types and avoiding unnecessary type conversions

    • Avoiding shuffling of data by using appropriate tr...

  • Answered by AI
Round 4 - Behavioral 

(1 Question)

  • Q1. How do you handle changing schema from source. What are the common issues faced in hadoop and how did you resolve it?
  • Ans. 

    Handling changing schema from source in Hadoop

    • Use schema evolution techniques like Avro or Parquet to handle schema changes

    • Implement a flexible ETL pipeline that can handle schema changes

    • Use tools like Apache NiFi to dynamically adjust schema during ingestion

    • Common issues include data loss, data corruption, and performance degradation

    • Resolve issues by implementing proper testing, monitoring, and backup strategies

  • Answered by AI
Round 5 - HR 

(1 Question)

  • Q1. Mostly it was salary negotiation

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident and pratice python, pyspark, sql for big data ppsitiom

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 27 Apr 2021

I applied via Recruitment Consultant and was interviewed in Mar 2021. There were 4 interview rounds.

Interview Questionnaire 

2 Questions

  • Q1. Questions were around my old project, what was my responsibility. Technical questions were objective questions on Python and SQL. There were three technical rounds o which final technical round was with th...
  • Q2. All technical questions were around how to process data using SQL and Python. What are triggers, functions and Store procedures and differences between them. What types of joins are possible and difference...

Interview Preparation Tips

Interview preparation tips for other job seekers - They needed someone with strong data structure knowledge and ability to wrote complex queries fast in my experience. Also tested me on best practices of a development project. If one wants a Data Engineer job role at any company SQL should be very strong as that is the building block of this profile. As I cracked interviews of other companies as well and their questions were also similar.

Interview questions from similar companies

I applied via Job Portal and was interviewed before Jan 2021. There were 2 interview rounds.

Round 1 - Aptitude Test 
Round 2 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips

Interview Preparation Tips

Topics to prepare for Deloitte Data Analyst interview:
  • Procurement
  • Marketing
  • Career goal
Interview preparation tips for other job seekers - Location should be Delhi NCR,Monday would be good

What people are saying about KPMG India

View All
sankalp344
Verified Icon
3d
currently not working
Job
Do top companies like Deloitte,KPMG,Tcs,HCL,Ey,etc hire someone from correspondence graduation ?
Got a question about KPMG India?
Ask anonymously on communities.

I applied via Naukri.com and was interviewed before Oct 2020. There were 3 interview rounds.

Interview Questionnaire 

2 Questions

  • Q1. Write a query where all the customer name start with A
  • Q2. Write a query where we'll get state wise sales summary report. Actually in this query we have to join multiple table and finding the rank .
  • Ans. 

    Query to generate state-wise sales summary report with ranking using multiple table joins.

    • Use SQL JOINs to combine sales, states, and products tables.

    • Aggregate sales data using SUM() function grouped by state.

    • Use RANK() or DENSE_RANK() to rank states based on total sales.

    • Example SQL: SELECT state, SUM(sales) AS total_sales, RANK() OVER (ORDER BY SUM(sales) DESC) AS sales_rank FROM sales_data JOIN states ON sales_data.s...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Interview was extremely easy and smooth.

Skills evaluated in this interview

I appeared for an interview before Jan 2021.

Round 1 - Coding Test 

(1 Question)

Round duration - 60 minutes
Round difficulty - Easy

This was an online aptitude round on the Talview platform which we can give from our Home. To attempt this test, we have been given a window of 12-24 hours. It was a proctored online test. The round consisted of 60 aptitude questions in 60 minutes of time without any negative marking. The aptitude test consisted of various questions of Logical Reasoning, Mental Ability, Data Interpretation Questions, Numerical Reasoning.

  • Q1. 

    Matrix Transpose Problem Statement

    Given a matrix MAT, your task is to return the transpose of the matrix. The transpose of a matrix is obtained by converting rows into columns and vice versa. Specificall...

  • Ans. 

    Transpose a given matrix by switching rows and columns.

    • Iterate through the matrix and swap elements at [i][j] with [j][i].

    • Create a new matrix to store the transposed values.

    • Ensure the dimensions of the transposed matrix are reversed from the original matrix.

  • Answered by AI
Round 2 - Video Call 

Round duration - 30 minutes
Round difficulty - Medium

This was a video interview round taken immediately after the Aptitude test, it was a proctored round on Talview platform. This round consisted of 8 questions, we have to answer each question in the span of 3 mins and 10 secs were given for reading the question, our audio and video was being recorded while answering those questions. Our soft skills were being judged in this round. The questions basically consisted of HR questions and Guesstimates

Round 3 - Video Call 

Round duration - 120 minutes
Round difficulty - Hard

This round was the case study round, in this round, we were provided a case study which we had to solve in 1 hour using pen and paper and had to share the answers of the case study questions in a PDF format to the HR, the questions were related to market share, advertisement strategies, and can be easily solved using basic mathematics. After this, there was be a one-to-one interview on Zoom Platform in which the interviewer will ask you to describe your approach for every question you have solved in the case study, they will not ask for the exact answer, all they ask is about your approach that you have used to solve a particular problem and the way you explain it. They may or may not ask some guesstimates or puzzles as well in this round. The interviewer was very friendly and he gave me hints as well if I was stuck somewhere.

Round 4 - Video Call 

(1 Question)

Round duration - 60 minutes
Round difficulty - Hard

The next round was the Behavioral interview round, in this round, the interviewer asked me about my projects and some guesstimates or puzzles. This round was for 1 hour. The interviewer was very friendly and helpful, she helped me if I was stuck somewhere, she kept me calm during the interview process.

  • Q1. How many people do you estimate watched Ramayana/Mahabharat during the first month of the Covid lockdown in 2020?
Round 5 - Video Call 

(1 Question)

Round duration - 40 minutes
Round difficulty - Hard

The fit round was my final interview round, in this, you will meet the manager/Principal from the company. They will basically check whether you are fit for the organization. They ask about the projects I have done in college/internships. Some basic questions like what are your strengths and weaknesses, why do you want to join the company and some guesstimates or puzzles as well.
On-campus, the behavioral and fit interview rounds were combined into one.
This round was of 40 minutes and the interview was taken on Zoom, the interviewer was helpful and sweet.

  • Q1. How many pizzas are sold in one day in Pune?
  • Ans. 

    The number of pizzas sold in Pune in one day varies depending on factors like day of the week, weather, events, etc.

    • The number of pizzas sold in Pune can range from hundreds to thousands in a day.

    • Factors like day of the week (weekend vs weekday), weather (rainy vs sunny), events (festivals, holidays) can impact the sales.

    • Popular pizza outlets in Pune like Domino's, Pizza Hut, etc. contribute to the overall sales.

    • Data f...

  • Answered by AI

Interview Preparation Tips

Professional and academic backgroundI applied for the job as Data Analyst in GurgaonEligibility criteriaAbove 6.5 CGPA in B. TechZS Associates interview preparation:Topics to prepare for the interview - Aptitude, Guesstimates, Puzzles, HR questions, Resume Related QuestionsTime required to prepare for the interview - 2 monthsInterview preparation tips for other job seekers

Tip 1 : Prepare well for guesstimates and puzzles.
Tip 2 : Practice Data Interpretation Questions.
Tip 3 : Be well versed with your projects.

Application resume tips for other job seekers

Tip 1 : The resume should be very precise and concise
Tip 2 : Do not add such skills in your resume in which you're not comfortable.

Final outcome of the interviewSelected

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
4-6 weeks
Result
Not Selected

I applied via Company Website and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Project related discussions
  • Q2. Meduim level SQl and DSA
Round 2 - One-on-one 

(2 Questions)

  • Q1. This was data modelling round
  • Q2. Design a uber data model
  • Ans. 

    Uber data model design for efficient storage and retrieval of ride-related information.

    • Create tables for users, drivers, rides, payments, and ratings

    • Include attributes like user_id, driver_id, ride_id, payment_id, rating_id, timestamp, location, fare, etc.

    • Establish relationships between tables using foreign keys

    • Implement indexing for faster query performance

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare SQl, Python and data modeling

Skills evaluated in this interview

Are these interview questions helpful?

I applied via Company Website and was interviewed in Dec 2021. There were 11 interview rounds.

Round 1 - Aptitude Test 

A test designed to a determination a person ability in particular skill of field knowledge

Round 2 - Coding Test 

Coding and decoding

Round 3 - Case Study 

Devolpment of a particular person group or situation over a period of time

Round 4 - Technical 

(1 Question)

  • Q1. Relating to a particular Subject art or carft or its techNical Terms
Round 5 - Group Discussion 

Is a situation faced when individuals collectively make a choice from alternative before them

Round 6 - Assignment 

A task or piEce of work allocated to someone a part of a job or course of study a home work assignment

Round 7 - Technical 

(1 Question)

  • Q1. InvolVoing or concerned with applied and industrial science
Round 8 - Coding Test 

Coding interview test candidate technical knowledge coding ability creativity typically on a white board

Round 9 - HR 

(1 Question)

  • Q1. Managing an organisation employees
Round 10 - One-on-one 

(1 Question)

  • Q1. Each player from one team is matched a single player from other team
Round 11 - Coding Test 

The from are checked and returned to the census officer for coding

Interview Preparation Tips

Interview preparation tips for other job seekers - Straight From the job Seekers themselves
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Indeed and was interviewed in Jan 2024. There were 2 interview rounds.

Round 1 - HR 

(3 Questions)

  • Q1. Basics of power bi and sql
  • Q2. What are joins and types of join
  • Q3. How to add date table in power bi
  • Ans. 

    To add a date table in Power BI, you can create a new table with a list of dates and relationships with other tables.

    • Create a new table in Power BI with a list of dates

    • Add columns for day, month, year, etc. for additional analysis

    • Establish relationships between the date table and other tables in the data model

  • Answered by AI
Round 2 - Technical 

(4 Questions)

  • Q1. Sql scenario based question on joins
  • Q2. Window function questions
  • Q3. Performance tuning
  • Q4. Power bi dax question

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Mar 2024. There were 3 interview rounds.

Round 1 - Aptitude Test 

Basic aptitude questions liek tiem distance, percentage , ratio, work and hour

Round 2 - Coding Test 

Excel, Numpy, Pandas, basic ML

Round 3 - Case Study 

Case study talking about how to optimise the supply chain using data analytics

KPMG India Interview FAQs

How many rounds are there in KPMG India Data Engineer interview for experienced candidates?
KPMG India interview process for experienced candidates usually has 3-4 rounds. The most common rounds in the KPMG India interview process for experienced candidates are Technical, Behavioral and HR.
How to prepare for KPMG India Data Engineer interview for experienced candidates?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at KPMG India. The most common topics and skills that interviewers at KPMG India expect are SQL, Python, Azure, Data Integration and ETL.
What are the top questions asked in KPMG India Data Engineer interview for experienced candidates?

Some of the top questions asked at the KPMG India Data Engineer interview for experienced candidates -

  1. How do you handle changing schema from source. What are the common issues faced...read more
  2. Write Pyspark code to read csv file and show top 10 recor...read more
  3. What are the optimization techniques applied in pyspark co...read more
How long is the KPMG India Data Engineer interview process?

The duration of KPMG India Data Engineer interview process can vary, but typically it takes about less than 2 weeks to complete.

Tell us how to improve this page.

Overall Interview Experience Rating

4/5

based on 1 interview experience

Difficulty level

Moderate 100%

Duration

Less than 2 weeks 100%
View more
KPMG India Data Engineer Salary
based on 141 salaries
₹10.3 L/yr - ₹22.4 L/yr
37% more than the average Data Engineer Salary in India
View more details

KPMG India Data Engineer Reviews and Ratings

based on 10 reviews

4.1/5

Rating in categories

4.1

Skill development

3.4

Work-life balance

3.4

Salary

3.8

Job security

3.3

Company culture

3.8

Promotions

3.4

Work satisfaction

Explore 10 Reviews and Ratings
Celonis Data Engineer - Consultant

Bangalore / Bengaluru

6-8 Yrs

Not Disclosed

Celonis Data Engineer

Bangalore / Bengaluru

4-8 Yrs

Not Disclosed

Senior - Data Engineering

Bangalore / Bengaluru

2-7 Yrs

₹ 4.8-27 LPA

Explore more jobs
Consultant
8.7k salaries
unlock blur

₹11.3 L/yr - ₹20 L/yr

Assistant Manager
7.9k salaries
unlock blur

₹15.9 L/yr - ₹27 L/yr

Associate Consultant
5.1k salaries
unlock blur

₹7.7 L/yr - ₹13 L/yr

Analyst
3.8k salaries
unlock blur

₹2.5 L/yr - ₹8 L/yr

Manager
3.5k salaries
unlock blur

₹22 L/yr - ₹38 L/yr

Explore more salaries
Compare KPMG India with

Cognizant

3.7
Compare

PwC

3.3
Compare

Capgemini

3.7
Compare

Google

4.4
Compare
write
Share an Interview