Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

KPMG India

Compare

3.5

based on 5.5k Reviews

Filter interviews by

KPMG India Azure Data Engineer Interview Questions, Process, and Tips

Updated 17 Jan 2025

Top KPMG India Azure Data Engineer Interview Questions and Answers

Q1. Difference between RDD, Dataframe and Dataset. How and what you have used in you databricks for data anlysis

View answer (1)
Q2. What are key components in ADF? What all you have used in your pipeline?

View answer (1)
Q3. Do you create any encryprion key in Databricks? Cluster size in Databricks.

View answer (1)

View all 12 questions

KPMG India Azure Data Engineer Interview Experiences

4 interviews found

Azure Data Engineer Interview Questions & Answers

SACHIN KAUSHIK

posted on 17 Jan 2025

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. What steps are involved in fetching data from an on-premises Unix server?

Add your answer

Q2. Types of triggers in azure data factory

Add your answer

Azure Data Engineer Interview Questions & Answers

Anonymous

posted on 11 Sep 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Not Selected

I applied via LinkedIn and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Technical

(3 Questions)

Q1. What is Medallion Architecture

Ans.

Medallion Architecture is a data processing architecture that involves breaking down data into smaller pieces for easier processing.

Medallion Architecture involves breaking down data into smaller pieces for easier processing
It allows for parallel processing of data to improve performance
Commonly used in big data processing systems like Hadoop and Spark

Answered by AI

Add your answer

Q2. What is Spark Architecture

Ans.

Spark Architecture is a distributed computing framework that provides an efficient way to process large datasets.

Spark Architecture consists of a driver program, cluster manager, and worker nodes.
It uses Resilient Distributed Datasets (RDDs) for fault-tolerant distributed data processing.
Spark supports various programming languages like Scala, Java, Python, and SQL.
It includes components like Spark Core, Spark SQL, Spa...

Answered by AI

Add your answer

Q3. Find the second highest salary in employee table

Ans.

Use SQL query to find the second highest salary in employee table

Use SQL query with ORDER BY and LIMIT to get the second highest salary
Example: SELECT DISTINCT salary FROM employee ORDER BY salary DESC LIMIT 1, 1

Answered by AI

Add your answer

Round 2 - Technical

(2 Questions)

Q1. How do you perform Partitioning

Ans.

Partitioning in Azure Data Engineer involves dividing data into smaller chunks for better performance and manageability.

Partitioning can be done based on a specific column or key in the dataset
It helps in distributing data across multiple nodes for parallel processing
Partitioning can improve query performance by reducing the amount of data that needs to be scanned
In Azure Synapse Analytics, you can use ROUND_ROBIN or H

Answered by AI

Add your answer

Q2. What are your current responsibilities as Azure Data Engineer

Ans.

As an Azure Data Engineer, my current responsibilities include designing and implementing data solutions on Azure, optimizing data storage and processing, and ensuring data security and compliance.

Designing and implementing data solutions on Azure
Optimizing data storage and processing for performance and cost efficiency
Ensuring data security and compliance with regulations
Collaborating with data scientists and analysts

Answered by AI

Add your answer

Skills evaluated in this interview

Azure Data Engineer Interview Questions & Answers

Anonymous

posted on 10 Jun 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - One-on-one

(2 Questions)

Q1. Day to day activities

Add your answer

Q2. Challenging problem

Ans.

Designing a data pipeline to process and analyze large volumes of real-time data from multiple sources.

Identify the sources of data and their formats
Design a scalable data ingestion process
Implement data transformation and cleansing steps
Utilize Azure Data Factory, Azure Databricks, and Azure Synapse Analytics for processing and analysis

Answered by AI

Add your answer

What people are saying about KPMG India

View All

Offer letter - from KPMG

Hi Currently I'm drawing a salary of 1050000 fixed from Deloitte india. Now i got an offer from KPMG global where they offered me 1425000 fixed. I'm qualified chartered accountant with 1 year of experience in statutory audit at Deloitte. Please rate this hike or give inputs.

Got a question about KPMG India?

Ask anonymously on communities.

Azure Data Engineer Interview Questions & Answers

Anonymous

posted on 27 Jun 2021

I applied via Naukri.com and was interviewed before Jun 2020. There were 4 interview rounds.

Interview Questionnaire

5 Questions

Q1. What are key components in ADF? What all you have used in your pipeline?

Ans.

ADF key components include pipelines, activities, datasets, triggers, and linked services.

Pipelines - logical grouping of activities
Activities - individual tasks within a pipeline
Datasets - data sources and destinations
Triggers - event-based or time-based execution of pipelines
Linked Services - connections to external data sources
Examples: Copy Data activity, Lookup activity, Blob Storage dataset

Answered by AI

Add your answer

Q2. Do you create any encryprion key in Databricks? Cluster size in Databricks.

Ans.

Yes, encryption keys can be created in Databricks. Cluster size can be adjusted based on workload.

Encryption keys can be created using Azure Key Vault or Databricks secrets
Cluster size can be adjusted manually or using autoscaling based on workload
Encryption at rest can also be enabled for data stored in Databricks

Answered by AI

Add your answer

Q3. Difference between ADLS gen 1 and gen 2?

Ans.

ADLS gen 2 is an upgrade to gen 1 with improved performance, scalability, and security features.

ADLS gen 2 is built on top of Azure Blob Storage, while gen 1 is a standalone service.
ADLS gen 2 supports hierarchical namespace, which allows for better organization and management of data.
ADLS gen 2 has better performance for large-scale analytics workloads, with faster read and write speeds.
ADLS gen 2 has improved securit...

Answered by AI

Add your answer

Q4. What is Semantic layer?

Ans.

Semantic layer is a virtual layer that provides a simplified view of complex data.

It acts as a bridge between the physical data and the end-user.
It provides a common business language for users to access data.
It simplifies data access by hiding the complexity of the underlying data sources.
Examples include OLAP cubes, data marts, and virtual tables.

Answered by AI

Add your answer

Q5. Difference between RDD, Dataframe and Dataset. How and what you have used in you databricks for data anlysis

Ans.

RDD, Dataframe and Dataset are data structures in Spark. RDD is a low-level structure, Dataframe is tabular and Dataset is a combination of both.

RDD stands for Resilient Distributed Datasets and is a low-level structure in Spark that is immutable and fault-tolerant.
Dataframe is a tabular structure with named columns and is similar to a table in a relational database.
Dataset is a combination of RDD and Dataframe and pro...

Answered by AI

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - You should know your project thoroughly.

Skills evaluated in this interview

Azure Data Engineer Jobs at KPMG India

View all

Azure Data Engineer - Consultant

Bangalore / Bengaluru

3-5 Yrs

₹ 18-20 LPA

Azure Data Engineer - Consultant

Bangalore / Bengaluru

4-6 Yrs

₹ 9.5-22 LPA

Azure Data Engineer - Assistant Manager

Bangalore / Bengaluru

6-8 Yrs

₹ 14.6-26.5 LPA

Azure Data Engineer - Consultant

Mumbai

3-6 Yrs

Not Disclosed

Azure Data Engineer - Assistant Manager

Bangalore / Bengaluru

6-8 Yrs

₹ 14.6-26.5 LPA

Azure Data Engineer - Assistant Manager

Mumbai

6-8 Yrs

Not Disclosed

Interview questions from similar companies

Azure Data Engineer Interview Questions & Answers

PwC

Anonymous

posted on 5 Dec 2024

Interview experience

Good

Difficulty level

Hard

Process Duration

Less than 2 weeks

Result

No response

I applied via Recruitment Consulltant and was interviewed in Nov 2024. There was 1 interview round.

Round 1 - Technical

(2 Questions)

Q1. Python code for prime numbers

Add your answer

Q2. Data bricks pyspark code for first 10 employees with salary

Add your answer

Azure Data Engineer Interview Questions & Answers

Deloitte

sagnik sil

posted on 13 Jun 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

4-6 weeks

Result

No response

I applied via Referral and was interviewed in May 2024. There was 1 interview round.

Round 1 - Technical

(2 Questions)

Q1. What is polybase?

Ans.

Polybase is a feature in Azure SQL Data Warehouse that allows users to query data stored in Hadoop or Azure Blob Storage.

Polybase enables users to access and query external data sources without moving the data into the database.
It provides a virtualization layer that allows SQL queries to seamlessly integrate with data stored in Hadoop or Azure Blob Storage.
Polybase can significantly improve query performance by levera...

Answered by AI

View 1 more answer

Q2. Explain your current project architecture.

Add your answer

Azure Data Engineer Interview Questions & Answers

PwC

Anonymous

posted on 12 Jul 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Not Selected

Round 1 - Technical

(3 Questions)

Q1. Previous project

Add your answer

Q2. What is partion key?

Ans.

Partition key is a field used to distribute data across multiple partitions in a database for scalability and performance.

Partition key determines the partition in which a row will be stored in a database.
It helps in distributing data evenly across multiple partitions to improve query performance.
Choosing the right partition key is crucial for efficient data storage and retrieval.
For example, in Azure Cosmos DB, partit...

Answered by AI

Add your answer

Q3. Explai data bricks,how its different from adf

Ans.

Data bricks is a unified analytics platform for big data and machine learning, while ADF (Azure Data Factory) is a cloud-based data integration service.

Data bricks is a unified analytics platform that provides a collaborative environment for big data and machine learning projects.
ADF is a cloud-based data integration service that allows you to create, schedule, and manage data pipelines.
Data bricks supports multiple pr...

Answered by AI

Add your answer

Interview Preparation Tips

Topics to prepare for PwC Azure Data Engineer interview:

Adf
data bricks
SQL
pyspark
basic quereis on sql,pyspark

Skills evaluated in this interview

Azure Data Engineer Interview Questions & Answers

Deloitte

SREYA KANDUKURU

posted on 27 Sep 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Coding Test

Coding round will consists of SQL and pyspark questions, it's a medium level

Azure Data Engineer Interview Questions & Answers

PwC

Anonymous

posted on 18 Jul 2023

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via LinkedIn and was interviewed in Feb 2023. There were 4 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - One-on-one

(5 Questions)

Q1. How do we do delta load using adf?

Ans.

Delta load in ADF is achieved by comparing source and target data and only loading the changed data.

Use a Lookup activity to retrieve the latest watermark or timestamp from the target table
Use a Source activity to extract data from the source system based on the watermark or timestamp
Use a Join activity to compare the source and target data and identify the changed records
Use a Sink activity to load only the changed re

Answered by AI

View 1 more answer

Q2. Sql:- fourth highest salry of an employee from an employee table.

Add your answer

Q3. What is the difference between Blob and adls

Ans.

Blob is a storage service for unstructured data, while ADLS is optimized for big data analytics workloads.

Blob is a general-purpose object storage service for unstructured data, while ADLS is optimized for big data analytics workloads.
ADLS offers features like file system semantics, file-level security, and scalability for big data analytics, while Blob storage is simpler and more cost-effective for general storage nee...

Answered by AI

View 1 more answer

Q4. What are the types of triggers available in adf?

Ans.

There are three types of triggers available in Azure Data Factory: Schedule, Tumbling Window, and Event.

Schedule trigger: Runs pipelines on a specified schedule.
Tumbling Window trigger: Runs pipelines at specified time intervals.
Event trigger: Runs pipelines in response to events like a file being added to a storage account.

Answered by AI

View 1 more answer

Q5. What is your team size?

Add your answer

Round 3 - Behavioral

(4 Questions)

Q1. Why do you want to change the company?

Add your answer

Q2. What you will do if you got an offer from Deloitte?

Add your answer

Q3. What are your roles and responsibilities?

Add your answer

Q4. What is expected salary?

Add your answer

Round 4 - HR

(2 Questions)

Q1. What is your expected salary?

Add your answer

Q2. Is there any other salary figure in your mind?

Add your answer

Skills evaluated in this interview

Azure Data Engineer Interview Questions & Answers

Deloitte

Anonymous

posted on 30 Jan 2024

Interview experience

Average

Difficulty level

Easy

Process Duration

2-4 weeks

Result

Not Selected

I applied via Job Portal and was interviewed in Dec 2023. There was 1 interview round.

Round 1 - One-on-one

(1 Question)

Q1. What are the difference b/w data lake gen1 and gen2

Ans.

Data Lake Gen1 is based on Hadoop Distributed File System (HDFS) while Gen2 is built on Azure Blob Storage.

Data Lake Gen1 uses HDFS for storing data while Gen2 uses Azure Blob Storage.
Gen1 has a hierarchical file system while Gen2 has a flat file system.
Gen2 provides better performance, scalability, and security compared to Gen1.
Gen2 supports Azure Data Lake Storage features like tiering, lifecycle management, and acce...

Answered by AI

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - good & smooth interview experience

Skills evaluated in this interview

KPMG India Interview FAQs

How many rounds are there in KPMG India Azure Data Engineer interview?

KPMG India interview process usually has 1-2 rounds. The most common rounds in the KPMG India interview process are Technical and One-on-one Round.

How to prepare for KPMG India Azure Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at KPMG India. The most common topics and skills that interviewers at KPMG India expect are Data Architecture, Unit Testing, Analytical Chemistry, Clinical Data Management and Cosmos.

What are the top questions asked in KPMG India Azure Data Engineer interview?

Some of the top questions asked at the KPMG India Azure Data Engineer interview -

Difference between RDD, Dataframe and Dataset. How and what you have used in yo...read more
What are key components in ADF? What all you have used in your pipeli...read more
Do you create any encryprion key in Databricks? Cluster size in Databric...read more

Tell us how to improve this page.

KPMG India Interviews By Designations

Interview Questions for Popular Designations

KPMG India Azure Data Engineer Interview Process

based on 3 interviews

1 Interview rounds

Technical Round

TCS Interview Questions

3.7

• 10.5k Interviews

Accenture Interview Questions

3.8

• 8.2k Interviews

Capgemini Interview Questions

3.7

• 4.8k Interviews

Deloitte Interview Questions

3.8

• 2.9k Interviews

IBM Interview Questions

4.0

• 2.4k Interviews

PwC Interview Questions

3.4

• 1.4k Interviews

Ernst & Young Interview Questions

3.4

• 1.1k Interviews

ZS Interview Questions

3.4

• 484 Interviews

McKinsey & Company Interview Questions

3.8

• 254 Interviews

KPMG Global Services Interview Questions

3.5

• 220 Interviews

View all

Indian Institute of Management (IIM), Lucknow Placement Questions

9 Interviews

Indian Institute of Technology (IIT), Chennai Placement Questions

5 Interviews

Institute of Chartered Accountant of India (ICAI) Placement Questions

4 Interviews

Gargi College, Delhi Placement Questions

3 Interviews

Indian School of Business (ISB), Hyderabad Placement Questions

2 Interviews

ICFAI Business School, Hyderabad Placement Questions

2 Interviews

Birla Institute of Technology (BIT), Ranchi Placement Questions

2 Interviews

View all

KPMG India Azure Data Engineer Salary

based on 33 salaries

₹7.8 L/yr - ₹26.5 L/yr

97% more than the average Azure Data Engineer Salary in India

View more details

Azure Data Engineer Jobs at KPMG India

Azure Data Engineer - Consultant

Bangalore / Bengaluru

3-5 Yrs

₹ 18-20 LPA

Azure Data Engineer - Consultant

Bangalore / Bengaluru

4-6 Yrs

₹ 9.5-22 LPA

Azure Data Engineer - Assistant Manager

Bangalore / Bengaluru

6-8 Yrs

₹ 14.6-26.5 LPA

Explore more jobs

KPMG India Salaries in India

Consultant 7.7k salaries	₹6.5 L/yr - ₹27 L/yr
Assistant Manager 6.9k salaries	₹10.3 L/yr - ₹35.1 L/yr
Associate Consultant 4.6k salaries	₹4.5 L/yr - ₹16 L/yr
Analyst 3.5k salaries	₹1 L/yr - ₹9.7 L/yr
Manager 2.9k salaries	₹15.9 L/yr - ₹50 L/yr