Upload Button Icon Add office photos

Filter interviews by

KPMG India Azure Data Engineer Interview Questions, Process, and Tips

Updated 17 Jan 2025

Top KPMG India Azure Data Engineer Interview Questions and Answers

  • Q1. Difference between RDD, Dataframe and Dataset. How and what you have used in you databricks for data anlysis
  • Q2. What are key components in ADF? What all you have used in your pipeline?
  • Q3. Do you create any encryprion key in Databricks? Cluster size in Databricks.
View all 10 questions

KPMG India Azure Data Engineer Interview Experiences

5 interviews found

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Not Selected

I applied via LinkedIn and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Technical 

(3 Questions)

  • Q1. What is Medallion Architecture
  • Ans. 

    Medallion Architecture is a data processing architecture that involves breaking down data into smaller pieces for easier processing.

    • Medallion Architecture involves breaking down data into smaller pieces for easier processing

    • It allows for parallel processing of data to improve performance

    • Commonly used in big data processing systems like Hadoop and Spark

  • Answered by AI
  • Q2. What is Spark Architecture
  • Ans. 

    Spark Architecture is a distributed computing framework that provides an efficient way to process large datasets.

    • Spark Architecture consists of a driver program, cluster manager, and worker nodes.

    • It uses Resilient Distributed Datasets (RDDs) for fault-tolerant distributed data processing.

    • Spark supports various programming languages like Scala, Java, Python, and SQL.

    • It includes components like Spark Core, Spark SQL, Spa...

  • Answered by AI
  • Q3. Find the second highest salary in employee table
  • Ans. 

    Use SQL query to find the second highest salary in employee table

    • Use SQL query with ORDER BY and LIMIT to get the second highest salary

    • Example: SELECT DISTINCT salary FROM employee ORDER BY salary DESC LIMIT 1, 1

  • Answered by AI
Round 2 - Technical 

(2 Questions)

  • Q1. How do you perform Partitioning
  • Ans. 

    Partitioning in Azure Data Engineer involves dividing data into smaller chunks for better performance and manageability.

    • Partitioning can be done based on a specific column or key in the dataset

    • It helps in distributing data across multiple nodes for parallel processing

    • Partitioning can improve query performance by reducing the amount of data that needs to be scanned

    • In Azure Synapse Analytics, you can use ROUND_ROBIN or H

  • Answered by AI
  • Q2. What are your current responsibilities as Azure Data Engineer
  • Ans. 

    As an Azure Data Engineer, my current responsibilities include designing and implementing data solutions on Azure, optimizing data storage and processing, and ensuring data security and compliance.

    • Designing and implementing data solutions on Azure

    • Optimizing data storage and processing for performance and cost efficiency

    • Ensuring data security and compliance with regulations

    • Collaborating with data scientists and analysts

  • Answered by AI

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - One-on-one 

(2 Questions)

  • Q1. Day to day activities
  • Q2. Challenging problem
  • Ans. 

    Designing a data pipeline to process and analyze large volumes of real-time data from multiple sources.

    • Identify the sources of data and their formats

    • Design a scalable data ingestion process

    • Implement data transformation and cleansing steps

    • Utilize Azure Data Factory, Azure Databricks, and Azure Synapse Analytics for processing and analysis

  • Answered by AI

Azure Data Engineer Interview Questions Asked at Other Companies

asked in TCS
Q1. 7. How can we load multiple(50)tables at a time using adf?
asked in KPMG India
Q2. Difference between RDD, Dataframe and Dataset. How and what you h ... read more
asked in Techigai
Q3. What is incremental load and other types of loads? How do you imp ... read more
asked in TCS
Q4. 2. What is the get metadata activity and what are the parameters ... read more
asked in KPMG India
Q5. What are key components in ADF? What all you have used in your pi ... read more

I applied via Naukri.com and was interviewed before Jun 2020. There were 4 interview rounds.

Interview Questionnaire 

5 Questions

  • Q1. What are key components in ADF? What all you have used in your pipeline?
  • Ans. 

    ADF key components include pipelines, activities, datasets, triggers, and linked services.

    • Pipelines - logical grouping of activities

    • Activities - individual tasks within a pipeline

    • Datasets - data sources and destinations

    • Triggers - event-based or time-based execution of pipelines

    • Linked Services - connections to external data sources

    • Examples: Copy Data activity, Lookup activity, Blob Storage dataset

  • Answered by AI
  • Q2. Do you create any encryprion key in Databricks? Cluster size in Databricks.
  • Ans. 

    Yes, encryption keys can be created in Databricks. Cluster size can be adjusted based on workload.

    • Encryption keys can be created using Azure Key Vault or Databricks secrets

    • Cluster size can be adjusted manually or using autoscaling based on workload

    • Encryption at rest can also be enabled for data stored in Databricks

  • Answered by AI
  • Q3. Difference between ADLS gen 1 and gen 2?
  • Ans. 

    ADLS gen 2 is an upgrade to gen 1 with improved performance, scalability, and security features.

    • ADLS gen 2 is built on top of Azure Blob Storage, while gen 1 is a standalone service.

    • ADLS gen 2 supports hierarchical namespace, which allows for better organization and management of data.

    • ADLS gen 2 has better performance for large-scale analytics workloads, with faster read and write speeds.

    • ADLS gen 2 has improved securit...

  • Answered by AI
  • Q4. What is Semantic layer?
  • Ans. 

    Semantic layer is a virtual layer that provides a simplified view of complex data.

    • It acts as a bridge between the physical data and the end-user.

    • It provides a common business language for users to access data.

    • It simplifies data access by hiding the complexity of the underlying data sources.

    • Examples include OLAP cubes, data marts, and virtual tables.

  • Answered by AI
  • Q5. Difference between RDD, Dataframe and Dataset. How and what you have used in you databricks for data anlysis
  • Ans. 

    RDD, Dataframe and Dataset are data structures in Spark. RDD is a low-level structure, Dataframe is tabular and Dataset is a combination of both.

    • RDD stands for Resilient Distributed Datasets and is a low-level structure in Spark that is immutable and fault-tolerant.

    • Dataframe is a tabular structure with named columns and is similar to a table in a relational database.

    • Dataset is a combination of RDD and Dataframe and pro...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - You should know your project thoroughly.

Skills evaluated in this interview

Azure Data Engineer Interview Questions & Answers

user image SACHIN KAUSHIK

posted on 17 Jan 2025

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What steps are involved in fetching data from an on-premises Unix server?
  • Q2. Types of triggers in azure data factory

KPMG India interview questions for designations

 Data Engineer

 (12)

 Data Analyst

 (9)

 Data Scientist

 (2)

 Data Executive

 (1)

 Data Science Consultant

 (2)

 Senior Data Analyst

 (1)

 Data Migration Specialist

 (1)

 Data Science Intern

 (1)

Azure Data Engineer interview

user image Hackveda Limited

posted on 18 Mar 2022

Get interview-ready with Top KPMG India Interview Questions

Interview questions from similar companies

Interview experience
4
Good
Difficulty level
Hard
Process Duration
Less than 2 weeks
Result
No response

I applied via Recruitment Consulltant and was interviewed in Nov 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Python code for prime numbers
  • Q2. Data bricks pyspark code for first 10 employees with salary
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
4-6 weeks
Result
No response

I applied via Referral and was interviewed in May 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. What is polybase?
  • Ans. 

    Polybase is a feature in Azure SQL Data Warehouse that allows users to query data stored in Hadoop or Azure Blob Storage.

    • Polybase enables users to access and query external data sources without moving the data into the database.

    • It provides a virtualization layer that allows SQL queries to seamlessly integrate with data stored in Hadoop or Azure Blob Storage.

    • Polybase can significantly improve query performance by levera...

  • Answered by AI
  • Q2. Explain your current project architecture.
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
Not Selected
Round 1 - Technical 

(3 Questions)

  • Q1. Previous project
  • Q2. What is partion key?
  • Ans. 

    Partition key is a field used to distribute data across multiple partitions in a database for scalability and performance.

    • Partition key determines the partition in which a row will be stored in a database.

    • It helps in distributing data evenly across multiple partitions to improve query performance.

    • Choosing the right partition key is crucial for efficient data storage and retrieval.

    • For example, in Azure Cosmos DB, partit...

  • Answered by AI
  • Q3. Explai data bricks,how its different from adf
  • Ans. 

    Data bricks is a unified analytics platform for big data and machine learning, while ADF (Azure Data Factory) is a cloud-based data integration service.

    • Data bricks is a unified analytics platform that provides a collaborative environment for big data and machine learning projects.

    • ADF is a cloud-based data integration service that allows you to create, schedule, and manage data pipelines.

    • Data bricks supports multiple pr...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for PwC Azure Data Engineer interview:
  • Adf
  • data bricks
  • SQL
  • pyspark
  • basic quereis on sql,pyspark

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
Easy
Process Duration
2-4 weeks
Result
Not Selected

I applied via Job Portal and was interviewed in Dec 2023. There was 1 interview round.

Round 1 - One-on-one 

(1 Question)

  • Q1. What are the difference b/w data lake gen1 and gen2
  • Ans. 

    Data Lake Gen1 is based on Hadoop Distributed File System (HDFS) while Gen2 is built on Azure Blob Storage.

    • Data Lake Gen1 uses HDFS for storing data while Gen2 uses Azure Blob Storage.

    • Gen1 has a hierarchical file system while Gen2 has a flat file system.

    • Gen2 provides better performance, scalability, and security compared to Gen1.

    • Gen2 supports Azure Data Lake Storage features like tiering, lifecycle management, and acce...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - good & smooth interview experience

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via LinkedIn and was interviewed in Feb 2023. There were 4 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - One-on-one 

(5 Questions)

  • Q1. How do we do delta load using adf?
  • Ans. 

    Delta load in ADF is achieved by comparing source and target data and only loading the changed data.

    • Use a Lookup activity to retrieve the latest watermark or timestamp from the target table

    • Use a Source activity to extract data from the source system based on the watermark or timestamp

    • Use a Join activity to compare the source and target data and identify the changed records

    • Use a Sink activity to load only the changed re

  • Answered by AI
  • Q2. Sql:- fourth highest salry of an employee from an employee table.
  • Q3. What is the difference between Blob and adls
  • Ans. 

    Blob is a storage service for unstructured data, while ADLS is optimized for big data analytics workloads.

    • Blob is a general-purpose object storage service for unstructured data, while ADLS is optimized for big data analytics workloads.

    • ADLS offers features like file system semantics, file-level security, and scalability for big data analytics, while Blob storage is simpler and more cost-effective for general storage nee...

  • Answered by AI
  • Q4. What are the types of triggers available in adf?
  • Ans. 

    There are three types of triggers available in Azure Data Factory: Schedule, Tumbling Window, and Event.

    • Schedule trigger: Runs pipelines on a specified schedule.

    • Tumbling Window trigger: Runs pipelines at specified time intervals.

    • Event trigger: Runs pipelines in response to events like a file being added to a storage account.

  • Answered by AI
  • Q5. What is your team size?
Round 3 - Behavioral 

(4 Questions)

  • Q1. Why do you want to change the company?
  • Q2. What you will do if you got an offer from Deloitte?
  • Q3. What are your roles and responsibilities?
  • Q4. What is expected salary?
Round 4 - HR 

(2 Questions)

  • Q1. What is your expected salary?
  • Q2. Is there any other salary figure in your mind?

Skills evaluated in this interview

KPMG India Interview FAQs

How many rounds are there in KPMG India Azure Data Engineer interview?
KPMG India interview process usually has 1-2 rounds. The most common rounds in the KPMG India interview process are Technical and One-on-one Round.
How to prepare for KPMG India Azure Data Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at KPMG India. The most common topics and skills that interviewers at KPMG India expect are Azure, Python, SQL, Adf and Computer Networking.
What are the top questions asked in KPMG India Azure Data Engineer interview?

Some of the top questions asked at the KPMG India Azure Data Engineer interview -

  1. Difference between RDD, Dataframe and Dataset. How and what you have used in yo...read more
  2. What are key components in ADF? What all you have used in your pipeli...read more
  3. Do you create any encryprion key in Databricks? Cluster size in Databric...read more

Tell us how to improve this page.

KPMG India Azure Data Engineer Interview Process

based on 3 interviews in last 1 year

Interview experience

4
  
Good

People are getting interviews through

based on 2 KPMG India interviews
Job Portal
100%
Moderate Confidence
?
Moderate Confidence means the data is based on a sufficient number of responses received from the candidates

Interview Questions from Similar Companies

TCS Interview Questions
3.7
 • 10.2k Interviews
Accenture Interview Questions
3.9
 • 8k Interviews
Capgemini Interview Questions
3.8
 • 4.7k Interviews
Deloitte Interview Questions
3.8
 • 2.8k Interviews
IBM Interview Questions
4.1
 • 2.4k Interviews
PwC Interview Questions
3.4
 • 1.4k Interviews
Ernst & Young Interview Questions
3.5
 • 1.1k Interviews
ZS Interview Questions
3.4
 • 467 Interviews
View all
KPMG India Azure Data Engineer Salary
based on 31 salaries
₹7.6 L/yr - ₹26.5 L/yr
99% more than the average Azure Data Engineer Salary in India
View more details

KPMG India Azure Data Engineer Reviews and Ratings

based on 1 review

3.0/5

Rating in categories

-

Skill development

-

Work-Life balance

-

Salary & Benefits

-

Job Security

-

Company culture

-

Promotions/Appraisal

-

Work Satisfaction

Explore 1 Review and Rating
Azure Data Engineer - Consultant

Bangalore / Bengaluru

4-6 Yrs

₹ 9.5-22 LPA

Azure Data Engineer - Assistant Manager

Bangalore / Bengaluru

6-8 Yrs

₹ 14.6-26.5 LPA

Azure Data Engineer - Consultant

Mumbai

3-6 Yrs

Not Disclosed

Explore more jobs
Consultant
7.5k salaries
unlock blur

₹6.9 L/yr - ₹26.2 L/yr

Assistant Manager
6.9k salaries
unlock blur

₹10.6 L/yr - ₹32 L/yr

Associate Consultant
4.5k salaries
unlock blur

₹4.4 L/yr - ₹15.5 L/yr

Analyst
3.4k salaries
unlock blur

₹1 L/yr - ₹8 L/yr

Manager
2.8k salaries
unlock blur

₹12 L/yr - ₹46 L/yr

Explore more salaries
Compare KPMG India with

Cognizant

3.8
Compare

PwC

3.4
Compare

Capgemini

3.8
Compare

Google

4.4
Compare

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Did you find this page helpful?
Yes No
write
Share an Interview