Upload Button Icon Add office photos
Premium Employer

i

This company page is being actively managed by Deloitte Team. If you also belong to the team, you can get access from here

Deloitte Verified Tick

Compare button icon Compare button icon Compare
3.8

based on 17.3k Reviews

Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards

zig zag pattern zig zag pattern

Filter interviews by

Deloitte Big Data Engineer Interview Questions and Answers

Updated 12 Aug 2024

Deloitte Big Data Engineer Interview Experiences

2 interviews found

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected
Round 1 - Technical 

(1 Question)

  • Q1. Partitioning, broadcast join
Round 2 - One-on-one 

(1 Question)

  • Q1. Client round interview questions
Round 3 - HR 

(1 Question)

  • Q1. Salary negotiation
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(1 Question)

  • Q1. Basics of spark Some UDF questions Joins Pivot and unpivot
Round 3 - Technical 

(1 Question)

  • Q1. Scenario based question
Round 4 - HR 

(1 Question)

  • Q1. Tell me about yourself, expectations, how do you see yourself down the life, etc

Big Data Engineer Interview Questions Asked at Other Companies

Q1. Difference between partitioning and bucketing. Types of joins in ... read more
asked in TCS
Q2. What optimization techniques have you utilized in your projects? ... read more
Q3. Checking whether a fibonacci number is present between a particuk ... read more
asked in TCS
Q4. What is the difference between lineage and directed acyclic graph ... read more
asked in TCS
Q5. What is the difference between cache and persistence?

Interview questions from similar companies

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
-

I applied via Naukri.com and was interviewed in Jun 2024. There was 1 interview round.

Round 1 - One-on-one 

(11 Questions)

  • Q1. Working Experienace in current project
  • Q2. If i have large dataset to load which will not fit into the memory, How will you load the file?
  • Q3. What is Apache spark?
  • Ans. 

    Apache Spark is an open-source distributed computing system that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

    • Apache Spark is designed for speed and ease of use in processing large amounts of data.

    • It can run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.

    • Spark provides high-level APIs in Java, Scala, Python, and R, and an opt...

  • Answered by AI
  • Q4. What are core components of spark?
  • Ans. 

    Core components of Spark include Spark Core, Spark SQL, Spark Streaming, MLlib, and GraphX.

    • Spark Core: foundation of the Spark platform, provides basic functionality for distributed data processing

    • Spark SQL: module for working with structured data using SQL and DataFrame API

    • Spark Streaming: extension of the core Spark API that enables scalable, high-throughput, fault-tolerant stream processing of live data streams

    • MLlib...

  • Answered by AI
  • Q5. If we have streaming data coming from kafka and spark , how will you handle fault tolerance?
  • Ans. 

    Implement fault tolerance by using checkpointing, replication, and monitoring mechanisms.

    • Enable checkpointing in Spark Streaming to save the state of the computation periodically to a reliable storage like HDFS or S3.

    • Use replication in Kafka to ensure that data is not lost in case of node failures.

    • Monitor the health of the Kafka and Spark clusters using tools like Prometheus and Grafana to detect and address issues pro

  • Answered by AI
  • Q6. What is hive Architecture?
  • Ans. 

    Hive Architecture is a data warehousing infrastructure built on top of Hadoop for querying and analyzing large datasets.

    • Hive uses a language called HiveQL which is similar to SQL for querying data stored in Hadoop.

    • It organizes data into tables, partitions, and buckets to optimize queries and improve performance.

    • Hive metastore stores metadata about tables, columns, partitions, and their locations.

    • Hive queries are conver...

  • Answered by AI
  • Q7. What is vectorization in ?
  • Ans. 

    Vectorization is the process of converting data into a format that can be easily processed by a computer's CPU or GPU.

    • Vectorization allows for parallel processing of data, improving computational efficiency.

    • It involves performing operations on entire arrays or matrices at once, rather than on individual elements.

    • Examples include using libraries like NumPy in Python to perform vectorized operations on arrays.

    • Vectorizati...

  • Answered by AI
  • Q8. We have to do Vectorization?
  • Q9. What is partition in hive?
  • Ans. 

    Partition in Hive is a way to organize data in a table into multiple directories based on the values of one or more columns.

    • Partitions help in improving query performance by allowing Hive to only read the relevant data directories.

    • Partitions are defined when creating a table in Hive using the PARTITIONED BY clause.

    • Example: CREATE TABLE table_name (column1 INT, column2 STRING) PARTITIONED BY (column3 STRING);

  • Answered by AI
  • Q10. What are functions in SQL?
  • Ans. 

    Functions in SQL are built-in operations that can be used to manipulate data or perform calculations within a database.

    • Functions in SQL can be used to perform operations on data, such as mathematical calculations, string manipulation, date/time functions, and more.

    • Examples of SQL functions include SUM(), AVG(), CONCAT(), UPPER(), LOWER(), DATE_FORMAT(), and many others.

    • Functions can be used in SELECT statements, WHERE ...

  • Answered by AI
  • Q11. Explain Rank, Dense_rank , row_number
  • Ans. 

    Rank, Dense_rank, and row_number are window functions used in SQL to assign a rank to each row based on a specified order.

    • Rank function assigns a unique rank to each row based on the specified order.

    • Dense_rank function assigns a unique rank to each row without any gaps based on the specified order.

    • Row_number function assigns a unique sequential integer to each row based on the specified order.

  • Answered by AI

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
Hard
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Feb 2024. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. What is explode function ?
  • Ans. 

    explode function is used in Apache Spark to split a column containing arrays into multiple rows.

    • Used in Apache Spark to split a column containing arrays into multiple rows

    • Creates a new row for each element in the array

    • Syntax: explode(col: Column): Column

    • Example: df.select(explode(col('array_column')))

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Cover your basics

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
6-8 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Dec 2024. There were 4 interview rounds.

Round 1 - Aptitude Test 

Set of questions on english , aptitude , all are at easy level

Round 2 - Coding Test 

Sql basics and some query questions

Round 3 - HR 

(2 Questions)

  • Q1. Why do u want to join zs ; what do u know about zs; where do u see urself after 5 years ; question on real life scenarios ;
  • Q2. Question on real life scenarios
Round 4 - HR 

(2 Questions)

  • Q1. Salary expectation
  • Q2. When can u join

Interview Preparation Tips

Interview preparation tips for other job seekers - prepare all topics thorougly and general scenario questions and be confident and able to provide justice to ur approach
Interview experience
2
Poor
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(3 Questions)

  • Q1. What is Linear regression
  • Q2. What li list and tuple
  • Q3. Previous company project experience
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
4-6 weeks
Result
Not Selected

I applied via Company Website and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Project related discussions
  • Q2. Meduim level SQl and DSA
Round 2 - One-on-one 

(2 Questions)

  • Q1. This was data modelling round
  • Q2. Design a uber data model
  • Ans. 

    Uber data model design for efficient storage and retrieval of ride-related information.

    • Create tables for users, drivers, rides, payments, and ratings

    • Include attributes like user_id, driver_id, ride_id, payment_id, rating_id, timestamp, location, fare, etc.

    • Establish relationships between tables using foreign keys

    • Implement indexing for faster query performance

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare SQl, Python and data modeling

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Newspaper Ad and was interviewed in Aug 2024. There were 3 interview rounds.

Round 1 - Aptitude Test 

Three sections are there 1) Aptitude Test 2) SQL 3) DSA

Round 2 - Technical 

(2 Questions)

  • Q1. What is DSA , sorting , difference between array and linked list
  • Ans. 

    DSA stands for Data Structures and Algorithms. Sorting is the process of arranging data in a particular order. Array is a data structure that stores elements of the same data type in contiguous memory locations, while linked list is a data structure that stores elements in nodes with pointers to the next node.

    • DSA stands for Data Structures and Algorithms

    • Sorting is the process of arranging data in a particular order

    • Arra...

  • Answered by AI
  • Q2. Written a SQL query
Round 3 - HR 

(2 Questions)

  • Q1. Coding question like add numbers
  • Q2. Experience on your project
  • Ans. 

    I have experience working on various data analysis projects, including market research, customer segmentation, and predictive modeling.

    • Developed predictive models to forecast customer behavior and optimize marketing strategies

    • Conducted market research to identify trends and opportunities for growth

    • Performed customer segmentation analysis to target specific demographics with personalized marketing campaigns

  • Answered by AI

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. Power BI Difference between ALL() and ALLSELECTED()
  • Ans. 

    ALL() ignores all filters in the query context, while ALLSELECTED() ignores only filters on columns in the visual.

    • ALL() removes all filters from the specified column or table.

    • ALLSELECTED() removes filters from the specified column or table, but keeps filters on other columns in the visual.

    • Example: ALL('Table') would remove all filters on the 'Table' in the query context.

    • Example: ALLSELECTED('Column') would remove filte...

  • Answered by AI
  • Q2. Excel Difference between COUNT() and COUNTA()
  • Ans. 

    COUNT() counts only numeric values, while COUNTA() counts all non-empty cells.

    • COUNT() counts only cells with numerical values.

    • COUNTA() counts all non-empty cells, including text and errors.

    • Example: COUNT(A1:A5) will count only cells with numbers, while COUNTA(A1:A5) will count all non-empty cells.

  • Answered by AI
Round 2 - Technical 

(2 Questions)

  • Q1. Resume based questions like explain the projects that you have done.
  • Q2. Sample dataset questions

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Approached by Company and was interviewed in Aug 2024. There was 1 interview round.

Round 1 - Coding Test 

Maxium sub string and reverse a string

Deloitte Interview FAQs

How many rounds are there in Deloitte Big Data Engineer interview?
Deloitte interview process usually has 3-4 rounds. The most common rounds in the Deloitte interview process are Technical, HR and Resume Shortlist.
How to prepare for Deloitte Big Data Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Deloitte. The most common topics and skills that interviewers at Deloitte expect are Hadoop, SCALA, Big Data, Hive and Python.
What are the top questions asked in Deloitte Big Data Engineer interview?

Some of the top questions asked at the Deloitte Big Data Engineer interview -

  1. Basics of spark Some UDF questions Joins Pivot and unpi...read more
  2. Client round interview questi...read more
  3. Partitioning, broadcast j...read more

Tell us how to improve this page.

Deloitte Big Data Engineer Interview Process

based on 2 interviews

Interview experience

4.5
  
Good
View more

Big Data Engineer Interview Questions from Similar Companies

View all
Deloitte Big Data Engineer Salary
based on 60 salaries
₹5.8 L/yr - ₹19.8 L/yr
13% more than the average Big Data Engineer Salary in India
View more details

Deloitte Big Data Engineer Reviews and Ratings

based on 1 review

5.0/5

Rating in categories

5.0

Skill development

5.0

Work-life balance

5.0

Salary

5.0

Job security

5.0

Company culture

5.0

Promotions

5.0

Work satisfaction

Explore 1 Review and Rating
Consultant
33.1k salaries
unlock blur

₹6.3 L/yr - ₹23.1 L/yr

Senior Consultant
20.8k salaries
unlock blur

₹11 L/yr - ₹42 L/yr

Analyst
14.1k salaries
unlock blur

₹3.7 L/yr - ₹12.4 L/yr

Assistant Manager
10k salaries
unlock blur

₹7.8 L/yr - ₹24 L/yr

Manager
7k salaries
unlock blur

₹15.7 L/yr - ₹52 L/yr

Explore more salaries
Compare Deloitte with

Accenture

3.9
Compare

PwC

3.4
Compare

Ernst & Young

3.4
Compare

Cognizant

3.8
Compare
Did you find this page helpful?
Yes No
write
Share an Interview