Upload Button Icon Add office photos
Premium Employer

i

This company page is being actively managed by NTT Data Team. If you also belong to the team, you can get access from here

NTT Data

Compare button icon Compare button icon Compare
3.9

based on 3.6k Reviews

Filter interviews by

NTT Data Big Data Engineer Interview Questions and Answers

Updated 22 Aug 2023

NTT Data Big Data Engineer Interview Experiences

1 interview found

Interview experience
1
Bad
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Jul 2023. There were 2 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(2 Questions)

  • Q1. Basic Questions of Scala Functional Programming concepts.
  • Q2. Spark internal working and optimization techniques
  • Ans. 

    Spark internal working and optimization techniques

    • Spark uses Directed Acyclic Graph (DAG) for optimizing workflows

    • Lazy evaluation helps in optimizing transformations by combining them into a single stage

    • Caching and persistence of intermediate results can improve performance

    • Partitioning data can help in parallel processing and reducing shuffle operations

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - The interview call was abruptly terminated into 20 mins of call duration, as the HR had another conflicting call. The HR called me over cellphone and told that if the interview panel requested she will let me know and the call can be extended, but the HR did not call, The interview did not extended. Finally they rejected me just after the panel spoke 20 mins with me in the 1st round interview. This shows how unprofessional are they in scheduling an interview call and how could any panel can decide within 20 mins of a discussion. Definitely not recommending anyone to attend Bigdata Engineering interviews here.

Skills evaluated in this interview

Interview questions from similar companies

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I was interviewed in Nov 2024.

Round 1 - One-on-one 

(7 Questions)

  • Q1. Command to check disk utilisation and health in Hadoop
  • Ans. 

    Use 'hdfs diskbalancer' command to check disk utilisation and health in Hadoop

    • Run 'hdfs diskbalancer -report' to get a report on disk utilisation

    • Use 'hdfs diskbalancer -plan <path>' to generate a plan for balancing disk usage

    • Check the Hadoop logs for any disk health issues

  • Answered by AI
  • Q2. Spark Architecture & the significance of each member of spark Architecture
  • Ans. 

    Spark Architecture consists of Driver, Cluster Manager, and Executors. Driver manages the execution of Spark jobs.

    • Driver: Manages the execution of Spark jobs, converts user code into tasks, and coordinates with Cluster Manager.

    • Cluster Manager: Manages resources across the cluster and allocates resources to Spark applications.

    • Executors: Execute tasks assigned by the Driver and store data in memory or disk for further pr...

  • Answered by AI
  • Q3. Partitioning and bucketing
  • Q4. Spark optimization techniques
  • Ans. 

    Optimization techniques in Spark improve performance and efficiency of data processing.

    • Partitioning data to distribute workload evenly

    • Caching frequently accessed data in memory

    • Using broadcast variables for small lookup tables

    • Avoiding shuffling operations whenever possible

    • Tuning memory settings and garbage collection parameters

  • Answered by AI
  • Q5. Second highest salary
  • Ans. 

    I am unable to provide this information as it is confidential.

    • Confidential information about salaries in previous organizations should not be disclosed.

    • It is important to respect the privacy and confidentiality of past employers.

    • Discussing specific salary details may not be appropriate in a professional setting.

  • Answered by AI
  • Q6. Pivot table creation in SQL from not pivot one
  • Ans. 

    To create a pivot table in SQL from a non-pivot table, you can use the CASE statement with aggregate functions.

    • Use the CASE statement to categorize data into columns

    • Apply aggregate functions like SUM, COUNT, AVG, etc. to calculate values for each category

    • Group the data by the columns you want to pivot on

  • Answered by AI
  • Q7. How to create triggers
  • Ans. 

    Creating triggers in a database involves defining the trigger, specifying the event that will activate it, and writing the code to be executed.

    • Define the trigger using the CREATE TRIGGER statement

    • Specify the event that will activate the trigger (e.g. INSERT, UPDATE, DELETE)

    • Write the code or actions to be executed when the trigger is activated

    • Test the trigger to ensure it functions as intended

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Easy to medium questions were asked.
They are focusing on concept basically

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. All type of joins with example
  • Ans. 

    Different types of joins in SQL with examples

    • Inner Join: Returns rows when there is a match in both tables

    • Left Join: Returns all rows from the left table and the matched rows from the right table

    • Right Join: Returns all rows from the right table and the matched rows from the left table

    • Full Outer Join: Returns all rows when there is a match in either table

  • Answered by AI
  • Q2. How to handle large spark datasets
  • Ans. 

    Large Spark datasets can be handled by partitioning, caching, optimizing transformations, and tuning resources.

    • Partitioning data to distribute workload evenly across nodes

    • Caching frequently accessed data to avoid recomputation

    • Optimizing transformations to reduce unnecessary processing

    • Tuning resources like memory allocation and parallelism for optimal performance

  • Answered by AI
Round 2 - Technical 

(2 Questions)

  • Q1. How do you tune sparks configuration setting's to optimize query performance
  • Ans. 

    Spark configuration settings can be tuned to optimize query performance by adjusting parameters like memory allocation, parallelism, and caching.

    • Increase executor memory and cores to allow for more parallel processing

    • Adjust shuffle partitions to optimize data shuffling during joins and aggregations

    • Enable dynamic allocation to scale resources based on workload demands

    • Utilize caching to store intermediate results and avo...

  • Answered by AI
  • Q2. What strategies do you use to handle data skew and partition imbalance in spark
  • Ans. 

    To handle data skew and partition imbalance in Spark, strategies include using salting, bucketing, repartitioning, and optimizing join operations.

    • Use salting to evenly distribute skewed keys across partitions

    • Implement bucketing to pre-partition data based on a specific column

    • Repartition data based on a specific key to balance partitions

    • Optimize join operations by broadcasting small tables or using partitioning strategi

  • Answered by AI

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
-

I applied via Company Website and was interviewed in Sep 2024. There was 1 interview round.

Round 1 - Technical 

(3 Questions)

  • Q1. Big Data fundamentals concepts on rdd, dataframe and datasets.
  • Q2. Spark optimization techniques
  • Ans. 

    Spark optimization techniques involve partitioning, caching, and tuning resource allocation.

    • Partitioning data to distribute workload evenly

    • Caching frequently accessed data to avoid recomputation

    • Tuning resource allocation for optimal performance

  • Answered by AI
  • Q3. SQL queries on windows function and joins.

Interview Preparation Tips

Interview preparation tips for other job seekers - Data Engineer fundamentals should be well prepared

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
-

I applied via Referral and was interviewed in Dec 2024. There were 2 interview rounds.

Round 1 - Aptitude Test 

30 Questions in 20 Minutes

Round 2 - Technical 

(1 Question)

  • Q1. Baiscs of SQL,Python,AWS and spark in depth question
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Coding Test 

More questions about coding in sql & pyspark

Interview Preparation Tips

Interview preparation tips for other job seekers - Great questions & interviewer is friendly

Big Data Engineer Interview Questions & Answers

Capgemini user image Manshi Raghuvanshi

posted on 26 Jun 2024

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Coalesce vs repartition
  • Ans. 

    Coalesce is used to reduce the number of partitions in a DataFrame, while repartition is used to increase the number of partitions.

    • Coalesce is a narrow transformation that can only decrease the number of partitions without shuffling data.

    • Repartition is a wide transformation that can both increase or decrease the number of partitions and involves shuffling data across the cluster.

    • Coalesce is more efficient for reducing ...

  • Answered by AI
Round 2 - Coding Test 

Rank vs dense rank quetions ctes

Round 3 - Coding Test 

Python data structure

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Jun 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(1 Question)

  • Q1. Scenario based questions
Round 3 - One-on-one 

(1 Question)

  • Q1. Project explanation , internal vs external table
  • Ans. 

    Internal tables are managed by Hive, while external tables are managed by the user.

    • Internal tables are stored in a Hive-managed warehouse directory, while external tables can be stored anywhere.

    • Internal tables are deleted when the table is dropped, while external tables are not.

    • External tables can be used to access data stored in non-Hive formats, such as CSV or JSON.

    • Internal tables are typically used for temporary or ...

  • Answered by AI
Round 4 - HR 

(1 Question)

  • Q1. What is your current salary and your expectations

Interview Preparation Tips

Topics to prepare for IBM Big Data Engineer interview:
  • Spark,Hive ,Scala
Interview preparation tips for other job seekers - Prepare basics well , Try to solve some real time questions on SQL,spark

I applied via Referral and was interviewed in Mar 2022. There was 1 interview round.

Round 1 - Technical 

(3 Questions)

  • Q1. Spark optimization techniques
  • Ans. 

    Spark optimization techniques improve performance and efficiency of Spark applications.

    • Partitioning data to reduce shuffling

    • Caching frequently used data

    • Using broadcast variables for small data

    • Using efficient data formats like Parquet

    • Tuning memory and CPU usage

    • Using appropriate cluster size

    • Avoiding unnecessary data shuffling

    • Using appropriate serialization formats

    • Using appropriate join strategies

  • Answered by AI
  • Q2. What type of filesystem used in ur project
  • Ans. 

    We use Hadoop Distributed File System (HDFS) for our project.

    • HDFS is a distributed file system designed to run on commodity hardware.

    • It provides high-throughput access to application data and is fault-tolerant.

    • HDFS is used by many big data processing frameworks like Hadoop, Spark, etc.

    • It stores data in a distributed manner across multiple nodes in a cluster.

    • HDFS is optimized for large files and sequential reads and wri

  • Answered by AI
  • Q3. Spark + SQL basic coding

Interview Preparation Tips

Interview preparation tips for other job seekers - Just concentrate on the basics

Skills evaluated in this interview

I applied via Recruitment Consultant and was interviewed in Jul 2021. There were 3 interview rounds.

Interview Questionnaire 

1 Question

  • Q1. Basic concepts of big data technologies such as schemas on read and write

Interview Preparation Tips

Interview preparation tips for other job seekers - Go trough all the basic content

NTT Data Interview FAQs

How many rounds are there in NTT Data Big Data Engineer interview?
NTT Data interview process usually has 2 rounds. The most common rounds in the NTT Data interview process are Resume Shortlist and Technical.
What are the top questions asked in NTT Data Big Data Engineer interview?

Some of the top questions asked at the NTT Data Big Data Engineer interview -

  1. Spark internal working and optimization techniq...read more
  2. Basic Questions of Scala Functional Programming concep...read more

Tell us how to improve this page.

NTT Data Big Data Engineer Salary
based on 16 salaries
₹6.6 L/yr - ₹17.5 L/yr
6% less than the average Big Data Engineer Salary in India
View more details
Software Engineer
935 salaries
unlock blur

₹2.8 L/yr - ₹11 L/yr

Senior Associate
761 salaries
unlock blur

₹1.2 L/yr - ₹7.3 L/yr

Network Engineer
654 salaries
unlock blur

₹1.8 L/yr - ₹10 L/yr

Software Developer
615 salaries
unlock blur

₹2.5 L/yr - ₹13 L/yr

Senior Software Engineer
510 salaries
unlock blur

₹6.5 L/yr - ₹25.5 L/yr

Explore more salaries
Compare NTT Data with

Tata Communications

4.1
Compare

Bharti Airtel

4.0
Compare

Reliance Communications

4.0
Compare

Vodafone Idea

4.1
Compare
Did you find this page helpful?
Yes No
write
Share an Interview