Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by IBM Team. If you also belong to the team, you can get access from here

IBM Verified Tick

Compare button icon Compare button icon Compare
4.1

based on 21.3k Reviews

Filter interviews by

IBM Big Data Engineer Interview Questions and Answers

Updated 1 Dec 2022

IBM Big Data Engineer Interview Experiences

3 interviews found

I applied via Naukri.com and was interviewed in Jan 2021. There were 5 interview rounds.

Interview Questionnaire 

2 Questions

  • Q1. Mostly on Hive, scala questions for first round which is of 43 theory questions and one coding question was asked for
  • Q2. 2nd and 3rd round would be on F2F online technical round where it was completely conversing about your project and the stuffs you had used while working. It would basically purely on your work experience s...

Interview Preparation Tips

Interview preparation tips for other job seekers - Do not panic, answer as much as u can never leave anything unattended, never lose hope, never have over confidence, ease yourself while attending and I say you definitely crack it if you have hands on experience
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Jun 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(1 Question)

  • Q1. Scenario based questions
Round 3 - One-on-one 

(1 Question)

  • Q1. Project explanation , internal vs external table
  • Ans. 

    Internal tables are managed by Hive, while external tables are managed by the user.

    • Internal tables are stored in a Hive-managed warehouse directory, while external tables can be stored anywhere.

    • Internal tables are deleted when the table is dropped, while external tables are not.

    • External tables can be used to access data stored in non-Hive formats, such as CSV or JSON.

    • Internal tables are typically used for temporary or ...

  • Answered by AI
Round 4 - HR 

(1 Question)

  • Q1. What is your current salary and your expectations

Interview Preparation Tips

Topics to prepare for IBM Big Data Engineer interview:
  • Spark,Hive ,Scala
Interview preparation tips for other job seekers - Prepare basics well , Try to solve some real time questions on SQL,spark

Big Data Engineer Interview Questions Asked at Other Companies

Q1. Difference between partitioning and bucketing. Types of joins in ... read more
asked in TCS
Q2. What optimization techniques have you utilized in your projects? ... read more
Q3. Checking whether a fibonacci number is present between a particuk ... read more
asked in TCS
Q4. What is the difference between lineage and directed acyclic graph ... read more
asked in TCS
Q5. What is the difference between cache and persistence?

Big Data Engineer Interview Questions & Answers

user image Prashant Kumar

posted on 19 Nov 2021

I applied via Referral and was interviewed before Nov 2020. There were 4 interview rounds.

Interview Questionnaire 

3 Questions

  • Q1. Spark architecture, Optimization technique , Hive , sql query
  • Q2. Python programming for list and string
  • Ans. 

    Python has built-in functions for manipulating lists and strings.

    • Lists are mutable and can be modified using various methods like append(), insert(), remove(), etc.

    • Strings are immutable and can be manipulated using slicing, concatenation, and various string methods.

    • List comprehension and string formatting are powerful tools for working with lists and strings.

    • Python also has powerful libraries like NumPy and Pandas for

  • Answered by AI
  • Q3. Unix and shell scripting

Interview Preparation Tips

Interview preparation tips for other job seekers - Focus on basics and rest will be clear

Skills evaluated in this interview

Interview questions from similar companies

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. All type of joins with example
  • Ans. 

    Different types of joins in SQL with examples

    • Inner Join: Returns rows when there is a match in both tables

    • Left Join: Returns all rows from the left table and the matched rows from the right table

    • Right Join: Returns all rows from the right table and the matched rows from the left table

    • Full Outer Join: Returns all rows when there is a match in either table

  • Answered by AI
  • Q2. How to handle large spark datasets
  • Ans. 

    Large Spark datasets can be handled by partitioning, caching, optimizing transformations, and tuning resources.

    • Partitioning data to distribute workload evenly across nodes

    • Caching frequently accessed data to avoid recomputation

    • Optimizing transformations to reduce unnecessary processing

    • Tuning resources like memory allocation and parallelism for optimal performance

  • Answered by AI
Round 2 - Technical 

(2 Questions)

  • Q1. How do you tune sparks configuration setting's to optimize query performance
  • Ans. 

    Spark configuration settings can be tuned to optimize query performance by adjusting parameters like memory allocation, parallelism, and caching.

    • Increase executor memory and cores to allow for more parallel processing

    • Adjust shuffle partitions to optimize data shuffling during joins and aggregations

    • Enable dynamic allocation to scale resources based on workload demands

    • Utilize caching to store intermediate results and avo...

  • Answered by AI
  • Q2. What strategies do you use to handle data skew and partition imbalance in spark
  • Ans. 

    To handle data skew and partition imbalance in Spark, strategies include using salting, bucketing, repartitioning, and optimizing join operations.

    • Use salting to evenly distribute skewed keys across partitions

    • Implement bucketing to pre-partition data based on a specific column

    • Repartition data based on a specific key to balance partitions

    • Optimize join operations by broadcasting small tables or using partitioning strategi

  • Answered by AI

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
-

I applied via Company Website and was interviewed in Sep 2024. There was 1 interview round.

Round 1 - Technical 

(3 Questions)

  • Q1. Big Data fundamentals concepts on rdd, dataframe and datasets.
  • Q2. Spark optimization techniques
  • Ans. 

    Spark optimization techniques involve partitioning, caching, and tuning resource allocation.

    • Partitioning data to distribute workload evenly

    • Caching frequently accessed data to avoid recomputation

    • Tuning resource allocation for optimal performance

  • Answered by AI
  • Q3. SQL queries on windows function and joins.

Interview Preparation Tips

Interview preparation tips for other job seekers - Data Engineer fundamentals should be well prepared

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
-

I applied via Company Website and was interviewed in Apr 2024. There were 3 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. Mostly about spark topics like shuffeling joins, data skew, and pyspark scenarios
  • Q2. Mostly about python and sql scenarios
Round 2 - Behavioral 

(2 Questions)

  • Q1. About behavioural questions
  • Q2. Day to day work and workflow model based questions
Round 3 - HR 

(2 Questions)

  • Q1. Why you need to join infosys
  • Ans. 

    I want to join Infosys because of its reputation for innovation and growth opportunities.

    • Infosys is known for its cutting-edge technology solutions and innovative projects.

    • I am impressed by Infosys' commitment to employee development and career growth.

    • I believe that joining Infosys will provide me with the opportunity to work on challenging projects and enhance my skills.

  • Answered by AI
  • Q2. Self introduction

Interview Preparation Tips

Interview preparation tips for other job seekers - be prepare well in pyspark and sql, python and practise well in scenario based questions
Interview experience
2
Poor
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Company Website and was interviewed in Apr 2024. There were 3 interview rounds.

Round 1 - Coding Test 

Questions on software and system designs

Round 2 - Group Discussion 

Helps employer identify particular personality traits like leadership, confidence, interpersonal and teamwork skills of potential employees

Round 3 - One-on-one 

(2 Questions)

  • Q1. Why do you want this job? Again, companies want to hire people who are passionate about the job, so you should have a great answer about why you want the ...
  • Q2. This opportunity can help me in many ways to boost my confidence as well as drive my career forward

Interview Preparation Tips

Topics to prepare for Tech Mahindra Big Data Engineer interview:
  • The role of technology in educat
Interview preparation tips for other job seekers - Your search is likely to be much more successful if you approach it with initiative, creativity and a positive attitude. Nobody "owes" you a job – the future ...
Job Search Tips & Resources
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Coding Test 

More questions about coding in sql & pyspark

Interview Preparation Tips

Interview preparation tips for other job seekers - Great questions & interviewer is friendly
Interview experience
2
Poor
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Job Portal and was interviewed in Sep 2023. There were 3 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Aptitude Test 

Easy only so prepare well that's it

Round 3 - Technical 

(5 Questions)

  • Q1. What method you use
  • Ans. 

    I use a combination of programming languages, tools, and frameworks to analyze and process large datasets.

    • Utilize programming languages like Python, Java, or Scala for data processing

    • Leverage tools like Hadoop, Spark, or Kafka for distributed computing

    • Implement frameworks like MapReduce or Apache Flink for data analysis

    • Use SQL or NoSQL databases for data storage and retrieval

  • Answered by AI
  • Q2. Why you lleave current company
  • Q3. What skill you have
  • Q4. Tell about ur project
  • Q5. What you implemented
  • Ans. 

    Implemented a real-time data processing system using Apache Kafka and Spark for analyzing customer behavior.

    • Developed data pipelines to ingest, process, and analyze large volumes of data

    • Utilized Apache Kafka for real-time data streaming

    • Implemented machine learning algorithms for predictive analytics

    • Optimized data storage and retrieval for faster query performance

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Do well easy

Skills evaluated in this interview

Big Data Engineer Interview Questions & Answers

Capgemini user image Manshi Raghuvanshi

posted on 26 Jun 2024

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Coalesce vs repartition
  • Ans. 

    Coalesce is used to reduce the number of partitions in a DataFrame, while repartition is used to increase the number of partitions.

    • Coalesce is a narrow transformation that can only decrease the number of partitions without shuffling data.

    • Repartition is a wide transformation that can both increase or decrease the number of partitions and involves shuffling data across the cluster.

    • Coalesce is more efficient for reducing ...

  • Answered by AI
Round 2 - Coding Test 

Rank vs dense rank quetions ctes

Round 3 - Coding Test 

Python data structure

IBM Interview FAQs

How many rounds are there in IBM Big Data Engineer interview?
IBM interview process usually has 4 rounds. The most common rounds in the IBM interview process are Resume Shortlist, Technical and One-on-one Round.
How to prepare for IBM Big Data Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at IBM. The most common topics and skills that interviewers at IBM expect are Big Data, SQL, Spark, Python and Hadoop.
What are the top questions asked in IBM Big Data Engineer interview?

Some of the top questions asked at the IBM Big Data Engineer interview -

  1. Python programming for list and str...read more
  2. Mostly on Hive, scala questions for first round which is of 43 theory questions...read more
  3. Spark architecture, Optimization technique , Hive , sql qu...read more

Tell us how to improve this page.

People are getting interviews through

based on 3 IBM interviews
Job Portal
Referral
67%
33%
Moderate Confidence
?
Moderate Confidence means the data is based on a sufficient number of responses received from the candidates
IBM Big Data Engineer Salary
based on 250 salaries
₹6 L/yr - ₹23.9 L/yr
25% more than the average Big Data Engineer Salary in India
View more details

IBM Big Data Engineer Reviews and Ratings

based on 20 reviews

3.9/5

Rating in categories

3.8

Skill development

4.3

Work-Life balance

3.3

Salary & Benefits

3.8

Job Security

3.9

Company culture

2.7

Promotions/Appraisal

3.7

Work Satisfaction

Explore 20 Reviews and Ratings
Application Developer
11.5k salaries
unlock blur

₹5.5 L/yr - ₹23.6 L/yr

Software Engineer
5.4k salaries
unlock blur

₹4.8 L/yr - ₹22.6 L/yr

Advisory System Analyst
5.2k salaries
unlock blur

₹9.2 L/yr - ₹27 L/yr

Senior Software Engineer
5k salaries
unlock blur

₹8 L/yr - ₹30 L/yr

Senior Systems Engineer
4.6k salaries
unlock blur

₹5.6 L/yr - ₹18.5 L/yr

Explore more salaries
Compare IBM with

Oracle

3.7
Compare

TCS

3.7
Compare

Cognizant

3.8
Compare

Accenture

3.9
Compare

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Did you find this page helpful?
Yes No
write
Share an Interview