Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Engaged Employer

Capgemini

Compare

3.7

based on 41.7k Reviews

Video summary

Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards

Top Rated Mega Company - 2024

Top Rated IT/ITES Company - 2024

Filter interviews by

Capgemini Data Engineer Interview Questions, Process, and Tips

Updated 14 Jan 2025

Capgemini Data Engineer Interview Experiences

33 interviews found

Data Engineer Interview Questions & Answers

Anonymous

posted on 1 Apr 2024

Interview experience

Excellent

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I was interviewed in Mar 2024.

Round 1 - Technical

(4 Questions)

Q1. Basic questions on pyspark , spark architecture, 2 coding questions one sql ,, one pyspark.

Add your answer

Q2. Spark architecture

Add your answer

Q3. Sql joins and sql select query

Add your answer

Q4. What will be spark configuration to process 2 gb of data

Ans.

Set spark configuration with appropriate memory and cores for efficient processing of 2 GB data

Increase executor memory and cores to handle larger data size
Adjust spark memory overhead to prevent out of memory errors
Optimize shuffle partitions for better performance

Answered by AI

Add your answer

Round 2 - Technical

(1 Question)

Q1. How you will run a child notebook into a parent notebook using dbutils command

Ans.

Use dbutils.notebook.run() command to run a child notebook in a parent notebook

Use dbutils.notebook.run() command with the path to the child notebook and any parameters needed
Ensure that the child notebook is accessible and has necessary permissions
Handle any return values or errors from the child notebook appropriately

Answered by AI

Add your answer

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Aparna Yadav

posted on 1 Jul 2024

Interview experience

Average

Difficulty level

Hard

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Referral and was interviewed in Jun 2024. There was 1 interview round.

Round 1 - Technical

(2 Questions)

Q1. Compare Databricks and azure synapse notebook.

Ans.

Databricks and Azure Synapse Notebook are both cloud-based platforms for data engineering and analytics.

Databricks is primarily focused on big data processing and machine learning, while Azure Synapse Notebook is part of a larger analytics platform.
Databricks provides a collaborative environment for data scientists and engineers to work together, while Azure Synapse Notebook is integrated with other Azure services for ...

Answered by AI

Add your answer

Q2. Co- related queries.

Add your answer

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Aarthi Murali

posted on 6 Dec 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(1 Question)

Q1. Spark architecture

Add your answer

What people are saying about Capgemini

View All

a consultant

Hi Everyone, I need your advice on selecting the right company. My top priorities are work-life balance and job security. Below are the offers I have: Capgemini – 24 LPA (21.5 LPA Fixed + 2 LPA Variable) Happiest Minds – 22.5 LPA (Fixed) Experience: 8 Years

Which one to choose

Capgemini

Happiest mind

9 participants . poll closed

Got a question about Capgemini?

Ask anonymously on communities.

Data Engineer Interview Questions & Answers

Anonymous

posted on 8 Mar 2024

Interview experience

Average

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I applied via Naukri.com and was interviewed in Feb 2024. There were 2 interview rounds.

Round 1 - Technical

(3 Questions)

Q1. Ques on GCP, python , sql,github

Add your answer

Q2. What is BQ what are advantages

Ans.

BQ stands for BigQuery, a fully managed, serverless, and highly scalable cloud data warehouse provided by Google Cloud.

Advantages of BigQuery include fast query performance due to its distributed architecture
Scalability to handle large datasets without the need for infrastructure management
Integration with other Google Cloud services like Dataflow, Dataproc, and Data Studio
Support for standard SQL queries and real-time...

Answered by AI

Add your answer

Q3. Python decorators

Add your answer

Round 2 - Technical

(1 Question)

Q1. General technical discussion no coding

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - expect easy ques

Data Engineer Interview Questions & Answers

sahithi 1999

posted on 14 Sep 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Selected

I applied via Referral and was interviewed in Mar 2024. There were 2 interview rounds.

Round 1 - Technical

(1 Question)

Q1. Explain snowflake architecture

Ans.

Snowflake architecture is a cloud-based data warehousing solution that separates storage and compute resources for scalability and performance.

Snowflake uses a unique architecture with three layers: storage, compute, and services.
Storage layer stores data in a columnar format for efficient querying.
Compute layer processes queries independently, allowing for elastic scalability.
Services layer manages metadata, security,...

Answered by AI

Add your answer

Round 2 - Technical

(1 Question)

Q1. Coding in sql joins

Ans.

SQL joins are used to combine rows from two or more tables based on a related column between them.

Use INNER JOIN to return rows when there is at least one match in both tables
Use LEFT JOIN to return all rows from the left table, and the matched rows from the right table
Use RIGHT JOIN to return all rows from the right table, and the matched rows from the left table
Use FULL JOIN to return rows when there is a match in on

Answered by AI

Add your answer

Skills evaluated in this interview

Get interview-ready with Top Capgemini Interview Questions

Data Engineer Interview Questions & Answers

Anonymous

posted on 26 Nov 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Aptitude Test

Time series
logical reasoing
family tree

Data Engineer Jobs at Capgemini

View all

Data Engineer

Bangalore / Bengaluru

6-9 Yrs

Not Disclosed

Data Engineer

Pune

4-6 Yrs

Not Disclosed

Data Engineer

Hyderabad / Secunderabad

4-6 Yrs

₹ 5.6-15.84 LPA

Data Engineer

Bangalore / Bengaluru

3-6 Yrs

₹ 4-24 LPA

Data Engineer - C

Pune

3-5 Yrs

₹ 4.2-16 LPA

Data Engineer

Bangalore / Bengaluru

2-6 Yrs

₹ 3.25-24 LPA

Data Engineer Interview Questions & Answers

Anonymous

posted on 10 Feb 2024

Interview experience

Average

Difficulty level

Process Duration

Result

Round 1 - One-on-one

(3 Questions)

Q1. Architecture of Spark

Ans.

Spark is a distributed computing system that provides an interface for programming clusters with implicit data parallelism.

Spark is built on the concept of Resilient Distributed Datasets (RDDs), which are fault-tolerant collections of objects.
It supports various programming languages such as Scala, Java, Python, and R.
Spark provides high-level APIs for distributed data processing, including transformations and actions.
...

Answered by AI

Add your answer

Q2. What is Lazy evaluation in spark

Ans.

Lazy evaluation is a strategy used by Spark to delay the execution of transformations until an action is called.

Lazy evaluation improves performance by optimizing the execution plan
Transformations in Spark are not executed immediately, but rather recorded as a lineage graph
Actions trigger the execution of the transformations and produce a result
Lazy evaluation allows Spark to optimize the execution plan by combining an...

Answered by AI

Add your answer

Q3. Difference between Left join and inner join

Ans.

Left join returns all records from the left table and the matching records from the right table.

Inner join returns only the matching records from both tables.
Left join includes all records from the left table, even if there are no matches in the right table.
Inner join excludes the non-matching records from both tables.
Left join is used to retrieve all records from one table and the matching records from another table.
I...

Answered by AI

Add your answer

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Anonymous

posted on 24 Jul 2024

Interview experience

Average

Difficulty level

Process Duration

Result

Round 1 - One-on-one

(2 Questions)

Q1. Tell me about yourself

Add your answer

Q2. Data Engineer GCP question

Add your answer

Data Engineer Interview Questions & Answers

syedumar A

posted on 25 Jun 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Technical

(1 Question)

Q1. 3-4 SQL queries were asked to write

Add your answer

Data Engineer Interview Questions & Answers

Anonymous

posted on 16 Jul 2024

Interview experience

Average

Difficulty level

Process Duration

Result

Round 1 - One-on-one

(1 Question)

Q1. Explain project structure

Ans.

Project structure refers to the organization of files, folders, and resources within a project.

Project structure should be logical and easy to navigate
Common structures include separating code into modules, organizing files by type (e.g. scripts, data, documentation), and using version control
Example: A data engineering project may have folders for data extraction, transformation, loading, and documentation

Answered by AI

Add your answer

Capgemini Interview FAQs

How many rounds are there in Capgemini Data Engineer interview?

Capgemini interview process usually has 1-2 rounds. The most common rounds in the Capgemini interview process are Technical, One-on-one Round and HR.

How to prepare for Capgemini Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Capgemini. The most common topics and skills that interviewers at Capgemini expect are Python, Spark, AWS, SCALA and SQL.

What are the top questions asked in Capgemini Data Engineer interview?

Some of the top questions asked at the Capgemini Data Engineer interview -

How will you Join if two tables are large in pyspa...read more
Write a SQL to get Student names who got marks>45 in each subject from Student...read more
How to remove Duplicates in Data frame using pyspa...read more

How long is the Capgemini Data Engineer interview process?

The duration of Capgemini Data Engineer interview process can vary, but typically it takes about less than 2 weeks to complete.

Tell us how to improve this page.

Capgemini Interviews By Designations

Interview Questions for Popular Designations

Capgemini Data Engineer Interview Process

based on 40 interviews

2 Interview rounds

Technical Round - 1
Technical Round - 2

Top Skills for Capgemini Data Engineer

Big Data Interview Questions & Answers

250 Questions

SQL Interview Questions & Answers

250 Questions

Spark Interview Questions & Answers

50 Questions

TCS Data Engineer Interview Questions

3.7

• 90 Interviews

Accenture Data Engineer Interview Questions

3.8

• 78 Interviews

LTIMindtree Data Engineer Interview Questions

3.8

• 61 Interviews

IBM Data Engineer Interview Questions

4.0

• 41 Interviews

Cognizant Data Engineer Interview Questions

3.8

• 30 Interviews

Infosys Data Engineer Interview Questions

3.6

• 25 Interviews

Wipro Data Engineer Interview Questions

3.7

• 24 Interviews

Tech Mahindra Data Engineer Interview Questions

3.5

• 14 Interviews

HCLTech Data Engineer Interview Questions

3.5

• 11 Interviews

Genpact Data Engineer Interview Questions

3.8

• 7 Interviews

View all

Lovely Professional University (LPU) Placement Questions

11 Interviews

SRM university (SRMU) Placement Questions

9 Interviews

GLA Institute of Technology and Management, Mathura Placement Questions

6 Interviews

Indian Institute of Technology (IIT), Chennai Placement Questions

6 Interviews

Mumbai University Placement Questions

6 Interviews

Vidyalankar Institute of Technology, Mumbai Placement Questions

5 Interviews

Sathyabama University Placement Questions

5 Interviews

View all

Capgemini Data Engineer Salary

based on 1.5k salaries

₹2.8 L/yr - ₹15.9 L/yr

16% less than the average Data Engineer Salary in India

View more details

Data Engineer Jobs at Capgemini

Data Engineer

Bangalore / Bengaluru

6-9 Yrs

Not Disclosed

Data Engineer

Pune

4-6 Yrs

Not Disclosed

Data Engineer

Hyderabad / Secunderabad

4-6 Yrs

₹ 5.6-15.84 LPA

Explore more jobs

Capgemini Salaries in India

Consultant 55.2k salaries	₹5.2 L/yr - ₹17.5 L/yr
Associate Consultant 50.8k salaries	₹3 L/yr - ₹10 L/yr
Senior Consultant 46.1k salaries	₹7.5 L/yr - ₹24.5 L/yr
Senior Analyst 20.6k salaries	₹2 L/yr - ₹7.5 L/yr
Senior Software Engineer 20.2k salaries	₹3.5 L/yr - ₹12.1 L/yr