Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Add office photos

Engaged Employer

TCS

Compare

3.6

based on 99k Reviews

Video summary

Filter interviews by

TCS Data Engineer Interview Questions and Answers

Updated 13 Jun 2025

78 Interview questions

A Data Engineer was asked

Q. How do you call a notebook from another notebook using Databricks?

Ans.

To call a notebook from another notebook in Databricks, use the %run command followed by the path of the notebook.

Use the %run command followed by the path of the notebook to call it from another notebook.
Make sure the notebook you want to call is in the same workspace or accessible to the notebook you are calling it from.
You can also pass parameters to the notebook being called using the %run command.

🔥 Asked by recruiter 6 times

A Data Engineer was asked

Q. Are you willing to relocate?

Ans.

Yes, I am open to relocating for the right opportunity.

I am willing to relocate for the right job opportunity
I have relocated in the past for work
I am flexible and open to new experiences

A Data Engineer was asked

Q. Are you comfortable working all shifts?

Ans.

I am flexible and willing to work all shifts to meet the team's needs and project deadlines.

I understand that data engineering often requires collaboration across different time zones.
For example, I can adjust my schedule to align with team members in other regions.
I have previously worked night shifts during critical project phases to ensure timely delivery.
I believe that flexibility in shifts can enhance team pr...

A Data Engineer was asked

Q. What is the difference between DataStage and Informatica?

Ans.

Datastage and Informatica are both ETL tools used for data integration, but they have differences in terms of features and capabilities.

Datastage is developed by IBM and is known for its parallel processing capabilities, while Informatica is developed by Informatica Corporation and is known for its strong data quality features.
Datastage has a more user-friendly interface compared to Informatica, making it easier f...

What people are saying about TCS

View All

a digital marketer

Do you think they're gonna work on employees' work-life balance, OR for just publicity?

Infosys, Infosys, TCS, Genpact Revise Workplace Policies Infosys is sending a warning mail, if an employee overshoots the daily limit while working remotely, the system triggers a notification Genpact introduced a new policy to log in before 11 am But will these companies really change, or is it just a show to mask their issues?

Got a question about TCS?

Ask anonymously on communities.

A Data Engineer was asked

Q. How do you optimize Spark jobs?

Ans.

Optimizing Spark jobs involves tuning configurations, optimizing code, and utilizing resources efficiently.

Tune Spark configurations such as executor memory, cores, and parallelism
Optimize code by reducing unnecessary shuffles, caching intermediate results, and using efficient transformations
Utilize resources efficiently by monitoring job performance, scaling cluster resources as needed, and optimizing data storag...

A Data Engineer was asked

Q. How do you ingest data into your pipeline?

Ans.

I ingest data in the pipeline using tools like Apache Kafka and Apache NiFi.

Use Apache Kafka for real-time data streaming
Utilize Apache NiFi for data ingestion and transformation
Implement data pipelines using tools like Apache Spark or Apache Flink

A Data Engineer was asked

Q. How do you add a column to a DataFrame?

Ans.

To add a column in a df, use the df['new_column'] = value syntax.

Use the df['new_column'] = value syntax to add a new column to a DataFrame.
Value can be a single value, a list, or a Series.
Example: df['new_column'] = 10

Are these interview questions helpful?

A Data Engineer was asked

Q. How can we join a table without any identity columns?

Ans.

You can join tables without identity columns using other unique columns or composite keys.

Use other unique columns or composite keys to join the tables
Consider using a combination of columns to create a unique identifier for joining
If no unique columns are available, consider using a combination of non-unique columns with additional logic to ensure accurate joins

A Data Engineer was asked

Q. How can data skewness be avoided?

Ans.

Avoid data skewness by partitioning data, using sampling techniques, and optimizing queries.

Partition data to distribute evenly across nodes
Use sampling techniques to analyze data distribution
Optimize queries to prevent skewed data distribution

A Data Engineer was asked

Q. How can we optimize stored procedures?

Ans.

Optimizing stored procedures involves improving performance by reducing execution time and resource usage.

Identify and eliminate unnecessary or redundant code
Use appropriate indexing to speed up data retrieval
Avoid using cursors and loops for better performance
Update statistics regularly to help the query optimizer make better decisions
Consider partitioning large tables to improve query performance

TCS Data Engineer Interview Experiences

97 interviews found

Data Engineer Interview Questions & Answers

Anonymous

posted on 17 May 2025

Interview experience

Bad

Difficulty level

Process Duration

Result

I appeared for an interview in Apr 2025, where I was asked the following questions.

Q1. TCS Bhubaneswar Absolutely the Worst Walk-in Interview Experience Ever! I’ve never seen such a poorly organized and mismanaged interview process in my life. The entire setup was a disaster from the start. ...

Add your answer

Q2. The chaos, the waiting time, the heat, and the indifferent attitude of the panel made it one of the most frustrating and humiliating experiences ever. If you value your time, energy, and dignity — avoid th...

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - My honest advice: Don’t fall into this trap. Avoid this company’s walk-ins(Specially TCS Bhubaneswar)unless you want to be disrespected, misled, and left exhausted and disappointed. One of the Worst Walk-in Interview Experiences Ever – A Complete Disaster! I’ve attended many interviews in my career, but this was by far the worst and most mismanaged experience ever. The level of unprofessionalism and disorganization was beyond anything I could’ve imagined. They scheduled walk-in interviews for 15–17 different technologies on the same day without any proper planning. Around 700+ candidates showed up, and there was no seating arrangement, no air conditioning, not even fans. We were all left standing for hours under the harsh sun like cattle. No water, no shelter, no communication — it was pure chaos. Now, here's where it gets even more ridiculous: I went there specifically for a Snowflake position. The person who interviewed me claimed to be from the Snowflake panel — but as it turns out, there was no Snowflake requirement at all. Shockingly, two other candidates who were interviewed after me were told the same thing — that there’s no Snowflake opening, and if they wanted to proceed, they’d have to switch and interview for Azure Databricks instead. Is this fair? If you don’t have a requirement, and if you don’t have the right panel, then why call people in the first place? Why make hundreds of people wait from 9 AM to 5 PM in the heat with zero basic facilities and then casually tell them there’s no opening for the technology they applied for? And the panelists? Extremely rude and arrogant — they talked down to candidates as if we were wasting their time, when in fact they were wasting ours.

Data Engineer Interview Questions & Answers

Anonymous

posted on 17 Oct 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

Result

Not Selected

I applied via Walk-in

Round 1 - Technical

(2 Questions)

Q1. Difference between rank and dense_rank, Left vs Left anti join

Ans.

Rank assigns unique ranks to rows, while dense_rank handles ties by assigning the same rank to tied rows. Left join includes all rows from the left table and matching rows from the right table, while left anti join includes only rows from the left table that do not have a match in the right table.

Rank assigns unique ranks to rows based on the specified order, while dense_rank handles ties by assigning the same rank to ...

Answered by AI

Add your answer

Q2. Python list comprehension, SQL query

Add your answer

Round 2 - Behavioral

(1 Question)

Q1. Project related questions

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - No response from HR, even after clearing technical and managerial rounds

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Anonymous

posted on 20 Sep 2024

Interview experience

Good

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I applied via Recruitment Consulltant and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - One-on-one

(3 Questions)

Q1. Questions from pyspark - theoritical

Add your answer

Q2. Pyspark basic coding

Add your answer

Q3. SQL easy level coding question

Add your answer

Round 2 - One-on-one

(1 Question)

Q1. 2 nd round was managerial. The interviewer asked about projects worked and assessed how I fit for the company

Add your answer

Interview Preparation Tips

Topics to prepare for TCS Data Engineer interview:

pyspark
databricks
SQL
projects

Data Engineer Interview Questions & Answers

Anonymous

posted on 10 Oct 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Aptitude Test

Focus of quantitative maths and aptitude a bit more

Round 2 - Technical

(3 Questions)

Q1. Some basic questions on your comfortable programming knowledge

Add your answer

Q2. Some behaviour oriented questions

Add your answer

Q3. Some role related questions

Add your answer

Round 3 - HR

(2 Questions)

Q1. Some behaviour oriented

Add your answer

Q2. Salary discussion

Add your answer

Data Engineer Interview Questions & Answers

Mohammed Suboor Ahmed

posted on 24 Nov 2024

Interview experience

Excellent

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I applied via LinkedIn and was interviewed in Oct 2024. There was 1 interview round.

Round 1 - Technical

(2 Questions)

Q1. Reverse string in python list

Ans.

Reverse strings in a Python list

Use list comprehension to iterate through the list and reverse each string
Use the slice notation [::-1] to reverse each string
Example: strings = ['hello', 'world'], reversed_strings = [s[::-1] for s in strings]

Answered by AI

Add your answer

Q2. 2nd highest salary sql

Ans.

To find the 2nd highest salary in SQL, use the 'SELECT' statement with 'ORDER BY' and 'LIMIT' clauses.

Use the 'SELECT' statement to retrieve the salary column from the table.
Use the 'ORDER BY' clause to sort the salaries in descending order.
Use the 'LIMIT' clause to limit the result to the second row.

Answered by AI

Add your answer

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Anonymous

posted on 11 Oct 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I appeared for an interview in Sep 2024.

Round 1 - One-on-one

(3 Questions)

Q1. Spark architecture

Add your answer

Q2. Databricks clusters

Add your answer

Q3. Copy activity and all features

Add your answer

Data Engineer Interview Questions & Answers

Anonymous

posted on 4 Oct 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Result

I applied via Approached by Company and was interviewed in Sep 2024. There was 1 interview round.

Round 1 - Technical

(2 Questions)

Q1. SCD 1 vs SCD 2

Ans.

SCD 1 overwrites old data with new data, while SCD 2 keeps track of historical changes.

SCD 1 updates existing records with new data, losing historical information.
SCD 2 creates new records for each change, preserving historical data.
SCD 1 is simpler and faster, but can lead to data loss.
SCD 2 is more complex and slower, but maintains a full history of changes.

Answered by AI

Add your answer

Q2. Corrupt Record Handling in Spark

Ans.

Corrupt record handling in Spark involves identifying and handling data that does not conform to expected formats.

Use DataFrameReader option("badRecordsPath", "path/to/bad/records") to save corrupt records to a separate location for further analysis.
Use DataFrame.na.drop() or DataFrame.na.fill() to handle corrupt records by dropping or filling missing values.
Implement custom logic to identify and handle corrupt records...

Answered by AI

Add your answer

Interview Preparation Tips

Topics to prepare for TCS Data Engineer interview:

Python
PySpark
SQL
ETL

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Anonymous

posted on 30 Aug 2024

Interview experience

Average

Difficulty level

Process Duration

Result

Round 1 - One-on-one

(3 Questions)

Q1. What is oops concept

Ans.

Object-oriented programming (OOP) is a programming paradigm based on the concept of objects, which can contain data in the form of fields and code in the form of procedures.

OOP focuses on creating objects that interact with each other to solve a problem
Key concepts include encapsulation, inheritance, polymorphism, and abstraction
Encapsulation involves bundling data and methods that operate on the data into a single uni...

Answered by AI

Add your answer

Q2. Explain data engineer life cycle and its tools

Ans.

Data engineer life cycle involves collecting, storing, processing, and analyzing data using various tools.

Data collection: Gathering data from various sources such as databases, APIs, and logs.
Data storage: Storing data in databases, data lakes, or data warehouses.
Data processing: Cleaning, transforming, and enriching data using tools like Apache Spark or Hadoop.
Data analysis: Analyzing data to extract insights and mak...

Answered by AI

Add your answer

Q3. What types of spark join strategies

Ans.

Spark join strategies include broadcast join, shuffle hash join, and shuffle sort merge join.

Broadcast join is used when one of the DataFrames is small enough to fit in memory on all nodes.
Shuffle hash join is used when joining two large DataFrames by partitioning and shuffling the data based on the join key.
Shuffle sort merge join is used when joining two large DataFrames by sorting and merging the data based on the j...

Answered by AI

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - prepared hadoop ecosystem related question ,sql and one programing language (python,java,sacala)

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Anonymous

posted on 5 Oct 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. What is Spark? Why it is so popular

Ans.

Spark is a fast and general-purpose cluster computing system for big data processing.

Spark is popular for its speed and ease of use in processing large datasets.
It provides in-memory processing capabilities, making it faster than traditional disk-based processing systems.
Spark supports multiple programming languages like Java, Scala, Python, and R.
It offers a wide range of libraries for diverse tasks such as SQL, strea...

Answered by AI

Add your answer

Q2. What is Clustering? what is difference between pods and nodes?

Ans.

Clustering is the process of grouping similar data points together. Pods are groups of one or more containers, while nodes are individual machines in a cluster.

Clustering is a technique used in machine learning to group similar data points together based on certain features or characteristics.
Pods in a cluster are groups of one or more containers that share resources and are scheduled together on the same node.
Nodes ar...

Answered by AI

Add your answer

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Himanshu Pilkhwal

posted on 19 Dec 2024

Interview experience

Poor

Difficulty level

Moderate

Process Duration

Result

No response

Round 1 - Technical

(1 Question)

Q1. Regarding Pipeline scheduling multiple questions

Add your answer

TCS Interview FAQs

How many rounds are there in TCS Data Engineer interview?

TCS interview process usually has 1-2 rounds. The most common rounds in the TCS interview process are Technical, HR and Coding Test.

How to prepare for TCS Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at TCS. The most common topics and skills that interviewers at TCS expect are Python, SQL, Spark, AWS and Big Data.

What are the top questions asked in TCS Data Engineer interview?

Some of the top questions asked at the TCS Data Engineer interview -

what is an internal and external table in H...read more
what is view in SQL and dense and dense r...read more
How to deal with data quality iss...read more

How long is the TCS Data Engineer interview process?

The duration of TCS Data Engineer interview process can vary, but typically it takes about less than 2 weeks to complete.

Tell us how to improve this page.

TCS Interviews By Designations

Interview Questions for Popular Designations

4/5

based on 101 interview experiences

Difficulty level

Easy 22%

Moderate 73%

Hard 5%

Duration

Less than 2 weeks 68%

2-4 weeks 18%

4-6 weeks 7%

6-8 weeks 5%

More than 8 weeks 2%

Top Skills for TCS Data Engineer

Python Interview Questions & Answers

400 Questions

SQL Interview Questions & Answers

250 Questions

Big Data Interview Questions & Answers

250 Questions

Cloud Computing Interview Questions & Answers

250 Questions

Data Structures Interview Questions & Answers

250 Questions

Spark Interview Questions & Answers

50 Questions

Accenture Data Engineer Interview Questions

3.8

• 80 Interviews

LTIMindtree Data Engineer Interview Questions

3.7

• 63 Interviews

IBM Data Engineer Interview Questions

4.0

• 41 Interviews

Capgemini Data Engineer Interview Questions

3.7

• 37 Interviews

Cognizant Data Engineer Interview Questions

3.7

• 32 Interviews

Infosys Data Engineer Interview Questions

3.6

• 30 Interviews

Wipro Data Engineer Interview Questions

3.7

• 26 Interviews

Tech Mahindra Data Engineer Interview Questions

3.5

• 16 Interviews

HCLTech Data Engineer Interview Questions

3.5

• 13 Interviews

Genpact Data Engineer Interview Questions

3.8

• 9 Interviews

View all

TCS Data Engineer Salary

based on 6.6k salaries

₹4.3 L/yr - ₹11 L/yr

40% less than the average Data Engineer Salary in India

View more details

Data Engineer Jobs at TCS

Data Engineer

Chennai

7-12 Yrs

Not Disclosed

Sr. AWS Databricks Data Engineer

Kolkata,

Pune

6-11 Yrs

Not Disclosed

AWS Databricks Data Engineer

Hyderabad / Secunderabad,

Bangalore / Bengaluru

6-11 Yrs

Not Disclosed

Explore more jobs

TCS Salaries in India

System Engineer 1.1L salaries	₹3.9 L/yr - ₹8.3 L/yr
IT Analyst 65.5k salaries	₹7.7 L/yr - ₹12.7 L/yr
AST Consultant 53.6k salaries	₹12 L/yr - ₹20.6 L/yr
Assistant System Engineer 33.2k salaries	₹2.5 L/yr - ₹6.4 L/yr
Associate Consultant 33k salaries	₹16.2 L/yr - ₹28 L/yr