Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Engaged Employer

Tech Mahindra

Compare

3.5

based on 34.9k Reviews

Video summary

Filter interviews by

Tech Mahindra AWS Data Engineer Interview Questions and Answers

Updated 7 Jan 2025

Tech Mahindra AWS Data Engineer Interview Experiences

1 interview found

AWS Data Engineer Interview Questions & Answers

Anonymous

posted on 7 Jan 2025

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(1 Question)

Q1. Explain How Lambda Asynchronous invokation works

Add your answer

Interview questions from similar companies

AWS Data Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 17 Mar 2024

Interview experience

Excellent

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I applied via Approached by Company and was interviewed in Feb 2024. There were 2 interview rounds.

Round 1 - Technical

(2 Questions)

Q1. Question on Project related. Pyspark question on transformation. AWS related questions, mostly of Glue jobs.

Add your answer

Q2. Modify null salary with avg salary, find count of employees by joining date. Configurations needed Glue job. What are connecters and Data connections in Glue service.

Ans.

Use Glue job to modify null salaries with average salary and find count of employees by joining date.

Create a Glue job to read data, modify null salaries with average salary, and count employees by joining date
Use Glue connectors to connect to data sources like S3, RDS, or Redshift
Data connections in Glue service are used to define the connection information to data sources
Example: Use Glue job to read employee data fr...

Answered by AI

Add your answer

Round 2 - Technical

(1 Question)

Q1. Project related questions and some behavior related questions.

Add your answer

Skills evaluated in this interview

AWS Data Engineer Interview Questions & Answers

TCS

Anonymous

posted on 1 Apr 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

No response

I applied via Recruitment Consulltant and was interviewed in Mar 2024. There were 2 interview rounds.

Round 1 - Technical

(2 Questions)

Q1. Question based on resume, cloud databases, EMR, Athena, Ec2, Sql

Add your answer

Q2. Scenario based questions

Add your answer

Round 2 - One-on-one

(1 Question)

Q1. Managerial questions

Add your answer

AWS Data Engineer Interview Questions & Answers

Cognizant

Anonymous

posted on 4 Feb 2023

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Selected

I applied via Naukri.com and was interviewed in Aug 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - One-on-one

(2 Questions)

Q1. Tell ne about your self , experience in brief

Add your answer

Q2. Why Not other cloud why you choose AWS

Ans.

AWS offers the most comprehensive and flexible cloud platform with unmatched scalability, security, and reliability.

AWS has the largest and most mature cloud ecosystem with a wide range of services and tools
AWS provides unmatched scalability and flexibility to meet the needs of any organization
AWS has a strong focus on security and compliance, with a wide range of certifications and compliance programs
AWS offers high a...

Answered by AI

Add your answer

Round 3 - Technical

(2 Questions)

Q1. How aws glue works ?

Ans.

AWS Glue is a fully managed ETL service that makes it easy to move data between data stores.

AWS Glue crawls your data sources and constructs a data catalog using metadata.
It then generates ETL code to transform and move data between data stores.
AWS Glue supports various data sources like Amazon S3, JDBC, Amazon RDS, etc.
It also provides a serverless environment to run your ETL jobs.
AWS Glue integrates with other AWS se

Answered by AI

Add your answer

Q2. What is AWS kinesis?

Ans.

AWS Kinesis is a managed service that enables real-time processing of streaming data at scale.

Kinesis can handle large amounts of data in real-time from various sources such as IoT devices, social media, and logs.
It allows data to be processed in real-time using AWS Lambda, Kinesis Analytics, or Kinesis Data Firehose.
Kinesis can be used for various use cases such as real-time analytics, machine learning, and fraud dete...

Answered by AI

Add your answer

Round 4 - HR

(3 Questions)

Q1. Very short discussion about salary pakage

Add your answer

Q2. They will ask fews question only related documents you have or not

Add your answer

Q3. Accepted or not as they offering you

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - As per my interview experience..do not go fully black at least you must know about basic knowledge, connections between that , concept. mostly candidate rejected because of insufficient knowledge and main thing do not fake it in front of technical staff , that time No one extract you . Whenever you apply for any position first you see you have full knowledge or related knowledge both are different. Because technical staff not like to heard you related he / she want deep knowledge if you experience or fresher you must about concepts , lab practices , connections which you said .

Skills evaluated in this interview

AWS Data Engineer Interview Questions & Answers

Accenture

Pranjal Gupta

posted on 6 Dec 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via Approached by Company and was interviewed before Dec 2023. There was 1 interview round.

Round 1 - Technical

(2 Questions)

Q1. SQL questions related to windowing functions

Add your answer

Q2. Spark theory questions

Add your answer

AWS Data Engineer Interview Questions & Answers

Wipro

Anonymous

posted on 19 Aug 2022

I applied via campus placement at Sastra University and was interviewed before Aug 2021. There were 2 interview rounds.

Round 1 - Technical

(1 Question)

Q1. As i was fresher it was like basic C language technical common questions

Add your answer

Round 2 - HR

(1 Question)

Q1. They have asked me about my final year projects and basic questions about organization like who was the founder, CEO etc

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - You can make anything possible with your confidence.

Senior Data Analyst Interview Questions & Answers

HCLTech

Anonymous

posted on 15 Jan 2025

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

4-6 weeks

Result

Selected

I applied via Walk-in and was interviewed in Dec 2024. There were 5 interview rounds.

Round 1 - HR

(1 Question)

Q1. Resume screening Self introduction Roles and responsibilities

Add your answer

Round 2 - Assignment

Given task Statics standard deviations Attrition Average of given table values and Given graph economi graph and poverty graph base on that need to gave answers 30 qustion and 60 min time duration

Round 3 - Technical

(2 Questions)

Q1. Versant test should be need to attend Hirepro will have interview prosess in YouTube will check and prepare mid level hard

Add your answer

Q2. We can goo for online prosess how it is

Add your answer

Round 4 - One-on-one

(1 Question)

Q1. Manger Level round they will ask KPI NPS AHT standard deviations Service Level handling Root cause Analysis Weighted average Same product how will calculate Totall Swer % how will get Many of like bpi <20 ...

Add your answer

Round 5 - HR

(1 Question)

Q1. Salary discussion and update structure of our role and responsibilities

Add your answer

Data Engineer Interview Questions & Answers

Genpact

Sashikanta Parida

posted on 17 Dec 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical

(3 Questions)

Q1. What are different type of joins available in Databricks?

Ans.

Different types of joins available in Databricks include inner join, outer join, left join, right join, and cross join.

Inner join: Returns only the rows that have matching values in both tables.
Outer join: Returns all rows when there is a match in either table.
Left join: Returns all rows from the left table and the matched rows from the right table.
Right join: Returns all rows from the right table and the matched rows ...

Answered by AI

Add your answer

Q2. How do you make your data pipeline fault tolerant?

Ans.

Implementing fault tolerance in a data pipeline involves redundancy, monitoring, and error handling.

Use redundant components to ensure continuous data flow
Implement monitoring tools to detect failures and bottlenecks
Set up automated alerts for immediate response to issues
Design error handling mechanisms to gracefully handle failures
Use checkpoints and retries to ensure data integrity

Answered by AI

Add your answer

Q3. What is AutoLoader?

Ans.

AutoLoader is a feature in data engineering that automatically loads data from various sources into a data warehouse or database.

Automates the process of loading data from different sources
Reduces manual effort and human error
Can be scheduled to run at specific intervals
Examples: Apache Nifi, AWS Glue

Answered by AI

Add your answer

Round 2 - Technical

(2 Questions)

Q1. How do you connect to different services in Azure?

Ans.

To connect to different services in Azure, you can use Azure SDKs, REST APIs, Azure Portal, Azure CLI, and Azure PowerShell.

Use Azure SDKs for programming languages like Python, Java, C#, etc.
Utilize REST APIs to interact with Azure services programmatically.
Access and manage services through the Azure Portal.
Leverage Azure CLI for command-line interface interactions.
Automate tasks using Azure PowerShell scripts.

Answered by AI

Add your answer

Q2. What are linked Services?

Ans.

Linked Services are connections to external data sources or destinations in Azure Data Factory.

Linked Services define the connection information needed to connect to external data sources or destinations.
They can be used in Data Factory pipelines to read from or write to external systems.
Examples of Linked Services include Azure Blob Storage, Azure SQL Database, and Amazon S3.

Answered by AI

Add your answer

Senior Data Engineer Interview Questions & Answers

Persistent Systems

Anonymous

posted on 17 Jul 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

No response

I applied via Naukri.com and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Technical

(12 Questions)

Q1. Tell me about yourself and Project

Ans.

I am a Senior Data Engineer with experience in developing data pipelines and optimizing data storage for various projects.

Developed data pipelines using Apache Spark for real-time data processing
Optimized data storage using technologies like Hadoop and AWS S3
Worked on a project to analyze customer behavior and improve marketing strategies

Answered by AI

Add your answer

Q2. What was you day-to-day job in your project

Ans.

My day-to-day job in the project involved designing and implementing data pipelines, optimizing data workflows, and collaborating with cross-functional teams.

Designing and implementing data pipelines to extract, transform, and load data from various sources
Optimizing data workflows to improve efficiency and performance
Collaborating with cross-functional teams including data scientists, analysts, and business stakeholde...

Answered by AI

Add your answer

Q3. Spark Architecture

Add your answer

Q4. How DAG handle Fault tolerance?

Ans.

DAGs handle fault tolerance by rerunning failed tasks and maintaining task dependencies.

DAGs rerun failed tasks automatically to ensure completion.
DAGs maintain task dependencies to ensure proper sequencing.
DAGs can be configured to retry failed tasks a certain number of times before marking them as failed.

Answered by AI

Add your answer

Q5. What is shuffling? How to Handle Shuffling?

Ans.

Shuffling is the process of redistributing data across partitions in a distributed computing environment.

Shuffling is necessary when data needs to be grouped or aggregated across different partitions.
It can be handled efficiently by minimizing the amount of data being shuffled and optimizing the partitioning strategy.
Techniques like partitioning, combiners, and reducers can help reduce the amount of shuffling in MapRed

Answered by AI

Add your answer

Q6. What is the difference between repartition and Coelsce?

Ans.

Repartition increases or decreases the number of partitions in a DataFrame, while Coalesce only decreases the number of partitions.

Repartition can increase or decrease the number of partitions in a DataFrame, leading to a shuffle of data across the cluster.
Coalesce only decreases the number of partitions in a DataFrame without performing a full shuffle, making it more efficient than repartition.
Repartition is typically...

Answered by AI

Add your answer

Q7. How do you handle Incremental data?

Ans.

Incremental data is handled by identifying new data since the last update and merging it with existing data.

Identify new data since last update
Merge new data with existing data
Update data warehouse or database with incremental changes

Answered by AI

Add your answer

Q8. What is SCD ??

Ans.

SCD stands for Slowly Changing Dimension, a concept in data warehousing to track changes in data over time.

SCD is used to maintain historical data in a data warehouse.
There are three types of SCD - Type 1, Type 2, and Type 3.
Type 1 SCD overwrites old data with new data.
Type 2 SCD creates a new record for each change, preserving history.
Type 3 SCD maintains both old and new values in the same record.
SCD is important for...

Answered by AI

Add your answer

Q9. Scenerio based questions related to Spark ?

Add your answer

Q10. Two SQL Codes and Two Python codes like reverse a string ?

Ans.

Reverse a string using SQL and Python codes.

In SQL, use the REVERSE function to reverse a string.
In Python, use slicing with a step of -1 to reverse a string.

Answered by AI

Add your answer

Q11. Find top 5 countries with highest population in Spark and SQL

Ans.

Use Spark and SQL to find the top 5 countries with the highest population.

Use Spark to load the data and perform data processing.
Use SQL queries to group by country and sum the population.
Order the results in descending order and limit to top 5.
Example: SELECT country, SUM(population) AS total_population FROM table_name GROUP BY country ORDER BY total_population DESC LIMIT 5

Answered by AI

Add your answer

Q12. Using two tables find the different records for different joins

Ans.

To find different records for different joins using two tables

Use the SQL query to perform different joins like INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
Identify the key columns in both tables to join on
Select the columns from both tables and use WHERE clause to filter out the different records

Answered by AI

Add your answer

Round 2 - One-on-one

(7 Questions)

Q1. What is a catalyst optimiser? How it works?

Ans.

A catalyst optimizer is a query optimization tool used in Apache Spark to improve performance by generating an optimal query plan.

Catalyst optimizer is a rule-based query optimization framework in Apache Spark.
It leverages rules to transform the logical query plan into a more optimized physical plan.
The optimizer applies various optimization techniques like predicate pushdown, constant folding, and join reordering.
By o...

Answered by AI

Add your answer

Q2. Tell me about the optimization you used in your project.

Ans.

Used query optimization techniques to improve performance in database queries.

Utilized indexing to speed up search queries.
Implemented query caching to reduce redundant database calls.
Optimized SQL queries by restructuring joins and subqueries.
Utilized database partitioning to improve query performance.
Used query profiling tools to identify and optimize slow queries.

Answered by AI

Add your answer

Q3. Pyspark question related to merging two schemas?

Add your answer

Q4. What is the best approach to finding whether the data frame is empty or not?

Ans.

Use the len() function to check the length of the data frame.

Use len() function to get the number of rows in the data frame.
If the length is 0, then the data frame is empty.
Example: if len(df) == 0: print('Data frame is empty')

Answered by AI

Add your answer

Q5. Spark Architecture

Add your answer

Q6. How do you decide on cores and worker nodes?

Ans.

Cores and worker nodes are decided based on the workload requirements and scalability needs of the data processing system.

Consider the size and complexity of the data being processed
Evaluate the processing speed and memory requirements of the tasks
Take into account the parallelism and concurrency needed for efficient data processing
Monitor the system performance and adjust cores and worker nodes as needed

Answered by AI

Add your answer

Q7. What happens when we enforce schema ?

Ans.

Enforcing schema ensures that data conforms to a predefined structure and rules.

Ensures data integrity by validating incoming data against predefined schema
Helps in maintaining consistency and accuracy of data
Prevents data corruption and errors in data processing
Can lead to rejection of data that does not adhere to the schema

Answered by AI

Add your answer

Interview Preparation Tips

Topics to prepare for Persistent Systems Senior Data Engineer interview:

SQL
Pyspark
Python
Spark
Database

Interview preparation tips for other job seekers - Be prepared with Spark core concepts and SQL Coding

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Capgemini

Brijesh yadav

posted on 9 Jan 2025

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(3 Questions)

Q1. What are the optimization techniques used in Apache Spark?

Add your answer

Q2. 2 SQL queries , 1 PySpark code and 1 Python Code .

Add your answer

Q3. 2-3 Scenario Based questions from ADF and databricks .

Add your answer

Tech Mahindra Interview FAQs

How many rounds are there in Tech Mahindra AWS Data Engineer interview?

Tech Mahindra interview process usually has 1 rounds. The most common rounds in the Tech Mahindra interview process are Technical.

How to prepare for Tech Mahindra AWS Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Tech Mahindra. The most common topics and skills that interviewers at Tech Mahindra expect are Python, AWS, SQL, AWS Glue and AWS Lambda.

Tell us how to improve this page.

Tech Mahindra Interviews By Designations

Interview Questions for Popular Designations

Tech Mahindra AWS Data Engineer Interview Process

based on 1 interview

Interview experience

Good

TCS Interview Questions

3.7

• 10.5k Interviews

Accenture Interview Questions

3.8

• 8.2k Interviews

Infosys Interview Questions

3.6

• 7.6k Interviews

Wipro Interview Questions

3.7

• 5.7k Interviews

Cognizant Interview Questions

3.8

• 5.6k Interviews

Capgemini Interview Questions

3.7

• 4.8k Interviews

HCLTech Interview Questions

3.5

• 3.8k Interviews

Genpact Interview Questions

3.8

• 3.1k Interviews

LTIMindtree Interview Questions

3.8

• 3k Interviews

IBM Interview Questions

4.0

• 2.4k Interviews

View all

National Institute of Technology (NIT), Warangal Placement Questions

5 Interviews

Jawaharlal Nehru Technological University (JNTU) Placement Questions

3 Interviews

D Y Patil College of Engineering, Pune Placement Questions

2 Interviews

Sathyabama University Placement Questions

1 Interview

Noida Institute of Engineering & Technology, Greater Noida Placement Questions

1 Interview

Indian Institute of Technology (IIT), Chennai Placement Questions

1 Interview

Jabalpur Engineering College, Jabalpur Placement Questions

1 Interview

View all

Tech Mahindra AWS Data Engineer Salary

based on 13 salaries

₹2.7 L/yr - ₹9.4 L/yr

32% less than the average AWS Data Engineer Salary in India

View more details

Tech Mahindra Salaries in India

Software Engineer 26.4k salaries	₹2 L/yr - ₹9.2 L/yr
Senior Software Engineer 21.4k salaries	₹5.5 L/yr - ₹22.2 L/yr
Technical Lead 11.7k salaries	₹9.5 L/yr - ₹38 L/yr
Associate Software Engineer 5.4k salaries	₹1.8 L/yr - ₹8.2 L/yr
Team Lead 5k salaries	₹5.2 L/yr - ₹17 L/yr