Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Add office photos

Engaged Employer

Tech Mahindra

Compare

3.5

based on 38.4k Reviews

Video summary

Filter interviews by

Tech Mahindra Gcp Data Engineer Interview Questions and Answers

Updated 20 Dec 2024

Tech Mahindra Gcp Data Engineer Interview Experiences

2 interviews found

Gcp Data Engineer Interview Questions & Answers

Anonymous

posted on 19 Dec 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

No response

I applied via Naukri.com and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - One-on-one

(7 Questions)

Q1. Explain your project

Ans.

Developed a data pipeline to ingest, process, and analyze customer feedback data for a retail company.

Used Google Cloud Platform services like BigQuery, Dataflow, and Pub/Sub for data processing.
Implemented data cleansing and transformation techniques to ensure data quality.
Created visualizations and dashboards using tools like Data Studio for stakeholders to easily interpret the data.

Answered by AI

Add your answer

Q2. GCP bigquery architecture

Add your answer

Q3. Gcp object versioning

Add your answer

Q4. Gcp storage class types

Ans.

GCP offers different storage classes for varying performance and cost requirements.

Standard Storage: for frequently accessed data
Nearline Storage: for data accessed less frequently
Coldline Storage: for data accessed very infrequently
Archive Storage: for data stored for long-term retention

Answered by AI

Add your answer

Q5. Sql optimisation techniques

Ans.

SQL optimization techniques focus on improving query performance by reducing execution time and resource usage.

Use indexes to speed up data retrieval
Avoid using SELECT * and instead specify only the columns needed
Optimize joins by using appropriate join types and conditions
Limit the use of subqueries and instead use JOINs where possible
Use EXPLAIN to analyze query execution plans and identify bottlenecks

Answered by AI

Add your answer

Q6. Sql coding date format and joins related questions

Add your answer

Q7. Partitioning and cluster by

Add your answer

Round 2 - One-on-one

(2 Questions)

Q1. Advanced sql recursive cte and python code using lambda function to explode the list

Add your answer

Q2. Project explanation and daily activity and challenge faced in project

Add your answer

Interview Preparation Tips

Topics to prepare for Tech Mahindra Gcp Data Engineer interview:

SQL
PYSPARK
GCP
PYTHON

Interview preparation tips for other job seekers - Practice SQL and Python coding extensively; they are evaluating our problem-solving approach and logic, as well as the depth of knowledge we possess.

Gcp Data Engineer Interview Questions & Answers

Manisa Sarangi

posted on 8 Jun 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(4 Questions)

Q1. Which of these 2 select * from table and select * from table limit 100 is faster

Ans.

select * from table limit 100 is faster

Using 'select * from table' retrieves all rows from the table, which can be slower if the table is large
Using 'select * from table limit 100' limits the number of rows retrieved, making it faster
Limiting the number of rows fetched can improve query performance

Answered by AI

Add your answer

Q2. Explain scd and Merge in bigquery

Ans.

SCD stands for Slowly Changing Dimension and Merge is a SQL operation used to update or insert data in BigQuery.

SCD is used to track changes to data over time in a data warehouse
Merge in BigQuery is used to perform insert, update, or delete operations in a single statement
Example: MERGE INTO target_table USING source_table ON condition WHEN MATCHED THEN UPDATE SET col1 = value1 WHEN NOT MATCHED THEN INSERT (col1, col2)...

Answered by AI

Add your answer

Q3. Architecture of bigquery

Ans.

BigQuery is a fully managed, serverless data warehouse that enables scalable analysis over petabytes of data.

BigQuery uses a columnar storage format for efficient querying.
It supports standard SQL for querying data.
BigQuery allows for real-time data streaming for analysis.
It integrates with various data sources like Google Cloud Storage, Google Sheets, etc.
BigQuery provides automatic scaling and high availability.

Answered by AI

Add your answer

Q4. Dataflow function to split sentence

Ans.

Dataflow function to split sentence

Use the Split transform in Dataflow to split the sentence into words
Apply ParDo function to process each word individually
Use regular expressions to handle punctuation and special characters

Answered by AI

Add your answer

Skills evaluated in this interview

Interview questions from similar companies

Gcp Data Engineer Interview Questions & Answers

IBM

Anonymous

posted on 24 Nov 2022

I applied via LinkedIn and was interviewed before Nov 2021. There were 3 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Properly align and format text in your resume. A recruiter will have to spend more time reading poorly aligned text, leading to high chances of rejection.

View all tips

Round 2 - Technical

(1 Question)

Q1. Ask about the GCP Projects we did before

Add your answer

Round 3 - Technical

(1 Question)

Q1. Managerial questions with salary discussion

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident and try to elaborate your projects. Easy to get into IBM.

Gcp Data Engineer Interview Questions & Answers

Capgemini

Bhau Rakhapasare

posted on 13 Apr 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Technical

(5 Questions)

Q1. What is windows function bigquery

Ans.

Window functions in BigQuery are used to perform calculations across a set of table rows related to the current row.

Window functions allow you to perform calculations on a set of rows related to the current row
They are used with the OVER() clause in SQL queries
Common window functions include ROW_NUMBER(), RANK(), and NTILE()
They can be used to calculate moving averages, cumulative sums, and more

Answered by AI

Add your answer

Q2. What types on nosql databases in gcp

Ans.

Types of NoSQL databases in GCP include Firestore, Bigtable, and Datastore.

Firestore is a flexible, scalable database for mobile, web, and server development.
Bigtable is a high-performance NoSQL database service for large analytical and operational workloads.
Datastore is a highly scalable NoSQL database for web and mobile applications.

Answered by AI

Add your answer

Q3. Write code to find max number of product by customer

Ans.

Code to find max number of product by customer

Iterate through each customer's purchases
Keep track of the count of each product for each customer
Find the product with the maximum count for each customer

Answered by AI

Add your answer

Q4. Read dataframe python and pyspark

Add your answer

Q5. Create dataframe

Ans.

Creating a dataframe in GCP Data Engineer

Use the pandas library to create a dataframe
Provide data in the form of a dictionary or list of lists
Specify column names if needed

Answered by AI

Add your answer

Skills evaluated in this interview

What people are saying about Tech Mahindra

View All

a senior engineer

💼 OFFER RECEIVED – Sr. Test Engineer (Band U3) | Tech Mahindra | Noida 📎 Screenshot attached | CTC: ₹13.5 LPA

✅ Variable is paid monthly and fully (as confirmed by HR) ❓ Looking to know the MONTHLY IN-HAND SALARY after standard deductions & partial FBP usage Would appreciate any insights from current/ex-TechM folks! 🙏

Got a question about Tech Mahindra?

Ask anonymously on communities.

Gcp Data Engineer Interview Questions & Answers

Cognizant

suresh p

posted on 22 Dec 2024

Interview experience

Bad

Difficulty level

Process Duration

Result

Round 1 - Technical

(6 Questions)

Q1. What are the GCP services used in your project

Ans.

The GCP services used in our project include BigQuery, Dataflow, Pub/Sub, and Cloud Storage.

BigQuery for data warehousing and analytics
Dataflow for real-time data processing
Pub/Sub for messaging and event ingestion
Cloud Storage for storing data and files

Answered by AI

Add your answer

Q2. What is cloud function

Ans.

Cloud Functions are event-driven functions that run in response to cloud events.

Serverless functions that automatically scale based on demand
Can be triggered by events from various cloud services
Supports multiple programming languages like Node.js, Python, etc.

Answered by AI

Add your answer

Q3. How to shedule job to trigger every hr in Airflow

Ans.

To schedule a job to trigger every hour in Airflow, you can use the Cron schedule interval

Define a DAG (Directed Acyclic Graph) in Airflow
Set the schedule_interval parameter to '0 * * * *' to trigger the job every hour
Example: schedule_interval='0 * * * *'

Answered by AI

Add your answer

Q4. Bigquey architecture

Add your answer

Q5. How display string in reverse using python

Ans.

Use Python's slicing feature to display a string in reverse order.

Use string slicing with a step of -1 to reverse the string.
Example: 'hello'[::-1] will output 'olleh'.

Answered by AI

Add your answer

Q6. What is pub sub and where are you getting used in your project.

Ans.

Pub/Sub is a messaging service that allows communication between independent applications.

Pub/Sub is used for real-time messaging and event-driven systems.
It is commonly used for data ingestion, streaming analytics, and event-driven architectures.
Examples of Pub/Sub services include Google Cloud Pub/Sub, Apache Kafka, and Amazon SNS/SQS.

Answered by AI

Add your answer

Gcp Data Engineer Interview Questions & Answers

Capgemini

Anonymous

posted on 1 Jul 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. Questions on BigQuery, SQL, GCP data services which you have worked on

Add your answer

Q2. Python small coding question and one SQL query

Add your answer

Gcp Data Engineer Interview Questions & Answers

Capgemini

Anonymous

posted on 18 Nov 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

No response

I applied via LinkedIn and was interviewed in Oct 2024. There were 2 interview rounds.

Round 1 - Technical

(2 Questions)

Q1. Questions on SQL Joins and Window Functions

Add your answer

Q2. GCP Big query and Cloud Storage qs

Add your answer

Round 2 - HR

(2 Questions)

Q1. About overall IT experience

Add your answer

Q2. Project experience and services used

Ans.

I have experience working on projects involving data processing, transformation, and analysis using GCP services like BigQuery, Dataflow, and Dataproc.

Utilized BigQuery for storing and querying large datasets
Implemented data pipelines using Dataflow for real-time data processing
Utilized Dataproc for running Apache Spark and Hadoop clusters for data processing
Worked on data ingestion and transformation using Cloud Stora...

Answered by AI

Add your answer

Are these interview questions helpful?

Gcp Data Engineer Interview Questions & Answers

Capgemini

Kiran Gurbani

posted on 4 Dec 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

Result

No response

I applied via Naukri.com and was interviewed in Jun 2024. There was 1 interview round.

Round 1 - One-on-one

(10 Questions)

Q1. Tools and Technology used in current project

Add your answer

Q2. What is Managed Table, external table, Materialised view

Add your answer

Q3. What is data flow? Working of data flow

Add your answer

Q4. Clusters in data proc, Types of clusters, Machine type used in cluster

Add your answer

Q5. Airflow - how to add email in airflow job - how to monitor jobs in airflow -Python Operator in airflow

Add your answer

Q6. Narrow , wide and broadcast transformation

Add your answer

Q7. Window functions

Add your answer

Q8. What is shuffle partition

Ans.

Shuffle partition is a data processing technique used to redistribute data across partitions in distributed computing.

Shuffle partition helps in balancing the load across different nodes in a distributed system.
It is commonly used in frameworks like Apache Spark during operations like groupBy and join.
For example, when joining two large datasets, shuffle partition ensures that related data is processed together.
Imprope...

Answered by AI

Add your answer

Q9. How to identify file size in python

Ans.

You can identify file size in Python using the os module or pathlib for efficient file handling.

Use os.path.getsize() to get the size of a file in bytes. Example: os.path.getsize('file.txt')
Use pathlib.Path.stat() to retrieve file size. Example: from pathlib import Path; Path('file.txt').stat().st_size
File size can also be checked using the built-in open() function with os.fstat(). Example: os.fstat(open('file.txt').fi...

Answered by AI

View 1 more answer

Q10. Which languages can be used in data flow

Ans.

Google Cloud Dataflow supports Java and Python for building data processing pipelines.

Java: Widely used for building robust data pipelines; example: Apache Beam SDK for Java.
Python: Popular for its simplicity and ease of use; example: Apache Beam SDK for Python.
Both languages allow for the creation of batch and streaming data processing applications.

Answered by AI

Add your answer

Skills evaluated in this interview

Gcp Data Engineer Interview Questions & Answers

TCS

Anonymous

posted on 17 Jul 2024

Interview experience

Good

Difficulty level

Easy

Process Duration

2-4 weeks

Result

Selected

I applied via Naukri.com and was interviewed in Jun 2024. There was 1 interview round.

Round 1 - Technical

(3 Questions)

Q1. String is palindrome or not

Ans.

Check if a string is a palindrome or not

Compare the string with its reverse to check for palindrome
Ignore spaces and punctuation marks when comparing
Examples: 'racecar' is a palindrome, 'hello' is not

Answered by AI

Add your answer

Q2. Create gcs bucket using python

Ans.

Use Python to create a GCS bucket

Import the necessary libraries like google.cloud.storage
Authenticate using service account credentials
Use the library functions to create a new bucket

Answered by AI

Add your answer

Q3. Write a python code to trigger a dataflow job in cloud function

Ans.

Python code to trigger a dataflow job in cloud function

Use the googleapiclient library to interact with the Dataflow API
Authenticate using service account credentials
Submit a job to Dataflow using the projects.locations.templates.launch endpoint

Answered by AI

Add your answer

Skills evaluated in this interview

Gcp Data Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 16 Mar 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - One-on-one

(6 Questions)

Q1. SQL: Find keys present in table A but not in B(B is old copy of A)

Ans.

Use SQL to find keys present in table A but not in table B (old copy of A).

Use a LEFT JOIN to combine tables A and B based on the key column
Filter the results where the key column in table B is NULL
This will give you the keys present in table A but not in table B

Answered by AI

Add your answer

Q2. SQL: 4th highest salary

Ans.

SQL query to retrieve the 4th highest salary from a salary table using various methods.

Use the 'DISTINCT' keyword to avoid duplicate salaries.
Utilize 'ORDER BY' to sort salaries in descending order.
Use 'LIMIT' with 'OFFSET' to skip the first three highest salaries.
Example SQL: 'SELECT DISTINCT salary FROM employees ORDER BY salary DESC LIMIT 1 OFFSET 3;'

Answered by AI

Add your answer

Q3. Case Study: Using GCP's tool make a pipeline to transfer file from one GCS bucket to another

Ans.

Use GCP Dataflow to transfer files between GCS buckets

Create a Dataflow pipeline using Apache Beam to read from source bucket and write to destination bucket
Use GCS connector to read and write files in Dataflow pipeline
Set up appropriate permissions for Dataflow service account to access both buckets

Answered by AI

Add your answer

Q4. Case Study: A new joiner in IT, how will you explain flow of project and ownership of work. Considering my YOE 3 years

Ans.

Explaining project flow and ownership to a new IT joiner involves outlining roles, responsibilities, and collaboration.

1. Project Initiation: Discuss how projects start with requirements gathering and stakeholder meetings.
2. Role Assignment: Explain how tasks are assigned based on team members' strengths and expertise.
3. Collaboration Tools: Introduce tools like JIRA or Trello for tracking progress and ownership.
4. Reg...

Answered by AI

Add your answer

Q5. Explain your project, and reasons behind why did you choose airflow over other orchestration tool.

Ans.

Implemented a data pipeline using Airflow for ETL processes, enhancing workflow management and scheduling.

Airflow's DAG (Directed Acyclic Graph) structure allows for clear visualization of task dependencies.
It supports dynamic pipeline generation, enabling flexibility in defining workflows based on external parameters.
Airflow has a rich user interface for monitoring and managing workflows, making it easier to troublesh...

Answered by AI

Add your answer

Q6. Discuss other orchestration tool in GCP

Ans.

Cloud Composer is another orchestration tool in GCP

Cloud Composer is a fully managed workflow orchestration service built on Apache Airflow
It allows you to author, schedule, and monitor workflows that span across GCP services
Cloud Composer provides a rich set of features like DAGs, plugins, and monitoring capabilities
It integrates seamlessly with other GCP services like BigQuery, Dataflow, and Dataproc

Answered by AI

Add your answer

Skills evaluated in this interview

Tech Mahindra Interview FAQs

How many rounds are there in Tech Mahindra Gcp Data Engineer interview?

Tech Mahindra interview process usually has 1-2 rounds. The most common rounds in the Tech Mahindra interview process are One-on-one Round and Technical.

How to prepare for Tech Mahindra Gcp Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Tech Mahindra. The most common topics and skills that interviewers at Tech Mahindra expect are GCP, SQL, Python, ETL and Bigquery.

What are the top questions asked in Tech Mahindra Gcp Data Engineer interview?

Some of the top questions asked at the Tech Mahindra Gcp Data Engineer interview -

which of these 2 select * from table and select * from table limit 100 is fas...read more
sql optimisation techniq...read more
gcp storage class ty...read more

Tell us how to improve this page.

Tech Mahindra Interviews By Designations

Interview Questions for Popular Designations

4/5

based on 2 interview experiences

Difficulty level

Moderate 100%

Duration

Less than 2 weeks 100%

Capgemini Gcp Data Engineer Interview Questions

3.7

• 5 Interviews

Accenture Gcp Data Engineer Interview Questions

3.7

• 4 Interviews

Cognizant Gcp Data Engineer Interview Questions

3.7

• 4 Interviews

TCS Gcp Data Engineer Interview Questions

3.6

• 3 Interviews

Infosys Gcp Data Engineer Interview Questions

3.6

• 1 Interview

Wipro Gcp Data Engineer Interview Questions

3.7

• 1 Interview

LTIMindtree Gcp Data Engineer Interview Questions

3.7

• 1 Interview

IBM Gcp Data Engineer Interview Questions

4.0

• 1 Interview

View all

Tech Mahindra Gcp Data Engineer Salary

based on 44 salaries

₹5.4 L/yr - ₹19.9 L/yr

37% more than the average Gcp Data Engineer Salary in India

View more details

Tech Mahindra Salaries in India

Software Engineer 26.6k salaries	₹3.7 L/yr - ₹9.2 L/yr
Senior Software Engineer 22.2k salaries	₹9.1 L/yr - ₹18.5 L/yr
Technical Lead 12.5k salaries	₹16.9 L/yr - ₹30 L/yr
Associate Software Engineer 6.1k salaries	₹1.9 L/yr - ₹5.7 L/yr
Team Lead 5.4k salaries	₹5.7 L/yr - ₹17.7 L/yr