Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Premium Employer

Persistent Systems Work with us

Compare

3.6

based on 4.4k Reviews

Filter interviews by

Persistent Systems Senior Data Engineer Interview Questions and Answers

Updated 27 Sep 2024

13 Interview questions

A Senior Data Engineer was asked 8mo ago

Q. What is the difference between repartition and coalesce?

Ans.

Repartition increases or decreases the number of partitions in a DataFrame, while Coalesce only decreases the number of partitions.

Repartition can increase or decrease the number of partitions in a DataFrame, leading to a shuffle of data across the cluster.
Coalesce only decreases the number of partitions in a DataFrame without performing a full shuffle, making it more efficient than repartition.
Repartition is typi...

A Senior Data Engineer was asked 8mo ago

Q. How does DAG handle fault tolerance?

Ans.

DAGs handle fault tolerance by rerunning failed tasks and maintaining task dependencies.

DAGs rerun failed tasks automatically to ensure completion.
DAGs maintain task dependencies to ensure proper sequencing.
DAGs can be configured to retry failed tasks a certain number of times before marking them as failed.

A Senior Data Engineer was asked 8mo ago

Q. Find the top 5 countries with the highest population using Spark and SQL.

Ans.

Use Spark and SQL to find the top 5 countries with the highest population.

Use Spark to load the data and perform data processing.
Use SQL queries to group by country and sum the population.
Order the results in descending order and limit to top 5.
Example: SELECT country, SUM(population) AS total_population FROM table_name GROUP BY country ORDER BY total_population DESC LIMIT 5

A Senior Data Engineer was asked 8mo ago

Q. How do you decide on the number of cores and worker nodes?

Ans.

Cores and worker nodes are decided based on the workload requirements and scalability needs of the data processing system.

Consider the size and complexity of the data being processed
Evaluate the processing speed and memory requirements of the tasks
Take into account the parallelism and concurrency needed for efficient data processing
Monitor the system performance and adjust cores and worker nodes as needed

A Senior Data Engineer was asked 8mo ago

Q. What happens when we enforce schema?

Ans.

Enforcing schema ensures that data conforms to a predefined structure and rules.

Ensures data integrity by validating incoming data against predefined schema
Helps in maintaining consistency and accuracy of data
Prevents data corruption and errors in data processing
Can lead to rejection of data that does not adhere to the schema

A Senior Data Engineer was asked 8mo ago

Q. Using two tables, how would you identify the different records resulting from different types of joins?

Ans.

To find different records for different joins using two tables

Use the SQL query to perform different joins like INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
Identify the key columns in both tables to join on
Select the columns from both tables and use WHERE clause to filter out the different records

A Senior Data Engineer was asked 8mo ago

Q. What is the best approach to determine if a data frame is empty?

Ans.

Use the len() function to check the length of the data frame.

Use len() function to get the number of rows in the data frame.
If the length is 0, then the data frame is empty.
Example: if len(df) == 0: print('Data frame is empty')

Are these interview questions helpful?

A Senior Data Engineer was asked 8mo ago

Q. What is SCD?

Ans.

SCD stands for Slowly Changing Dimension, a concept in data warehousing to track changes in data over time.

SCD is used to maintain historical data in a data warehouse.
There are three types of SCD - Type 1, Type 2, and Type 3.
Type 1 SCD overwrites old data with new data.
Type 2 SCD creates a new record for each change, preserving history.
Type 3 SCD maintains both old and new values in the same record.
SCD is importan...

A Senior Data Engineer was asked 8mo ago

Q. How do you merge two schemas in PySpark?

Ans.

Merging two schemas in PySpark involves combining DataFrames with different structures into a unified format.

Use the `unionByName()` method to merge DataFrames with different column names.
Example: df1.unionByName(df2, allowMissingColumns=True) merges df1 and df2, filling missing columns with nulls.
For schema evolution, use `mergeSchema` option when reading from Parquet files.
Example: spark.read.option('mergeSchema...

A Senior Data Engineer was asked 8mo ago

Q. Two SQL Codes and Two Python codes like reverse a string ?

Ans.

Reverse a string using SQL and Python codes.

In SQL, use the REVERSE function to reverse a string.
In Python, use slicing with a step of -1 to reverse a string.

Persistent Systems Senior Data Engineer Interview Experiences

2 interviews found

Senior Data Engineer Interview Questions & Answers

Anonymous

posted on 17 Jul 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

No response

I applied via Naukri.com and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Technical

(12 Questions)

Q1. Tell me about yourself and Project

Ans.

I am a Senior Data Engineer with experience in developing data pipelines and optimizing data storage for various projects.

Developed data pipelines using Apache Spark for real-time data processing
Optimized data storage using technologies like Hadoop and AWS S3
Worked on a project to analyze customer behavior and improve marketing strategies

Answered by AI

Add your answer

Q2. What was you day-to-day job in your project

Ans.

My day-to-day job in the project involved designing and implementing data pipelines, optimizing data workflows, and collaborating with cross-functional teams.

Designing and implementing data pipelines to extract, transform, and load data from various sources
Optimizing data workflows to improve efficiency and performance
Collaborating with cross-functional teams including data scientists, analysts, and business stakeholde...

Answered by AI

Add your answer

Q3. Spark Architecture

Add your answer

Q4. How DAG handle Fault tolerance?

Ans.

DAGs handle fault tolerance by rerunning failed tasks and maintaining task dependencies.

DAGs rerun failed tasks automatically to ensure completion.
DAGs maintain task dependencies to ensure proper sequencing.
DAGs can be configured to retry failed tasks a certain number of times before marking them as failed.

Answered by AI

Add your answer

Q5. What is shuffling? How to Handle Shuffling?

Ans.

Shuffling is the process of redistributing data across partitions in a distributed computing environment.

Shuffling is necessary when data needs to be grouped or aggregated across different partitions.
It can be handled efficiently by minimizing the amount of data being shuffled and optimizing the partitioning strategy.
Techniques like partitioning, combiners, and reducers can help reduce the amount of shuffling in MapRed...

Answered by AI

Add your answer

Q6. What is the difference between repartition and Coelsce?

Ans.

Repartition increases or decreases the number of partitions in a DataFrame, while Coalesce only decreases the number of partitions.

Repartition can increase or decrease the number of partitions in a DataFrame, leading to a shuffle of data across the cluster.
Coalesce only decreases the number of partitions in a DataFrame without performing a full shuffle, making it more efficient than repartition.
Repartition is typically...

Answered by AI

Add your answer

Q7. How do you handle Incremental data?

Ans.

Incremental data is handled by identifying new data since the last update and merging it with existing data.

Identify new data since last update
Merge new data with existing data
Update data warehouse or database with incremental changes

Answered by AI

Add your answer

Q8. What is SCD ??

Ans.

SCD stands for Slowly Changing Dimension, a concept in data warehousing to track changes in data over time.

SCD is used to maintain historical data in a data warehouse.
There are three types of SCD - Type 1, Type 2, and Type 3.
Type 1 SCD overwrites old data with new data.
Type 2 SCD creates a new record for each change, preserving history.
Type 3 SCD maintains both old and new values in the same record.
SCD is important for...

Answered by AI

Add your answer

Q9. Scenerio based questions related to Spark ?

Add your answer

Q10. Two SQL Codes and Two Python codes like reverse a string ?

Ans.

Reverse a string using SQL and Python codes.

In SQL, use the REVERSE function to reverse a string.
In Python, use slicing with a step of -1 to reverse a string.

Answered by AI

Add your answer

Q11. Find top 5 countries with highest population in Spark and SQL

Ans.

Use Spark and SQL to find the top 5 countries with the highest population.

Use Spark to load the data and perform data processing.
Use SQL queries to group by country and sum the population.
Order the results in descending order and limit to top 5.
Example: SELECT country, SUM(population) AS total_population FROM table_name GROUP BY country ORDER BY total_population DESC LIMIT 5

Answered by AI

Add your answer

Q12. Using two tables find the different records for different joins

Ans.

To find different records for different joins using two tables

Use the SQL query to perform different joins like INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
Identify the key columns in both tables to join on
Select the columns from both tables and use WHERE clause to filter out the different records

Answered by AI

Add your answer

Round 2 - One-on-one

(7 Questions)

Q1. What is a catalyst optimiser? How it works?

Ans.

A catalyst optimizer is a query optimization tool used in Apache Spark to improve performance by generating an optimal query plan.

Catalyst optimizer is a rule-based query optimization framework in Apache Spark.
It leverages rules to transform the logical query plan into a more optimized physical plan.
The optimizer applies various optimization techniques like predicate pushdown, constant folding, and join reordering.
By o...

Answered by AI

Add your answer

Q2. Tell me about the optimization you used in your project.

Ans.

Used query optimization techniques to improve performance in database queries.

Utilized indexing to speed up search queries.
Implemented query caching to reduce redundant database calls.
Optimized SQL queries by restructuring joins and subqueries.
Utilized database partitioning to improve query performance.
Used query profiling tools to identify and optimize slow queries.

Answered by AI

Add your answer

Q3. Pyspark question related to merging two schemas?

Ans.

Merging two schemas in PySpark involves combining DataFrames with different structures into a unified format.

Use the `unionByName()` method to merge DataFrames with different column names.
Example: df1.unionByName(df2, allowMissingColumns=True) merges df1 and df2, filling missing columns with nulls.
For schema evolution, use `mergeSchema` option when reading from Parquet files.
Example: spark.read.option('mergeSchema', 't...

Answered by AI

Add your answer

Q4. What is the best approach to finding whether the data frame is empty or not?

Ans.

Use the len() function to check the length of the data frame.

Use len() function to get the number of rows in the data frame.
If the length is 0, then the data frame is empty.
Example: if len(df) == 0: print('Data frame is empty')

Answered by AI

Add your answer

Q5. Spark Architecture

Add your answer

Q6. How do you decide on cores and worker nodes?

Ans.

Cores and worker nodes are decided based on the workload requirements and scalability needs of the data processing system.

Consider the size and complexity of the data being processed
Evaluate the processing speed and memory requirements of the tasks
Take into account the parallelism and concurrency needed for efficient data processing
Monitor the system performance and adjust cores and worker nodes as needed

Answered by AI

Add your answer

Q7. What happens when we enforce schema ?

Ans.

Enforcing schema ensures that data conforms to a predefined structure and rules.

Ensures data integrity by validating incoming data against predefined schema
Helps in maintaining consistency and accuracy of data
Prevents data corruption and errors in data processing
Can lead to rejection of data that does not adhere to the schema

Answered by AI

Add your answer

Interview Preparation Tips

Topics to prepare for Persistent Systems Senior Data Engineer interview:

SQL
Pyspark
Python
Spark
Database

Interview preparation tips for other job seekers - Be prepared with Spark core concepts and SQL Coding

Skills evaluated in this interview

Senior Data Engineer Interview Questions & Answers

kajal bomble

posted on 10 Jun 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via Naukri.com and was interviewed before Jun 2023. There were 3 interview rounds.

Round 1 - One-on-one

(2 Questions)

Q1. It’s general type of question

Add your answer

Q2. Experience n all

Add your answer

Round 2 - Group Discussion

It’s just reasoning type questions.

Round 3 - Technical

(2 Questions)

Q1. What is ssis? How we use

Ans.

SSIS stands for SQL Server Integration Services, a tool provided by Microsoft for data integration and workflow applications.

SSIS is a platform for building high-performance data integration and workflow solutions.
It allows you to create packages that move data from various sources to destinations.
SSIS includes a visual design interface for creating, monitoring, and managing data integration processes.
You can use SSIS ...

Answered by AI

Add your answer

Q2. When we use ssis packages? Difference between union merge

Ans.

SSIS packages are used for ETL processes in SQL Server. Union combines datasets vertically, while merge combines them horizontally.

SSIS packages are used for Extract, Transform, Load (ETL) processes in SQL Server.
Union in SSIS combines datasets vertically, stacking rows on top of each other.
Merge in SSIS combines datasets horizontally, matching rows based on specified columns.
Union All in SSIS combines datasets vertica...

Answered by AI

Add your answer

Skills evaluated in this interview

What people are saying about Persistent Systems

View All

a data engineer

Salary discusssion suggestion

Hi everyone, I have an offer from Deloitte as a data engineer for 15 CTC (all fix) + 10% variable I still have some time left on my notice period. Should I try for more offers and ask Deloitte to match? Yoe 3 CCTC : 6

Got a question about Persistent Systems?

Ask anonymously on communities.

Interview questions from similar companies

Senior Software Engineer Interview Questions & Answers

Optum Global Solutions

Anonymous

posted on 26 Feb 2021

I applied via Company Website and was interviewed before Feb 2020. There were 4 interview rounds.

Interview Questionnaire

4 Questions

Q1. .Net support related questions for example 1. What to do when applicable is down. 2. how to check IIS error logs.

Add your answer

Q2. Explain Projects you worked and your role in those.

Add your answer

Q3. Explain scenario when you handled high pressure from client.

Ans.

Handled high pressure from client by prioritizing tasks and communicating effectively.

Identified critical issues and addressed them first
Communicated regularly with the client to provide updates and manage expectations
Collaborated with team members to delegate tasks and ensure timely delivery
Maintained a calm and professional demeanor to avoid escalating the situation

Answered by AI

Add your answer

Q4. Explain release management.

Ans.

Release management is the process of planning, scheduling, coordinating, and deploying software releases.

It involves identifying the scope of the release and the features to be included
Creating a release plan and schedule
Coordinating with different teams involved in the release process
Testing the release to ensure it meets quality standards
Deploying the release to production
Monitoring the release to ensure it is stable...

Answered by AI

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - For .Net support related projects, you need to be aware about first identifying the problem and then you need to think best optimised solution for that. You need to know how to check error logs, should be well versed with basic SQL queries and debugging.

Skills evaluated in this interview

Senior Software Engineer Interview Questions & Answers

Hexaware Technologies

Anonymous

posted on 29 Jan 2021

I applied via Naukri.com and was interviewed in Jul 2020. There were 4 interview rounds.

Interview Questionnaire

1 Question

Q1. Java 8, J2EE, Spring, SQL

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - There were four rounds. Technical written, two technical f2f rounds, coding test.

All the best

Senior Software Engineer Interview Questions & Answers

LTIMindtree

Aradhana Singh

posted on 23 Jan 2022

I applied via Naukri.com and was interviewed in Jul 2021. There were 3 interview rounds.

Round 1 - Technical

(1 Question)

Q1. 1st round is technical filters in MVC, What is sealed classes, oops concepts, What is extension methods, how do we write custom filters in MVC, tools for unit testing.

Add your answer

Round 2 - cultural

(1 Question)

Q1. 2nd round was cultural round it was mixed of technical as well as managerial questions.

Add your answer

Round 3 - HR

(1 Question)

Q1. It is final round salary discussion and negotiation.

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Learn Oops Concept, C#, MVC give to the point answer and explain the concept clearly.

Senior Software Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 28 Jan 2022

I applied via Naukri.com and was interviewed before Jan 2021. There were 3 interview rounds.

Interview Questionnaire

2 Questions

Q1. 1. Core Java - collections multithreading, string operations.

Add your answer

Q2. 2. Springs frameworks boot, micro services, confugrstion filds

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Back end - prepare java8 features + collections, strings , exception handling.
Spring framework , jpa, sql

Senior Software Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 30 Dec 2021

I applied via Referral and was interviewed in Jun 2021. There were 3 interview rounds.

Interview Questionnaire

1 Question

Q1. Job profile related technical questions

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - If you are prepared well you will qualify for the interview...

Are these interview questions helpful?

Software Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 27 Nov 2021

I applied via Internshala and was interviewed in May 2021. There were 3 interview rounds.

Interview Questionnaire

2 Questions

Q1. They aske qustn relateed to resume

Ans.

Discussing my resume highlights my skills, experiences, and projects relevant to the software engineering role.

Experience with Java and Python in developing web applications.
Led a team project that improved application performance by 30%.
Contributed to open-source projects, enhancing my coding skills and collaboration.
Completed an internship at XYZ Corp, where I developed a feature that increased user engagement.

Answered by AI

Add your answer

Q2. What made you choose this company

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - always be prepared with what you have wrote in your resume I will suggest..bcoz there is no other qustn than basics

Software Engineer Interview Questions & Answers

LTIMindtree

Aadhish Arab

posted on 10 Aug 2022

I applied via Campus Placement and was interviewed before Aug 2021. There were 3 interview rounds.

Round 1 - Aptitude Test

The first round was an aptitude test with questions ranging from basic mathematical concepts to logical/analytical questions. English was also included in the test. The difficulty was medium and I was able to solve 70-80% of the questions.

Round 2 - Coding Test

Two coding questions were the part of the test. I was supposed to solve and pass all the test cases for both the questions. The coding questions tested my knowledge in the field of arrays, loops and pointers. I was able to solve one and partially solve another.

Round 3 - Technical

(1 Question)

Q1. The interview was a mixture of both HR as well as Technical. I was asked decent questions and tested my knowledge in the fundamentals of programming.

Add your answer

Interview Preparation Tips

Topics to prepare for LTIMindtree Software Engineer interview:

Python
Java
Full Stack

Interview preparation tips for other job seekers - Make sure that you are good in the fundamentals of Programming(if you are looking for an IT job) or just be good at a particular thing which perfectly suites your current necessities and also helps your career to grow in the future. Do not be disheartened if things do not go your way, there is always hope and a lot of companies would like to hire an intelligent candidate like you.
Cheers :)

Software Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 15 Sep 2022

I applied via Campus Placement and was interviewed before Sep 2021. There were 4 interview rounds.

Round 1 - Aptitude Test

Prepare normal for aptitude - maths, quant, analytic

Round 2 - Group Discussion

My GD topic was Is internet good for students or not

Round 3 - Coding Test

I didn't attempt this as I was noob back in third year Engg

Round 4 - HR

(1 Question)

Q1. Basic questions can you relocate, tell me about ur GD and composition

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Easy peasy give ur best. Prepare well. Hope u get the job. Phir tho google ka hi dream rhe gaya he mera.

Persistent Systems Interview FAQs

How many rounds are there in Persistent Systems Senior Data Engineer interview?

Persistent Systems interview process usually has 2-3 rounds. The most common rounds in the Persistent Systems interview process are One-on-one Round, Technical and Group Discussion.

How to prepare for Persistent Systems Senior Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Persistent Systems. The most common topics and skills that interviewers at Persistent Systems expect are Python, Java, Kafka, Spark and HBase.

What are the top questions asked in Persistent Systems Senior Data Engineer interview?

Some of the top questions asked at the Persistent Systems Senior Data Engineer interview -

What is the best approach to finding whether the data frame is empty or n...read more
What is the difference between repartition and Coels...read more
Two SQL Codes and Two Python codes like reverse a strin...read more

Tell us how to improve this page.

Persistent Systems Interviews By Designations

Interview Questions for Popular Designations

4/5

based on 2 interview experiences

Difficulty level

Moderate 100%

Duration

Less than 2 weeks 100%

Join Persistent Systems See Beyond, Rise Above

LTIMindtree Senior Data Engineer Interview Questions

3.7

• 53 Interviews

Publicis Sapient Senior Data Engineer Interview Questions

3.5

• 11 Interviews

Coforge Senior Data Engineer Interview Questions

3.3

• 6 Interviews

GlobalLogic Senior Data Engineer Interview Questions

3.6

• 3 Interviews

EXL Service Senior Data Engineer Interview Questions

3.7

• 2 Interviews

Nagarro Senior Data Engineer Interview Questions

4.0

• 2 Interviews

Virtusa Consulting Services Senior Data Engineer Interview Questions

3.7

• 2 Interviews

CGI Group Senior Data Engineer Interview Questions

4.0

• 2 Interviews

Fujitsu Senior Data Engineer Interview Questions

3.8

• 2 Interviews

Mphasis Senior Data Engineer Interview Questions

3.3

• 1 Interview

View all

Persistent Systems Senior Data Engineer Salary

based on 39 salaries

₹6.6 L/yr - ₹28.6 L/yr

21% less than the average Senior Data Engineer Salary in India

View more details

Persistent Systems Salaries in India

Software Engineer 4.6k salaries	₹4.7 L/yr - ₹11.1 L/yr
Senior Software Engineer 4.6k salaries	₹6.8 L/yr - ₹18.7 L/yr
Lead Software Engineer 3.6k salaries	₹8.4 L/yr - ₹17.4 L/yr
Lead Engineer 3.5k salaries	₹13.7 L/yr - ₹25 L/yr
Project Lead 2.1k salaries	₹21.2 L/yr - ₹36 L/yr