Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Add office photos

Employer? Claim Account for FREE

Fragma Data Systems

Compare

4.1

based on 120 Reviews

Video summary

Filter interviews by

Fragma Data Systems Interview Questions and Answers

Updated 16 Jan 2025

Popular Designations

15 Interview questions

A Data Engineer was asked 5mo ago

Q. How would you find duplicate rows in a given table?

Ans.

Identify duplicate rows in a table

Use SQL query with GROUP BY and HAVING clause to identify duplicate rows based on specific columns
Example: SELECT column1, column2, COUNT(*) FROM table_name GROUP BY column1, column2 HAVING COUNT(*) > 1

View all Data Engineer interview questions

A Databricks Engineer was asked 6mo ago

Q. Write PySpark code to connect data to ADLS with partitioning.

Ans.

Use PySpark code to connect data to ADLS with partitioning

Use SparkSession to create a Spark application
Set the configuration for ADLS storage account and container
Read data from ADLS using Spark DataFrame API
Partition the data based on a specific column while writing back to ADLS

A Databricks Engineer was asked 6mo ago

Q. Explain Windows functions and how you have used them.

Ans.

Window functions are used to perform calculations across a set of table rows that are related to the current row.

Window functions are used in SQL to perform calculations on a specific subset of rows related to the current row.
They are often used with aggregate functions like SUM, AVG, and COUNT to calculate running totals, moving averages, and rankings.
Examples of window functions include ROW_NUMBER(), RANK(), LEA...

A Data Engineer was asked 9mo ago

Q. What is a DAG?

Ans.

DAG stands for Directed Acyclic Graph, a data structure used to represent dependencies between tasks in a workflow.

DAG is a collection of nodes connected by edges, where each edge has a direction and there are no cycles.
It is commonly used in data engineering for representing data pipelines and workflows.
DAGs help in visualizing and optimizing the order of tasks to be executed in a workflow.
Popular tools like Apac...

View all Data Engineer interview questions

A Data Engineer was asked 9mo ago

Q. In SQL, how would you determine which gender has the highest purchase rate in each state?

Ans.

The answer to the question is that in which state which gender makes the most purchases.

Aggregate the data by state and gender to calculate the total purchases made by each gender in each state.
Identify the gender with the highest total purchases in each state.
Present the results in a table or chart for easy visualization.

View all Data Engineer interview questions

A Data Engineer was asked 9mo ago

Q. There are four cores and four worker nodes in Spark. How many jobs will run in parallel?

Ans.

Only one job will run in parallel in Spark with four cores and four worker nodes.

In Spark, each core can only run one task at a time, so with four cores, only four tasks can run concurrently.
Since there are four worker nodes, each with four cores, a total of four tasks can run in parallel.
Therefore, only one job will run in parallel in this scenario.

View all Data Engineer interview questions

A Data Engineer was asked 9mo ago

Q. What is ADF?

Ans.

ADF stands for Azure Data Factory, a cloud-based data integration service that allows you to create, schedule, and manage data pipelines.

ADF is used for building, scheduling, and monitoring data pipelines to move and transform data from various sources to destinations.
It supports data integration between various data stores such as Azure SQL Database, Azure Blob Storage, and on-premises data sources.
ADF provides a...

View all Data Engineer interview questions

Are these interview questions helpful?

A Data Engineer was asked 9mo ago

Q. Write a SQL query to calculate the difference in marks for each student ID across different years.

Ans.

Use SQL to calculate the difference in marks for each student ID across different years.

Use a self join on the table to compare marks for the same student ID across different years.
Calculate the difference in marks by subtracting the marks from different years.
Group the results by student ID to get the difference in marks for each student.

View all Data Engineer interview questions

A Data Engineer was asked 9mo ago

Q. How does Spark handle fault tolerance?

Ans.

Spark handles fault tolerance through resilient distributed datasets (RDDs) and lineage tracking.

Spark achieves fault tolerance through RDDs, which are immutable distributed collections of objects that can be rebuilt if a partition is lost.
RDDs track the lineage of transformations applied to the data, allowing lost partitions to be recomputed based on the original data and transformations.
Spark also replicates dat...

View all Data Engineer interview questions

A Data Engineer was asked 9mo ago

Q. What is Lineage?

Ans.

Lineage refers to the history and origin of data, including its source, transformations, and dependencies.

Lineage helps in understanding how data is generated, processed, and transformed throughout its lifecycle.
It tracks the flow of data from its source to its destination, including any intermediate steps or transformations.
Lineage is important for data governance, data quality, and troubleshooting data issues.
Ex...

View all Data Engineer interview questions

Fragma Data Systems Interview Experiences

13 interviews found

Interview Questions & Answers

Anonymous

posted on 26 Dec 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Company Website and was interviewed in Nov 2024. There was 1 interview round.

Round 1 - One-on-one

(4 Questions)

Q1. Self introduction

Add your answer

Q2. Windows function and how you used

Ans.

Window functions are used to perform calculations across a set of table rows that are related to the current row.

Window functions are used in SQL to perform calculations on a specific subset of rows related to the current row.
They are often used with aggregate functions like SUM, AVG, and COUNT to calculate running totals, moving averages, and rankings.
Examples of window functions include ROW_NUMBER(), RANK(), LEAD(), ...

Answered by AI

Add your answer

Q3. Sql Coding questions. Window functions with joins.

Add your answer

Q4. Pyspark code for how you connect the data to ADLS with doing partition

Ans.

Use PySpark code to connect data to ADLS with partitioning

Use SparkSession to create a Spark application
Set the configuration for ADLS storage account and container
Read data from ADLS using Spark DataFrame API
Partition the data based on a specific column while writing back to ADLS

Answered by AI

Add your answer

Data Engineer Interview Questions & Answers

Anonymous

posted on 27 Sep 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Naukri.com and was interviewed in Aug 2024. There was 1 interview round.

Round 1 - Technical

(10 Questions)

Q1. Tell me about yourself, and project architecture. What are your day-to-day responsibilities?

Ans.

I am a Data Engineer with experience in designing and implementing project architectures. My day-to-day responsibilities include data processing, ETL tasks, and ensuring data quality.

Designing and implementing project architectures for data processing
Performing ETL tasks to extract, transform, and load data into the system
Ensuring data quality and integrity through data validation and cleansing
Collaborating with cross-...

Answered by AI

Add your answer

Q2. SQL: To calculate the difference in marks for each student ID and marks across different years?

Ans.

Use SQL to calculate the difference in marks for each student ID across different years.

Use a self join on the table to compare marks for the same student ID across different years.
Calculate the difference in marks by subtracting the marks from different years.
Group the results by student ID to get the difference in marks for each student.

Answered by AI

Add your answer

Q3. SQL: Statewise which gender purchase is the most?

Ans.

The answer to the question is that in which state which gender makes the most purchases.

Aggregate the data by state and gender to calculate the total purchases made by each gender in each state.
Identify the gender with the highest total purchases in each state.
Present the results in a table or chart for easy visualization.

Answered by AI

Add your answer

Q4. What is ADF ??

Ans.

ADF stands for Azure Data Factory, a cloud-based data integration service that allows you to create, schedule, and manage data pipelines.

ADF is used for building, scheduling, and monitoring data pipelines to move and transform data from various sources to destinations.
It supports data integration between various data stores such as Azure SQL Database, Azure Blob Storage, and on-premises data sources.
ADF provides a visu...

Answered by AI

Add your answer

Q5. Pysaprk Coding Question?

Add your answer

Q6. What is DAG ??

Ans.

DAG stands for Directed Acyclic Graph, a data structure used to represent dependencies between tasks in a workflow.

DAG is a collection of nodes connected by edges, where each edge has a direction and there are no cycles.
It is commonly used in data engineering for representing data pipelines and workflows.
DAGs help in visualizing and optimizing the order of tasks to be executed in a workflow.
Popular tools like Apache Ai...

Answered by AI

Add your answer

Q7. What is Lineage ??

Ans.

Lineage refers to the history and origin of data, including its source, transformations, and dependencies.

Lineage helps in understanding how data is generated, processed, and transformed throughout its lifecycle.
It tracks the flow of data from its source to its destination, including any intermediate steps or transformations.
Lineage is important for data governance, data quality, and troubleshooting data issues.
Example...

Answered by AI

Add your answer

Q8. How does Spark handle fault tolerance?

Ans.

Spark handles fault tolerance through resilient distributed datasets (RDDs) and lineage tracking.

Spark achieves fault tolerance through RDDs, which are immutable distributed collections of objects that can be rebuilt if a partition is lost.
RDDs track the lineage of transformations applied to the data, allowing lost partitions to be recomputed based on the original data and transformations.
Spark also replicates data par...

Answered by AI

Add your answer

Q9. There are four cores and four worker nodes in Spark. How many jobs will run in parallel?

Ans.

Only one job will run in parallel in Spark with four cores and four worker nodes.

In Spark, each core can only run one task at a time, so with four cores, only four tasks can run concurrently.
Since there are four worker nodes, each with four cores, a total of four tasks can run in parallel.
Therefore, only one job will run in parallel in this scenario.

Answered by AI

Add your answer

Q10. What are the optimisation techniques you have used in your project ?

Ans.

I have used techniques like indexing, query optimization, and parallel processing in my projects.

Indexing: Used to improve the speed of data retrieval by creating indexes on columns frequently used in queries.
Query optimization: Rewriting queries to improve efficiency and reduce execution time.
Parallel processing: Distributing tasks across multiple processors to speed up data processing.
Caching: Storing frequently acce...

Answered by AI

Add your answer

Skills evaluated in this interview

Interview Questions & Answers

Anonymous

posted on 6 Dec 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. Problem solving in js.

Add your answer

Q2. Javascript o/p question

Add your answer

Round 2 - HR

(1 Question)

Q1. General HR discussions

Add your answer

Round 3 - Technical

(1 Question)

Q1. Basic React questions, Javascript questions, one coding exercise

Add your answer

Data Engineer Interview Questions & Answers

Omkar Singh

posted on 16 Jan 2025

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - One-on-one

(1 Question)

Q1. Find duplicate rows in the given table

Ans.

Identify duplicate rows in a table

Use SQL query with GROUP BY and HAVING clause to identify duplicate rows based on specific columns
Example: SELECT column1, column2, COUNT(*) FROM table_name GROUP BY column1, column2 HAVING COUNT(*) > 1

Answered by AI

Add your answer

Technical Lead Interview Questions & Answers

Anonymous

posted on 30 Jul 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. Coding question for count occurence of string

Ans.

Count occurrences of a specific string in an array of strings efficiently.

Use a loop to iterate through the array and a counter to track occurrences.
Example: For array ['apple', 'banana', 'apple'], count 'apple' returns 2.
Consider using a hash map for larger datasets to improve performance.
Example: {'apple': 2, 'banana': 1} for the previous array.

Answered by AI

Add your answer

Q2. Java basic question with exception handling

Add your answer

Round 2 - One-on-one

(2 Questions)

Q1. Microservice question

Add your answer

Q2. Spring batch working flow

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Don't join the organization.. it's very unprofessional

Senior Software Engineer Interview Questions & Answers

Anonymous

posted on 19 Nov 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Technical

(1 Question)

Q1. Javascript based coding question

Add your answer

Data Engineer Interview Questions & Answers

Anonymous

posted on 10 Oct 2023

Interview experience

Good

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I appeared for an interview before Oct 2022.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - Coding Test

1.ETL Pipeline
2.PySpark Code
3.SQL

Round 3 - Technical

(2 Questions)

Q1. 1.sql easy question 2.pyspark easy code as I was fesher

Add your answer

Q2. 1.query related to group by

Ans.

Group by is a SQL clause used to aggregate data based on one or more columns.

Used to group rows that have the same values in specified columns.
Commonly used with aggregate functions like COUNT, SUM, AVG.
Example: SELECT department, COUNT(*) FROM employees GROUP BY department;
Can include HAVING clause to filter groups based on aggregate values.
Example: SELECT department, AVG(salary) FROM employees GROUP BY department HAV...

Answered by AI

Add your answer

Round 4 - Technical

(1 Question)

Q1. 1.One tricky sql query 2.What is sqoop?sqoop performs which job - map or reduce?

Add your answer

Informatica Developer Interview Questions & Answers

Anonymous

posted on 1 Dec 2021

I applied via Cutshort.io and was interviewed in Nov 2021. There were 3 interview rounds.

Interview Questionnaire

1 Question

Q1. SCD type 2 mapping, Store procedure and functions, ETL and Query Optimisation, Data warehousing and UNIX

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be thorough with the Informatica and Data warehousing concepts.

Business Analyst Interview Questions & Answers

Anonymous

posted on 10 Sep 2022

I applied via Internshala and was interviewed before Sep 2021. There were 5 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - Assignment

I was given a pdf file containing 3 Problem statements along with the output for which I had to write SQL queries.

Round 3 - Technical

(3 Questions)

Q1. Theoretical Questions related to SQL were asked like types of join, the difference between union and join, related to my project, and some behavioral questions.

Add your answer

Q2. What are different types of Joins?

Ans.

Joins are used to combine data from two or more tables based on a related column between them.

Inner Join: returns only the matching rows from both tables
Left Join: returns all rows from the left table and matching rows from the right table
Right Join: returns all rows from the right table and matching rows from the left table
Full Outer Join: returns all rows from both tables
Cross Join: returns the Cartesian product of b...

Answered by AI

Add your answer

Q3. What is the Difference between Union and Joins

Ans.

Joins combine rows from two or more tables based on a related column, while unions combine rows from two or more tables with the same structure.

Joins are used to combine data from different tables based on a related column
Unions are used to combine data from tables with the same structure
Joins can be inner, left, right, or full, while unions are always a combination of all rows
Joins can have multiple conditions, while ...

Answered by AI

Add your answer

Round 4 - Technical

(1 Question)

Q1. Scenario-based questions on SQL

Add your answer

Round 5 - HR

(1 Question)

Q1. Questions related to me like what my parents do etc. and their company profile.

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - If applying as a fresher, prepare SQL well as most of the questions will be from SQL only.

Skills evaluated in this interview

Software Engineer Interview Questions & Answers

raguvaran ravi

posted on 26 Jul 2021

I appeared for an interview in Jun 2021.

Interview Questionnaire

1 Question

Q1. Basic questions and aptitute

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - It was good, selection is one day process. salary is 3.6 lacks per annum it was very a good experience. interview was knowledgeable and experience. overall it was good but salary could have been better. work location given were too far away form my hometown

Top trending discussions

View All

Interview Tips & Stories

toobluntforu

works at

Cvent

Can speak English, can’t deliver in interviews

I feel like I can't speak fluently during interviews. I do know english well and use it daily to communicate, but the moment I'm in an interview, I just get stuck. since it's not my first language, I struggle to express what I actually feel. I know the answer in my head, but I just can’t deliver it properly at that moment. Please guide me

Got a question about Fragma Data Systems?

Ask anonymously on communities.

Fragma Data Systems Interview FAQs

How many rounds are there in Fragma Data Systems interview?

Fragma Data Systems interview process usually has 2-3 rounds. The most common rounds in the Fragma Data Systems interview process are Technical, One-on-one Round and Resume Shortlist.

How to prepare for Fragma Data Systems interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Fragma Data Systems. The most common topics and skills that interviewers at Fragma Data Systems expect are SQL, Java, Python, Spark and Angularjs.

What are the top questions asked in Fragma Data Systems interview?

Some of the top questions asked at the Fragma Data Systems interview -

There are four cores and four worker nodes in Spark. How many jobs will run in ...read more
What are the optimisation techniques you have used in your projec...read more
Pyspark code for how you connect the data to ADLS with doing partit...read more

Tell us how to improve this page.

Fragma Data Systems Interviews By Designations

Interview Questions for Popular Designations

4/5

based on 6 interview experiences

Difficulty level

Easy 50%

Moderate 50%

Duration

Less than 2 weeks 100%

Techsteck Solutions Interview Questions

4.5

• 12 Interviews

Allied Globetech Interview Questions

4.3

• 11 Interviews

WarpDrive Tech Works Interview Questions

3.8

• 9 Interviews

Rite Software Solutions And Services Interview Questions

3.4

• 7 Interviews

Buoyancy Consultants Interview Questions

4.1

• 6 Interviews

Pragmasys Consulting LLP Interview Questions

3.9

• 6 Interviews

DigiMantra Interview Questions

3.7

• 6 Interviews

Pythian Interview Questions

4.1

• 6 Interviews

Vowelweb Interview Questions

4.3

• 5 Interviews

Quokka Labs Interview Questions

3.8

• 5 Interviews

View all

Jobs at Fragma Data Systems

Senior Java Developer

Bangalore / Bengaluru

5-10 Yrs

Not Disclosed

Fragmadata - Solution Architect - Cloud Infrastructure (15-20 yrs)

15-20 Yrs

Not Disclosed

Fragmadata - Full Stack Developer (5-10 yrs)

5-10 Yrs

Not Disclosed

Explore more jobs

Fragma Data Systems Salaries in India

Data Engineer 53 salaries	₹5.8 L/yr - ₹18 L/yr
Software Engineer 29 salaries	₹4.7 L/yr - ₹24 L/yr
Business Analyst 24 salaries	₹3.5 L/yr - ₹6 L/yr
Senior Software Engineer 22 salaries	₹10 L/yr - ₹36 L/yr
Data Analyst 13 salaries	₹3.5 L/yr - ₹6.4 L/yr