Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 1K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Gyan Asthali Residential School

Compare

4.5

based on 2 Reviews

Filter interviews by

Gyan Asthali Residential School Data Engineer 1 Interview Questions and Answers

Be the first one to contribute and help others!

Interview questions from similar companies

Data Engineer 1 Interview Questions & Answers

Accenture

Anonymous

posted on 9 Aug 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Naukri.com and was interviewed in Jul 2024. There were 2 interview rounds.

Round 1 - Technical

(2 Questions)

Q1. How many types of integration runtime is present

Ans.

There are three types of integration runtime: Self-hosted, Azure, and SSIS

Self-hosted integration runtime is installed on a local machine or a virtual machine within an on-premises network
Azure integration runtime is managed by Azure Data Factory and runs in the Azure cloud
SSIS integration runtime is used to run SQL Server Integration Services packages in Azure Data Factory

Answered by AI

Add your answer

Q2. How many types of trigger are there in Adf

Ans.

There are two types of triggers in Azure Data Factory: Schedule-based triggers and Event-based triggers.

Schedule-based triggers are based on a time schedule and can be set to run at specific intervals.
Event-based triggers are triggered by events such as the completion of a pipeline run or the arrival of new data.
Triggers can be used to automate the execution of pipelines in Azure Data Factory.

Answered by AI

Add your answer

Round 2 - HR

(2 Questions)

Q1. About my experience of leading a team in my previous company

Add your answer

Q2. Why I choose Accenture

Ans.

I chose Accenture for its reputation, global presence, and opportunities for growth.

Accenture is a renowned company known for its innovative solutions and cutting-edge technology.
The global presence of Accenture provides opportunities to work on diverse projects and collaborate with experts from around the world.
Accenture offers ample opportunities for career growth and development through training programs and mentors...

Answered by AI

Add your answer

Skills evaluated in this interview

Data Engineer 1 Interview Questions & Answers

Tech Mahindra

Anonymous

posted on 19 Jul 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(1 Question)

Q1. Difference between Delete and Truncate

Ans.

Delete removes specific rows from a table, while Truncate removes all rows from a table.

Delete is a DML operation, while Truncate is a DDL operation.
Delete can be rolled back, while Truncate cannot be rolled back.
Delete operation is slower compared to Truncate operation.
Delete operation maintains the integrity constraints, triggers, and indexes, while Truncate does not.

Answered by AI

Add your answer

Skills evaluated in this interview

Data Engineer 1 Interview Questions & Answers

IBM

Anonymous

posted on 3 Apr 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via LinkedIn and was interviewed before Apr 2023. There were 2 interview rounds.

Round 1 - Aptitude Test

MCQ based technical questions they have asked. Covering most of the basic abinitio components

Round 2 - Technical

(1 Question)

Q1. Technical questions like scd types ,rollup normalise join scenarios.

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Go through run time behaviour of components

Data Engineer 1 Interview Questions & Answers

Cognizant

Anonymous

posted on 1 Aug 2023

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via Campus Placement and was interviewed before Aug 2022. There were 5 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - Aptitude Test

Aptitude Questions related to Basic Quanitative aptitue, psuedo code snippets,Computer Fundamental Questions, Related to Operating System

Round 3 - Coding Test

There are 3 coding question
1.Easy (Related to Arrays)
2.Medium(String related questions)
3.Medium(Stack related questions)

Round 4 - One-on-one

(1 Question)

Q1. 1.Introduction 2.Discussion about my projects 3.Questions on some terms which was mentioned in my project 4.One easy level coding questions 5. Two sql questions related to Join Logic 6.Some questions relat...

Add your answer

Round 5 - HR

(1 Question)

Q1. Mainly for Document Verification

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Focus more on basics , because there will be very less chance that interviewer will ask very advanced questions

Data Engineer 1 Interview Questions & Answers

TCS

Anonymous

posted on 22 Jan 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Selected

I applied via Campus Placement and was interviewed before Jan 2023. There was 1 interview round.

Round 1 - HR

(2 Questions)

Q1. Are you ready to relocate

Add your answer

Q2. What's your salary expectation

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be honest and mindful while answering the questions

Senior Data Engineer Interview Questions & Answers

HCLTech

Anonymous

posted on 7 Jan 2025

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

I applied via Recruitment Consulltant

Round 1 - Technical

(5 Questions)

Q1. Explain ETL pipeline ecosystem in Azure Databricks?

Add your answer

Q2. Star vs Snowflake schema, when to use?

Add your answer

Q3. Find Salary higher than Average department salary

Add your answer

Q4. Implementation of SCD2 table

Add your answer

Q5. How incremental loading is done

Add your answer

Azure Data Engineer Interview Questions & Answers

Capgemini

Anonymous

posted on 26 Dec 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Naukri.com and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical

(5 Questions)

Q1. How would you create a pipeline for ADLS to SQL data movement?

Add your answer

Q2. How would you create a pipeline from REST API to ADLS? What is there are 8 million rows of records?

Add your answer

Q3. IF data needs filtering, joining and aggregation, how would you do it with ADF?

Add your answer

Q4. Explain medallion architecture.

Add your answer

Q5. Explain medallion with databricks

Add your answer

Round 2 - HR

(1 Question)

Q1. Basic questions and salary expectation.

Add your answer

Interview Preparation Tips

Topics to prepare for Capgemini Azure Data Engineer interview:

ADF
Databricks

Data Engineer Interview Questions & Answers

Deloitte

Anonymous

posted on 3 Dec 2024

Interview experience

Average

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Company Website and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical

(2 Questions)

Q1. Sql constrainsts, star schema, dml dcl commands

Add your answer

Q2. About cureent project and responsibilities

Add your answer

Round 2 - Technical

(2 Questions)

Q1. Current projects and resposibilities

Add your answer

Q2. Where vs having, reason for job change

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - 1. Technical - about you current project and responsibilities, basic SQL question-constraints, starschema, DML DCL command, one sql query write.
2. Technical with senior manager- about project ,where vs having , reason of job change

Data Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 7 Nov 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

No response

I applied via Naukri.com and was interviewed in Oct 2024. There were 2 interview rounds.

Round 1 - Technical

(7 Questions)

Q1. How do you optimize SQL queries?

Ans.

Optimizing SQL queries involves using indexes, avoiding unnecessary joins, and optimizing the query structure.

Use indexes on columns frequently used in WHERE clauses
Avoid using SELECT * and only retrieve necessary columns
Optimize joins by using INNER JOIN instead of OUTER JOIN when possible
Use EXPLAIN to analyze query performance and make necessary adjustments

Answered by AI

Add your answer

Q2. How do you do performance optimization in Spark. Tell how you did it in you project.

Ans.

Performance optimization in Spark involves tuning configurations, optimizing code, and utilizing caching.

Tune Spark configurations such as executor memory, number of executors, and shuffle partitions.
Optimize code by reducing unnecessary shuffles, using efficient transformations, and avoiding unnecessary data movements.
Utilize caching to store intermediate results in memory and avoid recomputation.
Example: In my projec...

Answered by AI

Add your answer

Q3. What is SparkContext and SparkSession?

Ans.

SparkContext is the main entry point for Spark functionality, while SparkSession is the entry point for Spark SQL.

SparkContext is the entry point for low-level API functionality in Spark.
SparkSession is the entry point for Spark SQL functionality.
SparkContext is used to create RDDs (Resilient Distributed Datasets) in Spark.
SparkSession provides a unified entry point for reading data from various sources and performing

Answered by AI

Add your answer

Q4. When a spark job is submitted, what happens at backend. Explain the flow.

Ans.

When a spark job is submitted, various steps are executed at the backend to process the job.

The job is submitted to the Spark driver program.
The driver program communicates with the cluster manager to request resources.
The cluster manager allocates resources (CPU, memory) to the job.
The driver program creates DAG (Directed Acyclic Graph) of the job stages and tasks.
Tasks are then scheduled and executed on worker nodes ...

Answered by AI

View 1 more answer

Q5. Calculate second highest salary using SQL as well as pyspark.

Ans.

Calculate second highest salary using SQL and pyspark

Use SQL query with ORDER BY and LIMIT to get the second highest salary
In pyspark, use orderBy() and take() functions to achieve the same result

Answered by AI

Add your answer

Q6. 2 types of modes for Spark architecture ?

Ans.

The two types of modes for Spark architecture are standalone mode and cluster mode.

Standalone mode: Spark runs on a single machine with a single JVM and is suitable for development and testing.
Cluster mode: Spark runs on a cluster of machines managed by a cluster manager like YARN or Mesos for production workloads.

Answered by AI

Add your answer

Q7. If you want very less latency - which is better standalone or client mode?

Ans.

Client mode is better for very less latency due to direct communication with the cluster.

Client mode allows direct communication with the cluster, reducing latency.
Standalone mode requires an additional layer of communication, increasing latency.
Client mode is preferred for real-time applications where low latency is crucial.

Answered by AI

Add your answer

Round 2 - Technical

(2 Questions)

Q1. Scenario based. Write SQL and pyspark code for a dataset.

Add your answer

Q2. If you have to find latest record based on latest timestamp in a table for a particular customer(table is having history) , how will you do it. Self join and nested query will be expensive. Optimized query...

Add your answer

Interview Preparation Tips

Topics to prepare for LTIMindtree Data Engineer interview:

SQL
pyspark
ETL

Interview preparation tips for other job seekers - L2 was scheduled next day to L1 so the process is fast. Brush up your practical knowledge more.

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Genpact

Sashikanta Parida

posted on 17 Dec 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical

(3 Questions)

Q1. What are different type of joins available in Databricks?

Ans.

Different types of joins available in Databricks include inner join, outer join, left join, right join, and cross join.

Inner join: Returns only the rows that have matching values in both tables.
Outer join: Returns all rows when there is a match in either table.
Left join: Returns all rows from the left table and the matched rows from the right table.
Right join: Returns all rows from the right table and the matched rows ...

Answered by AI

Add your answer

Q2. How do you make your data pipeline fault tolerant?

Ans.

Implementing fault tolerance in a data pipeline involves redundancy, monitoring, and error handling.

Use redundant components to ensure continuous data flow
Implement monitoring tools to detect failures and bottlenecks
Set up automated alerts for immediate response to issues
Design error handling mechanisms to gracefully handle failures
Use checkpoints and retries to ensure data integrity

Answered by AI

Add your answer

Q3. What is AutoLoader?

Ans.

AutoLoader is a feature in data engineering that automatically loads data from various sources into a data warehouse or database.

Automates the process of loading data from different sources
Reduces manual effort and human error
Can be scheduled to run at specific intervals
Examples: Apache Nifi, AWS Glue

Answered by AI

Add your answer

Round 2 - Technical

(2 Questions)

Q1. How do you connect to different services in Azure?

Ans.

To connect to different services in Azure, you can use Azure SDKs, REST APIs, Azure Portal, Azure CLI, and Azure PowerShell.

Use Azure SDKs for programming languages like Python, Java, C#, etc.
Utilize REST APIs to interact with Azure services programmatically.
Access and manage services through the Azure Portal.
Leverage Azure CLI for command-line interface interactions.
Automate tasks using Azure PowerShell scripts.

Answered by AI

Add your answer

Q2. What are linked Services?

Ans.

Linked Services are connections to external data sources or destinations in Azure Data Factory.

Linked Services define the connection information needed to connect to external data sources or destinations.
They can be used in Data Factory pipelines to read from or write to external systems.
Examples of Linked Services include Azure Blob Storage, Azure SQL Database, and Amazon S3.

Answered by AI

Add your answer