Upload Button Icon Add office photos

Filter interviews by

Datasee.ai Data Engineer Interview Questions and Answers for Experienced

Updated 2 Sep 2024

Datasee.ai Data Engineer Interview Experiences for Experienced

1 interview found

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via LinkedIn and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Coding Test 

I have been asked with basic python coding questions ( strings)

Round 2 - HR 

(2 Questions)

  • Q1. Where you seen yourself after 5 years?
  • Ans. 

    In 5 years, I see myself leading a team of data engineers, implementing cutting-edge technologies, and driving impactful data-driven decisions.

    • Leading a team of data engineers

    • Implementing cutting-edge technologies

    • Driving impactful data-driven decisions

  • Answered by AI
  • Q2. What are you strength and weakness
  • Ans. 

    Strength: Strong problem-solving skills. Weakness: Sometimes too detail-oriented.

    • Strength: Ability to analyze complex data sets and find efficient solutions

    • Weakness: Occasionally get caught up in minor details and lose sight of the bigger picture

  • Answered by AI

Interview questions from similar companies

Interview experience
2
Poor
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Coding Test 

Oops dsa sql network

Round 2 - Technical 

(2 Questions)

  • Q1. Dsa based python question
  • Q2. Dsa based python question on tress
Round 3 - Technical 

(2 Questions)

  • Q1. Sql question queries
  • Q2. Sql question query
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What is the SQL query to calculate the average sales over a period of 7 days?
  • Q2. Explain Data bricks architecture?
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. Basic Python, SQL & Spark coding questions.
  • Q2. Nth highest salary, word count in pyspark
  • Ans. 

    To find the nth highest salary in pyspark, use the window function with row_number and filter on the desired rank.

    • Use window function with row_number to assign a rank to each salary

    • Filter the result to get the row with the desired rank

    • Example: df.withColumn('rank', F.row_number().over(Window.orderBy(F.col('salary').desc()))).filter(F.col('rank') == n).select('salary')

  • Answered by AI
Round 2 - Technical 

(1 Question)

  • Q1. Questions on SQL & PySpark.
Round 3 - Behavioral 

(1 Question)

  • Q1. Questions about previous projects & optimization techniques etc. Also behavioural
Round 4 - HR 

(1 Question)

  • Q1. Salary negotiation part.

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. Explain the architecture of delta lake
  • Ans. 

    Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark and big data workloads.

    • Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing.

    • It stores data in Parquet format and uses Apache Spark for processing.

    • Delta Lake ensures data reliability and data quality by providing schema enforcement and data versioning.

    • It supports time tra...

  • Answered by AI
  • Q2. Tuning operations in databricks
  • Ans. 

    Tuning operations in Databricks involves optimizing performance and efficiency of data processing tasks.

    • Use cluster configuration settings to allocate resources efficiently

    • Optimize code by minimizing data shuffling and reducing unnecessary operations

    • Leverage Databricks Auto Optimize to automatically tune performance

    • Monitor job performance using Databricks Runtime Metrics and Spark UI

  • Answered by AI

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - One-on-one 

(2 Questions)

  • Q1. Normal questions on spark, hadoop, sql
  • Q2. One python coding question was there
Round 2 - One-on-one 

(2 Questions)

  • Q1. Python coding felt difficult, its a moderate difficult question
  • Q2. Questions on project, sql moderate questions
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Spark architecture

Data Engineer Interview Questions & Answers

HCLTech user image Manoj Chitturi

posted on 28 May 2024

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Approached by Company and was interviewed in Apr 2024. There were 3 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. Explain Spark RDD,DF
  • Ans. 

    Spark RDD and DF are two data structures in Apache Spark for processing and analyzing data.

    • RDD (Resilient Distributed Dataset) is a distributed collection of elements that can be operated on in parallel. It is immutable and fault-tolerant.

    • DF (DataFrame) is a distributed collection of data organized into named columns. It provides a more structured and efficient way to work with data compared to RDDs.

    • RDD is low-level an...

  • Answered by AI
  • Q2. Spark Architecture
Round 2 - Case Study 

End to end data engineering architecture

Round 3 - HR 

(2 Questions)

  • Q1. Salary expectation
  • Q2. Cultural fitment

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Mostly on Spark

Data Engineer Interview Questions & Answers

Wipro user image Aarthi Varatharaji

posted on 13 Jun 2024

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
Not Selected
Round 1 - Technical 

(2 Questions)

  • Q1. What is executor memory
  • Ans. 

    Executor memory is the amount of memory allocated to each executor in a Spark application.

    • Executor memory is specified using the 'spark.executor.memory' configuration property.

    • It determines how much memory each executor can use to process tasks.

    • It is important to properly configure executor memory to avoid out-of-memory errors or inefficient resource utilization.

  • Answered by AI
  • Q2. Difference between rdd and data frame
  • Ans. 

    RDD is a low-level abstraction in Spark representing distributed data, while DataFrames are higher-level structured APIs for working with data.

    • RDD is an immutable distributed collection of objects, while DataFrames are distributed collection of data organized into named columns.

    • RDDs are more suitable for unstructured data and low-level transformations, while DataFrames provide a more user-friendly API for structured da...

  • Answered by AI

Skills evaluated in this interview

Datasee.ai Interview FAQs

How many rounds are there in Datasee.ai Data Engineer interview for experienced candidates?
Datasee.ai interview process for experienced candidates usually has 2 rounds. The most common rounds in the Datasee.ai interview process for experienced candidates are Coding Test and HR.
How to prepare for Datasee.ai Data Engineer interview for experienced candidates?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Datasee.ai. The most common topics and skills that interviewers at Datasee.ai expect are Analytical, Data Processing, Database, Gaming and Machine Learning.

Tell us how to improve this page.

Datasee.ai Data Engineer Salary
based on 5 salaries
₹14 L/yr - ₹15.4 L/yr
39% more than the average Data Engineer Salary in India
View more details
Data Engineer
5 salaries
unlock blur

₹14 L/yr - ₹15.4 L/yr

GIS Analyst
5 salaries
unlock blur

₹3.7 L/yr - ₹4 L/yr

Support Engineer
4 salaries
unlock blur

₹16.5 L/yr - ₹16.5 L/yr

Front end Developer
4 salaries
unlock blur

₹3.5 L/yr - ₹5 L/yr

Business Development Associate
3 salaries
unlock blur

₹4.5 L/yr - ₹5 L/yr

Explore more salaries
Compare Datasee.ai with

Fractal Analytics

4.0
Compare

Mu Sigma

2.7
Compare

Tiger Analytics

3.7
Compare

LatentView Analytics

3.7
Compare
Did you find this page helpful?
Yes No
write
Share an Interview