Filter interviews by
Clear (1)
I applied via Referral and was interviewed before May 2023. There were 2 interview rounds.
Transformations in Databricks involve manipulating data using functions like map, filter, reduce, etc.
Transformations are operations that are applied to RDDs in Databricks
Common transformations include map, filter, reduce, flatMap, etc.
Transformations are lazy evaluated and create a new RDD
Example: map transformation to convert each element in an RDD to uppercase
Spark is a distributed computing framework for processing big data.
Spark is built around the concept of Resilient Distributed Datasets (RDDs)
It supports various programming languages like Scala, Java, Python, and R
Spark provides high-level APIs like DataFrames and Datasets for structured data processing
It includes libraries for SQL, streaming, machine learning, and graph processing
Spark can run on various cluster manag
Performance enhancements in PySpark involve optimizing code, tuning configurations, and utilizing efficient data structures.
Use partitioning to distribute data evenly across nodes
Cache intermediate results to avoid recomputation
Optimize joins by broadcasting smaller tables
Use efficient data formats like Parquet or ORC
Tune Spark configurations for memory and parallelism
I applied via Approached by Company and was interviewed before Feb 2023. There were 3 interview rounds.
It was a hackerearth test ranging from simple to intermediate
I applied via Naukri.com and was interviewed before Nov 2021. There were 4 interview rounds.
Language coding test which might be very specific to the proficient for data technologies
Top trending discussions
posted on 21 Mar 2022
I applied via Naukri.com and was interviewed in Sep 2021. There were 3 interview rounds.
Questions related to cloud types, ADF activities, advanced SQL, and basic OOPs concepts.
Types of cloud include public, private, and hybrid
ADF activities include data ingestion, transformation, and loading
Advanced SQL includes window functions, subqueries, and joins
Basic OOPs concepts include encapsulation, inheritance, and polymorphism
posted on 27 Mar 2024
I applied via Approached by Company and was interviewed in Sep 2023. There were 2 interview rounds.
Use SQL query with subquery to find nth highest salary
Use ORDER BY and LIMIT to get the nth highest salary
Use a subquery to exclude the top n-1 salaries before selecting the nth highest salary
posted on 7 Jan 2025
I applied via Approached by Company and was interviewed before Jan 2024. There were 3 interview rounds.
Basics of SQL, Python
Experience based questions, SQL and Python
I have worked on projects involving building data pipelines, optimizing data storage, and developing machine learning models.
Built data pipelines using Apache Spark and Airflow
Optimized data storage by implementing partitioning and indexing strategies
Developed machine learning models for predictive analytics
I applied via LinkedIn and was interviewed in Jun 2021. There were 3 interview rounds.
Prepare well on coding n programming
Topic will be give should talk for a min
I applied via Approached by Company and was interviewed in Jun 2022. There was 1 interview round.
Python and SQL scenarios
based on 2 interviews
Interview experience
based on 14 reviews
Rating in categories
Consultant
1.1k
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Engineer
718
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Consultant
588
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Scientist
462
salaries
| ₹0 L/yr - ₹0 L/yr |
Engineer
202
salaries
| ₹0 L/yr - ₹0 L/yr |
Mu Sigma
AbsolutData
LatentView Analytics
Tiger Analytics