Filter interviews by
Spark architecture is a distributed computing framework that consists of a driver program, cluster manager, and worker nodes.
Spark architecture includes a driver program that manages the execution of the Spark application.
It also includes a cluster manager that allocates resources and coordinates tasks across worker nodes.
Worker nodes are responsible for executing tasks and storing data in memory or disk.
Spark architec...
Top trending discussions
I applied via Company Website and was interviewed in Nov 2024. There were 2 interview rounds.
posted on 31 Dec 2024
Apache Spark architecture includes a cluster manager, worker nodes, and driver program.
Apache Spark architecture consists of a cluster manager, which allocates resources and schedules tasks.
Worker nodes execute tasks and store data in memory or disk.
Driver program coordinates tasks and communicates with the cluster manager.
Spark applications run as independent sets of processes on a cluster, coordinated by the SparkCon...
reduceBy is used to aggregate data based on key, while groupBy is used to group data based on key.
reduceBy is a transformation that combines the values of each key using an associative function and a neutral 'zero value'.
groupBy is a transformation that groups the data based on a key and returns a grouped data set.
reduceBy is more efficient for aggregating data as it reduces the data before shuffling, while groupBy shu...
RDD is a low-level abstraction representing a distributed collection of objects, while DataFrame is a higher-level abstraction representing a distributed collection of data organized into named columns.
RDD is more suitable for unstructured data and low-level transformations, while DataFrame is more suitable for structured data and high-level abstractions.
DataFrames provide optimizations like query optimization and code...
The different modes of execution in Apache Spark include local mode, standalone mode, YARN mode, and Mesos mode.
Local mode: Spark runs on a single machine with one executor.
Standalone mode: Spark runs on a cluster managed by a standalone cluster manager.
YARN mode: Spark runs on a Hadoop cluster using YARN as the resource manager.
Mesos mode: Spark runs on a Mesos cluster with Mesos as the resource manager.
I applied via Recruitment Consulltant and was interviewed in Dec 2024. There was 1 interview round.
SQL order of execution determines the sequence in which different clauses are processed in a query.
SQL query is parsed and validated first
Next, the query optimizer creates an execution plan
Execution plan includes steps like table scans, index scans, joins, etc.
Finally, the query is executed and results are returned
I was interviewed in Sep 2024.
They asked various questions on guestimates and few SQL Questions
I applied via LinkedIn and was interviewed in Nov 2024. There was 1 interview round.
Aptitude test involved with quantative aptitude, logical reasoning and reading comprehensions.
I have strong skills in data processing, ETL, data modeling, and programming languages like Python and SQL.
Proficient in data processing and ETL techniques
Strong knowledge of data modeling and database design
Experience with programming languages like Python and SQL
Familiarity with big data technologies such as Hadoop and Spark
Yes, I am open to relocating for the right opportunity.
I am willing to relocate for the right job opportunity.
I have experience moving for previous roles.
I am flexible and adaptable to new locations.
I am excited about the possibility of exploring a new city or country.
Databricks is a unified analytics platform that provides a collaborative environment for data scientists, engineers, and analysts.
Databricks is built on top of Apache Spark, providing a unified platform for data engineering, data science, and business analytics.
Internals of Databricks include a cluster manager, job scheduler, and workspace for collaboration.
Optimization techniques in Databricks include query optimizati...
I applied via Job Portal and was interviewed in Aug 2024. There were 3 interview rounds.
Its mandatory test even for experience people
Software Engineer
28
salaries
| ₹4 L/yr - ₹10 L/yr |
Software Developer
11
salaries
| ₹2.9 L/yr - ₹9.5 L/yr |
Test Engineer
11
salaries
| ₹3 L/yr - ₹9 L/yr |
Devops Engineer
11
salaries
| ₹4.5 L/yr - ₹10 L/yr |
Senior Software Engineer
10
salaries
| ₹7.5 L/yr - ₹15 L/yr |
Cognizant
Teleperformance
Reliance Retail
iEnergizer