i
Ernst & Young
Filter interviews by
Spark architecture refers to the structure of Apache Spark, a distributed computing framework.
Spark architecture consists of a cluster manager, worker nodes, and a driver program.
The cluster manager allocates resources and schedules tasks across worker nodes.
Worker nodes execute tasks in parallel and store data in memory or disk.
The driver program coordinates the execution of tasks and manages the overall workflow.
Spar...
Optimizing Spark jobs involves tuning configurations, partitioning data, using appropriate data structures, and leveraging caching.
Tune Spark configurations for optimal performance
Partition data to distribute workload evenly
Use appropriate data structures like DataFrames or Datasets
Leverage caching to avoid recomputation
Optimize shuffle operations to reduce data movement
It was also very easy.
I applied via Naukri.com and was interviewed in May 2024. There was 1 interview round.
PySpark architecture is a distributed computing framework that combines Python and Spark to process big data.
PySpark architecture consists of a driver program, cluster manager, and worker nodes.
The driver program is responsible for creating SparkContext, which connects to the cluster manager.
Cluster manager allocates resources and schedules tasks on worker nodes.
Worker nodes execute the tasks and return results to the ...
Skewed partitioning is when data is not evenly distributed across partitions, leading to performance issues.
Skewed partitioning can occur when a key column has a few values that are much more common than others.
It can lead to uneven processing and resource utilization in distributed systems like Hadoop or Spark.
To address skewed partitioning, techniques like data skew detection, data skew handling, and data skew preven
MDM (Master Data Management) typically consists of three layers: operational, analytical, and data governance.
Operational layer: manages the day-to-day data operations and transactions.
Analytical layer: focuses on data analysis and reporting for decision-making.
Data governance layer: ensures data quality, security, and compliance.
Example: In a retail company, the operational layer manages customer transactions, the ana...
What people are saying about Ernst & Young
I applied via Recruitment Consulltant and was interviewed before Jul 2022. There were 4 interview rounds.
Mostly coding questions on Python, pyspark and SQL of medium difficulty.
Ernst & Young interview questions for designations
I applied via Referral and was interviewed before Oct 2019. There were 4 interview rounds.
I applied via Naukri.com and was interviewed before Nov 2020. There were 3 interview rounds.
I applied via Referral and was interviewed before Sep 2021. There were 2 interview rounds.
I applied via Walk-in and was interviewed before Feb 2021. There were 2 interview rounds.
No
I applied via Walk-in and was interviewed before Nov 2020. There were 3 interview rounds.
I applied via Walk-in and was interviewed before Nov 2020. There was 1 interview round.
based on 4 interviews
1 Interview rounds
based on 6 reviews
Rating in categories
Senior Consultant
15.9k
salaries
| ₹0 L/yr - ₹0 L/yr |
Consultant
12k
salaries
| ₹0 L/yr - ₹0 L/yr |
Manager
7.6k
salaries
| ₹0 L/yr - ₹0 L/yr |
Assistant Manager
6.4k
salaries
| ₹0 L/yr - ₹0 L/yr |
Associate Consultant
3.9k
salaries
| ₹0 L/yr - ₹0 L/yr |
Deloitte
PwC
EY Global Delivery Services ( EY GDS)
Accenture