Filter interviews by
Top trending discussions
I applied via Naukri.com and was interviewed in Oct 2023. There was 1 interview round.
3 SQL and 2 Python questions were asked, SQL was moderate, Python was tough
I applied via Referral and was interviewed in May 2024. There were 4 interview rounds.
Python and SQL questions were asked
Python , dictionary operations
I applied via Campus Placement and was interviewed before May 2023. There was 1 interview round.
I applied via Naukri.com and was interviewed in Jun 2022. There was 1 interview round.
Optimizing Spark jobs involves tuning various parameters such as memory allocation, partitioning, and caching.
Increase memory allocation for executors and driver
Partition data appropriately to avoid skewness
Cache frequently accessed data to avoid recomputation
Use broadcast variables for small data sets
Avoid shuffling data unnecessarily
Use appropriate serialization format
Use appropriate hardware for cluster
Monitor and o
Coalesce is used to select the first non-null value from a set of columns. Repartition is used to shuffle data across nodes.
Coalesce is a function that returns the first non-null value from a set of columns.
Repartition is used to shuffle data across nodes to increase parallelism.
Coalesce reduces the number of columns in a DataFrame.
Repartition increases or decreases the number of partitions in a DataFrame.
Coalesce is u...
posted on 23 Sep 2024
It wa really a great experience.
It was really a great experience.
In 3 years, I see myself leading a team of data engineers, implementing cutting-edge technologies, and driving impactful data-driven decisions.
Leading a team of data engineers
Implementing cutting-edge technologies
Driving impactful data-driven decisions
My strengths include strong analytical skills, attention to detail, and the ability to work well in a team.
Strong analytical skills - able to analyze complex data sets and derive meaningful insights
Attention to detail - meticulous in ensuring data accuracy and quality
Team player - collaborate effectively with colleagues to achieve common goals
I have worked on projects involving building data pipelines, optimizing database performance, and creating machine learning models.
Built data pipelines using Apache Spark and Kafka
Optimized database performance by tuning queries and indexes
Created machine learning models for predictive analytics
Implemented real-time data processing using technologies like Apache Flink
My CGPA is 3.8 out of 4.0.
My CGPA is 3.8, which is considered high in my university.
I have consistently maintained a high CGPA throughout my academic career.
I have received several academic awards based on my CGPA.
My CGPA reflects my dedication and hard work towards my studies.
My hobbies include hiking, photography, and playing the guitar.
Hiking: I enjoy exploring nature trails and challenging myself physically.
Photography: I love capturing moments and landscapes through my camera lens.
Playing the guitar: I find relaxation and creativity in strumming chords and learning new songs.
Cloud Engineer
56
salaries
| ₹3.4 L/yr - ₹8 L/yr |
Regional Sales Manager
50
salaries
| ₹6.4 L/yr - ₹25.5 L/yr |
Customer Life Cycle Manager
32
salaries
| ₹3.6 L/yr - ₹5.4 L/yr |
Customer Lifecycle Manager
23
salaries
| ₹4 L/yr - ₹5.4 L/yr |
Accounts Manager
20
salaries
| ₹5 L/yr - ₹11.5 L/yr |
CtrlS
Sify Technologies
Web Werks
Reliance Data Center