First round Spark: why and how accumulators Why and how reparation and coalesce How spark will generate code and executes Which operation executes where ( let's df2=df1.filter.map.collection df2.map ) Spark modules Why we will get and how to solve if we are getting memory out of exception and other issues Indirectly asking when to use broadcast join Group by and count example in spark What project and flow (data comes from and goes where to where Hive: Bucketing and partitioning why and how Python: Memory management Tuple and list Program to find minimum number of swaps to sort an array How to use and handle specific errors like KeyError using Try and except

Be the first one to answer
Add answer anonymously...
Impetus Technologies Big Data Engineer Interview Questions
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+

Reviews

4 L+

Interviews

4 Cr+

Salaries

1 Cr+

Users/Month

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter