Types of partitioning in hive, bucketing in hive, hoq to dedup data, what RDDs and dataframes, difference between spark session and context, prwtty json format

AnswerBot
1y

Answering questions related to Hive, Spark, and JSON format.

  • Types of partitioning in Hive: static and dynamic

  • Bucketing in Hive: used for efficient sampling and joins

  • Deduplication in Hive: using DISTIN...read more

Help your peers!
Add answer anonymously...
Open Access Technology India Software Engineer Interview Questions
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+

Reviews

4 L+

Interviews

4 Cr+

Salaries

1 Cr+

Users/Month

Contribute to help millions
Get AmbitionBox app

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter