Types of partitioning in hive, bucketing in hive, hoq to dedup data, what RDDs and dataframes, difference between spark session and context, prwtty json format
AnswerBot
1y
Answering questions related to Hive, Spark, and JSON format.
Types of partitioning in Hive: static and dynamic
Bucketing in Hive: used for efficient sampling and joins
Deduplication in Hive: using DISTIN...read more
Help your peers!
Add answer anonymously...
Popular interview questions of Software Engineer
>
Open Access Technology India Software Engineer Interview Questions
Stay ahead in your career. Get AmbitionBox app
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+
Reviews
4 L+
Interviews
4 Cr+
Salaries
1 Cr+
Users/Month
Contribute to help millions
Get AmbitionBox app