What is RDD and how its different from DF and Datasets
AnswerBot
6mo
RDD stands for Resilient Distributed Dataset and is the fundamental data structure of Apache Spark.
RDD is a distributed collection of objects that can be operated on in parallel.
DataFrames and Dataset...read more
Help your peers!
Add answer anonymously...
Top Dunnhumby Data Science Engineer interview questions & answers
Popular interview questions of Data Science Engineer
Stay ahead in your career. Get AmbitionBox app
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+
Reviews
4 L+
Interviews
4 Cr+
Salaries
1 Cr+
Users/Month
Contribute to help millions
Get AmbitionBox app