What is RDD and how its different from DF and Datasets

AnswerBot
6mo

RDD stands for Resilient Distributed Dataset and is the fundamental data structure of Apache Spark.

  • RDD is a distributed collection of objects that can be operated on in parallel.

  • DataFrames and Dataset...read more

Help your peers!
Add answer anonymously...
Dunnhumby Data Science Engineer Interview Questions
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+

Reviews

4 L+

Interviews

4 Cr+

Salaries

1 Cr+

Users/Month

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter