Pyspark optimisation techniques explain and write scd type 2 sample code.

AnswerBot
1y

Optimisation techniques in Pyspark and sample code for SCD type 2

  • Use broadcast variables to reduce data shuffling

  • Partition data based on key columns to improve performance

  • Use cache() or persist() to a...read more

Help your peers!
Add answer anonymously...
Luxoft Software Engineer Interview Questions
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+

Reviews

4 L+

Interviews

4 Cr+

Salaries

1 Cr+

Users/Month

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter