Add office photos
Engaged Employer

Coforge

3.3
based on 5k Reviews
Video summary
Filter interviews by

Mirabilis Design Interview Questions and Answers

Updated 5 Feb 2024
Popular Designations

Q1. Write python code to remove duplicates from list of string

Ans.

Python code to remove duplicates from list of strings

  • Use set() to remove duplicates from the list

  • Convert the set back to a list to maintain the order of strings

  • Example: input_list = ['apple', 'banana', 'apple', 'orange']

  • Output: ['apple', 'banana', 'orange']

Add your answer

Q2. Write SQL to get 2nd highest sal

Ans.

Use SQL query with ORDER BY and LIMIT to get 2nd highest salary.

  • Use SELECT statement to retrieve salary column

  • Use ORDER BY clause to sort salaries in descending order

  • Use LIMIT 1,1 to get the second highest salary

Add your answer

Q3. Remove duplicates in a dataframe

Ans.

Use drop_duplicates() method to remove duplicates in a dataframe

  • Use drop_duplicates() method on the dataframe to remove duplicates based on all columns

  • Specify subset parameter to remove duplicates based on specific columns

  • Use keep parameter to control which duplicate to keep (first, last, or False)

  • Example: df.drop_duplicates()

  • Example: df.drop_duplicates(subset=['column1', 'column2'])

Add your answer

Q4. Pyspark scd type2 implementation

Ans.

Implementing Slowly Changing Dimension Type 2 in PySpark

  • Use PySpark DataFrame operations to handle SCD Type 2 implementation

  • Maintain historical records by adding new rows with updated information and end dates for previous records

  • Utilize window functions and joins to identify changes and update records accordingly

Add your answer
Discover Mirabilis Design interview dos and don'ts from real experiences

Q5. Data skewness handling in spark

Ans.

Data skewness handling in Spark involves redistributing data to balance workload and optimize performance.

  • Use repartition() or coalesce() to redistribute data evenly across partitions

  • Consider using broadcast joins for small tables to avoid data shuffling

  • Implement custom partitioning strategies for specific use cases

  • Monitor job performance and adjust partitioning as needed

Add your answer

Q6. Optimisation technique in saprk

Ans.

Optimisation techniques in Spark improve performance by efficiently utilizing resources.

  • Use partitioning to distribute data evenly across nodes

  • Cache intermediate results to avoid recomputation

  • Use broadcast variables for small lookup tables

  • Optimize shuffle operations to reduce data movement

Add your answer
Contribute & help others!
Write a review
Share interview
Contribute salary
Add office photos

Interview Process at Mirabilis Design

based on 4 interviews
2 Interview rounds
Resume Shortlist Round
Technical Round
View more
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories

Top Senior Data Engineer Interview Questions from Similar Companies

3.7
 • 36 Interview Questions
3.7
 • 24 Interview Questions
3.5
 • 14 Interview Questions
3.7
 • 10 Interview Questions
View all
Share an Interview
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
75 Lakh+

Reviews

5 Lakh+

Interviews

4 Crore+

Salaries

1 Cr+

Users/Month

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter