Add office photos
Premium Employer

Hexaware Technologies

3.6
based on 5.6k Reviews
Filter interviews by

SAVE Housing Finance Interview Questions and Answers

Updated 12 Sep 2024
Popular Designations

Q1. What is the main advantage of delta lake?

Ans.

Delta Lake provides ACID transactions, schema enforcement, and time travel capabilities for data lakes.

  • ACID transactions ensure data consistency and reliability.

  • Schema enforcement helps maintain data quality and prevent data corruption.

  • Time travel allows users to access and revert to previous versions of data for auditing or analysis purposes.

Add your answer

Q2. Find the student with marks greater than 80 in all subjects

Ans.

Filter students with marks greater than 80 in all subjects

  • Iterate through each student's marks in all subjects

  • Check if all marks are greater than 80 for a student

  • Return the student if all marks are greater than 80

Add your answer

Q3. Write the syntax to define the schema of a file for loading.

Ans.

Syntax to define schema of a file for loading

  • Use CREATE EXTERNAL TABLE statement in SQL

  • Specify column names and data types in the schema definition

  • Example: CREATE EXTERNAL TABLE MyTable (col1 INT, col2 STRING) USING CSV

Add your answer

Q4. how to do performance tuning in adf

Ans.

Performance tuning in Azure Data Factory involves optimizing data flows and activities to improve efficiency and reduce processing time.

  • Identify bottlenecks in data flows and activities

  • Optimize data partitioning and distribution

  • Use appropriate data integration patterns

  • Leverage caching and parallel processing

  • Monitor and analyze performance metrics

Add your answer
Discover SAVE Housing Finance interview dos and don'ts from real experiences

Q5. What is Azure synapse architecture?

Ans.

Azure Synapse is a cloud-based analytics service that brings together big data and data warehousing.

  • Azure Synapse integrates big data and data warehousing capabilities in a single service

  • It allows for data ingestion, preparation, management, and serving for BI and machine learning

  • Supports both serverless and provisioned resources for data processing

  • Offers integration with Azure Machine Learning, Power BI, and Azure Data Factory

Add your answer

Q6. Types of cluster in data bricks??

Ans.

Types of clusters in Databricks include Standard, High Concurrency, and Single Node clusters.

  • Standard cluster: Suitable for running single jobs or workflows.

  • High Concurrency cluster: Designed for multiple users running concurrent jobs.

  • Single Node cluster: Used for development and testing purposes.

Add your answer

Q7. What is a catalyst optimizer?

Ans.

The catalyst optimizer is a query optimization engine in Apache Spark that improves performance by generating optimized query plans.

  • It is a query optimization engine in Apache Spark.

  • It improves performance by generating optimized query plans.

  • It uses rule-based and cost-based optimization techniques.

  • It leverages advanced techniques like code generation and adaptive query execution.

  • Example: Catalyst optimizer in Spark SQL analyzes the query and generates an optimized query plan...read more

Add your answer

Q8. What is Catalyst optimizer

Ans.

Catalyst optimizer is a query optimization framework in Apache Spark.

  • Catalyst optimizer is a rule-based optimization framework used in Apache Spark for optimizing query plans.

  • It leverages advanced programming language features in Scala to build an extensible query optimizer.

  • Catalyst optimizer performs various optimizations such as constant folding, predicate pushdown, and projection pruning.

  • It helps in improving the performance of Spark SQL queries by generating efficient que...read more

Add your answer

Q9. Find the duplicate row ?

Ans.

Use SQL query with GROUP BY and HAVING clause to find duplicate rows.

  • Use GROUP BY to group rows with same values

  • Use HAVING COUNT(*) > 1 to filter out duplicate rows

  • Example: SELECT column1, column2, COUNT(*) FROM table_name GROUP BY column1, column2 HAVING COUNT(*) > 1

Add your answer
Contribute & help others!
Write a review
Share interview
Contribute salary
Add office photos

Interview Process at SAVE Housing Finance

based on 7 interviews in the last 1 year
1 Interview rounds
Technical Round
View more
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories

Top Azure Data Engineer Interview Questions from Similar Companies

3.7
 • 15 Interview Questions
3.9
 • 14 Interview Questions
3.1
 • 10 Interview Questions
3.5
 • 10 Interview Questions
View all
Share an Interview
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
70 Lakh+

Reviews

5 Lakh+

Interviews

4 Crore+

Salaries

1 Cr+

Users/Month

Contribute to help millions
Get AmbitionBox app

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter