Filter interviews by
Clear (1)
I was interviewed in Jan 2025.
I applied via Referral and was interviewed in Feb 2024. There was 1 interview round.
Types of transformations include filtering, sorting, aggregating, joining, and pivoting.
Filtering: Selecting a subset of rows based on certain criteria.
Sorting: Arranging rows in a specific order based on one or more columns.
Aggregating: Combining multiple rows into a single result, such as summing or averaging values.
Joining: Combining data from multiple sources based on a common key.
Pivoting: Restructuring data from
Spark is lazy execution to optimize performance by delaying computation until necessary.
Spark delays execution until an action is called to optimize performance.
This allows Spark to optimize the execution plan and minimize unnecessary computations.
Lazy evaluation helps in reducing unnecessary data shuffling and processing.
Example: Transformations like map, filter, and reduce are not executed until an action like collec
SCD stands for Slowly Changing Dimension. There are three types: Type 1, Type 2, and Type 3.
SCD is used in data warehousing to track changes in dimension data over time.
Type 1 SCD overwrites old data with new data, losing historical information.
Type 2 SCD creates new records for each change, preserving historical data.
Type 3 SCD keeps both old and new data in the same record, with separate columns for each version.
A linked service is a connection to an external data source or destination in Azure Data Factory.
Linked services define the connection information needed to connect to external data sources or destinations.
They can be used in pipelines to read from or write to the linked data source.
Examples of linked services include Azure Blob Storage, Azure SQL Database, and Salesforce.
Linked services can store connection strings, a...
A dataset is a collection of data that is organized in a structured format for easy access and analysis.
A dataset can consist of tables, files, or other types of data sources.
It is used for storing and managing data for analysis and reporting purposes.
Examples of datasets include customer information, sales data, and sensor readings.
Datasets can be structured, semi-structured, or unstructured depending on the type of d
Partition pruning is a query optimization technique that reduces the amount of data scanned by excluding irrelevant partitions.
Partition pruning is used in partitioned tables to skip scanning partitions that do not contain data relevant to the query.
It helps improve query performance by reducing the amount of data that needs to be processed.
For example, if a query filters data based on a specific partition key, partiti...
Top trending discussions
I applied via Referral and was interviewed in Feb 2023. There were 3 interview rounds.
I applied via Referral and was interviewed in Oct 2023. There was 1 interview round.
Prepare well on coding n programming
Topic will be give should talk for a min
posted on 24 Nov 2024
I applied via LinkedIn and was interviewed in Oct 2024. There was 1 interview round.
In Mu Sigma APT round is divided into 3 sections Quant, Logical and verbal
I applied via Referral and was interviewed before Dec 2023. There were 2 interview rounds.
OOP is a programming paradigm based on the concept of objects, which can contain data in the form of fields and code in the form of procedures.
OOP focuses on creating objects that interact with each other to solve problems.
Key concepts include classes, objects, inheritance, polymorphism, and encapsulation.
Classes are blueprints for creating objects, while objects are instances of classes.
Inheritance allows a class to i...
Coding was good and it was based on bubble sort
I applied via Recruitment Consulltant and was interviewed in Feb 2023. There were 3 interview rounds.
Interview was based on spark... initially basics of spark, optimisation followed by coding round like apply filter, read from multi delimiter, joins, de-duplication
based on 3 interviews
1 Interview rounds
based on 6 reviews
Rating in categories
Consultant
1.1k
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Engineer
718
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Consultant
588
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Scientist
462
salaries
| ₹0 L/yr - ₹0 L/yr |
Engineer
202
salaries
| ₹0 L/yr - ₹0 L/yr |
Mu Sigma
AbsolutData
LatentView Analytics
Tiger Analytics