Filter interviews by
I applied via Campus Placement and was interviewed before Sep 2021. There were 2 interview rounds.
Logical questions and Python analytics coding test
Entropy is a measure of impurity in a decision tree node used in Random Forest algorithm.
Entropy is used to determine the best split for a decision tree node.
It measures the randomness or impurity of the data in a node.
The formula for entropy is -p*log2(p) - q*log2(q) where p and q are the probabilities of the classes.
A node with low entropy is considered pure and a good split.
Random Forest uses multiple decision trees
Cross join in SQL returns the Cartesian product of two tables.
Use the CROSS JOIN keyword to perform a cross join.
It returns all possible combinations of rows from both tables.
Syntax: SELECT * FROM table1 CROSS JOIN table2;
It is also known as a Cartesian join or product join.
Be careful when using cross join as it can result in a large number of rows.
Top trending discussions
The top 10 and bottom 10 employees based on their rank need to be identified.
Sort the employees based on their rank in ascending order.
Select the top 10 employees from the sorted list.
Select the bottom 10 employees from the sorted list.
I applied via Referral and was interviewed before Aug 2022. There were 3 interview rounds.
Normal Group discussion
I applied via Referral and was interviewed in Sep 2024. There were 2 interview rounds.
Coalesce is a function used to return the first non-null value in a list of expressions.
Coalesce function is commonly used in SQL to handle null values.
It takes multiple arguments and returns the first non-null value.
Example: COALESCE(column1, column2, 'default') will return the value of column1 if not null, else column2, and if both are null, it will return 'default'.
Repartition is the process of redistributing data across partitions in a distributed system.
Repartitioning helps in balancing the workload and improving performance in distributed computing environments.
It involves moving data between partitions based on certain criteria such as key values or hash functions.
Repartitioning can be done in Spark using operations like repartition() or coalesce().
A non repeatable character is a character that appears only once in a given string.
Iterate through the string and count the frequency of each character
Identify the characters that have a frequency of 1
Return the first non repeatable character found
Sort list without using sort method
Create a custom sorting function using loops
Compare each element with every other element to determine the correct order
Swap elements based on the comparison results
I applied via LinkedIn and was interviewed in Sep 2023. There were 2 interview rounds.
Blend combines data from different data sources in a single view, while join combines data from the same data source.
Blend is used when data comes from different data sources, join is used when data comes from the same data source
Blend creates a relationship between data sources without actually merging the data, join merges the data based on a common field
Blend can be used to combine data from different databases like...
I applied via Referral and was interviewed before Aug 2022. There were 3 interview rounds.
Normal Group discussion
I applied via Job Portal and was interviewed before Mar 2023. There were 4 interview rounds.
I applied via Job Portal and was interviewed before May 2022. There were 2 interview rounds.
I have worked as a data analyst for 5 years in various industries.
Worked as a data analyst for 5 years in various industries
Analyzed and interpreted data to provide insights and recommendations
Created reports and dashboards to visualize data
Collaborated with cross-functional teams to identify business opportunities
Developed and implemented data-driven strategies to improve business performance
I applied via Company Website and was interviewed before Apr 2023. There was 1 interview round.
Tableau architecture is a visual analytics platform that helps users to understand data through interactive visualizations.
Tableau consists of three main components: Data Connection, Data Engine, and VizQL Server.
Data Connection allows users to connect to various data sources such as databases, spreadsheets, and cloud services.
Data Engine processes and optimizes data queries for faster performance.
VizQL Server translat...
The top 10 and bottom 10 employees based on their rank need to be identified.
Sort the employees based on their rank in ascending order.
Select the top 10 employees from the sorted list.
Select the bottom 10 employees from the sorted list.
Data Analyst
21
salaries
| ₹4.5 L/yr - ₹6.8 L/yr |
Data Scientist
11
salaries
| ₹6.1 L/yr - ₹10.5 L/yr |
Software Developer
8
salaries
| ₹5.6 L/yr - ₹10.3 L/yr |
Data Engineer
6
salaries
| ₹6 L/yr - ₹15 L/yr |
Jr. Data Scientist
5
salaries
| ₹4 L/yr - ₹7 L/yr |
Fractal Analytics
Mu Sigma
Tiger Analytics
LatentView Analytics