Filter interviews by
I applied via LinkedIn and was interviewed before Aug 2023. There were 2 interview rounds.
Union combines and removes duplicates, Union All combines without removing duplicates.
Union combines result sets and removes duplicates
Union All combines result sets without removing duplicates
Union is slower than Union All as it involves removing duplicates
Example: SELECT column1 FROM table1 UNION SELECT column1 FROM table2;
Example: SELECT column1 FROM table1 UNION ALL SELECT column1 FROM table2;
To show top 5 in pandas, use the nlargest() function.
Use the nlargest() function with the 'n' parameter set to 5 to get the top 5 values in a pandas DataFrame.
For example: df['column_name'].nlargest(5) will return the top 5 values in the specified column.
A scatter plot is a better representation for 3 numerical columns.
Use a scatter plot to show the relationship between the numerical columns.
Scatter plots are effective for visualizing correlations and patterns in data.
Each point on the plot represents a data point with values from all 3 columns.
Top trending discussions
I applied via Naukri.com and was interviewed in May 2024. There were 2 interview rounds.
Excel file was give, used sumifs, countifs , index match formulas
I applied via Campus Placement and was interviewed in Feb 2022. There was 1 interview round.
I applied via LinkedIn and was interviewed in Sep 2023. There were 2 interview rounds.
XGBoost is preferred over Random Forest due to its faster execution speed and better performance in complex datasets.
XGBoost is faster than Random Forest due to its optimized implementation of gradient boosting algorithm.
XGBoost generally performs better in complex datasets with high-dimensional features.
XGBoost allows for more fine-tuning of hyperparameters compared to Random Forest.
XGBoost has regularization techniqu...
I applied via Naukri.com and was interviewed before May 2023. There was 1 interview round.
I applied via Referral and was interviewed in Nov 2021. There was 1 interview round.
Coalesce is used to select the first non-null value from a set of columns. Repartition is used to shuffle data across nodes.
Coalesce reduces the number of partitions to the minimum required.
Repartition increases or decreases the number of partitions.
Coalesce is a narrow transformation while repartition is a wide transformation.
Coalesce is used to optimize data for queries while repartition is used to balance data acros...
Optimizing joins involves selecting appropriate join types, indexing tables, and minimizing data movement.
Choose the appropriate join type based on the size and structure of the tables being joined
Index the tables on the join columns to speed up the join process
Minimize data movement by selecting only the necessary columns and filtering rows before joining
Consider using denormalization or materialized views to precompu
RDD is a low-level distributed data structure while DataFrame is a high-level structured data abstraction.
RDD is immutable and unstructured while DataFrame is structured and has a schema
DataFrames are optimized for SQL queries and can be cached in memory
RDDs are more flexible and can be used for complex data processing tasks
DataFrames are easier to use and provide a more concise syntax for data manipulation
RDDs are the...
I applied via Naukri.com and was interviewed in May 2024. There were 2 interview rounds.
Excel file was give, used sumifs, countifs , index match formulas
Online test on sql join sub query
I applied via Naukri.com and was interviewed before Oct 2023. There were 2 interview rounds.
Technical question on based of resume
Questionnaire given to solve
I applied via LinkedIn and was interviewed before Oct 2022. There were 4 interview rounds.
Basic dsa question in python and data engineering questions ,sql
Basic dsa question in python and data engineering questions
Interview experience
based on 7 reviews
Rating in categories
Hyderabad / Secunderabad,
Chennai
7-12 Yrs
Not Disclosed
Analyst
85
salaries
| ₹4.5 L/yr - ₹10 L/yr |
Senior Analyst
45
salaries
| ₹5.8 L/yr - ₹12.2 L/yr |
Data Analyst
45
salaries
| ₹3.3 L/yr - ₹14 L/yr |
Senior Associate
31
salaries
| ₹3.5 L/yr - ₹6.5 L/yr |
Associate
28
salaries
| ₹3 L/yr - ₹5 L/yr |
Groupm Media
Madison World
Dentsu Aegis Network
IPG Mediabrands