i
LTIMindtree
Filter interviews by
Dual axis is a feature in data visualization where two different scales are used on the same chart to represent two different data sets.
Dual axis allows for comparing two different measures on the same chart
Each measure is assigned to its own axis, allowing for easy comparison
Commonly used in tools like Tableau for creating more complex visualizations
A scatter plot is a type of data visualization that displays the relationship between two numerical variables through dots on a graph.
Scatter plots are used to identify patterns and relationships between variables.
Each dot on the plot represents a single data point with the x-axis representing one variable and the y-axis representing the other variable.
The pattern of the dots can indicate the strength and direction of ...
Blending is the process of combining multiple data sources or datasets to create a unified view.
Blending involves merging data from different sources to gain insights or make decisions.
It helps in creating a comprehensive dataset by combining relevant information from various sources.
Blending can be done using tools like Tableau, Power BI, or Python libraries like Pandas.
For example, blending sales data from CRM with c...
I applied via Company Website and was interviewed before Feb 2021. There were 5 interview rounds.
It was technical MCQ with 60 questions, based Spark, Hive, Python, ML.
Coding scenario like as - read xml, json using pyspark, flatten nested xml, json and basic data transformation related scenarios.
I applied via Naukri.com and was interviewed before Jul 2021. There were 2 interview rounds.
I applied via Naukri.com and was interviewed in Jul 2021. There were 4 interview rounds.
I applied via Recruitment Consulltant and was interviewed before Jul 2023. There was 1 interview round.
I appeared for an interview in Feb 2025.
SCD (Slowly Changing Dimensions) manages historical data changes in data warehouses.
SCD Type 1: Overwrite old data (e.g., updating a customer's address without keeping history).
SCD Type 2: Create new records for changes (e.g., adding a new row for a customer's address change).
SCD Type 3: Store current and previous values in the same record (e.g., adding a 'previous address' column).
Implementation can be done using ETL ...
Handling multiple inputs in data sources requires effective integration, transformation, and validation strategies.
Use ETL (Extract, Transform, Load) processes to consolidate data from various sources.
Implement data validation checks to ensure data quality from each input source.
Utilize data orchestration tools like Apache Airflow to manage workflows and dependencies.
Consider using a message queue (e.g., Kafka) for rea...
I applied via Naukri.com and was interviewed in Oct 2022. There were 2 interview rounds.
Questions on big data, Hadoop, Spark, Scala, Git, project and Agile.
Hadoop architecture and HDFS commands for copying and listing files in HDFS
Spark architecture and Transformation and Action question
What happens when we submit a Spark program
Spark DataFrame coding question
Scala basic program on List
Git and Github
Project-related question
Agile-related
I applied via Referral and was interviewed in Mar 2022. There was 1 interview round.
I applied via LinkedIn and was interviewed in Feb 2024. There were 3 interview rounds.
Working with nested JSON using PySpark involves using the StructType and StructField classes to define the schema and then using the select function to access nested fields.
Define the schema using StructType and StructField classes
Use the select function to access nested fields
Use dot notation to access nested fields, for example df.select('nested_field.sub_field')
Implementing SCD2 involves tracking historical changes in data over time.
Identify the business key that uniquely identifies each record
Add effective start and end dates to track when the record was valid
Insert new records with updated data and end date of '9999-12-31'
Update end date of previous record when a change occurs
Use a SQL query to select data from table 2 where data exists in table 1
Use a JOIN statement to link the two tables based on a common column
Specify the columns you want to select from table 2
Use a WHERE clause to check for existence of data in table 1
The number of records retrieved after performing joins depends on the type of join - inner, left, right, or outer.
Inner join retrieves only the matching records from both tables
Left join retrieves all records from the left table and matching records from the right table
Right join retrieves all records from the right table and matching records from the left table
Outer join retrieves all records from both tables, filling
based on 1 interview
Interview experience
Senior Software Engineer
21.5k
salaries
| ₹5 L/yr - ₹19 L/yr |
Software Engineer
16.2k
salaries
| ₹2 L/yr - ₹10 L/yr |
Technical Lead
6.4k
salaries
| ₹9.4 L/yr - ₹36 L/yr |
Module Lead
5.9k
salaries
| ₹7 L/yr - ₹25.5 L/yr |
Senior Engineer
4.4k
salaries
| ₹4.2 L/yr - ₹16.8 L/yr |
Cognizant
Capgemini
Accenture
TCS