i
Sigmoid
Filter interviews by
I applied via Naukri.com and was interviewed in Sep 2024. There was 1 interview round.
SCD stands for Slowly Changing Dimension in Data Warehousing.
SCD is a technique used in data warehousing to track changes to dimension data over time.
There are different types of SCDs - Type 1, Type 2, and Type 3.
Type 1 SCD overwrites old data with new data, Type 2 creates new records for changes, and Type 3 maintains both old and new values in separate columns.
Example: In a customer dimension table, if a customer chan...
inferschema in pyspark is used to automatically infer the schema of a file when reading it.
inferschema is a parameter in pyspark that can be set to true when reading a file to automatically infer the schema based on the data
It is useful when the schema of the file is not known beforehand
Example: df = spark.read.csv('file.csv', header=True, inferSchema=True)
Rank assigns unique ranks to each distinct value, while dense rank assigns ranks without gaps.
Rank function assigns unique ranks to each distinct value in a result set.
Dense rank function assigns ranks to rows in a result set without any gaps between the ranks.
Rank function may skip ranks if there are ties in values, while dense rank will not skip ranks.
Optimizing techniques in Spark involve partitioning, caching, and tuning resources for efficient data processing.
Use partitioning to distribute data evenly across nodes for parallel processing
Cache frequently accessed data in memory to avoid recomputation
Tune resources such as memory allocation and parallelism settings for optimal performance
Repartition is used to increase the number of partitions in a DataFrame, while coalesce is used to decrease the number of partitions.
Repartition involves shuffling data across the network, which can be expensive in terms of performance and resources.
Coalesce is a more efficient operation as it minimizes data movement by only merging existing partitions.
Repartition is typically used when there is a need for more paralle...
Normalization in databases is the process of organizing data in a database to reduce redundancy and improve data integrity.
Normalization is used to eliminate redundant data and ensure data integrity.
It involves breaking down a table into smaller tables and defining relationships between them.
There are different normal forms such as 1NF, 2NF, 3NF, and BCNF.
Normalization helps in reducing data redundancy and improving qu...
Transformation involves changing the data structure, while action involves performing a computation on the data.
Transformation changes the data structure without executing any computation
Action performs a computation on the data and triggers the execution
Examples of transformation include map, filter, and reduce in Spark or Pandas
Examples of action include count, collect, and saveAsTextFile in Spark
Top trending discussions
posted on 22 Oct 2024
I applied via Approached by Company and was interviewed in Sep 2024. There were 4 interview rounds.
Handle stakeholder's interests by understanding their needs, communicating effectively, and delivering results.
Understand the stakeholders' needs and priorities
Communicate regularly and effectively with stakeholders
Involve stakeholders in decision-making processes
Deliver results that align with stakeholders' interests
Manage expectations and address concerns promptly
SQL Rank functions.
Python easy to medium DSA programs.
Pyspark Concepts and questions.
Analytical functions in SQL are used to perform calculations on sets of rows.
Aggregate functions like SUM, AVG, COUNT, MIN, MAX
Window functions like ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD
Ranking functions like NTILE, PERCENT_RANK, CUME_DIST
Statistical functions like STDDEV, VARIANCE
String functions like CONCAT, SUBSTRING, TRIM
I applied via Referral and was interviewed before Apr 2023. There were 5 interview rounds.
General code testing and mcq
I applied via Referral and was interviewed in Aug 2024. There were 2 interview rounds.
Utilize customer transaction data and behavior analysis to identify loyal customers for DMart and SmartBazar.
Use customer transaction history to identify frequent shoppers
Analyze customer behavior patterns such as repeat purchases and average spend
Implement loyalty programs to incentivize repeat purchases
Utilize customer feedback and reviews to gauge loyalty
Segment customers based on their shopping habits and preferenc
It depends on the business model and goals of the company.
Small transactions everyday can lead to consistent revenue streams and customer engagement.
Big transactions in a month can indicate high purchasing power and potential for larger profits.
Consider customer lifetime value, retention rates, and overall business strategy when determining value.
I would conduct a thorough analysis of the sales data to identify trends and potential causes of the decline.
Review historical sales data to identify patterns or seasonality
Conduct customer surveys or interviews to gather feedback
Analyze competitor data to understand market dynamics
Implement predictive modeling to forecast future sales
Collaborate with marketing team to develop targeted strategies
I would showcase the potential benefits and results of my innovative approach to convince the team.
Highlight the advantages of the innovative approach such as improved efficiency, accuracy, or cost-effectiveness.
Provide real-world examples or case studies where similar innovative approaches have led to successful outcomes.
Encourage open discussion and collaboration within the team to explore the potential of combining ...
1. A store has promotional offers how will you analyse that offers are working in their favour.
2. What data will you require if you want to predict the sales of the chocolate in a store.
3. Why data is distributed normally in linear regression.
4. Difference between linear and logistic regression
5. A person who is senior to you and you are working on the same project. But that person has very bad reputation of misbehaving and being rude to people. And he is doing same with you. What will you do?
They will ask aptitude, reasoning and other questions as an online test to know your problem solving skills.
They will give a topic where either we have to speak or write on that topic
I applied via Naukri.com and was interviewed in Aug 2024. There was 1 interview round.
Functions return a single value, while stored procedures can perform multiple operations and return multiple values.
Functions return a single value, while stored procedures can return multiple values.
Functions are called in SQL statements, while stored procedures are called using EXECUTE statement.
Functions cannot have output parameters, while stored procedures can have output parameters.
Functions cannot modify server ...
Custom exceptions can be implemented in MVC by creating a new class that inherits from Exception class.
Create a new class that inherits from Exception class
Override the constructor to pass a custom message to the base Exception class
Throw the custom exception in the MVC controller or service layer
First round was on Hacker Earth
I applied via Naukri.com and was interviewed in Oct 2024. There was 1 interview round.
1 hr exam it is easy to crack
based on 2 interviews
Interview experience
based on 2 reviews
Rating in categories
Software Development Engineer II
83
salaries
| ₹14 L/yr - ₹24 L/yr |
Data Scientist
48
salaries
| ₹10.5 L/yr - ₹22.5 L/yr |
Data Engineer
45
salaries
| ₹8.5 L/yr - ₹25 L/yr |
Senior Data Scientist
43
salaries
| ₹17 L/yr - ₹28 L/yr |
Software Development Engineer
36
salaries
| ₹13.2 L/yr - ₹20.4 L/yr |
Fractal Analytics
Mu Sigma
Tiger Analytics
LatentView Analytics