i
Sigmoid
Filter interviews by
I applied via Naukri.com and was interviewed in Sep 2024. There was 1 interview round.
SCD stands for Slowly Changing Dimension in Data Warehousing.
SCD is a technique used in data warehousing to track changes to dimension data over time.
There are different types of SCDs - Type 1, Type 2, and Type 3.
Type 1 SCD overwrites old data with new data, Type 2 creates new records for changes, and Type 3 maintains both old and new values in separate columns.
Example: In a customer dimension table, if a customer chan...
inferschema in pyspark is used to automatically infer the schema of a file when reading it.
inferschema is a parameter in pyspark that can be set to true when reading a file to automatically infer the schema based on the data
It is useful when the schema of the file is not known beforehand
Example: df = spark.read.csv('file.csv', header=True, inferSchema=True)
Rank assigns unique ranks to each distinct value, while dense rank assigns ranks without gaps.
Rank function assigns unique ranks to each distinct value in a result set.
Dense rank function assigns ranks to rows in a result set without any gaps between the ranks.
Rank function may skip ranks if there are ties in values, while dense rank will not skip ranks.
Optimizing techniques in Spark involve partitioning, caching, and tuning resources for efficient data processing.
Use partitioning to distribute data evenly across nodes for parallel processing
Cache frequently accessed data in memory to avoid recomputation
Tune resources such as memory allocation and parallelism settings for optimal performance
Repartition is used to increase the number of partitions in a DataFrame, while coalesce is used to decrease the number of partitions.
Repartition involves shuffling data across the network, which can be expensive in terms of performance and resources.
Coalesce is a more efficient operation as it minimizes data movement by only merging existing partitions.
Repartition is typically used when there is a need for more paralle...
Normalization in databases is the process of organizing data in a database to reduce redundancy and improve data integrity.
Normalization is used to eliminate redundant data and ensure data integrity.
It involves breaking down a table into smaller tables and defining relationships between them.
There are different normal forms such as 1NF, 2NF, 3NF, and BCNF.
Normalization helps in reducing data redundancy and improving qu...
Transformation involves changing the data structure, while action involves performing a computation on the data.
Transformation changes the data structure without executing any computation
Action performs a computation on the data and triggers the execution
Examples of transformation include map, filter, and reduce in Spark or Pandas
Examples of action include count, collect, and saveAsTextFile in Spark
Top trending discussions
posted on 22 Oct 2024
I applied via Approached by Company and was interviewed in Sep 2024. There were 4 interview rounds.
Handle stakeholder's interests by understanding their needs, communicating effectively, and delivering results.
Understand the stakeholders' needs and priorities
Communicate regularly and effectively with stakeholders
Involve stakeholders in decision-making processes
Deliver results that align with stakeholders' interests
Manage expectations and address concerns promptly
SQL Rank functions.
Python easy to medium DSA programs.
Pyspark Concepts and questions.
Analytical functions in SQL are used to perform calculations on sets of rows.
Aggregate functions like SUM, AVG, COUNT, MIN, MAX
Window functions like ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD
Ranking functions like NTILE, PERCENT_RANK, CUME_DIST
Statistical functions like STDDEV, VARIANCE
String functions like CONCAT, SUBSTRING, TRIM
I applied via Referral and was interviewed before Apr 2023. There were 5 interview rounds.
General code testing and mcq
I applied via Campus Placement and was interviewed in Oct 2024. There were 2 interview rounds.
Loop in linked list is a situation where a node points to a previous node in the list, creating an infinite loop.
Check for loops using Floyd's cycle detection algorithm
Use two pointers, one moving twice as fast as the other, to detect a loop
If the fast pointer catches up to the slow pointer, there is a loop
I applied via Naukri.com and was interviewed in Aug 2024. There was 1 interview round.
Functions return a single value, while stored procedures can perform multiple operations and return multiple values.
Functions return a single value, while stored procedures can return multiple values.
Functions are called in SQL statements, while stored procedures are called using EXECUTE statement.
Functions cannot have output parameters, while stored procedures can have output parameters.
Functions cannot modify server ...
Custom exceptions can be implemented in MVC by creating a new class that inherits from Exception class.
Create a new class that inherits from Exception class
Override the constructor to pass a custom message to the base Exception class
Throw the custom exception in the MVC controller or service layer
I applied via Naukri.com and was interviewed in Mar 2024. There were 2 interview rounds.
I applied via Naukri.com and was interviewed in Oct 2024. There was 1 interview round.
1 hr exam it is easy to crack
posted on 13 Mar 2024
I applied via campus placement at Zagdu Singh Chartitable Trust's Thakur Institute of Management Studies & Research, Mumbai and was interviewed in Feb 2024. There were 3 interview rounds.
Aptitude test was divided into 3 parts and was not a typical quant, logical question it was more of an application based questions
based on 1 review
Rating in categories
Software Development Engineer II
79
salaries
| ₹14 L/yr - ₹23 L/yr |
Data Engineer
45
salaries
| ₹8.5 L/yr - ₹25 L/yr |
Data Scientist
44
salaries
| ₹10.5 L/yr - ₹22.5 L/yr |
Senior Data Scientist
38
salaries
| ₹15 L/yr - ₹28 L/yr |
Software Development Engineer
36
salaries
| ₹13.2 L/yr - ₹20.4 L/yr |
Fractal Analytics
Mu Sigma
Tiger Analytics
LatentView Analytics