Filter interviews by
I applied via Naukri.com and was interviewed in Dec 2022. There were 4 interview rounds.
Sql queries round. Window functions, grouping, intermediate sql
Spark optimization techniques
Partitioning data to optimize parallelism
Caching frequently used data to avoid recomputation
Using broadcast variables to reduce data shuffling
Avoiding unnecessary transformations
Tuning memory and executor settings
Using efficient data formats like Parquet or ORC
Using appropriate join strategies
Top trending discussions
I applied via Naukri.com and was interviewed in Dec 2024. There were 2 interview rounds.
I applied via Naukri.com and was interviewed in Dec 2024. There were 2 interview rounds.
It was well designed
I applied via Approached by Company and was interviewed in Nov 2024. There were 3 interview rounds.
I applied via campus placement at Netaji Subhas Institute of Technology (NSIT) and was interviewed in May 2024. There were 5 interview rounds.
Python Programming related questions, along with one advanced SQL query problem. The final question was a Data Science project on prediting sales potential of various outlets.
Use RAID 5 to store data across all three memory chips with parity bits for fault tolerance.
Implement RAID 5 to distribute data and parity bits across all three memory chips.
If one memory chip is corrupted, the data can be reconstructed using the parity bits from the other two chips.
Example: Store 1GB of data on each chip and use the remaining space for parity bits to ensure fault tolerance.
Find the longest common prefix string from a list of strings.
Iterate through the characters of the first string and compare with corresponding characters of other strings
Stop when a mismatch is found or when reaching the end of any string
Return the prefix found so far
Joins are used in DBMS to combine rows from two or more tables based on a related column between them.
Types of joins include INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN.
INNER JOIN returns rows when there is at least one match in both tables.
LEFT JOIN returns all rows from the left table and the matched rows from the right table.
RIGHT JOIN returns all rows from the right table and the matched rows from the left tab...
Was taken by the product manager employed in the company. Basic case study question regarding a ride share app planning to expand internationally.
A formal orientation and introduction with the VP and founder of ION India
Algorithms and law can be correlated through the use of algorithms in legal processes and decision-making.
Algorithms can be used in legal research to analyze large amounts of data and identify patterns or trends.
Predictive algorithms can be used in legal cases to assess the likelihood of success or failure.
Algorithmic tools can help in legal document review and contract analysis.
However, there are concerns about bias i...
3 question were asked in 90 min time
Precision is the ratio of correctly predicted positive observations to the total predicted positive observations.
Precision is calculated as TP / (TP + FP), where TP is true positives and FP is false positives.
It measures the accuracy of positive predictions made by the model.
A high precision indicates that the model is good at predicting positive cases without many false positives.
For example, in a binary classificatio...
A large language model is a type of artificial intelligence model that is capable of understanding and generating human language at a large scale.
Large language models use deep learning techniques to process and generate text.
Examples include GPT-3 (Generative Pre-trained Transformer 3) and BERT (Bidirectional Encoder Representations from Transformers).
Cache is used for temporary storage of data in memory, while persist is used for saving data to disk for long-term storage.
Cache is typically faster as it stores data in memory for quick access.
Persist saves data to disk for durability and long-term storage.
Cache is often used for temporary data that can be recomputed if lost, while persist is used for important data that needs to be retained.
Examples: Using cache for ...
Reverse a sentence using Python
Split the sentence into words using split() method
Reverse the list of words using list slicing
Join the reversed list of words back into a sentence using join() method
I applied via LinkedIn and was interviewed in Jun 2024. There were 3 interview rounds.
Entropy measures randomness in data, while information gain measures the reduction in uncertainty after splitting data.
Entropy is used in decision trees to measure impurity in a dataset before splitting it.
Information gain is used in decision trees to measure the effectiveness of a split in reducing uncertainty.
Entropy ranges from 0 (pure dataset) to 1 (completely impure dataset).
Information gain is calculated as the d...
LSTM for longer sequences, GRU for faster training and less complex models.
Use LSTM for tasks requiring long-term dependencies and memory retention.
Use GRU for faster training and simpler models with fewer parameters.
Consider using LSTM for tasks like language translation or speech recognition.
Consider using GRU for tasks like sentiment analysis or text generation.
Time Series data were given, we have to provide some insights
I applied via Referral and was interviewed in Nov 2024. There were 2 interview rounds.
I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.
Interview experience
based on 1 review
Rating in categories
Software Development Engineer II
82
salaries
| ₹10 L/yr - ₹15.5 L/yr |
Devops Engineer
44
salaries
| ₹6.4 L/yr - ₹18 L/yr |
Development Engineer 3
38
salaries
| ₹13.5 L/yr - ₹23 L/yr |
Software Developer
36
salaries
| ₹6 L/yr - ₹24.5 L/yr |
Engineer III
30
salaries
| ₹10 L/yr - ₹20 L/yr |
Cisco
Microsoft Corporation
Intel
IBM