Filter interviews by
I applied via Naukri.com and was interviewed in Dec 2022. There were 4 interview rounds.
Sql queries round. Window functions, grouping, intermediate sql
Spark optimization techniques
Partitioning data to optimize parallelism
Caching frequently used data to avoid recomputation
Using broadcast variables to reduce data shuffling
Avoiding unnecessary transformations
Tuning memory and executor settings
Using efficient data formats like Parquet or ORC
Using appropriate join strategies
Top trending discussions
I applied via Naukri.com and was interviewed in Dec 2024. There were 2 interview rounds.
It was well designed
I applied via Approached by Company and was interviewed in Nov 2024. There were 3 interview rounds.
I applied via campus placement at Netaji Subhas Institute of Technology (NSIT) and was interviewed in May 2024. There were 5 interview rounds.
Python Programming related questions, along with one advanced SQL query problem. The final question was a Data Science project on prediting sales potential of various outlets.
Use RAID 5 to store data across all three memory chips with parity bits for fault tolerance.
Implement RAID 5 to distribute data and parity bits across all three memory chips.
If one memory chip is corrupted, the data can be reconstructed using the parity bits from the other two chips.
Example: Store 1GB of data on each chip and use the remaining space for parity bits to ensure fault tolerance.
Find the longest common prefix string from a list of strings.
Iterate through the characters of the first string and compare with corresponding characters of other strings
Stop when a mismatch is found or when reaching the end of any string
Return the prefix found so far
Joins are used in DBMS to combine rows from two or more tables based on a related column between them.
Types of joins include INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN.
INNER JOIN returns rows when there is at least one match in both tables.
LEFT JOIN returns all rows from the left table and the matched rows from the right table.
RIGHT JOIN returns all rows from the right table and the matched rows from the left tab...
Was taken by the product manager employed in the company. Basic case study question regarding a ride share app planning to expand internationally.
A formal orientation and introduction with the VP and founder of ION India
Algorithms and law can be correlated through the use of algorithms in legal processes and decision-making.
Algorithms can be used in legal research to analyze large amounts of data and identify patterns or trends.
Predictive algorithms can be used in legal cases to assess the likelihood of success or failure.
Algorithmic tools can help in legal document review and contract analysis.
However, there are concerns about bias i...
3 question were asked in 90 min time
Precision is the ratio of correctly predicted positive observations to the total predicted positive observations.
Precision is calculated as TP / (TP + FP), where TP is true positives and FP is false positives.
It measures the accuracy of positive predictions made by the model.
A high precision indicates that the model is good at predicting positive cases without many false positives.
For example, in a binary classificatio...
A large language model is a type of artificial intelligence model that is capable of understanding and generating human language at a large scale.
Large language models use deep learning techniques to process and generate text.
Examples include GPT-3 (Generative Pre-trained Transformer 3) and BERT (Bidirectional Encoder Representations from Transformers).
I was interviewed before Feb 2024.
Written test, asked general question for testing our skills
Cache is used for temporary storage of data in memory, while persist is used for saving data to disk for long-term storage.
Cache is typically faster as it stores data in memory for quick access.
Persist saves data to disk for durability and long-term storage.
Cache is often used for temporary data that can be recomputed if lost, while persist is used for important data that needs to be retained.
Examples: Using cache for ...
Reverse a sentence using Python
Split the sentence into words using split() method
Reverse the list of words using list slicing
Join the reversed list of words back into a sentence using join() method
I applied via Referral and was interviewed in Nov 2024. There were 2 interview rounds.
A lambda function in Python is a small anonymous function defined using the lambda keyword.
Lambda functions can have any number of arguments, but can only have one expression.
Syntax: lambda arguments : expression
Example: lambda x, y : x + y
dbutils is a utility provided by Databricks for interacting with files and directories in the Databricks environment.
dbutils.fs.ls('/') - list files in root directory
dbutils.fs.cp('dbfs:/file.txt', 'file.txt') - copy file from DBFS to local file system
dbutils.fs.mkdirs('dbfs:/new_dir') - create a new directory in DBFS
A commit in SQL is a command that saves all the changes made in a transaction to the database.
A commit is used to make all the changes made in a transaction permanent.
Once a commit is issued, the changes cannot be rolled back.
It is important to use commit to ensure data integrity and consistency.
Example: COMMIT; - this command is used to commit the changes in a transaction.
It contain both Aptitude and Coding about base models and Deep learning too
Different models techniques include linear regression, decision trees, random forests, support vector machines, and neural networks.
Linear regression is used for predicting continuous values.
Decision trees are used for classification and regression tasks.
Random forests are an ensemble method based on decision trees.
Support vector machines are used for classification tasks.
Neural networks are used for complex pattern re
Different performance metrics are used for different types of machine learning models to evaluate their effectiveness.
For classification models, metrics like accuracy, precision, recall, F1 score, and ROC-AUC are commonly used.
For regression models, metrics like mean squared error (MSE), mean absolute error (MAE), and R-squared are commonly used.
For clustering models, metrics like silhouette score and Davies-Bouldin in...
Interview experience
based on 1 review
Rating in categories
Software Development Engineer II
82
salaries
| ₹10 L/yr - ₹15.5 L/yr |
Devops Engineer
44
salaries
| ₹6.4 L/yr - ₹18 L/yr |
Development Engineer 3
38
salaries
| ₹13.5 L/yr - ₹23 L/yr |
Software Developer
36
salaries
| ₹6 L/yr - ₹24.5 L/yr |
Engineer III
30
salaries
| ₹10 L/yr - ₹20 L/yr |
Cisco
Microsoft Corporation
Intel
IBM