Filter interviews by
I applied via Referral and was interviewed before Oct 2023. There was 1 interview round.
Transformers are a type of deep learning model that uses self-attention mechanisms to process sequential data.
Transformers are neural network architectures designed to handle sequential data efficiently.
They use self-attention mechanisms to weigh the importance of different input elements when making predictions.
Examples of transformer models include BERT, GPT-3, and Transformer-XL.
Identifying a target involves defining the specific outcome or variable of interest in the given use case.
Understand the objectives and goals of the project to determine the target variable
Analyze the available data to identify patterns and relationships that can help define the target
Consider the business context and stakeholders' requirements to determine the target variable
Use statistical techniques and machine lear...
Top trending discussions
I applied via Naukri.com and was interviewed in Dec 2024. There were 2 interview rounds.
I applied via Approached by Company and was interviewed in Nov 2024. There were 3 interview rounds.
I applied via campus placement at Netaji Subhas Institute of Technology (NSIT) and was interviewed in May 2024. There were 5 interview rounds.
Python Programming related questions, along with one advanced SQL query problem. The final question was a Data Science project on prediting sales potential of various outlets.
Use RAID 5 to store data across all three memory chips with parity bits for fault tolerance.
Implement RAID 5 to distribute data and parity bits across all three memory chips.
If one memory chip is corrupted, the data can be reconstructed using the parity bits from the other two chips.
Example: Store 1GB of data on each chip and use the remaining space for parity bits to ensure fault tolerance.
Find the longest common prefix string from a list of strings.
Iterate through the characters of the first string and compare with corresponding characters of other strings
Stop when a mismatch is found or when reaching the end of any string
Return the prefix found so far
Joins are used in DBMS to combine rows from two or more tables based on a related column between them.
Types of joins include INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN.
INNER JOIN returns rows when there is at least one match in both tables.
LEFT JOIN returns all rows from the left table and the matched rows from the right table.
RIGHT JOIN returns all rows from the right table and the matched rows from the left tab...
Was taken by the product manager employed in the company. Basic case study question regarding a ride share app planning to expand internationally.
A formal orientation and introduction with the VP and founder of ION India
Algorithms and law can be correlated through the use of algorithms in legal processes and decision-making.
Algorithms can be used in legal research to analyze large amounts of data and identify patterns or trends.
Predictive algorithms can be used in legal cases to assess the likelihood of success or failure.
Algorithmic tools can help in legal document review and contract analysis.
However, there are concerns about bias i...
3 question were asked in 90 min time
Precision is the ratio of correctly predicted positive observations to the total predicted positive observations.
Precision is calculated as TP / (TP + FP), where TP is true positives and FP is false positives.
It measures the accuracy of positive predictions made by the model.
A high precision indicates that the model is good at predicting positive cases without many false positives.
For example, in a binary classificatio...
A large language model is a type of artificial intelligence model that is capable of understanding and generating human language at a large scale.
Large language models use deep learning techniques to process and generate text.
Examples include GPT-3 (Generative Pre-trained Transformer 3) and BERT (Bidirectional Encoder Representations from Transformers).
I applied via LinkedIn and was interviewed in Jun 2024. There were 3 interview rounds.
Entropy measures randomness in data, while information gain measures the reduction in uncertainty after splitting data.
Entropy is used in decision trees to measure impurity in a dataset before splitting it.
Information gain is used in decision trees to measure the effectiveness of a split in reducing uncertainty.
Entropy ranges from 0 (pure dataset) to 1 (completely impure dataset).
Information gain is calculated as the d...
LSTM for longer sequences, GRU for faster training and less complex models.
Use LSTM for tasks requiring long-term dependencies and memory retention.
Use GRU for faster training and simpler models with fewer parameters.
Consider using LSTM for tasks like language translation or speech recognition.
Consider using GRU for tasks like sentiment analysis or text generation.
Time Series data were given, we have to provide some insights
It contain both Aptitude and Coding about base models and Deep learning too
Different models techniques include linear regression, decision trees, random forests, support vector machines, and neural networks.
Linear regression is used for predicting continuous values.
Decision trees are used for classification and regression tasks.
Random forests are an ensemble method based on decision trees.
Support vector machines are used for classification tasks.
Neural networks are used for complex pattern re
Different performance metrics are used for different types of machine learning models to evaluate their effectiveness.
For classification models, metrics like accuracy, precision, recall, F1 score, and ROC-AUC are commonly used.
For regression models, metrics like mean squared error (MSE), mean absolute error (MAE), and R-squared are commonly used.
For clustering models, metrics like silhouette score and Davies-Bouldin in...
Adam optimizer is an extension to the Gradient Descent optimizer with adaptive learning rates and momentum.
Adam optimizer combines the benefits of both AdaGrad and RMSProp optimizers.
Adam optimizer uses adaptive learning rates for each parameter.
Gradient Descent optimizer has a fixed learning rate for all parameters.
Adam optimizer includes momentum to speed up convergence.
Gradient Descent optimizer updates parameters b...
Use ReLU for hidden layers in deep neural networks, avoid for output layers.
ReLU is commonly used in hidden layers to introduce non-linearity and speed up convergence.
Avoid using ReLU in output layers for regression tasks as it can lead to vanishing gradients.
Consider using Leaky ReLU or Sigmoid for output layers depending on the task.
ReLU is computationally efficient and helps in preventing the vanishing gradient prob...
I applied via Naukri.com and was interviewed in Jan 2024. There was 1 interview round.
Sql and python questions were there with basic logic check
Python code with function
Define a function using 'def' keyword
Include parameters inside parentheses
Use 'return' statement to return a value from the function
based on 1 interview
Interview experience
Software Development Engineer II
94
salaries
| ₹10 L/yr - ₹15 L/yr |
Devops Engineer
45
salaries
| ₹6.4 L/yr - ₹18 L/yr |
Development Engineer 3
40
salaries
| ₹13.5 L/yr - ₹23 L/yr |
Software Developer
36
salaries
| ₹6 L/yr - ₹24.5 L/yr |
Engineer III
30
salaries
| ₹10 L/yr - ₹20 L/yr |
Cisco
Microsoft Corporation
Intel
IBM