Filter interviews by
Model Training based on the dataset and problem statement provided.
Supervised learning uses labeled data to train the model, while unsupervised learning uses unlabeled data.
Supervised learning requires a target variable to predict, while unsupervised learning does not.
In supervised learning, the model learns from labeled examples, while in unsupervised learning, the model finds patterns in data.
Examples of supervised learning include regression and classification tasks, while clusteri
I applied via Approached by Company and was interviewed in May 2024. There was 1 interview round.
Xgboost is a popular machine learning algorithm known for its speed and performance in handling large datasets.
Xgboost stands for eXtreme Gradient Boosting, which is an optimized implementation of gradient boosting.
It is widely used in Kaggle competitions and other machine learning tasks due to its high accuracy and efficiency.
Xgboost uses a technique called boosting, where multiple weak learners are combined to create...
I applied via Referral and was interviewed in Nov 2024. There were 2 interview rounds.
posted on 6 Jan 2025
SQL & aptitude question
1 coding question for 45 min
I applied via LinkedIn and was interviewed in Jul 2024. There were 3 interview rounds.
Assignment on credit risk
Model Gini is a measure of statistical dispersion used to evaluate the performance of classification models.
Model Gini is calculated as twice the area between the ROC curve and the diagonal line (random model).
It ranges from 0 (worst model) to 1 (best model), with higher values indicating better model performance.
A Gini coefficient of 0.5 indicates a model that is no better than random guessing.
Commonly used in credit
XGBoost model is trained by specifying parameters, splitting data into training and validation sets, fitting the model, and tuning hyperparameters.
Specify parameters for XGBoost model such as learning rate, max depth, and number of trees
Split data into training and validation sets using train_test_split function
Fit the XGBoost model on training data using fit method
Tune hyperparameters using techniques like grid search
I was asked Python, sql, coding questions
Case study on how would you identify the total number of footfall on a airport
posted on 7 May 2024
I applied via Job Portal and was interviewed in Nov 2023. There was 1 interview round.
Gradient descent is an optimization algorithm used to minimize a function by iteratively moving in the direction of steepest descent.
Gradient descent is used to find the minimum of a function by taking steps proportional to the negative of the gradient at the current point.
It is commonly used in machine learning to optimize the parameters of a model by minimizing the loss function.
There are different variants of gradie...
LSTM (Long Short-Term Memory) is a type of recurrent neural network designed to handle long-term dependencies.
LSTM has three gates: input gate, forget gate, and output gate.
Input gate controls the flow of information into the cell state.
Forget gate decides what information to discard from the cell state.
Output gate determines the output based on the cell state.
T-test is a statistical test used to determine if there is a significant difference between the means of two groups.
Mean is the average of a set of numbers, median is the middle value when the numbers are ordered, and mode is the most frequently occurring value.
Mean is sensitive to outliers, median is robust to outliers, and mode is useful for categorical data.
T-test is used to compare means of two groups, mean is used...
Random Forest is an ensemble learning method used for classification and regression tasks.
Random Forest is a collection of decision trees that are trained on random subsets of the data.
Each tree in the forest makes a prediction, and the final prediction is the average (regression) or majority vote (classification) of all trees.
Random Forest helps reduce overfitting and improve accuracy compared to a single decision tre...
I applied via Company Website and was interviewed before Aug 2023. There were 2 interview rounds.
Bert and transformer are models used in natural language processing for tasks like text classification and language generation.
Bert (Bidirectional Encoder Representations from Transformers) is a transformer-based model developed by Google for NLP tasks.
Transformer is a deep learning model architecture that uses self-attention mechanisms to process sequential data like text.
Both Bert and transformer have been widely use...
NLP pre processing techniques involve cleaning and preparing text data for analysis.
Tokenization: breaking text into words or sentences
Stopword removal: removing common words that do not add meaning
Lemmatization: reducing words to their base form
Normalization: converting text to lowercase
Removing special characters and punctuation
Good round take aptitude test. Prepare very well and try to solve leet code problems and also practice available aptitude questions bank available in google
based on 2 reviews
Rating in categories
Data Scientist
11
salaries
| ₹9 L/yr - ₹25 L/yr |
Director
7
salaries
| ₹60 L/yr - ₹98 L/yr |
Business Analyst
5
salaries
| ₹18.5 L/yr - ₹26.2 L/yr |
Senior Data Analyst
5
salaries
| ₹17 L/yr - ₹26 L/yr |
Senior Software Engineer
5
salaries
| ₹10 L/yr - ₹23.8 L/yr |
HDFC Bank
ICICI Bank
Axis Bank
State Bank of India