Upload Button Icon Add office photos

Filter interviews by

CIMB bank Data Scientist Interview Questions and Answers

Updated 17 Nov 2024

CIMB bank Data Scientist Interview Experiences

2 interviews found

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

Model Training based on the dataset and problem statement provided.

Round 2 - Technical 

(2 Questions)

  • Q1. Asked about Roc Auc
  • Q2. What is difference between supervised and unsupervised learning
  • Ans. 

    Supervised learning uses labeled data to train the model, while unsupervised learning uses unlabeled data.

    • Supervised learning requires a target variable to predict, while unsupervised learning does not.

    • In supervised learning, the model learns from labeled examples, while in unsupervised learning, the model finds patterns in data.

    • Examples of supervised learning include regression and classification tasks, while clusteri

  • Answered by AI

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Approached by Company and was interviewed in May 2024. There was 1 interview round.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Xgboost in-depth
  • Ans. 

    Xgboost is a popular machine learning algorithm known for its speed and performance in handling large datasets.

    • Xgboost stands for eXtreme Gradient Boosting, which is an optimized implementation of gradient boosting.

    • It is widely used in Kaggle competitions and other machine learning tasks due to its high accuracy and efficiency.

    • Xgboost uses a technique called boosting, where multiple weak learners are combined to create...

  • Answered by AI
  • Q2. Type 1 & 2 error.

Skills evaluated in this interview

Interview questions from similar companies

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Referral and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical 

(4 Questions)

  • Q1. Types of Chunking in data preparation in RAG
  • Q2. How Embedding works in Vector Databases
  • Q3. Explain ARIMA model
  • Q4. How can we decide to choose Linear Regression for a business problem
Round 2 - Technical 

(4 Questions)

  • Q1. What is token and it's limit for Open Source LLMs
  • Q2. Difference of a Regression and Time Series problem
  • Q3. Advantage of LSTM over RNN
  • Q4. Performance Metrics for Logistic Regression

Data Scientist Interview Questions & Answers

Bajaj Finserv user image Vaibhav Diwakar Gavli

posted on 6 Jan 2025

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

SQL & aptitude question

Round 2 - Coding Test 

1 coding question for 45 min

Round 3 - Technical 

(1 Question)

  • Q1. Detailed questing for machine learning model's.
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via LinkedIn and was interviewed in Jul 2024. There were 3 interview rounds.

Round 1 - Assignment 

Assignment on credit risk

Round 2 - Technical 

(1 Question)

  • Q1. Hyperparameter tuning
Round 3 - Technical 

(1 Question)

  • Q1. Case study for problem solving
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. How do you define model Gini?
  • Ans. 

    Model Gini is a measure of statistical dispersion used to evaluate the performance of classification models.

    • Model Gini is calculated as twice the area between the ROC curve and the diagonal line (random model).

    • It ranges from 0 (worst model) to 1 (best model), with higher values indicating better model performance.

    • A Gini coefficient of 0.5 indicates a model that is no better than random guessing.

    • Commonly used in credit

  • Answered by AI
  • Q2. How to you train XG boost model
  • Ans. 

    XGBoost model is trained by specifying parameters, splitting data into training and validation sets, fitting the model, and tuning hyperparameters.

    • Specify parameters for XGBoost model such as learning rate, max depth, and number of trees

    • Split data into training and validation sets using train_test_split function

    • Fit the XGBoost model on training data using fit method

    • Tune hyperparameters using techniques like grid search

  • Answered by AI

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Coding Test 

I was asked Python, sql, coding questions

Round 2 - Case Study 

Case study on how would you identify the total number of footfall on a airport

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
-
Result
No response

I applied via Job Portal and was interviewed in Nov 2023. There was 1 interview round.

Round 1 - One-on-one 

(5 Questions)

  • Q1. What is Gradient Descents?
  • Ans. 

    Gradient descent is an optimization algorithm used to minimize a function by iteratively moving in the direction of steepest descent.

    • Gradient descent is used to find the minimum of a function by taking steps proportional to the negative of the gradient at the current point.

    • It is commonly used in machine learning to optimize the parameters of a model by minimizing the loss function.

    • There are different variants of gradie...

  • Answered by AI
  • Q2. What is LSTM?, and what are the gates in it?
  • Ans. 

    LSTM (Long Short-Term Memory) is a type of recurrent neural network designed to handle long-term dependencies.

    • LSTM has three gates: input gate, forget gate, and output gate.

    • Input gate controls the flow of information into the cell state.

    • Forget gate decides what information to discard from the cell state.

    • Output gate determines the output based on the cell state.

  • Answered by AI
  • Q3. They gave me a link to dataset and started saying the operations to apply on that. E.g, value_counts, null_values, fill the values with mean,etc.
  • Q4. What is t-test? What is Mean, Median and Mode and where to use these?
  • Ans. 

    T-test is a statistical test used to determine if there is a significant difference between the means of two groups.

    • Mean is the average of a set of numbers, median is the middle value when the numbers are ordered, and mode is the most frequently occurring value.

    • Mean is sensitive to outliers, median is robust to outliers, and mode is useful for categorical data.

    • T-test is used to compare means of two groups, mean is used...

  • Answered by AI
  • Q5. What is RANDOM FOREST ?
  • Ans. 

    Random Forest is an ensemble learning method used for classification and regression tasks.

    • Random Forest is a collection of decision trees that are trained on random subsets of the data.

    • Each tree in the forest makes a prediction, and the final prediction is the average (regression) or majority vote (classification) of all trees.

    • Random Forest helps reduce overfitting and improve accuracy compared to a single decision tre...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Motilal Oswal Financial Services Data Scientist interview:
  • Machine Learning
  • Statistics
  • Pandas
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Selected Selected

I applied via Company Website and was interviewed before Aug 2023. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. What is Bert and transformer
  • Ans. 

    Bert and transformer are models used in natural language processing for tasks like text classification and language generation.

    • Bert (Bidirectional Encoder Representations from Transformers) is a transformer-based model developed by Google for NLP tasks.

    • Transformer is a deep learning model architecture that uses self-attention mechanisms to process sequential data like text.

    • Both Bert and transformer have been widely use...

  • Answered by AI
  • Q2. NLP pre processing techniques
  • Ans. 

    NLP pre processing techniques involve cleaning and preparing text data for analysis.

    • Tokenization: breaking text into words or sentences

    • Stopword removal: removing common words that do not add meaning

    • Lemmatization: reducing words to their base form

    • Normalization: converting text to lowercase

    • Removing special characters and punctuation

  • Answered by AI
Round 2 - HR 

(2 Questions)

  • Q1. Basic questions
  • Q2. Strength weakness

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

Good round take aptitude test. Prepare very well and try to solve leet code problems and also practice available aptitude questions bank available in google

CIMB bank Interview FAQs

How many rounds are there in CIMB bank Data Scientist interview?
CIMB bank interview process usually has 1-2 rounds. The most common rounds in the CIMB bank interview process are One-on-one Round, Aptitude Test and Technical.
What are the top questions asked in CIMB bank Data Scientist interview?

Some of the top questions asked at the CIMB bank Data Scientist interview -

  1. what is difference between supervised and unsupervised learn...read more
  2. Xgboost in-de...read more
  3. asked about Roc ...read more

Tell us how to improve this page.

CIMB bank Data Scientist Salary
based on 11 salaries
₹9 L/yr - ₹25 L/yr
At par with the average Data Scientist Salary in India
View more details

CIMB bank Data Scientist Reviews and Ratings

based on 2 reviews

5.0/5

Rating in categories

4.1

Skill development

5.0

Work-Life balance

4.0

Salary & Benefits

4.0

Job Security

4.9

Company culture

4.0

Promotions/Appraisal

4.9

Work Satisfaction

Explore 2 Reviews and Ratings
Data Scientist
11 salaries
unlock blur

₹9 L/yr - ₹25 L/yr

Director
7 salaries
unlock blur

₹60 L/yr - ₹98 L/yr

Business Analyst
5 salaries
unlock blur

₹18.5 L/yr - ₹26.2 L/yr

Senior Data Analyst
5 salaries
unlock blur

₹17 L/yr - ₹26 L/yr

Senior Software Engineer
5 salaries
unlock blur

₹10 L/yr - ₹23.8 L/yr

Explore more salaries
Compare CIMB bank with

HDFC Bank

3.9
Compare

ICICI Bank

4.0
Compare

Axis Bank

3.8
Compare

State Bank of India

3.8
Compare

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Did you find this page helpful?
Yes No
write
Share an Interview