Upload Button Icon Add office photos

Filter interviews by

Xyz Company Data Scientist Interview Questions and Answers

Updated 27 May 2024

Xyz Company Data Scientist Interview Experiences

1 interview found

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

Good round take aptitude test. Prepare very well and try to solve leet code problems and also practice available aptitude questions bank available in google

Interview questions from similar companies

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Referral and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical 

(4 Questions)

  • Q1. Types of Chunking in data preparation in RAG
  • Q2. How Embedding works in Vector Databases
  • Q3. Explain ARIMA model
  • Q4. How can we decide to choose Linear Regression for a business problem
Round 2 - Technical 

(4 Questions)

  • Q1. What is token and it's limit for Open Source LLMs
  • Q2. Difference of a Regression and Time Series problem
  • Q3. Advantage of LSTM over RNN
  • Q4. Performance Metrics for Logistic Regression

Data Scientist Interview Questions & Answers

Bajaj Finserv user image Vaibhav Diwakar Gavli

posted on 6 Jan 2025

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

SQL & aptitude question

Round 2 - Coding Test 

1 coding question for 45 min

Round 3 - Technical 

(1 Question)

  • Q1. Detailed questing for machine learning model's.
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via LinkedIn and was interviewed in Jul 2024. There were 3 interview rounds.

Round 1 - Assignment 

Assignment on credit risk

Round 2 - Technical 

(1 Question)

  • Q1. Hyperparameter tuning
Round 3 - Technical 

(1 Question)

  • Q1. Case study for problem solving
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. How do you define model Gini?
  • Ans. 

    Model Gini is a measure of statistical dispersion used to evaluate the performance of classification models.

    • Model Gini is calculated as twice the area between the ROC curve and the diagonal line (random model).

    • It ranges from 0 (worst model) to 1 (best model), with higher values indicating better model performance.

    • A Gini coefficient of 0.5 indicates a model that is no better than random guessing.

    • Commonly used in credit

  • Answered by AI
  • Q2. How to you train XG boost model
  • Ans. 

    XGBoost model is trained by specifying parameters, splitting data into training and validation sets, fitting the model, and tuning hyperparameters.

    • Specify parameters for XGBoost model such as learning rate, max depth, and number of trees

    • Split data into training and validation sets using train_test_split function

    • Fit the XGBoost model on training data using fit method

    • Tune hyperparameters using techniques like grid search

  • Answered by AI

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Coding Test 

I was asked Python, sql, coding questions

Round 2 - Case Study 

Case study on how would you identify the total number of footfall on a airport

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
-
Result
No response

I applied via Job Portal and was interviewed in Nov 2023. There was 1 interview round.

Round 1 - One-on-one 

(5 Questions)

  • Q1. What is Gradient Descents?
  • Ans. 

    Gradient descent is an optimization algorithm used to minimize a function by iteratively moving in the direction of steepest descent.

    • Gradient descent is used to find the minimum of a function by taking steps proportional to the negative of the gradient at the current point.

    • It is commonly used in machine learning to optimize the parameters of a model by minimizing the loss function.

    • There are different variants of gradie...

  • Answered by AI
  • Q2. What is LSTM?, and what are the gates in it?
  • Ans. 

    LSTM (Long Short-Term Memory) is a type of recurrent neural network designed to handle long-term dependencies.

    • LSTM has three gates: input gate, forget gate, and output gate.

    • Input gate controls the flow of information into the cell state.

    • Forget gate decides what information to discard from the cell state.

    • Output gate determines the output based on the cell state.

  • Answered by AI
  • Q3. They gave me a link to dataset and started saying the operations to apply on that. E.g, value_counts, null_values, fill the values with mean,etc.
  • Q4. What is t-test? What is Mean, Median and Mode and where to use these?
  • Ans. 

    T-test is a statistical test used to determine if there is a significant difference between the means of two groups.

    • Mean is the average of a set of numbers, median is the middle value when the numbers are ordered, and mode is the most frequently occurring value.

    • Mean is sensitive to outliers, median is robust to outliers, and mode is useful for categorical data.

    • T-test is used to compare means of two groups, mean is used...

  • Answered by AI
  • Q5. What is RANDOM FOREST ?
  • Ans. 

    Random Forest is an ensemble learning method used for classification and regression tasks.

    • Random Forest is a collection of decision trees that are trained on random subsets of the data.

    • Each tree in the forest makes a prediction, and the final prediction is the average (regression) or majority vote (classification) of all trees.

    • Random Forest helps reduce overfitting and improve accuracy compared to a single decision tre...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Motilal Oswal Financial Services Data Scientist interview:
  • Machine Learning
  • Statistics
  • Pandas
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - HR 

(1 Question)

  • Q1. Tell me about your self
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Selected Selected

I applied via Company Website and was interviewed before Aug 2023. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. What is Bert and transformer
  • Ans. 

    Bert and transformer are models used in natural language processing for tasks like text classification and language generation.

    • Bert (Bidirectional Encoder Representations from Transformers) is a transformer-based model developed by Google for NLP tasks.

    • Transformer is a deep learning model architecture that uses self-attention mechanisms to process sequential data like text.

    • Both Bert and transformer have been widely use...

  • Answered by AI
  • Q2. NLP pre processing techniques
  • Ans. 

    NLP pre processing techniques involve cleaning and preparing text data for analysis.

    • Tokenization: breaking text into words or sentences

    • Stopword removal: removing common words that do not add meaning

    • Lemmatization: reducing words to their base form

    • Normalization: converting text to lowercase

    • Removing special characters and punctuation

  • Answered by AI
Round 2 - HR 

(2 Questions)

  • Q1. Basic questions
  • Q2. Strength weakness

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Campus Placement and was interviewed before Jul 2023. There was 1 interview round.

Round 1 - Technical 

(4 Questions)

  • Q1. Explain any Data Science project
  • Ans. 

    Developed a predictive model to forecast customer churn for a telecommunications company.

    • Identified key features such as customer tenure, monthly charges, and service usage

    • Collected and cleaned data from customer databases

    • Built a machine learning model using logistic regression or random forest algorithms

    • Evaluated model performance using metrics like accuracy, precision, and recall

    • Provided actionable insights to reduce

  • Answered by AI
  • Q2. Types of Error in Statistics
  • Ans. 

    Types of errors in statistics include sampling error, measurement error, and non-sampling error.

    • Sampling error occurs when the sample does not represent the population accurately.

    • Measurement error is caused by inaccuracies in data collection or measurement instruments.

    • Non-sampling error includes errors in data processing, analysis, and interpretation.

    • Examples: Sampling error - selecting a biased sample, Measurement err...

  • Answered by AI
  • Q3. Types of Machine learning models
  • Ans. 

    Types of machine learning models include supervised learning, unsupervised learning, and reinforcement learning.

    • Supervised learning: Models learn from labeled data, making predictions based on past examples (e.g. linear regression, support vector machines)

    • Unsupervised learning: Models find patterns in unlabeled data, clustering similar data points together (e.g. k-means clustering, PCA)

    • Reinforcement learning: Models le...

  • Answered by AI
  • Q4. Functions of pandas library, such as get_dummies()
  • Ans. 

    get_dummies() function in pandas library is used to convert categorical variables into dummy/indicator variables.

    • get_dummies() function creates dummy variables for categorical columns in a DataFrame.

    • It converts categorical variables into numerical representation for machine learning models.

    • Example: df = pd.get_dummies(df, columns=['column_name'])

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - On campus interview, Be confident, be good at project explaination.

Skills evaluated in this interview

Xyz Company Interview FAQs

How many rounds are there in Xyz Company Data Scientist interview?
Xyz Company interview process usually has 1 rounds. The most common rounds in the Xyz Company interview process are Aptitude Test.

Tell us how to improve this page.

Xyz Company Data Scientist Salary
based on 84 salaries
₹10 L/yr - ₹36.3 L/yr
62% more than the average Data Scientist Salary in India
View more details
Software Engineer
1.1k salaries
unlock blur

₹12.6 L/yr - ₹36 L/yr

Manager
881 salaries
unlock blur

₹11 L/yr - ₹45 L/yr

Software Developer
832 salaries
unlock blur

₹11.4 L/yr - ₹36 L/yr

Senior Software Engineer
407 salaries
unlock blur

₹10.1 L/yr - ₹42 L/yr

Consultant
405 salaries
unlock blur

₹8.5 L/yr - ₹29 L/yr

Explore more salaries
Compare Xyz Company with

Bajaj Finserv

4.0
Compare

Wells Fargo

3.9
Compare

JPMorgan Chase & Co.

4.0
Compare

HSBC Group

4.0
Compare
Did you find this page helpful?
Yes No
write
Share an Interview