Premium Employer

i

This company page is being actively managed by Infosys Team. If you also belong to the team, you can get access from here

Infosys Verified Tick

Compare button icon Compare button icon Compare

Filter interviews by

Infosys Data Scientist Interview Questions, Process, and Tips

Updated 19 Oct 2024

Top Infosys Data Scientist Interview Questions and Answers

  • Q1. XgBoost algorithm has 10-20 features. How are the splits decided, on which feature are they going to be divided?
  • Q2. Explain precision and recall, when are they used in which scenario?
  • Q3. What is activation function? Explain Naive Bayes? Confusion matrix? Hyperparameters in DL? Hypothesis testing
View all 14 questions

Infosys Data Scientist Interview Experiences

18 interviews found

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Not Selected

I applied via Job Portal and was interviewed in Apr 2024. There was 1 interview round.

Round 1 - Technical 

(9 Questions)

  • Q1. Explain XGBoost algoritm
  • Ans. 

    XGBoost is a powerful machine learning algorithm known for its speed and performance in handling large datasets.

    • XGBoost stands for eXtreme Gradient Boosting, which is an implementation of gradient boosting machines.

    • It is widely used in machine learning competitions and is known for its speed and performance.

    • XGBoost uses a technique called boosting, where multiple weak learners are combined to create a strong learner.

    • It...

  • Answered by AI
  • Q2. XgBoost algorithm has 10-20 features. How are the splits decided, on which feature are they going to be divided?
  • Ans. 

    XgBoost algorithm uses a greedy approach to determine splits based on feature importance.

    • XgBoost algorithm calculates the information gain for each feature to determine the best split.

    • The feature with the highest information gain is chosen for the split.

    • This process is repeated recursively for each node in the tree.

    • Features can be split based on numerical values or categories.

    • Example: If a feature like 'age' has the hi...

  • Answered by AI
  • Q3. Do you have any experience on cloud platform?
  • Ans. 

    Yes, I have experience working on cloud platforms such as AWS and Google Cloud.

    • Experience with AWS services like S3, EC2, and Redshift

    • Familiarity with Google Cloud services like BigQuery and Compute Engine

    • Utilized cloud platforms for data storage, processing, and analysis

  • Answered by AI
  • Q4. What is entropy, information gain?
  • Ans. 

    Entropy is a measure of randomness or uncertainty in a dataset, while information gain is the reduction in entropy after splitting a dataset based on a feature.

    • Entropy is used in decision tree algorithms to determine the best feature to split on.

    • Information gain measures the effectiveness of a feature in classifying the data.

    • Higher information gain indicates that a feature is more useful for splitting the data.

    • Entropy ...

  • Answered by AI
  • Q5. What is hypothesis testing?
  • Ans. 

    Hypothesis testing is a statistical method used to make inferences about a population based on sample data.

    • Hypothesis testing involves formulating a null hypothesis and an alternative hypothesis.

    • The null hypothesis is assumed to be true until there is enough evidence to reject it.

    • Statistical tests are used to determine the likelihood of observing the data if the null hypothesis is true.

    • The p-value is used to determine ...

  • Answered by AI
  • Q6. Explain precision and recall, when are they used in which scenario?
  • Ans. 

    Precision and recall are metrics used in evaluating the performance of classification models.

    • Precision measures the accuracy of positive predictions, while recall measures the ability of the model to find all positive instances.

    • Precision = TP / (TP + FP)

    • Recall = TP / (TP + FN)

    • Precision is important when false positives are costly, while recall is important when false negatives are costly.

    • For example, in a spam email de...

  • Answered by AI
  • Q7. What is data imbalance?
  • Ans. 

    Data imbalance refers to unequal distribution of classes in a dataset, where one class has significantly more samples than others.

    • Data imbalance can lead to biased models that favor the majority class.

    • It can result in poor performance for minority classes, as the model may struggle to accurately predict them.

    • Techniques like oversampling, undersampling, and using different evaluation metrics can help address data imbala...

  • Answered by AI
  • Q8. What is SMOTE? Do you have any experience working on Time Series? Code analysis of global variable?
  • Ans. 

    SMOTE stands for Synthetic Minority Over-sampling Technique, used to balance imbalanced datasets by generating synthetic samples.

    • SMOTE is commonly used in machine learning to address class imbalance by creating synthetic samples of the minority class.

    • It works by generating new instances of the minority class by interpolating between existing instances.

    • SMOTE is particularly useful in scenarios where the minority class i...

  • Answered by AI
  • Q9. Find 5th highest salary in every department. What are window functions Difference between union and union all Difference between delete and truncate.

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare basics well. Go through the top questions asked for SQL,Python,Data Science.
Well versed with resume projects and concepts used in it.

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Referral and was interviewed in Jul 2024. There were 2 interview rounds.

Round 1 - Coding Test 

Basic operations on dataframe using Pandas and SQL basics.

Round 2 - Technical 

(2 Questions)

  • Q1. Data preprocessing related questions like steps took. Experience about working projects.
  • Q2. Random forest and decision tress related questions

Data Scientist Interview Questions Asked at Other Companies

Q1. for a data with 1000 samples and 700 dimensions, how would you fi ... read more
Q2. Special Sum of Array Problem Statement Given an array 'arr' conta ... read more
asked in Affine
Q3. you have a pandas dataframe with three columns, filled with state ... read more
Q4. Clone a Linked List with Random Pointers Given a linked list wher ... read more
asked in Coforge
Q5. coding question of finding index of 2 nos. having total equal to ... read more
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. Knn and logistic regression
  • Q2. Correlation vs covariance
  • Ans. 

    Covariance measures the relationship between two variables, while correlation measures the strength and direction of the relationship.

    • Covariance can be positive, negative, or zero, indicating the direction of the relationship between variables.

    • Correlation is always between -1 and 1, with 1 indicating a perfect positive relationship, -1 indicating a perfect negative relationship, and 0 indicating no relationship.

    • Covaria...

  • Answered by AI
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. Basic Statistics
  • Q2. Basic ML, DL question

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare well with basics

Infosys interview questions for designations

 Senior Data Analyst

 (2)

 Associate Data Analyst

 (1)

 Business Intelligence Analyst

 (1)

 Business Intelligence Developer

 (1)

 Data Analyst

 (33)

 Data Engineer

 (25)

 Data Manager

 (1)

 Data Architect

 (1)

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
-

I applied via Newspaper Ad and was interviewed in Dec 2023. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. Questions on Bert , lstm
  • Q2. Questions on bi lstm gpt
Round 2 - HR 

(1 Question)

  • Q1. Salary negotiations and bonus

Interview Preparation Tips

Topics to prepare for Infosys Data Scientist interview:
  • Machine Learning

Get interview-ready with Top Infosys Interview Questions

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
6-8 weeks
Result
Not Selected

I applied via Recruitment Consulltant and was interviewed in Feb 2024. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. What is L1 and L2 Regularization?
  • Ans. 

    L1 and L2 regularization are techniques used in machine learning to prevent overfitting by adding penalty terms to the cost function.

    • L1 regularization adds the absolute values of the coefficients as penalty term to the cost function.

    • L2 regularization adds the squared values of the coefficients as penalty term to the cost function.

    • L1 regularization can lead to sparse models by forcing some coefficients to be exactly zer...

  • Answered by AI

Skills evaluated in this interview

Data Scientist Jobs at Infosys

View all
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. What projects you worked on
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Company Website and was interviewed before Feb 2023. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Attended the interview on April 2023. 2 panel members joined the interview. Most of the questions are from basic ML and DL concepts. Same day I received the documents upload email and had salary discussio...
  • Q2. What is activation function? Explain Naive Bayes? Confusion matrix? Hyperparameters in DL? Hypothesis testing
  • Ans. 

    Activation function is a mathematical function used in neural networks to introduce non-linearity.

    • Activation function is applied to the weighted sum of inputs in a neural network node.

    • It helps in determining the output of a node or the activation of a neuron.

    • Common activation functions include sigmoid, tanh, ReLU, and softmax.

    • Activation functions introduce non-linearity, allowing neural networks to learn complex patter...

  • Answered by AI

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(3 Questions)

  • Q1. What is multi collinearity?
  • Ans. 

    Multicollinearity is a phenomenon where two or more independent variables in a regression model are highly correlated.

    • It can lead to unstable and unreliable estimates of regression coefficients.

    • It can also make it difficult to determine the individual effect of each independent variable on the dependent variable.

    • It can be detected using methods such as correlation matrix, variance inflation factor (VIF), and eigenvalue...

  • Answered by AI
  • Q2. Machine learning algorithms - decsisin tree?
  • Q3. Solve Try catch block

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare basics of machine Learning algorithm. And have generalised overview of latest technology.

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Case Study 

Case Study for Data Analysis for marketing company

Round 3 - Group Discussion 

Group Discussion on Case Study

Round 4 - One-on-one 

(2 Questions)

  • Q1. Technical round questions about data analytics
  • Q2. Technical round about data analytics

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident!

Infosys Interview FAQs

How many rounds are there in Infosys Data Scientist interview?
Infosys interview process usually has 1-2 rounds. The most common rounds in the Infosys interview process are Technical, Resume Shortlist and HR.
How to prepare for Infosys Data Scientist interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Infosys. The most common topics and skills that interviewers at Infosys expect are Python, SQL, Data Science, Machine Learning and R.
What are the top questions asked in Infosys Data Scientist interview?

Some of the top questions asked at the Infosys Data Scientist interview -

  1. XgBoost algorithm has 10-20 features. How are the splits decided, on which feat...read more
  2. Explain precision and recall, when are they used in which scenar...read more
  3. What is activation function? Explain Naive Bayes? Confusion matrix? Hyperparame...read more

Tell us how to improve this page.

Infosys Data Scientist Interview Process

based on 15 interviews

3 Interview rounds

  • Technical Round
  • HR Round
  • Group Discussion Round
View more
Join Infosys Creating the next opportunity for people, businesses & communities
Infosys Data Scientist Salary
based on 456 salaries
₹4 L/yr - ₹16.2 L/yr
35% less than the average Data Scientist Salary in India
View more details

Infosys Data Scientist Reviews and Ratings

based on 17 reviews

4.4/5

Rating in categories

4.1

Skill development

4.2

Work-life balance

3.5

Salary

4.3

Job security

4.2

Company culture

3.3

Promotions

4.0

Work satisfaction

Explore 17 Reviews and Ratings
Data Scientist-Gen AI

Bangalore / Bengaluru

9-16 Yrs

Not Disclosed

Data Scientist

Bangalore / Bengaluru

9-11 Yrs

Not Disclosed

Data Scientist (0-2 yrs)

0-2 Yrs

Not Disclosed

Explore more jobs
Technology Analyst
55.9k salaries
unlock blur

₹3 L/yr - ₹11 L/yr

Senior Systems Engineer
50.4k salaries
unlock blur

₹2.8 L/yr - ₹8 L/yr

System Engineer
39.5k salaries
unlock blur

₹2.5 L/yr - ₹5.5 L/yr

Technical Lead
30.8k salaries
unlock blur

₹5.2 L/yr - ₹19.5 L/yr

Senior Associate Consultant
27.7k salaries
unlock blur

₹6.2 L/yr - ₹16.8 L/yr

Explore more salaries
Compare Infosys with

TCS

3.7
Compare

Wipro

3.7
Compare

Cognizant

3.8
Compare

Accenture

3.8
Compare
Did you find this page helpful?
Yes No
write
Share an Interview