Data Science

Data Science Interview Questions and Answers for Freshers

Updated 12 Jul 2025
search-icon
2d ago

Q. What are Python’s key features? Explain list vs. tuple vs. set vs. dictionary. How do you handle missing data in a dataset? What is the difference between apply() and map() in pandas? What are lambda functions...

read more
Ans.

Python is a versatile programming language known for its simplicity, readability, and extensive libraries for data science.

  • Easy to Learn: Python's syntax is clear and intuitive, making it accessible for beginners and experienced programmers alike.

  • Extensive Libraries: Python has a rich ecosystem of libraries like NumPy, Pandas, and Matplotlib for data manipulation and analysis.

  • Cross-Platform: Python runs on various operating systems, including Windows, macOS, and Linux, ensuri...read more

5d ago

Q. What is the Central Limit Theorem? Explain p-value and its importance. What are Type I and Type II errors? Describe a normal distribution and its properties. What is correlation vs. causation?

Ans.

The Central Limit Theorem states that the distribution of sample means approaches normality as sample size increases.

  • Central Limit Theorem (CLT): Regardless of the population distribution, the sampling distribution of the mean will be approximately normal if the sample size is large enough (n > 30).

  • P-Value: The p-value measures the strength of evidence against the null hypothesis; a low p-value (typically < 0.05) indicates strong evidence to reject the null hypothesis.

  • Type I ...read more

4d ago

Q. If your model’s accuracy drops in production, how would you troubleshoot? How would you deal with imbalanced data? How do you explain your model to a non-technical stakeholder?

Ans.

Troubleshooting model accuracy involves systematic checks, while imbalanced data requires specific techniques to address bias.

  • Monitor Data Drift: Check if the input data distribution has changed since the model was trained, which can affect accuracy.

  • Evaluate Model Performance: Use metrics like precision, recall, and F1-score to get a better understanding of model performance beyond accuracy.

  • Feature Importance Analysis: Identify if certain features have become less relevant or...read more

Asked in TCS

4d ago

Q. What are the differences between a list and a tuple?

Ans.

List and tuple are both data structures in Python used to store collections of items.

  • Lists are mutable, meaning their values can be changed after creation.

  • Tuples are immutable, meaning their values cannot be changed after creation.

  • Lists are defined using square brackets [], while tuples are defined using parentheses ().

  • Lists are typically used for collections of similar items, while tuples are used for collections of different items.

  • Example of a list: my_list = [1, 2, 3]

  • Examp...read more

Are these interview questions helpful?

Asked in Accenture

6d ago

Q. What are data structures?

Ans.

Data structures are ways of organizing and storing data in a computer so that it can be accessed and used efficiently.

  • Data structures can be linear or non-linear

  • Examples of linear data structures include arrays, linked lists, and stacks

  • Examples of non-linear data structures include trees and graphs

  • Choosing the right data structure is important for optimizing performance

Asked in Capgemini

3d ago

Q. What are the OOP concepts?

Ans.

OOP concepts are foundational principles in programming that enable code reusability and organization through objects and classes.

  • Encapsulation: Bundling data and methods that operate on the data within one unit (e.g., a class).

  • Inheritance: Mechanism where a new class derives properties and behavior from an existing class (e.g., a 'Dog' class inheriting from an 'Animal' class).

  • Polymorphism: Ability to present the same interface for different data types (e.g., a function that ...read more

Data Science Jobs

American Express India logo
Analyst-Data Science 0-2 years
American Express India
4.1
Gurgaon / Gurugram
American Express India logo
Analyst- Data Science 0-3 years
American Express India
4.1
Gurgaon / Gurugram
American Express India logo
Analyst-Data Science 0-4 years
American Express India
4.1
Gurgaon / Gurugram

Asked in Infosys

2d ago

Q. What is Machine Learning?

Ans.

Machine learning is a subset of artificial intelligence that enables systems to learn from data and improve over time without explicit programming.

  • Machine learning algorithms can be supervised (e.g., predicting house prices) or unsupervised (e.g., clustering customers).

  • Common algorithms include decision trees, neural networks, and support vector machines.

  • Applications range from image recognition (e.g., facial recognition) to natural language processing (e.g., chatbots).

  • Machin...read more

Interview Experiences of Popular Companies

Deloitte Logo
3.7
 • 3k Interviews
ICICI Bank Logo
4.0
 • 2.6k Interviews
HDFC Bank Logo
3.9
 • 2.5k Interviews
Tata Motors Logo
4.1
 • 1.1k Interviews
Mu Sigma Logo
2.6
 • 240 Interviews
View all
interview tips and stories logo
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories
Data Science Interview Questions
Share an Interview
Stay ahead in your career. Get AmbitionBox app
play-icon
play-icon
qr-code
Trusted by over 1.5 Crore job seekers to find their right fit company
80 L+

Reviews

10L+

Interviews

4 Cr+

Salaries

1.5 Cr+

Users

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2025 Info Edge (India) Ltd.

Follow Us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter
Profile Image
Hello, Guest
AmbitionBox Employee Choice Awards 2025
Winners announced!
awards-icon
Contribute to help millions!
Write a review
Write a review
Share interview
Share interview
Contribute salary
Contribute salary
Add office photos
Add office photos
Add office benefits
Add office benefits