Data Science
Data Science Interview Questions and Answers for Freshers

Asked in Infotact Solutions

Q. What are Python’s key features? Explain list vs. tuple vs. set vs. dictionary. How do you handle missing data in a dataset? What is the difference between apply() and map() in pandas? What are lambda functions...
read morePython is a versatile programming language known for its simplicity, readability, and extensive libraries for data science.
Easy to Learn: Python's syntax is clear and intuitive, making it accessible for beginners and experienced programmers alike.
Extensive Libraries: Python has a rich ecosystem of libraries like NumPy, Pandas, and Matplotlib for data manipulation and analysis.
Cross-Platform: Python runs on various operating systems, including Windows, macOS, and Linux, ensuri...read more

Asked in Infotact Solutions

Q. What is the Central Limit Theorem? Explain p-value and its importance. What are Type I and Type II errors? Describe a normal distribution and its properties. What is correlation vs. causation?
The Central Limit Theorem states that the distribution of sample means approaches normality as sample size increases.
Central Limit Theorem (CLT): Regardless of the population distribution, the sampling distribution of the mean will be approximately normal if the sample size is large enough (n > 30).
P-Value: The p-value measures the strength of evidence against the null hypothesis; a low p-value (typically < 0.05) indicates strong evidence to reject the null hypothesis.
Type I ...read more

Asked in Infotact Solutions

Q. If your model’s accuracy drops in production, how would you troubleshoot? How would you deal with imbalanced data? How do you explain your model to a non-technical stakeholder?
Troubleshooting model accuracy involves systematic checks, while imbalanced data requires specific techniques to address bias.
Monitor Data Drift: Check if the input data distribution has changed since the model was trained, which can affect accuracy.
Evaluate Model Performance: Use metrics like precision, recall, and F1-score to get a better understanding of model performance beyond accuracy.
Feature Importance Analysis: Identify if certain features have become less relevant or...read more

Asked in TCS

Q. What are the differences between a list and a tuple?
List and tuple are both data structures in Python used to store collections of items.
Lists are mutable, meaning their values can be changed after creation.
Tuples are immutable, meaning their values cannot be changed after creation.
Lists are defined using square brackets [], while tuples are defined using parentheses ().
Lists are typically used for collections of similar items, while tuples are used for collections of different items.
Example of a list: my_list = [1, 2, 3]
Examp...read more

Asked in Accenture

Q. What are data structures?
Data structures are ways of organizing and storing data in a computer so that it can be accessed and used efficiently.
Data structures can be linear or non-linear
Examples of linear data structures include arrays, linked lists, and stacks
Examples of non-linear data structures include trees and graphs
Choosing the right data structure is important for optimizing performance

Asked in Capgemini

Q. What are the OOP concepts?
OOP concepts are foundational principles in programming that enable code reusability and organization through objects and classes.
Encapsulation: Bundling data and methods that operate on the data within one unit (e.g., a class).
Inheritance: Mechanism where a new class derives properties and behavior from an existing class (e.g., a 'Dog' class inheriting from an 'Animal' class).
Polymorphism: Ability to present the same interface for different data types (e.g., a function that ...read more
Data Science Jobs




Asked in Infosys

Q. What is Machine Learning?
Machine learning is a subset of artificial intelligence that enables systems to learn from data and improve over time without explicit programming.
Machine learning algorithms can be supervised (e.g., predicting house prices) or unsupervised (e.g., clustering customers).
Common algorithms include decision trees, neural networks, and support vector machines.
Applications range from image recognition (e.g., facial recognition) to natural language processing (e.g., chatbots).
Machin...read more
Interview Experiences of Popular Companies








Reviews
Interviews
Salaries
Users

