Data Science

10+ Data Science Interview Questions and Answers

Updated 21 Oct 2024

Popular Companies

search-icon

Q1. What is list and tuple?

Ans.

List and tuple are both data structures in Python used to store collections of items.

  • Lists are mutable, meaning their values can be changed after creation.

  • Tuples are immutable, meaning their values cannot be changed after creation.

  • Lists are defined using square brackets [], while tuples are defined using parentheses ().

  • Lists are typically used for collections of similar items, while tuples are used for collections of different items.

  • Example of a list: my_list = [1, 2, 3]

  • Examp...read more

Q2. What are data structures?

Ans.

Data structures are ways of organizing and storing data in a computer so that it can be accessed and used efficiently.

  • Data structures can be linear or non-linear

  • Examples of linear data structures include arrays, linked lists, and stacks

  • Examples of non-linear data structures include trees and graphs

  • Choosing the right data structure is important for optimizing performance

Frequently asked in,

Data Science Interview Questions and Answers for Freshers

illustration image

Q3. How does regression work.

Ans.

Regression is a statistical method used to establish a relationship between a dependent variable and one or more independent variables.

  • Regression helps to predict the value of the dependent variable based on the values of the independent variables.

  • It involves fitting a line or curve to the data points to minimize the difference between the predicted and actual values.

  • There are different types of regression such as linear regression, logistic regression, polynomial regression,...read more

Q4. difference betweenhaving and where clause

Ans.

HAVING is used with GROUP BY to filter groups, WHERE is used to filter rows

  • HAVING is used with GROUP BY to filter groups based on aggregate functions

  • WHERE is used to filter rows based on conditions

  • HAVING is applied after GROUP BY, WHERE is applied before GROUP BY

  • Example: SELECT department, AVG(salary) FROM employees GROUP BY department HAVING AVG(salary) > 50000;

  • Example: SELECT * FROM employees WHERE age > 30;

Are these interview questions helpful?

Q5. what is object orinted programing ?

Ans.

Object-oriented programming is a programming paradigm based on the concept of objects, which can contain data and code.

  • Objects are instances of classes, which define the structure and behavior of the objects.

  • Encapsulation, inheritance, and polymorphism are key principles of object-oriented programming.

  • Examples include Java, C++, and Python, which are popular object-oriented programming languages.

Q6. write sql query for the bank transaction ?

Ans.

SQL query to retrieve bank transactions data.

  • Use SELECT statement to retrieve data from transaction table.

  • Filter data based on account number or transaction date.

  • Group data by transaction type or amount for analysis.

Share interview questions and help millions of jobseekers 🌟

man-with-laptop

Q7. Explai how neural network works

Ans.

Neural networks are a type of machine learning algorithm inspired by the human brain, consisting of interconnected nodes that process information.

  • Neural networks consist of layers of interconnected nodes, with each node performing a simple mathematical operation.

  • Information is passed through the network via weighted connections between nodes, with the weights adjusted during training to optimize performance.

  • Neural networks are trained using labeled data to learn patterns and ...read more

Q8. Different type of metrics for regression

Ans.

Different metrics for evaluating regression models

  • Mean Squared Error (MSE)

  • Root Mean Squared Error (RMSE)

  • Mean Absolute Error (MAE)

  • R-squared (Coefficient of Determination)

Data Science Jobs

Analyst-Data Science 1-3 years
American Express Company
4.2
Gurgaon / Gurugram
Data Science Practitioner 18-23 years
Accenture Solutions Pvt Ltd
3.9
Mumbai
Data Science Practitioner 5-10 years
Accenture Solutions Pvt Ltd
3.9
Bhubaneswar

Q9. what is central limit theorem

Ans.

Central Limit Theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases.

  • The theorem states that regardless of the shape of the population distribution, the sampling distribution of the sample mean will be approximately normally distributed.

  • It is a fundamental concept in statistics and is used in hypothesis testing and confidence intervals.

  • For example, if you take multiple samples of a population and calculat...read more

Q10. Anyone ML model in depth concept

Ans.

A machine learning model is a mathematical model that learns from data to make predictions or decisions without being explicitly programmed.

  • ML models can be classified into categories such as supervised learning, unsupervised learning, and reinforcement learning.

  • Examples of ML models include linear regression, decision trees, support vector machines, and neural networks.

  • ML models require training data to learn patterns and relationships, and testing data to evaluate their per...read more

Q11. What is data preprocessing

Ans.

Data preprocessing is the process of cleaning, transforming, and organizing raw data before analysis.

  • Removing irrelevant or duplicate data

  • Handling missing values

  • Normalizing or standardizing data

  • Encoding categorical variables

  • Feature scaling

  • Data transformation (e.g. log transformation)

  • Data reduction (e.g. PCA)

  • Handling outliers

Q12. What is cluster?

Ans.

A cluster is a group of data points or objects that are similar to each other within the group and dissimilar to data points in other groups.

  • Clusters are formed based on the similarity of data points within the group.

  • Clustering is an unsupervised learning technique used in data science.

  • Examples of clustering algorithms include K-means, hierarchical clustering, and DBSCAN.

Q13. explain cosine similarity

Ans.

Cosine similarity is a measure of similarity between two non-zero vectors of an inner product space.

  • It measures the cosine of the angle between two vectors.

  • Values range from -1 (completely opposite) to 1 (identical), with 0 indicating orthogonality.

  • Used in recommendation systems, text mining, and clustering algorithms.

Q14. Explain project

Ans.

Developed a machine learning model to predict customer churn for a telecommunications company.

  • Collected and cleaned customer data including demographics, usage patterns, and customer service interactions.

  • Used classification algorithms such as logistic regression and random forest to build the predictive model.

  • Evaluated model performance using metrics like accuracy, precision, recall, and ROC curve.

  • Implemented the model into the company's CRM system to identify at-risk custome...read more

Frequently asked in, ,
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories

Interview experiences of popular companies

3.8
 • 2.8k Interviews
4.0
 • 2.4k Interviews
3.9
 • 2k Interviews
4.2
 • 964 Interviews
2.7
 • 221 Interviews
3.6
 • 208 Interviews
3.7
 • 8 Interviews
View all

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary

Data Science Interview Questions
Share an Interview
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+

Reviews

4 L+

Interviews

4 Cr+

Salaries

1 Cr+

Users/Month

Contribute to help millions
Get AmbitionBox app

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter