Upload Button Icon Add office photos

Filter interviews by

GeakMinds Associate Data Scientist Interview Questions, Process, and Tips

Updated 24 Jul 2024

GeakMinds Associate Data Scientist Interview Experiences

1 interview found

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Jun 2024. There were 3 interview rounds.

Round 1 - System Test 

(3 Questions)

  • Q1. Write SQL query to join two tables
  • Ans. 

    SQL query to join two tables

    • Use JOIN keyword to combine rows from two or more tables based on a related column between them

    • Specify the columns to be selected from each table

    • Use ON keyword to specify the join condition

  • Answered by AI
  • Q2. Perform EDA on the provided datasets and find insights
  • Q3. Analyse the datasets and build a Machine Learning model
  • Ans. 

    Analyzing datasets and building a Machine Learning model for Associate Data Scientist role.

    • 1. Explore and understand the datasets to identify patterns and relationships.

    • 2. Preprocess the data by handling missing values, encoding categorical variables, and scaling numerical features.

    • 3. Split the data into training and testing sets for model evaluation.

    • 4. Choose a suitable Machine Learning algorithm based on the nature o...

  • Answered by AI
Round 2 - Technical 

(5 Questions)

  • Q1. Explain Assumptions of Linear Regression
  • Ans. 

    Assumptions of linear regression are important for the model to be valid and reliable.

    • Linear relationship between independent and dependent variables

    • Independence of residuals (errors)

    • Homoscedasticity (constant variance of residuals)

    • Normality of residuals

    • No multicollinearity among independent variables

  • Answered by AI
  • Q2. What is the difference between R-Squared and Adjusted R-Squared?
  • Ans. 

    R-Squared measures the proportion of variance explained by the model, while Adjusted R-Squared adjusts for the number of predictors in the model.

    • R-Squared increases as more predictors are added to the model, even if they are not relevant.

    • Adjusted R-Squared penalizes for adding irrelevant predictors, making it a more reliable measure of model fit.

    • R-Squared can never decrease when adding predictors, while Adjusted R-Squa...

  • Answered by AI
  • Q3. What is Central Mean Theorem?
  • Ans. 

    Central Limit Theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases.

    • The Central Limit Theorem is a fundamental concept in statistics that states that the sampling distribution of the sample mean will be approximately normally distributed, regardless of the shape of the population distribution, as the sample size increases.

    • It is important because i...

  • Answered by AI
  • Q4. What is the difference between Rank and Dense Rank in SQL?
  • Ans. 

    Rank assigns unique ranks to each row based on the order specified, while Dense Rank assigns consecutive ranks without gaps.

    • Rank may have gaps in ranks if there are ties, while Dense Rank does not have gaps.

    • Rank function is used to assign a unique rank to each row based on the specified order, while Dense Rank function assigns consecutive ranks.

    • Example: If three rows have the same value and are ranked 1, 1, and 2 using...

  • Answered by AI
  • Q5. What is the difference between Series and Dataframe?
  • Ans. 

    Series is a one-dimensional labeled array while Dataframe is a two-dimensional labeled data structure.

    • Series can hold data of any type while Dataframe is a collection of Series.

    • Dataframe is like a table with rows and columns, while Series is like a single column of that table.

    • Dataframe is more versatile and powerful compared to Series.

    • Example: Series - a column of employee names. Dataframe - a table with columns for em

  • Answered by AI
Round 3 - Technical 

(3 Questions)

  • Q1. Explain Random Forest algorithm
  • Ans. 

    Random Forest is an ensemble learning algorithm that creates multiple decision trees and combines their predictions.

    • Random Forest is a collection of decision trees that are trained on random subsets of the data.

    • Each tree in the Random Forest independently predicts the outcome, and the final prediction is made by averaging the predictions of all trees.

    • Random Forest is used for classification and regression tasks, and it...

  • Answered by AI
  • Q2. What is the difference between Stemming and Lemmatization? Which one is better and why?
  • Ans. 

    Stemming reduces words to their root form, while lemmatization reduces words to their dictionary form.

    • Stemming chops off prefixes or suffixes to get the root form (e.g. 'running' becomes 'run')

    • Lemmatization uses vocabulary analysis to reduce words to their base form (e.g. 'better' becomes 'good')

    • Lemmatization is more accurate but slower than stemming

    • Stemming is faster but may not always result in a valid word

  • Answered by AI
  • Q3. Questions about my previous projects

Interview Preparation Tips

Topics to prepare for GeakMinds Associate Data Scientist interview:
  • Data Science
  • Machine Learning
  • Python
  • SQL
  • Pandas
  • Statistics
Interview preparation tips for other job seekers - Good understanding of basic data science concepts is enough
Confidence and attitude toward answering questions are important

Skills evaluated in this interview

Interview questions from similar companies

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What is overfitting in machine learning?
  • Ans. 

    Overfitting occurs when a machine learning model learns the training data too well, including noise and outliers, leading to poor generalization on new data.

    • Overfitting happens when a model is too complex and captures noise in the training data.

    • It leads to poor performance on unseen data as the model fails to generalize well.

    • Techniques to prevent overfitting include cross-validation, regularization, and early stopping.

    • ...

  • Answered by AI
  • Q2. Overfitting accurs when a model learns the details.......etc
  • Ans. 

    Overfitting occurs when a model learns the details and noise in the training data to the extent that it negatively impacts the model's performance on new data.

    • Overfitting happens when a model is too complex and captures noise in the training data.

    • It leads to poor generalization and high accuracy on training data but low accuracy on new data.

    • Techniques to prevent overfitting include cross-validation, regularization, and...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Research the company before interview.

GeakMinds Interview FAQs

How many rounds are there in GeakMinds Associate Data Scientist interview?
GeakMinds interview process usually has 3 rounds. The most common rounds in the GeakMinds interview process are Technical.
What are the top questions asked in GeakMinds Associate Data Scientist interview?

Some of the top questions asked at the GeakMinds Associate Data Scientist interview -

  1. What is the difference between Rank and Dense Rank in S...read more
  2. What is the difference between Stemming and Lemmatization? Which one is better ...read more
  3. What is the difference between R-Squared and Adjusted R-Squar...read more

Tell us how to improve this page.

GeakMinds Associate Data Scientist Interview Process

based on 1 interview

Interview experience

5
  
Excellent
View more

Interview Questions from Similar Companies

TCS Interview Questions
3.7
 • 10.4k Interviews
Infosys Interview Questions
3.6
 • 7.6k Interviews
Wipro Interview Questions
3.7
 • 5.6k Interviews
Tech Mahindra Interview Questions
3.5
 • 3.8k Interviews
HCLTech Interview Questions
3.5
 • 3.8k Interviews
LTIMindtree Interview Questions
3.8
 • 3k Interviews
Mphasis Interview Questions
3.4
 • 805 Interviews
View all

Fast track your campus placements

View all
GeakMinds Associate Data Scientist Salary
based on 5 salaries
₹3.8 L/yr - ₹5.2 L/yr
53% less than the average Associate Data Scientist Salary in India
View more details

GeakMinds Associate Data Scientist Reviews and Ratings

based on 3 reviews

4.4/5

Rating in categories

4.0

Skill development

4.4

Work-life balance

3.7

Salary

4.5

Job security

4.5

Company culture

3.7

Promotions

4.0

Work satisfaction

Explore 3 Reviews and Ratings
Software Engineer
11 salaries
unlock blur

₹3.6 L/yr - ₹6.3 L/yr

Data Scientist
7 salaries
unlock blur

₹3 L/yr - ₹5 L/yr

Softwaretest Engineer
6 salaries
unlock blur

₹2 L/yr - ₹6.5 L/yr

Associate Software Engineer
6 salaries
unlock blur

₹2.7 L/yr - ₹3.9 L/yr

Lead Data Scientist
5 salaries
unlock blur

₹20 L/yr - ₹25 L/yr

Explore more salaries
Compare GeakMinds with

TCS

3.7
Compare

Infosys

3.6
Compare

Wipro

3.7
Compare

HCLTech

3.5
Compare
Did you find this page helpful?
Yes No
write
Share an Interview