Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 1K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Zangoh

Compare

No reviews yet

Filter interviews by

Zangoh Data Scientist Interview Questions and Answers for Experienced

Updated 18 Dec 2023

Zangoh Data Scientist Interview Experiences for Experienced

1 interview found

Data Scientist Interview Questions & Answers

Shlok Jain

posted on 18 Dec 2023

Interview experience

Poor

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Approached by Company and was interviewed in Nov 2023. There were 3 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - Aptitude Test

Easy, questions seems familiar to that of I practice

Round 3 - Coding Test

No testing of output was available either you code gives correct output or alot of error. And MCQ of coding round was mediocre . And at the last it was suppose to be HR round if selected, best regards.

Interview Preparation Tips

Interview preparation tips for other job seekers - As a fellow who did gave the test they have soo high expectations with output and team responsibility, if you are a team player then only apply their.

Top trending discussions

View All

Interview Tips & Stories

honestword

ex -

Oxane Partners

Disappointing Interview Experience with D.E. Shaw

I recently interviewed with D.E. Shaw, and it turned out to be one of the most draining experiences I’ve had. The interviewer’s approach felt unnecessarily harsh and dismissive, which made the entire process uncomfortable. At multiple points, the interviewer asked me to drop off the call, even when I tried asking basic questions about the designation, job description, and other details. After the interview, I found myself sitting there for a few minutes, trying to process what just happened. It left me questioning the overall tone and professionalism of the interview. Definitely not the experience I was expecting from such a well-known firm.

Got a question about Zangoh?

Ask anonymously on communities.

Interview questions from similar companies

Data Scientist Interview Questions & Answers

Infosys

Anonymous

posted on 30 May 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Not Selected

I applied via Job Portal and was interviewed in Apr 2024. There was 1 interview round.

Round 1 - Technical

(9 Questions)

Q1. Explain XGBoost algoritm

Ans.

XGBoost is a powerful machine learning algorithm known for its speed and performance in handling large datasets.

XGBoost stands for eXtreme Gradient Boosting, which is an implementation of gradient boosting machines.
It is widely used in machine learning competitions and is known for its speed and performance.
XGBoost uses a technique called boosting, where multiple weak learners are combined to create a strong learner.
It...

Answered by AI

Add your answer

Q2. XgBoost algorithm has 10-20 features. How are the splits decided, on which feature are they going to be divided?

Ans.

XgBoost algorithm uses a greedy approach to determine splits based on feature importance.

XgBoost algorithm calculates the information gain for each feature to determine the best split.
The feature with the highest information gain is chosen for the split.
This process is repeated recursively for each node in the tree.
Features can be split based on numerical values or categories.
Example: If a feature like 'age' has the hi...

Answered by AI

View 1 more answer

Q3. Do you have any experience on cloud platform?

Ans.

Yes, I have experience working on cloud platforms such as AWS and Google Cloud.

Experience with AWS services like S3, EC2, and Redshift
Familiarity with Google Cloud services like BigQuery and Compute Engine
Utilized cloud platforms for data storage, processing, and analysis

Answered by AI

Add your answer

Q4. What is entropy, information gain?

Ans.

Entropy is a measure of randomness or uncertainty in a dataset, while information gain is the reduction in entropy after splitting a dataset based on a feature.

Entropy is used in decision tree algorithms to determine the best feature to split on.
Information gain measures the effectiveness of a feature in classifying the data.
Higher information gain indicates that a feature is more useful for splitting the data.
Entropy ...

Answered by AI

Add your answer

Q5. What is hypothesis testing?

Ans.

Hypothesis testing is a statistical method used to make inferences about a population based on sample data.

Hypothesis testing involves formulating a null hypothesis and an alternative hypothesis.
The null hypothesis is assumed to be true until there is enough evidence to reject it.
Statistical tests are used to determine the likelihood of observing the data if the null hypothesis is true.
The p-value is used to determine ...

Answered by AI

Add your answer

Q6. Explain precision and recall, when are they used in which scenario?

Ans.

Precision and recall are metrics used in evaluating the performance of classification models.

Precision measures the accuracy of positive predictions, while recall measures the ability of the model to find all positive instances.
Precision = TP / (TP + FP)
Recall = TP / (TP + FN)
Precision is important when false positives are costly, while recall is important when false negatives are costly.
For example, in a spam email de...

Answered by AI

Add your answer

Q7. What is data imbalance?

Ans.

Data imbalance refers to unequal distribution of classes in a dataset, where one class has significantly more samples than others.

Data imbalance can lead to biased models that favor the majority class.
It can result in poor performance for minority classes, as the model may struggle to accurately predict them.
Techniques like oversampling, undersampling, and using different evaluation metrics can help address data imbala...

Answered by AI

Add your answer

Q8. What is SMOTE? Do you have any experience working on Time Series? Code analysis of global variable?

Ans.

SMOTE stands for Synthetic Minority Over-sampling Technique, used to balance imbalanced datasets by generating synthetic samples.

SMOTE is commonly used in machine learning to address class imbalance by creating synthetic samples of the minority class.
It works by generating new instances of the minority class by interpolating between existing instances.
SMOTE is particularly useful in scenarios where the minority class i...

Answered by AI

Add your answer

Q9. Find 5th highest salary in every department. What are window functions Difference between union and union all Difference between delete and truncate.

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare basics well. Go through the top questions asked for SQL,Python,Data Science.
Well versed with resume projects and concepts used in it.

Skills evaluated in this interview

Data Scientist Interview Questions & Answers

Deloitte

Anonymous

posted on 30 Nov 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

4-6 weeks

Result

Selected

I applied via Approached by Company and was interviewed in May 2024. There were 3 interview rounds.

Round 1 - Coding Test

DSA was asked. And general coding language questions were asked. Previous experience based questions were asked.

Round 2 - Coding Test

Machine Learning, Generative AI, Deep learning interview questions. 2 Coding problems based on Algorithms.

Round 3 - Technical

(3 Questions)

Q1. One systems design question was there.

Add your answer

Q2. Situations based problems was there.

Add your answer

Q3. Questions on Past experiences.

Add your answer

Data Scientist Interview Questions & Answers

TCS

Anonymous

posted on 14 May 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Round 1 - One-on-one

(4 Questions)

Q1. Cosine similarity

Ans.

Cosine similarity measures the similarity between two non-zero vectors in an inner product space.

Cosine similarity ranges from -1 to 1, with 1 indicating identical vectors and -1 indicating opposite vectors.
It is commonly used in information retrieval, text mining, and recommendation systems.
Formula: cos(theta) = (A . B) / (||A|| * ||B||)
Example: Calculating similarity between two documents based on their word frequenc

Answered by AI

Add your answer

Q2. What is difference between recall and precission

Ans.

Recall is the ratio of correctly predicted positive observations to the all observations in actual class, while precision is the ratio of correctly predicted positive observations to the total predicted positive observations.

Recall is about the actual positive instances that were correctly identified by the model.
Precision is about the predicted positive instances and how many of them were actually positive.
Recall = Tr...

Answered by AI

View 1 more answer

Q3. How to remove stop words and how it works

Ans.

Stop words are common words like 'the', 'is', 'and' that are removed from text data to improve analysis.

Stop words are commonly removed from text data to improve the accuracy of natural language processing tasks.
They are typically removed before tokenization and can be done using libraries like NLTK or spaCy.
Examples of stop words include 'the', 'is', 'and', 'in', 'on', etc.

Answered by AI

Add your answer

Q4. Whats the goal of project

Add your answer

Round 2 - One-on-one

(1 Question)

Q1. Pipeline design

Add your answer

Skills evaluated in this interview

Data Scientist Interview Questions & Answers

HDFC Bank

G R

posted on 26 Apr 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

4-6 weeks

Result

No response

I applied via Naukri.com and was interviewed in Mar 2024. There were 3 interview rounds.

Round 1 - One-on-one

(3 Questions)

Q1. Machine learning algorithms.

Ans.

Machine learning algorithms are tools used to analyze data, identify patterns, and make predictions without being explicitly programmed.

Machine learning algorithms can be categorized into supervised, unsupervised, and reinforcement learning.
Examples of machine learning algorithms include linear regression, decision trees, support vector machines, and neural networks.
These algorithms require training data to learn patte...

Answered by AI

Add your answer

Q2. Credit risk life cycle

Add your answer

Q3. Pandas related questions

Add your answer

Round 2 - One-on-one

(3 Questions)

Q1. Steps of developing a credit risk model

Ans.

Developing a credit risk model involves several steps to assess the likelihood of a borrower defaulting on a loan.

1. Define the problem and objectives of the credit risk model.
2. Gather relevant data such as credit history, income, debt-to-income ratio, etc.
3. Preprocess the data by handling missing values, encoding categorical variables, and scaling features.
4. Select a suitable machine learning algorithm such as logi...

Answered by AI

Add your answer

Q2. Pandas related questions

Add your answer

Q3. Bagging and Boosting

Add your answer

Round 3 - One-on-one

(3 Questions)

Q1. Explain AIC and BIC

Ans.

AIC and BIC are statistical measures used for model selection in the context of regression analysis.

AIC (Akaike Information Criterion) is used to compare the goodness of fit of different models. It penalizes the model for the number of parameters used.
BIC (Bayesian Information Criterion) is similar to AIC but penalizes more heavily for the number of parameters, making it more suitable for model selection when the focus...

Answered by AI

Add your answer

Q2. Difference between xgboost and lightgbm

Ans.

XGBoost is a popular gradient boosting library while LightGBM is a faster and more memory-efficient alternative.

XGBoost is known for its accuracy and performance on structured/tabular data.
LightGBM is faster and more memory-efficient, making it suitable for large datasets.
LightGBM uses a histogram-based algorithm for splitting whereas XGBoost uses a level-wise tree growth strategy.

Answered by AI

Add your answer

Q3. Bagging and boosting

Add your answer

Skills evaluated in this interview

Data Scientist Interview Questions & Answers

Wipro

Anonymous

posted on 6 Aug 2024

Interview experience

Average

Difficulty level

Process Duration

Result

I applied via Naukri.com and was interviewed in Jul 2024. There was 1 interview round.

Round 1 - Technical

(1 Question)

Q1. Pandas basics and SQL joins nlp

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - It was good

Data Scientist Interview Questions & Answers

Accenture

Anonymous

posted on 21 May 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Technical

(1 Question)

Q1. Explain a project written in resume.

Add your answer

Round 2 - Technical

(1 Question)

Q1. Explain a project from you resume.

Add your answer

Round 3 - HR

(1 Question)

Q1. What are your strengths?

Add your answer

Data Scientist Interview Questions & Answers

Tech Mahindra

Anonymous

posted on 25 Jun 2024

Interview experience

Average

Difficulty level

Process Duration

Result

Selected

Round 1 - One-on-one

(2 Questions)

Q1. Difference between supervised and unsupervised learning

Ans.

Supervised learning uses labeled data to train a model, while unsupervised learning uses unlabeled data.

Supervised learning requires a target variable for training the model.
Examples of supervised learning include classification and regression.
Unsupervised learning finds patterns and relationships in data without a target variable.
Examples of unsupervised learning include clustering and dimensionality reduction.

Answered by AI

Add your answer

Q2. What is sigmoid function

Ans.

Sigmoid function is a mathematical function that maps any real value to a value between 0 and 1.

Used in machine learning for binary classification problems to produce probabilities
Commonly used in logistic regression
Has an S-shaped curve
Equation: f(x) = 1 / (1 + e^(-x))

Answered by AI

Add your answer

Skills evaluated in this interview

Data Scientist Interview Questions & Answers

Capgemini

Anonymous

posted on 10 Jul 2024

Interview experience

Poor

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. What is overfitting and underfitting

Ans.

Overfitting occurs when a model learns the training data too well, leading to poor performance on new data. Underfitting occurs when a model is too simple to capture the underlying patterns in the data.

Overfitting: Model is too complex, fits noise in the training data, performs poorly on new data
Underfitting: Model is too simple, fails to capture underlying patterns in the data, performs poorly on both training and new...

Answered by AI

Add your answer

Q2. What are LLM Models

Ans.

LLM models, or Language Model Models, are a type of machine learning model that focuses on predicting the next word in a sequence of words.

LLM models are commonly used in natural language processing tasks such as text generation, machine translation, and speech recognition.
They are trained on large amounts of text data to learn the relationships between words and predict the most likely next word in a given context.
Exa...

Answered by AI

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare for python questions

Skills evaluated in this interview

Data Scientist Interview Questions & Answers

Cognizant

Anonymous

posted on 3 Apr 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

I applied via Naukri.com and was interviewed in Oct 2023. There were 2 interview rounds.

Round 1 - Technical

(3 Questions)

Q1. Discussion on projects mentioned in the Resume

Add your answer

Q2. Two Python live coding questions

Add your answer

Q3. Few DL, ML, Transformers, and NLP questions Few questions on deployment tools mentioned in the resume

Add your answer

Round 2 - HR

(1 Question)

Q1. General HR discussion

Add your answer

Zangoh Interview FAQs

How many rounds are there in Zangoh Data Scientist interview for experienced candidates?

Zangoh interview process for experienced candidates usually has 3 rounds. The most common rounds in the Zangoh interview process for experienced candidates are Resume Shortlist, Aptitude Test and Coding Test.