Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by C5i Team. If you also belong to the team, you can get access from here

C5i Verified Tick

Compare button icon Compare button icon Compare

Filter interviews by

C5i Data Scientist Interview Questions, Process, and Tips for Freshers

Updated 17 Dec 2024

C5i Data Scientist Interview Experiences for Freshers

1 interview found

Data Scientist Interview Questions & Answers

user image rustam garg

posted on 4 May 2019

I applied via Recruitment Consultant and was interviewed in Dec 2018. There were 3 interview rounds.

Interview Questionnaire 

11 Questions

  • Q1. 1. Why Machine Learning?
  • Q2. 2. Why did you choose Data Science Field?
  • Ans. 

    I chose Data Science field because of its potential to solve complex problems and make a positive impact on society.

    • Fascination with data and its potential to drive insights

    • Desire to solve complex problems and make a positive impact on society

    • Opportunity to work with cutting-edge technology and tools

    • Ability to work in a variety of industries and domains

    • Examples: Predictive maintenance in manufacturing, fraud detection

  • Answered by AI
  • Q3. 3. What about Linear Regression? (Theory Part)
  • Q4. 4. What is the difference between Linear Regression and Logistic Regression?
  • Ans. 

    Linear Regression is used for predicting continuous numerical values, while Logistic Regression is used for predicting binary categorical values.

    • Linear Regression predicts a continuous output, while Logistic Regression predicts a binary output.

    • Linear Regression uses a linear equation to model the relationship between the independent and dependent variables, while Logistic Regression uses a logistic function.

    • Linear Regr...

  • Answered by AI
  • Q5. 5. Explain Confusion Matrix?
  • Ans. 

    Confusion matrix is a table used to evaluate the performance of a classification model.

    • It is a 2x2 matrix that shows the number of true positives, false positives, true negatives, and false negatives.

    • It helps in calculating various metrics like accuracy, precision, recall, and F1 score.

    • It is useful in identifying the strengths and weaknesses of a model and improving its performance.

    • Example: In a binary classification p...

  • Answered by AI
  • Q6. 6. Can we use confusion matrix in Linear Regression?
  • Ans. 

    No, confusion matrix is not used in Linear Regression.

    • Confusion matrix is used to evaluate classification models.

    • Linear Regression is a regression model, not a classification model.

    • Evaluation metrics for Linear Regression include R-squared, Mean Squared Error, etc.

  • Answered by AI
  • Q7. 7. Explain KNN Algorithm?
  • Ans. 

    KNN is a non-parametric algorithm used for classification and regression tasks.

    • KNN stands for K-Nearest Neighbors.

    • It works by finding the K closest data points to a given test point.

    • The class or value of the test point is then determined by the majority class or average value of the K neighbors.

    • KNN can be used for both classification and regression tasks.

    • It is a simple and easy-to-understand algorithm, but can be compu

  • Answered by AI
  • Q8. 8. Explain Random Forest and Decision Tree?
  • Ans. 

    Random Forest is an ensemble learning method that builds multiple decision trees and combines their outputs to improve accuracy.

    • Random Forest is a type of supervised learning algorithm used for classification and regression tasks.

    • It creates multiple decision trees and combines their outputs to make a final prediction.

    • Each decision tree is built using a random subset of features and data points to reduce overfitting.

    • Ran...

  • Answered by AI
  • Q9. 9. One Tricky Mathematical Question !
  • Q10. 10. What are the Projects you have done?
  • Ans. 

    I have worked on various projects involving data analysis, machine learning, and predictive modeling.

    • Developed a predictive model to forecast customer churn for a telecommunications company.

    • Built a recommendation system using collaborative filtering for an e-commerce platform.

    • Performed sentiment analysis on social media data to understand customer opinions and preferences.

    • Implemented a fraud detection system using anom...

  • Answered by AI
  • Q11. I didn't get shortlisted for 2nd Round.

Interview Preparation Tips

General Tips: anyone who wants to go in data science field should actually be interested in the field not the money. They should be good in Statistics, Probability and Theory part of ML algorithms.
They will ask you about the projects you have mentioned in resume and all the questions will be from that part.
Skills: Communication, Body Language, Problem Solving, Analytical Skills
Duration: 1-4 weeks

Skills evaluated in this interview

Data Scientist Jobs at C5i

View all

Interview questions from similar companies

Interview Questionnaire 

1 Question

  • Q1. What friends think of you?
  • Ans. 

    My friends think of me as reliable, supportive, and always up for a good time.

    • Reliable - always there when they need help or support

    • Supportive - willing to listen and offer advice

    • Fun-loving - enjoys socializing and trying new things

  • Answered by AI

Interview Preparation Tips

Round: Resume Shortlist
Experience: After Resume Shortlist we had an aptitute round.
Tips: Answer according to your own judgement. Dont try to be too precise.

Round: HR Interview
Experience: I said they think I am a workaholic as I prefer to complete my work before chilling with them.

College Name: NIT Durgapur

Interview Preparation Tips

Round: HR Interview
Experience: Interview at 11 pm. Stressed environment, close to stress interview.
SELECTION PROCEDURE:
1.Online Test
2. GD
3. PI(HR)
GD TOPICS :
Topic 1 : How can education system benefit from interdisciplinary methods.
Topic 2 : Interconnected problems in the field of movie making.
INTERVIEW EXPERIENCE:
So you can speak German? Describe MS Dhoni in german. They opened Google Translate to counter check the words they wanted to be translated in both Deutsch and Spanish. Your profile speaks of an inclination towards software skills, why do you want to join an analytics company? Justify your action in two reasons as to why are you sitting here interviewing for the post of a data scientist rather than apply for a software engineer when this CV speaks highly of computer science? What is Finite Element Method? Explain. How relevant is your work in Computer Vision? Breakdown the tagline of Audi and translate accordingly. What is "Technik für Mobel" ? What are your current projects? Answer : Microsoft Xbox Kinect, Gesture Recognition. Counter question : But at Musimga you'd be doing far simpler stuff.? Counter suggestion : Why don't you go for MS?


Tips: Keep your cool during counter questions. Prepare your profile and CV well. Rest all is your hard work and groomed personal talents and acquired skills you learnt over the internet.

Skills: Ability To Cope Up With Stress, Spanish, German, Finite Element Modeling - FEM, Foreign Language
College Name: NIT Raipur
Funny Moments: Another HR enters in the midst of my interview and asks with bewildered amazement : What language is he speaking?
The other HR, "German".

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
-

I applied via Company Website and was interviewed in Aug 2024. There was 1 interview round.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Explain the RAG pipeline?
  • Ans. 

    RAG pipeline is a data processing pipeline used in data science to categorize data into Red, Amber, and Green based on certain criteria.

    • RAG stands for Red, Amber, Green which are used to categorize data based on certain criteria

    • Red category typically represents data that needs immediate attention or action

    • Amber category represents data that requires monitoring or further investigation

    • Green category represents data that...

  • Answered by AI
  • Q2. Explain Confusion metrics
  • Ans. 

    Confusion metrics are used to evaluate the performance of a classification model by comparing predicted values with actual values.

    • Confusion matrix is a table that describes the performance of a classification model.

    • It consists of four different metrics: True Positive, True Negative, False Positive, and False Negative.

    • These metrics are used to calculate other evaluation metrics like accuracy, precision, recall, and F1 s...

  • Answered by AI

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Explain any ML model.
  • Ans. 

    Random Forest is an ensemble learning method that builds multiple decision trees and merges them together to get a more accurate and stable prediction.

    • Random Forest is a popular machine learning model used for classification and regression tasks.

    • It works by creating multiple decision trees during training and outputs the mode of the classes for classification or the average prediction for regression.

    • Random Forest is ro...

  • Answered by AI
  • Q2. Create Dataframe from two lists.
  • Ans. 

    Creating a DataFrame from two lists in Python.

    • Import the pandas library

    • Create two lists of data

    • Use pd.DataFrame() to create a DataFrame from the two lists

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Nielsen Data Scientist interview:
  • Python
  • pandas
  • ML
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
6-8 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Oct 2023. There were 4 interview rounds.

Round 1 - Coding Test 

Sql, python, Statistics mcq, Aptitude test. These were medium level questions.

Round 2 - Technical 

(3 Questions)

  • Q1. SQL and python, time complexity
  • Q2. Make 2 lists a=[1,2,3,4] b=[9,8,5,5,2,3,3,4,1,1,10,9,2,3,4,10,10,9,7,7,8] Write a program to remove duplicate of b and keep only those elements of b which are not present in a, and the final list should ...
  • Ans. 

    Remove duplicates from list b, keep elements not in list a, and sort in ascending order.

    • Create a set from list b to remove duplicates

    • Use list comprehension to keep elements not in list a

    • Sort the final list in ascending order

  • Answered by AI
  • Q3. SQL question Remove duplicate from a table tab1
  • Ans. 

    Use the DISTINCT keyword in SQL to remove duplicates from a table.

    • Use the SELECT DISTINCT statement to retrieve unique rows from the table.

    • Identify the columns that should be used to determine uniqueness.

    • Example: SELECT DISTINCT column1, column2 FROM tab1;

  • Answered by AI
Round 3 - Case Study 

Given 2 case studies on data science and asked different possibilities to improve the models.

How to work with imbalance dataset.
How to remove null values, what is features engineering.
What is PCA
What is the working of XGBOOST

Round 4 - Project discussion 

(1 Question)

  • Q1. What was last project, tell me in detail. There were different technical questions related to my project

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident and practice SQL, python, mainly pandas and numpy. Should have good knowledge on time complexity.


All the metrics of evaluating a model.
Linear regression, logestic regression, random forest, decission tree, adaboost, Gradient boosting, XGb in detail.

Recall, precision roc_curve. Auc, f1 score, mse,mae, r2, adjusted r2 score.

Is it possible that r2 score appears in minus

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
4-6 weeks
Result
Selected Selected

I applied via Referral and was interviewed in Mar 2024. There were 4 interview rounds.

Round 1 - Aptitude Test 

Test consisted 7 sections which lasted for more than a hour. There were questions related to coding , sql , analytical questions etc.

Round 2 - Coding Test 

1 python coding questions and 2 Sql question

Round 3 - Case Study 

2 case study question

Round 4 - HR 

(1 Question)

  • Q1. This was the final round . Technical+HR .

Interview Preparation Tips

Interview preparation tips for other job seekers - All the best. Interview process is too long , have patience. And be prepared , questions will be basic.
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via LinkedIn and was interviewed before Mar 2022. There were 3 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(3 Questions)

  • Q1. Questions related to python basics.
  • Q2. Basic calculus to test math skills.
  • Q3. Machine learning metrics.
Round 3 - HR 

(3 Questions)

  • Q1. Policies, behavioral round.
  • Q2. Work culture and the selection process.
  • Q3. Discussion about previous employers and educational background.

Interview Preparation Tips

Interview preparation tips for other job seekers - keep it simple and be confident. it's good to know the reasons behind your data science projects.
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Selected Selected

I applied via Referral and was interviewed in May 2024. There were 3 interview rounds.

Round 1 - Coding Test 

I was asked to write SQL queries for 3rd highest salary of the employee, some name filtering, group by tasks.
Python code to find the index of the maximum number without using numpy.

Round 2 - One-on-one 

(1 Question)

  • Q1. Explain the Project undertaken during the research and follow-up questions
Round 3 - Technical 

(1 Question)

  • Q1. Write pandas query to separate the names as first and last name from the full name. Drop the duplicate columns and also the missing values. Write output for the Python code. Write SQL query to retrieve t...
  • Ans. 

    Answering questions related to data science concepts and techniques.

    • Recall is the ratio of correctly predicted positive observations to the total actual positives. Precision is the ratio of correctly predicted positive observations to the total predicted positives.

    • To reduce variance in an ensemble model, techniques like bagging, boosting, and stacking can be used. Bagging involves training multiple models on different ...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Nielsen Data Scientist interview:
  • Python
  • Pandas
  • SQL
  • Machine Learning
Interview preparation tips for other job seekers - Have your basics strong.

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Campus Placement and was interviewed in Jul 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Don’t add your photo or details such as gender, age, and address in your resume. These details do not add any value.
View all tips
Round 2 - Aptitude Test 

Medium level of question are there in this section

Round 3 - Group Discussion 

Basic level of question are there in this section

Round 4 - Technical 

(2 Questions)

  • Q1. Q. What is joints? Q. What is linear search? Q. What is your hobby?
  • Ans. 

    Joints are connections between bones that allow movement and provide support to the body.

    • Joints are found throughout the body, such as the knee, elbow, and shoulder.

    • They are made up of bones, cartilage, ligaments, and synovial fluid.

    • Joints enable various types of movements, including flexion, extension, rotation, and abduction.

    • Different types of joints include hinge joints, ball-and-socket joints, and pivot joints.

    • Join...

  • Answered by AI
  • Q2. What is joints and what is your hobby?
  • Ans. 

    I'm not sure how joints relate to data science, but my hobby is playing guitar.

    • Joints can refer to the connection between two bones in the body or the way two things are connected or joined together.

    • Playing guitar is a hobby that helps me relax and unwind after a long day of working with data.

    • While seemingly unrelated to data science, playing an instrument can actually improve cognitive function and creativity, which c

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Basic SQL and Java language are enough for the interview

Skills evaluated in this interview

C5i Interview FAQs

How many rounds are there in C5i Data Scientist interview for freshers?
C5i interview process for freshers usually has 2-3 rounds. The most common rounds in the C5i interview process for freshers are Assignment, Technical and HR.
How to prepare for C5i Data Scientist interview for freshers?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at C5i. The most common topics and skills that interviewers at C5i expect are Python, Data Science, Machine Learning, SQL and Deep Learning.
What are the top questions asked in C5i Data Scientist interview for freshers?

Some of the top questions asked at the C5i Data Scientist interview for freshers -

  1. 4. What is the difference between Linear Regression and Logistic Regressi...read more
  2. 2. Why did you choose Data Science Fie...read more
  3. 6. Can we use confusion matrix in Linear Regressi...read more

Tell us how to improve this page.

Interview Questions from Similar Companies

Mu Sigma Interview Questions
2.6
 • 234 Interviews
Crisil Interview Questions
3.6
 • 192 Interviews
Indegene Interview Questions
3.4
 • 152 Interviews
Nielsen Interview Questions
3.7
 • 122 Interviews
Netscribes Interview Questions
2.8
 • 86 Interviews
Cotiviti Interview Questions
3.7
 • 77 Interviews
View all
C5i Data Scientist Salary
based on 51 salaries
₹4 L/yr - ₹13.9 L/yr
38% less than the average Data Scientist Salary in India
View more details

C5i Data Scientist Reviews and Ratings

based on 6 reviews

4.0/5

Rating in categories

3.9

Skill development

3.2

Work-life balance

2.9

Salary

4.4

Job security

3.1

Company culture

2.7

Promotions

3.1

Work satisfaction

Explore 6 Reviews and Ratings
Data Scientist(GEN AI LLM)

Gurgaon / Gurugram,

Bangalore / Bengaluru

+1

2-5 Yrs

Not Disclosed

Explore more jobs
Analyst
252 salaries
unlock blur

₹3 L/yr - ₹10.8 L/yr

Senior Analyst
201 salaries
unlock blur

₹4.2 L/yr - ₹14.6 L/yr

Data Analyst
197 salaries
unlock blur

₹2.5 L/yr - ₹8.2 L/yr

Associate Manager
128 salaries
unlock blur

₹7.8 L/yr - ₹21.1 L/yr

Junior Analyst
117 salaries
unlock blur

₹2.9 L/yr - ₹7 L/yr

Explore more salaries
Compare C5i with

Crisil

3.6
Compare

CorroHealth infotech

3.3
Compare

Indegene

3.4
Compare

Acuity Knowledge Partners

3.3
Compare
Did you find this page helpful?
Yes No
write
Share an Interview