Premium Employer

i

This company page is being actively managed by Affine Team. If you also belong to the team, you can get access from here

Affine Verified Tick Work with us arrow

Compare button icon Compare button icon Compare

Filter interviews by

Affine Data Scientist Interview Questions and Answers

Updated 17 Jul 2024

19 Interview questions

A Data Scientist was asked 11mo ago
Q. What is the R-squared value?
Ans. 

R-squared value is a statistical measure that represents the proportion of the variance in the dependent variable that is predictable from the independent variable(s).

  • R-squared value ranges from 0 to 1, with 1 indicating a perfect fit.

  • It is used to evaluate the goodness of fit of a regression model.

  • A higher R-squared value indicates that the model explains a larger proportion of the variance in the dependent varia...

A Data Scientist was asked
Q. What are AIC and BIC in linear regression?
Ans. 

AIC & BIC are statistical measures used to evaluate the goodness of fit of a linear regression model.

  • AIC stands for Akaike Information Criterion and BIC stands for Bayesian Information Criterion.

  • Both AIC and BIC are used to compare different models and select the best one.

  • AIC penalizes complex models less severely than BIC.

  • Lower AIC/BIC values indicate a better fit of the model to the data.

  • AIC and BIC can be calcu...

Data Scientist Interview Questions Asked at Other Companies

Q1. for a data with 1000 samples and 700 dimensions, how would you fi ... read more
Q2. Special Sum of Array Problem Statement Given an array 'arr' conta ... read more
asked in Affine
Q3. You have a pandas dataframe with three columns filled with state ... read more
asked in Walmart
Q4. Describe the data you would analyze to solve cost and revenue opt ... read more
Q5. Clone a Linked List with Random Pointers Given a linked list wher ... read more
A Data Scientist was asked
Q. What is the purpose of a lambda function when regular functions exist? How are they different?
Ans. 

Lambda functions are anonymous functions used for short and simple operations. They are different from regular functions in their syntax and usage.

  • Lambda functions are defined without a name and keyword 'lambda' is used to define them.

  • They can take any number of arguments but can only have one expression.

  • They are commonly used in functional programming and as arguments to higher-order functions.

  • Lambda functions ar...

A Data Scientist was asked
Q. How can you create dictionaries in Python with repeated keys?
Ans. 

To create dictionaries in Python with repeated keys, use defaultdict from the collections module.

  • Import the collections module

  • Create a defaultdict object

  • Add key-value pairs to the dictionary using the same key multiple times

  • Access the values using the key

  • Example: from collections import defaultdict; d = defaultdict(list); d['key'].append('value1'); d['key'].append('value2')

A Data Scientist was asked
Q. How do you perform manipulations more quickly in pandas?
Ans. 

Use vectorized operations, avoid loops, and optimize memory usage.

  • Use vectorized operations like apply(), map(), and applymap() instead of loops.

  • Avoid using iterrows() and itertuples() as they are slower than vectorized operations.

  • Optimize memory usage by using appropriate data types and dropping unnecessary columns.

  • Use inplace=True parameter to modify the DataFrame in place instead of creating a copy.

  • Use the pd.e...

A Data Scientist was asked
Q. How can you efficiently read large .csv files using pandas?
Ans. 

Use pandas' read_csv() method with appropriate parameters to read large .csv files quickly.

  • Use the chunksize parameter to read the file in smaller chunks

  • Use the low_memory parameter to optimize memory usage

  • Use the dtype parameter to specify data types for columns

  • Use the usecols parameter to read only necessary columns

  • Use the skiprows parameter to skip unnecessary rows

  • Use the nrows parameter to read only a specific...

A Data Scientist was asked
Q. How do you retain special characters (that pandas discards by default) in the data while reading it?
Ans. 

To retain special characters in pandas data, use encoding parameter while reading the data.

  • Use encoding parameter while reading the data in pandas

  • Specify the encoding type of the data file

  • Example: pd.read_csv('filename.csv', encoding='utf-8')

Are these interview questions helpful?
A Data Scientist was asked
Q. What are the differences between merge and join operations in pandas?
Ans. 

Merge and join are used to combine dataframes in pandas.

  • Merge is used to combine dataframes based on a common column or index.

  • Join is used to combine dataframes based on their index.

  • Merge can handle different column names, while join cannot.

  • Merge can handle different types of joins (inner, outer, left, right), while join only does inner join by default.

A Data Scientist was asked
Q. How does lookup happen in a list when you do my_list[5]?
Ans. 

my_list[5] retrieves the 6th element of the list.

  • Indexing starts from 0 in Python.

  • The integer inside the square brackets is the index of the element to retrieve.

  • If the index is out of range, an IndexError is raised.

A Data Scientist was asked
Q. How will the resultant table be when you merge two tables that match on a column, and the second table has many repeated keys?
Ans. 

The resultant table will have all the columns from both tables and the rows will be a combination of matching rows.

  • The resultant table will have all the columns from both tables

  • The rows in the resultant table will be a combination of matching rows

  • If the second table has repeated keys, there will be multiple rows with the same key in the resultant table

Affine Data Scientist Interview Experiences

5 interviews found

Interview experience
1
Bad
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Referral and was interviewed in Mar 2024. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. Questions based on my work experience, basic level of python coding (return counts for duplicate integers in a list), theory questions on basic DL (activation layers, time series)

Interview Preparation Tips

Interview preparation tips for other job seekers - Do not get upset if you fail interviews, many of these end up hiring internally or are simply following the protocol of interviewing you after already having selected some other candidate. Just keep on trying!
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. What is r squared value
  • Ans. 

    R-squared value is a statistical measure that represents the proportion of the variance in the dependent variable that is predictable from the independent variable(s).

    • R-squared value ranges from 0 to 1, with 1 indicating a perfect fit.

    • It is used to evaluate the goodness of fit of a regression model.

    • A higher R-squared value indicates that the model explains a larger proportion of the variance in the dependent variable.

    • F...

  • Answered by AI

Data Scientist Interview Questions & Answers

user image Dheeraj Warudkar

posted on 8 Jul 2024

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Machine learning
Round 2 - Coding Test 

Python , pandas, sql

I applied via Naukri.com and was interviewed in Feb 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Coding Test 

Test had a mix of questions on Statistics, Probability, Machine Learning, SQL and Python.

Round 3 - Technical 

(11 Questions)

  • Q1. How to retain special characters (that pandas discards by default) in the data while reading it?
  • Ans. 

    To retain special characters in pandas data, use encoding parameter while reading the data.

    • Use encoding parameter while reading the data in pandas

    • Specify the encoding type of the data file

    • Example: pd.read_csv('filename.csv', encoding='utf-8')

  • Answered by AI
  • Q2. How to read large .csv files in pandas quickly?
  • Ans. 

    Use pandas' read_csv() method with appropriate parameters to read large .csv files quickly.

    • Use the chunksize parameter to read the file in smaller chunks

    • Use the low_memory parameter to optimize memory usage

    • Use the dtype parameter to specify data types for columns

    • Use the usecols parameter to read only necessary columns

    • Use the skiprows parameter to skip unnecessary rows

    • Use the nrows parameter to read only a specific numb...

  • Answered by AI
  • Q3. How do perform the manipulations quicker in pandas?
  • Ans. 

    Use vectorized operations, avoid loops, and optimize memory usage.

    • Use vectorized operations like apply(), map(), and applymap() instead of loops.

    • Avoid using iterrows() and itertuples() as they are slower than vectorized operations.

    • Optimize memory usage by using appropriate data types and dropping unnecessary columns.

    • Use inplace=True parameter to modify the DataFrame in place instead of creating a copy.

    • Use the pd.eval()...

  • Answered by AI
  • Q4. Explain generators and decorators in python
  • Ans. 

    Generators are functions that allow you to iterate over a sequence of values without creating the entire sequence in memory. Decorators are functions that modify the behavior of other functions.

    • Generators use the yield keyword to return values one at a time

    • Generators are memory efficient and can handle large datasets

    • Decorators are functions that take another function as input and return a modified version of that funct...

  • Answered by AI
  • Q5. You have a pandas dataframe with three columns, filled with state names, city names and arbitrary numbers respectively. How to retrieve top 2 cities per state. (top according to the max number in the third...
  • Q6. How does look up happens in a list when you do my_list[5]?
  • Ans. 

    my_list[5] retrieves the 6th element of the list.

    • Indexing starts from 0 in Python.

    • The integer inside the square brackets is the index of the element to retrieve.

    • If the index is out of range, an IndexError is raised.

  • Answered by AI
  • Q7. How to create dictionaries in python with repeated keys?
  • Ans. 

    To create dictionaries in Python with repeated keys, use defaultdict from the collections module.

    • Import the collections module

    • Create a defaultdict object

    • Add key-value pairs to the dictionary using the same key multiple times

    • Access the values using the key

    • Example: from collections import defaultdict; d = defaultdict(list); d['key'].append('value1'); d['key'].append('value2')

  • Answered by AI
  • Q8. What is the purpose of lambda function when regural functions(of def) exist? how are they different?
  • Ans. 

    Lambda functions are anonymous functions used for short and simple operations. They are different from regular functions in their syntax and usage.

    • Lambda functions are defined without a name and keyword 'lambda' is used to define them.

    • They can take any number of arguments but can only have one expression.

    • They are commonly used in functional programming and as arguments to higher-order functions.

    • Lambda functions are oft...

  • Answered by AI
  • Q9. Merge vs join in pandas
  • Ans. 

    Merge and join are used to combine dataframes in pandas.

    • Merge is used to combine dataframes based on a common column or index.

    • Join is used to combine dataframes based on their index.

    • Merge can handle different column names, while join cannot.

    • Merge can handle different types of joins (inner, outer, left, right), while join only does inner join by default.

  • Answered by AI
  • Q10. How will the resultant table be, when you "merge" two tables that match at a column. and the second table has many of keys repeated.
  • Ans. 

    The resultant table will have all the columns from both tables and the rows will be a combination of matching rows.

    • The resultant table will have all the columns from both tables

    • The rows in the resultant table will be a combination of matching rows

    • If the second table has repeated keys, there will be multiple rows with the same key in the resultant table

  • Answered by AI
  • Q11. Some questions on spacy and NLP models and my project.
Round 4 - Technical 

(8 Questions)

  • Q1. Explain eign vectors and eign values? what purpose do they serve in ML?
  • Ans. 

    Eigenvalues and eigenvectors are linear algebra concepts used in machine learning for dimensionality reduction and feature extraction.

    • Eigenvalues represent the scaling factor of the eigenvectors.

    • Eigenvectors are the directions along which a linear transformation acts by stretching or compressing.

    • In machine learning, eigenvectors are used for principal component analysis (PCA) to reduce the dimensionality of data.

    • Eigenv...

  • Answered by AI
  • Q2. Explain PCA briefly? what can it be used for and what can it not be used for?
  • Ans. 

    PCA is a dimensionality reduction technique used to transform high-dimensional data into a lower-dimensional space.

    • PCA can be used for feature extraction, data visualization, and noise reduction.

    • PCA cannot be used for causal inference or to handle missing data.

    • PCA assumes linear relationships between variables and may not work well with non-linear data.

    • PCA can be applied to various fields such as finance, image process...

  • Answered by AI
  • Q3. What is VIF and how is it calculated?
  • Ans. 

    VIF stands for Variance Inflation Factor, a measure of multicollinearity in regression analysis.

    • VIF is calculated for each predictor variable in a regression model.

    • It measures how much the variance of the estimated regression coefficient is increased due to multicollinearity.

    • A VIF of 1 indicates no multicollinearity, while a VIF greater than 1 indicates increasing levels of multicollinearity.

    • VIF is calculated as 1 / (1...

  • Answered by AI
  • Q4. What is AIC & BIC in linear regression?
  • Ans. 

    AIC & BIC are statistical measures used to evaluate the goodness of fit of a linear regression model.

    • AIC stands for Akaike Information Criterion and BIC stands for Bayesian Information Criterion.

    • Both AIC and BIC are used to compare different models and select the best one.

    • AIC penalizes complex models less severely than BIC.

    • Lower AIC/BIC values indicate a better fit of the model to the data.

    • AIC and BIC can be calculated...

  • Answered by AI
  • Q5. Do we minimize or maximize the loss in logistic regression?
  • Ans. 

    We minimize the loss in logistic regression.

    • The goal of logistic regression is to minimize the loss function.

    • The loss function measures the difference between predicted and actual values.

    • The optimization algorithm tries to find the values of coefficients that minimize the loss function.

    • Minimizing the loss function leads to better model performance.

    • Examples of loss functions used in logistic regression are cross-entropy...

  • Answered by AI
  • Q6. How does one vs rest work for logistic regression?
  • Ans. 

    One vs Rest is a technique used to extend binary classification to multi-class problems in logistic regression.

    • It involves training multiple binary classifiers, one for each class.

    • In each classifier, one class is treated as the positive class and the rest as negative.

    • The class with the highest probability is predicted as the final output.

    • It is also known as one vs all or one vs others.

    • Example: In a 3-class problem, we ...

  • Answered by AI
  • Q7. What is one vs one classification?
  • Ans. 

    One vs one classification is a binary classification method where multiple models are trained to classify each pair of classes.

    • It is used when there are more than two classes in the dataset.

    • It involves training multiple binary classifiers for each pair of classes.

    • The final prediction is made by combining the results of all the binary classifiers.

    • Example: In a dataset with 5 classes, 10 binary classifiers will be traine...

  • Answered by AI
  • Q8. How to find the number of white cars in a city? (interviewer wanted my approach and had given me 5 minutes to come up with an apporach)
  • Ans. 

    Estimate the number of white cars using surveys, traffic data, and image recognition techniques.

    • Conduct surveys: Ask residents about car colors in their neighborhoods.

    • Use traffic cameras: Analyze footage to count white cars during peak hours.

    • Leverage social media: Analyze posts or images of cars in the city.

    • Utilize machine learning: Train a model on images of cars to identify white ones.

    • Collaborate with local authoriti...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - for the most part, practical questions were asked. so, your experience would matter the most. hence prepare accordingly.

Skills evaluated in this interview

I applied via Job Portal and was interviewed in Mar 2022. There were 2 interview rounds.

Round 1 - Coding Test 

(1 Question)

  • Q1. Machine Learning, Python, SQL, Basic Stats The difficulty level of questions was average.
Round 2 - Technical 

(1 Question)

  • Q1. Resume and project related. One or two questions around probability and stats.

Interview Preparation Tips

Interview preparation tips for other job seekers - Round 1 have MCQ questions around machine Learning , data science and sql with average difficulty
Round 2 -- Technical round. Asked mostly around resume and the project mentioned.
Also asked to do live python and sql coding

Top trending discussions

View All
Interview Tips & Stories
4d (edited)
a team lead
Why are women still asked such personal questions in interview?
I recently went for an interview… and honestly, m still trying to process what just happened. Instead of being asked about my skills, experience, or how I could add value to the company… the questions took a totally unexpected turn. The interviewer started asking things like When are you getting married? Are you engaged? And m sure, if I had said I was married, the next question would’ve been How long have you been married? What does my personal life have to do with the job m applying for? This is where I felt the gender discrimination hit hard. These types of questions are so casually thrown at women during interviews but are they ever asked to men? No one asks male candidates if they’re planning a wedding or how old their kids are. So why is it okay to ask women? Can we please stop normalising this kind of behaviour in interviews? Our careers shouldn’t be judged by our relationship status. Period.
Got a question about Affine?
Ask anonymously on communities.

Interview questions from similar companies

I applied via Company Website and was interviewed before May 2020. There were 3 interview rounds.

Interview Questionnaire 

1 Question

  • Q1. Question asked were on the fundamentals of selected Classification and Regression algorithms, Live coding(Basic and Easy) , Basic Knowledge on any cloud service, Deployment of ML projects on cloud .

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare the math and fundamentals behind each ML or DL algorithms. No need to know all State of the Art algorithms , be sure on whatever you know. Deployment using Flask and on any cloud service is required. Strong grip on python like using basic data structures like using Dictionaries, set, lists and other functions like lambda,filter, map etc.
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Walk-in and was interviewed before Oct 2023. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. Proficiency in cloud technology
  • Ans. 

    Proficiency in cloud technology is essential for data scientists to efficiently store, manage, and analyze large datasets.

    • Experience with cloud platforms like AWS, Azure, or Google Cloud

    • Knowledge of cloud storage solutions like S3, Blob Storage, or Cloud Storage

    • Understanding of cloud computing concepts like virtual machines, containers, and serverless computing

    • Ability to work with big data technologies like Hadoop, Spa...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Be honest and true about yourself

Skills evaluated in this interview

Are these interview questions helpful?
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Campus Placement and was interviewed before Nov 2023. There were 5 interview rounds.

Round 1 - Coding Test 

2 coding questions in 45 min

Round 2 - Technical 

(2 Questions)

  • Q1. Basics of java and frameworks
  • Q2. Basics of SQL and types of queries
Round 3 - Aptitude Test 

30 min of logical reasoning and basic aptitude

Round 4 - Group Discussion 

30 min of GD over a general topic to check your communication skills

Round 5 - HR 

(2 Questions)

  • Q1. Why do you want to join our company
  • Q2. Why should we hire you

Interview Questionnaire 

1 Question

  • Q1. Situation based question on experience

Interview Preparation Tips

Interview preparation tips for other job seekers - Be prepared for what is in CV
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Selected Selected

I applied via Approached by Company and was interviewed before Dec 2023. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. What information can you provide about the projects you worked on in your previous companies
  • Ans. 

    I have worked on projects involving data pipeline development, ETL processes, and data warehousing.

    • Developed and maintained data pipelines to ingest, process, and store large volumes of data

    • Implemented ETL processes to transform raw data into usable formats for analysis

    • Designed and optimized data warehouses for efficient storage and retrieval of data

    • Worked on real-time data processing using technologies like Apache Kaf...

  • Answered by AI
  • Q2. What are some SQL queries that utilize joins and window functions?
  • Ans. 

    SQL queries using joins and window functions

    • Use INNER JOIN to combine rows from two or more tables based on a related column

    • Use WINDOW functions like ROW_NUMBER() to assign a unique sequential integer to each row within a partition

    • Example: SELECT column1, column2, ROW_NUMBER() OVER(PARTITION BY column1 ORDER BY column2) AS row_num FROM table_name

  • Answered by AI
Round 2 - HR 

(2 Questions)

  • Q1. What are the reasons for your decision to leave your current organization?
  • Ans. 

    Seeking new challenges and growth opportunities in a more innovative environment.

    • Desire for career advancement

    • Lack of growth opportunities in current organization

    • Interest in working with new technologies or industries

    • Seeking a more collaborative or dynamic work environment

  • Answered by AI
  • Q2. What are your reasons for wanting to join this company?
  • Ans. 

    I am impressed by the company's innovative projects and collaborative work culture.

    • Innovative projects in the data engineering field

    • Collaborative work culture that fosters growth and learning

    • Company's reputation for valuing employee input and ideas

  • Answered by AI

Affine Interview FAQs

How many rounds are there in Affine Data Scientist interview?
Affine interview process usually has 2 rounds. The most common rounds in the Affine interview process are Technical, Coding Test and Resume Shortlist.
How to prepare for Affine Data Scientist interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Affine. The most common topics and skills that interviewers at Affine expect are Machine Learning, Python, Data Science, Artificial Intelligence and Computer Vision.
What are the top questions asked in Affine Data Scientist interview?

Some of the top questions asked at the Affine Data Scientist interview -

  1. you have a pandas dataframe with three columns, filled with state names, city n...read more
  2. How to retain special characters (that pandas discards by default) in the data ...read more
  3. how to find the number of white cars in a city? (interviewer wanted my approach...read more

Tell us how to improve this page.

Overall Interview Experience Rating

3.3/5

based on 4 interview experiences

Difficulty level

Easy 100%

Duration

Less than 2 weeks 100%
View more
Join Affine Command The New

Interview Questions from Similar Companies

HCL Infosystems Interview Questions
3.9
 • 144 Interviews
Webkul Software Interview Questions
4.0
 • 71 Interviews
Softenger Interview Questions
4.0
 • 59 Interviews
JK Tech Interview Questions
3.6
 • 36 Interviews
View all
Affine Data Scientist Salary
based on 33 salaries
₹6.5 L/yr - ₹25 L/yr
9% less than the average Data Scientist Salary in India
View more details

Affine Data Scientist Reviews and Ratings

based on 4 reviews

2.8/5

Rating in categories

2.7

Skill development

2.7

Work-life balance

1.9

Salary

3.5

Job security

2.7

Company culture

2.7

Promotions

2.7

Work satisfaction

Explore 4 Reviews and Ratings
Senior Associate
139 salaries
unlock blur

₹10 L/yr - ₹18.3 L/yr

Business Analyst
104 salaries
unlock blur

₹5.8 L/yr - ₹12.6 L/yr

Consultant
91 salaries
unlock blur

₹12 L/yr - ₹31.4 L/yr

Senior Business Analyst
87 salaries
unlock blur

₹10 L/yr - ₹18 L/yr

Data Engineer
48 salaries
unlock blur

₹8 L/yr - ₹13 L/yr

Explore more salaries
Compare Affine with

HCL Infosystems

3.9
Compare

Softenger

4.0
Compare

Capital Numbers Infotech

4.4
Compare

JK Tech

3.6
Compare
write
Share an Interview