Upload Button Icon Add office photos

Filter interviews by

Indiafilings Data Scientist Interview Questions and Answers

Updated 29 Jun 2023

Indiafilings Data Scientist Interview Experiences

1 interview found

Interview experience
1
Bad
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Approached by Company and was interviewed in May 2023. There were 3 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(2 Questions)

  • Q1. What is the difference between sigmoid and softmax activation function?
  • Ans. 

    Sigmoid is used for binary classification while softmax is used for multi-class classification.

    • Sigmoid function outputs values between 0 and 1, suitable for binary classification tasks.

    • Softmax function outputs a probability distribution over multiple classes, summing up to 1.

    • Sigmoid is used in the output layer for binary classification, while softmax is used for multi-class classification.

    • Softmax is the generalization

  • Answered by AI
  • Q2. What is an activation function ?
  • Ans. 

    An activation function is a mathematical function that determines the output of a neural network.

    • Activation functions introduce non-linearity to the neural network, allowing it to learn complex patterns in the data.

    • Common activation functions include sigmoid, tanh, ReLU, and softmax.

    • The choice of activation function can impact the performance and training speed of the neural network.

  • Answered by AI
Round 3 - Coding Test 

Build an NLP model on their dataset

Interview Preparation Tips

Interview preparation tips for other job seekers - Pathetic experience. HR Recruiter will constantly call you every day (multiple times per day) to set up the interview almost to the point of harassment, but when the interview rounds are done, there is zero communication from the entire team.

Skills evaluated in this interview

Interview questions from similar companies

I applied via Walk-in and was interviewed in Mar 2020. There was 1 interview round.

Interview Questionnaire 

10 Questions

  • Q1. What is R square and how R square is different from Adjusted R square
  • Ans. 

    R square is a statistical measure that represents the proportion of the variance in the dependent variable explained by the independent variables.

    • R square is a value between 0 and 1, where 0 indicates that the independent variables do not explain any of the variance in the dependent variable, and 1 indicates that they explain all of it.

    • It is used to evaluate the goodness of fit of a regression model.

    • Adjusted R square t...

  • Answered by AI
  • Q2. Explain what do u understand by the team WOE and IV. What's the importance. Advantages and disadvantages
  • Q3. What are variable reducing techniques
  • Ans. 

    Variable reducing techniques are methods used to identify and select the most relevant variables in a dataset.

    • Variable reducing techniques help in reducing the number of variables in a dataset.

    • These techniques aim to identify the most important variables that contribute significantly to the outcome.

    • Some common variable reducing techniques include feature selection, dimensionality reduction, and correlation analysis.

    • Fea...

  • Answered by AI
  • Q4. Which test is used in logistic regression to check the significance of the variable
  • Ans. 

    The Wald test is used in logistic regression to check the significance of the variable.

    • The Wald test calculates the ratio of the estimated coefficient to its standard error.

    • It follows a chi-square distribution with one degree of freedom.

    • A small p-value indicates that the variable is significant.

    • For example, in Python, the statsmodels library provides the Wald test in the summary of a logistic regression model.

  • Answered by AI
  • Q5. How to check multicollinearity in Logistic regression
  • Ans. 

    Multicollinearity in logistic regression can be checked using correlation matrix and variance inflation factor (VIF).

    • Calculate the correlation matrix of the independent variables and check for high correlation coefficients.

    • Calculate the VIF for each independent variable and check for values greater than 5 or 10.

    • Consider removing one of the highly correlated variables or variables with high VIF to address multicollinear...

  • Answered by AI
  • Q6. Difference between bagging and boosting
  • Ans. 

    Bagging and boosting are ensemble methods used in machine learning to improve model performance.

    • Bagging involves training multiple models on different subsets of the training data and then combining their predictions through averaging or voting.

    • Boosting involves iteratively training models on the same dataset, with each subsequent model focusing on the samples that were misclassified by the previous model.

    • Bagging reduc...

  • Answered by AI
  • Q7. Explain the logistics regression process
  • Ans. 

    Logistic regression is a statistical method used to analyze and model the relationship between a binary dependent variable and one or more independent variables.

    • It is a type of regression analysis used for predicting the outcome of a categorical dependent variable based on one or more predictor variables.

    • It uses a logistic function to model the probability of the dependent variable taking a particular value.

    • It is commo...

  • Answered by AI
  • Q8. Explain Gini coefficient
  • Ans. 

    Gini coefficient measures the inequality among values of a frequency distribution.

    • Gini coefficient ranges from 0 to 1, where 0 represents perfect equality and 1 represents perfect inequality.

    • It is commonly used to measure income inequality in a population.

    • A Gini coefficient of 0.4 or higher is considered to be a high level of inequality.

    • Gini coefficient can be calculated using the Lorenz curve, which plots the cumulati...

  • Answered by AI
  • Q9. Difference between chair and cart
  • Ans. 

    A chair is a piece of furniture used for sitting, while a cart is a vehicle used for transporting goods.

    • A chair typically has a backrest and armrests, while a cart does not.

    • A chair is designed for one person to sit on, while a cart can carry multiple items or people.

    • A chair is usually stationary, while a cart is mobile and can be pushed or pulled.

    • A chair is commonly found in homes, offices, and public spaces, while a c...

  • Answered by AI
  • Q10. How to check outliers in a variable, what treatment should you use to remove such outliers
  • Ans. 

    Outliers can be detected using statistical methods like box plots, z-score, and IQR. Treatment can be removal or transformation.

    • Use box plots to visualize outliers

    • Calculate z-score and remove data points with z-score greater than 3

    • Calculate IQR and remove data points outside 1.5*IQR

    • Transform data using log or square root to reduce the impact of outliers

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Explain the concept properly, if not able to explain properly then take a pause and try again with some examples. Be confident.

Skills evaluated in this interview

I applied via Recruitment Consulltant and was interviewed before Aug 2021. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. Difference between CNN and MLP
  • Ans. 

    CNN is used for image recognition while MLP is used for general classification tasks.

    • CNN uses convolutional layers to extract features from images while MLP uses fully connected layers.

    • CNN is better suited for tasks that require spatial understanding like object detection while MLP is better for tabular data.

    • CNN has fewer parameters than MLP due to weight sharing in convolutional layers.

    • CNN can handle input of varying

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Brush up basic statistics . Also prepare atleast 2 , 3 ML algorithms for the interview.

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-

I applied via Campus Placement

Round 1 - Technical 

(1 Question)

  • Q1. Work done in previous companty
  • Ans. 

    Developed machine learning models to predict customer churn and optimize marketing campaigns.

    • Built predictive models using Python and scikit-learn

    • Utilized SQL to extract and manipulate data for analysis

    • Collaborated with cross-functional teams to implement data-driven solutions

  • Answered by AI
Interview experience
4
Good
Difficulty level
Easy
Process Duration
2-4 weeks
Result
Selected Selected

I applied via IIM Jobs and was interviewed before Jun 2023. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. SQL basic questions
  • Q2. Python - pandas, numpy based questions

Interview Preparation Tips

Interview preparation tips for other job seekers - Great place to work
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
-
Result
No response

I applied via Job Portal and was interviewed in Nov 2023. There was 1 interview round.

Round 1 - One-on-one 

(5 Questions)

  • Q1. What is Gradient Descents?
  • Ans. 

    Gradient descent is an optimization algorithm used to minimize a function by iteratively moving in the direction of steepest descent.

    • Gradient descent is used to find the minimum of a function by taking steps proportional to the negative of the gradient at the current point.

    • It is commonly used in machine learning to optimize the parameters of a model by minimizing the loss function.

    • There are different variants of gradie...

  • Answered by AI
  • Q2. What is LSTM?, and what are the gates in it?
  • Ans. 

    LSTM (Long Short-Term Memory) is a type of recurrent neural network designed to handle long-term dependencies.

    • LSTM has three gates: input gate, forget gate, and output gate.

    • Input gate controls the flow of information into the cell state.

    • Forget gate decides what information to discard from the cell state.

    • Output gate determines the output based on the cell state.

  • Answered by AI
  • Q3. They gave me a link to dataset and started saying the operations to apply on that. E.g, value_counts, null_values, fill the values with mean,etc.
  • Q4. What is t-test? What is Mean, Median and Mode and where to use these?
  • Ans. 

    T-test is a statistical test used to determine if there is a significant difference between the means of two groups.

    • Mean is the average of a set of numbers, median is the middle value when the numbers are ordered, and mode is the most frequently occurring value.

    • Mean is sensitive to outliers, median is robust to outliers, and mode is useful for categorical data.

    • T-test is used to compare means of two groups, mean is used...

  • Answered by AI
  • Q5. What is RANDOM FOREST ?
  • Ans. 

    Random Forest is an ensemble learning method used for classification and regression tasks.

    • Random Forest is a collection of decision trees that are trained on random subsets of the data.

    • Each tree in the forest makes a prediction, and the final prediction is the average (regression) or majority vote (classification) of all trees.

    • Random Forest helps reduce overfitting and improve accuracy compared to a single decision tre...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Motilal Oswal Financial Services Data Scientist interview:
  • Machine Learning
  • Statistics
  • Pandas
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Coding Test 

I was asked Python, sql, coding questions

Round 2 - Case Study 

Case study on how would you identify the total number of footfall on a airport

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
-

I applied via Company Website and was interviewed before Feb 2023. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. ML concepts , regression, regularization etc
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Referral and was interviewed before May 2023. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Self Intro and projects discussion
  • Q2. Feature selection methods
  • Ans. 

    Feature selection methods help in selecting the most relevant features for building predictive models.

    • Feature selection methods aim to reduce the number of input variables to only those that are most relevant.

    • Common methods include filter methods, wrapper methods, and embedded methods.

    • Examples include Recursive Feature Elimination (RFE), Principal Component Analysis (PCA), and Lasso regression.

  • Answered by AI

Skills evaluated in this interview

I appeared for an interview in May 2022.

Round 1 - Assignment 

Round duration - 60 Minutes
Round difficulty - Easy

Round 2 - Coding Test 

(1 Question)

Round duration - 60 Minutes
Round difficulty - Easy

There were 10 MCQs ranging from Aptitude to Programming MCQs to basics of Data Science.
The coding question only the optimized solution was accepted

  • Q1. 

    Special Sum of Array Problem Statement

    Given an array 'arr' containing single-digit integers, your task is to calculate the total sum of all its elements. However, the resulting sum must also be a single-...

  • Ans. 

    Calculate the total sum of array elements until a single-digit number is obtained by repeatedly summing digits.

    • Iterate through the array and calculate the sum of all elements.

    • If the sum is a single-digit number, return it. Otherwise, repeat the process of summing digits until a single-digit number is obtained.

    • Return the final single-digit sum.

  • Answered by AI
Round 3 - Video Call 

(1 Question)

Round duration - 45 minutes
Round difficulty - Easy

The interview happened in the evening. It was an online video call.
The interviewer was very cooperative. I would say it was rather a discussion session between us.

  • Q1. 

    Clone a Linked List with Random Pointers

    Given a linked list where each node contains two pointers: one pointing to the next node and another random pointer that can point to any node within the list (or ...

  • Ans. 

    Create a deep copy of a linked list with random pointers.

    • Iterate through the original linked list and create a new node for each node in the list.

    • Store the mapping of original nodes to new nodes in a hashmap to handle random pointers.

    • Update the random pointers of new nodes based on the mapping stored in the hashmap.

    • Return the head of the copied linked list.

  • Answered by AI
Round 4 - HR 

Round duration - 10 Minutes
Round difficulty - Easy

It was late night
It was a telephonic call

Interview Preparation Tips

Professional and academic backgroundI completed Computer Science Engineering from Vellore Institute of Technology. I applied for the job as Data Scientist in PuneEligibility criteriaAbove 8 CGPA. Only CSE, IT, ECE, EEE branches were allowed.Bajaj Finserv Ltd. interview preparation:Topics to prepare for the interview - Data Structures and Algorithms, OOPs, DBMS, Data Science Fundamentals, Personal ProjectsTime required to prepare for the interview - 6-8 monthsInterview preparation tips for other job seekers

Tip 1 : Start your preparation early. Start from the very basics before directly moving onto DSA. Get a grasp of the basics in each topic. Practice different varieties of questions from each topic. I would recommend at least 200 questions of DSA.
Tip 2 : Revise your projects before you attend any interview. This is extremely important. You must be able to clearly explain your project along with your role in the project in layman terms to the interviewer.
Tip 3 : Grind hard to achieve your goals but don't take much stress. There's a long way to go.

Application resume tips for other job seekers

Tip 1 : Never, I say never put false things or your friends project in your resume
Tip 2 : Make a 1 page resume. Make your resume in such a way that the interviewer must be able to see the things you want him to see in the very first scan.

Final outcome of the interviewSelected

Skills evaluated in this interview

Indiafilings Interview FAQs

How many rounds are there in Indiafilings Data Scientist interview?
Indiafilings interview process usually has 3 rounds. The most common rounds in the Indiafilings interview process are Resume Shortlist, Technical and Coding Test.
What are the top questions asked in Indiafilings Data Scientist interview?

Some of the top questions asked at the Indiafilings Data Scientist interview -

  1. What is the difference between sigmoid and softmax activation functi...read more
  2. What is an activation functio...read more

Tell us how to improve this page.

Indiafilings Data Scientist Interview Process

based on 1 interview

Interview experience

1
  
Bad
View more
Accountant
101 salaries
unlock blur

₹1.2 L/yr - ₹4.2 L/yr

Business Advisor
79 salaries
unlock blur

₹1.5 L/yr - ₹5 L/yr

Relationship Manager
60 salaries
unlock blur

₹1.5 L/yr - ₹3.5 L/yr

Accounts Manager
24 salaries
unlock blur

₹1.8 L/yr - ₹4 L/yr

HR Associate
21 salaries
unlock blur

₹2 L/yr - ₹3.2 L/yr

Explore more salaries
Compare Indiafilings with

Bajaj Finserv

4.0
Compare

Wells Fargo

3.8
Compare

JPMorgan Chase & Co.

3.9
Compare

HSBC Group

3.9
Compare
Did you find this page helpful?
Yes No
write
Share an Interview