Filter interviews by
I applied via LinkedIn and was interviewed before Sep 2023. There were 4 interview rounds.
Binomial distribution is a probability distribution that describes the number of successes in a fixed number of independent trials.
Describes the number of successes in a fixed number of independent trials
Each trial has only two possible outcomes (success or failure)
The trials are independent and the probability of success is constant
Examples: Coin toss (success = heads), Pass/fail exams, Yes/no surveys
K-means clustering is a popular unsupervised machine learning algorithm used for clustering data points into groups based on similarity.
Divides data points into K clusters based on similarity
Minimizes the sum of squared distances within each cluster
Requires specifying the number of clusters (K) beforehand
Iteratively assigns data points to the nearest cluster centroid
Commonly used in customer segmentation, image compres
Central Limit Theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases.
Central Limit Theorem is a fundamental concept in statistics.
It states that the sampling distribution of the sample mean will be approximately normally distributed regardless of the shape of the population distribution.
As the sample size increases, the sampling distribution of the...
Developed predictive model to forecast customer churn using machine learning algorithms
Collected and cleaned customer data from various sources
Performed feature engineering to create relevant predictors
Built and trained machine learning models such as logistic regression and random forest
Evaluated model performance using metrics like accuracy, precision, and recall
Implemented the model in a production environment for r
Precision-Recall AUC is a metric used to evaluate the performance of classification models, particularly in imbalanced datasets.
Precision-Recall AUC focuses on the trade-off between precision and recall for different threshold values.
It is particularly useful when dealing with imbalanced datasets where the positive class is rare.
A higher Precision-Recall AUC indicates better model performance in terms of precision and ...
Use SQL window function to calculate monthwise cumulative sum.
Use the SUM() function with OVER() clause to calculate cumulative sum.
Partition the data by month to get monthwise cumulative sum.
Order the data by date to ensure correct cumulative sum calculation.
Chi square test is a statistical test used to determine if there is a significant association between two categorical variables.
Chi square test is used to compare observed frequencies with expected frequencies in a contingency table.
It is commonly used in research to analyze data and determine if there is a relationship between two variables.
For example, it can be used to test if there is a significant difference in th...
Z test is used when sample size is large and population standard deviation is known. T test is used when sample size is small or population standard deviation is unknown.
Z test is used for hypothesis testing when sample size is large (n > 30) and population standard deviation is known.
T test is used when sample size is small (n < 30) or population standard deviation is unknown.
Z test is used for comparing means of two ...
Utilized K-means clustering to group customers based on purchasing behavior. Evaluated clusters using silhouette score and inertia.
Used K-means clustering algorithm to group customers into segments
Evaluated the quality of clusters using silhouette score and inertia
Silhouette score measures how similar an object is to its own cluster compared to other clusters
Inertia measures how tightly the clusters are packed together
...
I stay up to date with new analytical tools and techniques by attending workshops, online courses, reading research papers, and participating in industry conferences.
Attend workshops and training sessions on new tools and techniques
Take online courses and certifications to learn about the latest advancements
Read research papers and articles to stay informed about cutting-edge methods
Participate in industry conferences ...
I prioritize tasks, use time management tools, set deadlines, and minimize distractions.
Prioritize tasks based on importance and deadlines
Use time management tools like calendars and to-do lists
Set specific deadlines for each task to stay on track
Minimize distractions by setting boundaries and creating a focused work environment
Top trending discussions
I applied via Referral
Recommendation system uses data analysis and machine learning algorithms to suggest items to users based on their preferences.
Collect user data and item data
Analyze data to find patterns and similarities
Use machine learning algorithms to make predictions and suggest items to users
Continuously update and improve the system based on user feedback
Examples: Netflix suggesting movies based on viewing history, Amazon suggest
I applied via Internshala and was interviewed in Dec 2024. There was 1 interview round.
I applied via Approached by Company and was interviewed in Aug 2024. There was 1 interview round.
I applied via LinkedIn and was interviewed before Jan 2024. There was 1 interview round.
The SQL query for calculating a moving average involves using window functions.
Use the OVER clause with the ORDER BY clause to define the window frame for the moving average calculation.
Use the AVG() function to calculate the average within the window frame.
Example: SELECT value, AVG(value) OVER (ORDER BY date ROWS BETWEEN 2 PRECEDING AND CURRENT ROW) AS moving_avg FROM table_name;
DDL is used to define the structure of database objects, while DML is used to manipulate data within those objects.
DDL is used to create, modify, and delete database objects such as tables, indexes, and views.
DML is used to insert, update, retrieve, and delete data within those database objects.
Examples of DDL statements include CREATE TABLE, ALTER INDEX, and DROP VIEW.
Examples of DML statements include INSERT INTO, UP
I applied via Referral
Recommendation system uses data analysis and machine learning algorithms to suggest items to users based on their preferences.
Collect user data and item data
Analyze data to find patterns and similarities
Use machine learning algorithms to make predictions and suggest items to users
Continuously update and improve the system based on user feedback
Examples: Netflix suggesting movies based on viewing history, Amazon suggest
Delete removes specific rows from a table, drop removes entire table structure, and truncate removes all rows from a table.
Delete is a DML command, drop is a DDL command, and truncate is a DDL command.
Delete can be rolled back, drop cannot be rolled back, and truncate cannot be rolled back.
Delete triggers delete triggers, drop triggers drop triggers, and truncate does not trigger any triggers.
Example: DELETE FROM table...
Iterators are objects that allow sequential access to elements in a collection.
Iterators are used to loop through elements in a collection one at a time.
They provide a way to access elements without exposing the underlying data structure.
Iterators have methods like next() to retrieve the next element in the collection.
Examples of iterators include Python's iter() and Java's Iterator interface.
Python coding 45 mins
I applied via LinkedIn and was interviewed in Mar 2024. There were 3 interview rounds.
Assignment containing the python coding and the pandas data analysis and statistics questions
Check if a sentence is a palindrome or not.
Remove all non-alphanumeric characters and convert to lowercase.
Compare the original sentence with its reverse to check for palindrome.
Example: 'A man, a plan, a canal, Panama' is a palindrome sentence.
A simple Python coding assesment - had fibonacci series and just general python questions
Statistical significance refers to the likelihood that a result or relationship is not due to chance.
Statistical significance is a measure of the probability that a relationship between variables is not due to random chance.
Hypothesis testing is a common method to determine statistical significance by comparing observed data to what would be expected by chance.
A p-value is used to determine statistical significance, wi...
based on 1 interview
Interview experience
Supplier Relationship Manager
38
salaries
| ₹4 L/yr - ₹9.6 L/yr |
Software Engineer
18
salaries
| ₹16 L/yr - ₹30 L/yr |
Data Specialist
17
salaries
| ₹3 L/yr - ₹5 L/yr |
Senior Software Engineer
16
salaries
| ₹24.8 L/yr - ₹48.2 L/yr |
Engineering Manager
12
salaries
| ₹30 L/yr - ₹62.1 L/yr |
LendingKart
NeoGrowth Credit
Indifi Technologies
Rubique