i
Urban
Company
Filter interviews by
Assumptions in Linear Regression
Linear relationship between independent and dependent variables
Homoscedasticity (constant variance) of residuals
Independence of residuals
Normal distribution of residuals
No multicollinearity among independent variables
Type I error is rejecting a true null hypothesis, while Type II error is failing to reject a false null hypothesis.
Type I error is also known as a false positive
Type II error is also known as a false negative
Type I error occurs when the significance level is set too high
Type II error occurs when the significance level is set too low
Examples: Type I error - Convicting an innocent person, Type II error - Failing to ...
Logistic Regression formula is used to model the probability of a certain event occurring.
The formula is: P(Y=1) = e^(b0 + b1*X1 + b2*X2 + ... + bn*Xn) / (1 + e^(b0 + b1*X1 + b2*X2 + ... + bn*Xn))
Y is the dependent variable and X1, X2, ..., Xn are the independent variables
b0, b1, b2, ..., bn are the coefficients that need to be estimated
The formula is used to predict the probability of a binary outcome, such as wh...
Metrics used to evaluate Linear Regression
Mean Squared Error (MSE)
Root Mean Squared Error (RMSE)
R-squared (R²)
Adjusted R-squared (Adj R²)
Mean Absolute Error (MAE)
Residual Sum of Squares (RSS)
Akaike Information Criterion (AIC)
Bayesian Information Criterion (BIC)
To improve the performance of Linear Regression, you can consider feature engineering, regularization, and handling outliers.
Perform feature engineering to create new features that capture important information.
Apply regularization techniques like L1 or L2 regularization to prevent overfitting.
Handle outliers by either removing them or using robust regression techniques.
Check for multicollinearity among the indepe...
Overfitting and underfitting are two common problems in machine learning models.
Overfitting occurs when a model is too complex and fits the training data too closely, resulting in poor performance on new data.
Underfitting occurs when a model is too simple and cannot capture the underlying patterns in the data, resulting in poor performance on both training and new data.
Overfitting can be prevented by using regular...
Metrics used to evaluate classification models
Accuracy
Precision
Recall
F1 Score
ROC Curve
Confusion Matrix
Overfitting in decision trees can be handled by pruning, reducing tree depth, increasing dataset size, and using ensemble methods.
Prune the tree to remove unnecessary branches
Reduce tree depth to prevent overfitting
Increase dataset size to improve model generalization
Use ensemble methods like Random Forest to reduce overfitting
Underfitting can be handled by increasing tree depth, adding more features, and reducing...
Overfitting in Linear Regression can be handled by using regularization techniques.
Regularization techniques like Ridge regression and Lasso regression can help in reducing overfitting.
Cross-validation can be used to find the optimal regularization parameter.
Feature selection and dimensionality reduction techniques can also help in reducing overfitting.
Collecting more data can help in reducing overfitting by provi...
Cost function measures the difference between predicted and actual values. Error function measures the average of cost function.
Cost function is used to evaluate the performance of a machine learning model.
It measures the difference between predicted and actual values.
Error function is the average of cost function over the entire dataset.
It is used to optimize the parameters of the model.
Examples of cost functions...
I applied via Referral and was interviewed in Feb 2022. There were 5 interview rounds.
Given time series data of provider, compute hour wise provider wise no of seconds online
Assumptions in Linear Regression
Linear relationship between independent and dependent variables
Homoscedasticity (constant variance) of residuals
Independence of residuals
Normal distribution of residuals
No multicollinearity among independent variables
Overfitting and underfitting are two common problems in machine learning models.
Overfitting occurs when a model is too complex and fits the training data too closely, resulting in poor performance on new data.
Underfitting occurs when a model is too simple and cannot capture the underlying patterns in the data, resulting in poor performance on both training and new data.
Overfitting can be prevented by using regularizati...
To improve the performance of Linear Regression, you can consider feature engineering, regularization, and handling outliers.
Perform feature engineering to create new features that capture important information.
Apply regularization techniques like L1 or L2 regularization to prevent overfitting.
Handle outliers by either removing them or using robust regression techniques.
Check for multicollinearity among the independent...
Metrics used to evaluate Linear Regression
Mean Squared Error (MSE)
Root Mean Squared Error (RMSE)
R-squared (R²)
Adjusted R-squared (Adj R²)
Mean Absolute Error (MAE)
Residual Sum of Squares (RSS)
Akaike Information Criterion (AIC)
Bayesian Information Criterion (BIC)
Cost function measures the difference between predicted and actual values. Error function measures the average of cost function.
Cost function is used to evaluate the performance of a machine learning model.
It measures the difference between predicted and actual values.
Error function is the average of cost function over the entire dataset.
It is used to optimize the parameters of the model.
Examples of cost functions are ...
Overfitting in Linear Regression can be handled by using regularization techniques.
Regularization techniques like Ridge regression and Lasso regression can help in reducing overfitting.
Cross-validation can be used to find the optimal regularization parameter.
Feature selection and dimensionality reduction techniques can also help in reducing overfitting.
Collecting more data can help in reducing overfitting by providing ...
Least Squares Method and Maximum Likelihood are both used to estimate parameters, but differ in their approach.
Least Squares Method minimizes the sum of squared errors between the observed and predicted values.
Maximum Likelihood estimates the parameters that maximize the likelihood of observing the given data.
Least Squares Method assumes that the errors are normally distributed and independent.
Maximum Likelihood does n...
Logistic Regression formula is used to model the probability of a certain event occurring.
The formula is: P(Y=1) = e^(b0 + b1*X1 + b2*X2 + ... + bn*Xn) / (1 + e^(b0 + b1*X1 + b2*X2 + ... + bn*Xn))
Y is the dependent variable and X1, X2, ..., Xn are the independent variables
b0, b1, b2, ..., bn are the coefficients that need to be estimated
The formula is used to predict the probability of a binary outcome, such as whether...
Type I error is rejecting a true null hypothesis, while Type II error is failing to reject a false null hypothesis.
Type I error is also known as a false positive
Type II error is also known as a false negative
Type I error occurs when the significance level is set too high
Type II error occurs when the significance level is set too low
Examples: Type I error - Convicting an innocent person, Type II error - Failing to convi...
Metrics used to evaluate classification models
Accuracy
Precision
Recall
F1 Score
ROC Curve
Confusion Matrix
Overfitting in decision trees can be handled by pruning, reducing tree depth, increasing dataset size, and using ensemble methods.
Prune the tree to remove unnecessary branches
Reduce tree depth to prevent overfitting
Increase dataset size to improve model generalization
Use ensemble methods like Random Forest to reduce overfitting
Underfitting can be handled by increasing tree depth, adding more features, and reducing regu...
Case Study - How do you improve user engagement of Facebook?
Guesstimates - How many people watched the Squid Game series on Netflix
How do you reduce partner churn in UC?
Top trending discussions
I applied via Job Portal
SQL and Python data analysis questions
Matching data in different columns involves comparing the values in the columns and identifying similarities or patterns.
Use string matching techniques like exact match, partial match, or fuzzy matching.
Apply data cleaning and preprocessing techniques to standardize the data before matching.
Utilize advanced algorithms like Levenshtein distance or Jaccard similarity for more complex matching.
Consider using database join...
SQL jobs include database administrator, data analyst, data scientist, business intelligence analyst, and software developer.
Database Administrator
Data Analyst
Data Scientist
Business Intelligence Analyst
Software Developer
I applied via Campus Placement and was interviewed in Nov 2024. There were 4 interview rounds.
Questions from arrays and strings and some aptitude questions
To merge two CSV files, you can use software like Microsoft Excel or programming languages like Python.
Open both CSV files in a software like Microsoft Excel.
Copy the data from one CSV file and paste it into the other CSV file.
Save the merged CSV file with a new name.
Alternatively, you can use programming languages like Python to merge CSV files by reading both files, combining the data, and writing to a new file.
I applied to this company because of its reputation in the industry, opportunities for growth, and company culture.
Reputation in the industry - I have heard great things about the company's innovative projects and successful track record.
Opportunities for growth - The company offers various training programs and career advancement opportunities for employees.
Company culture - I value a positive work environment and the...
I applied via Company Website and was interviewed before Feb 2023. There was 1 interview round.
Types of join in SQL include inner join, left join, right join, and full outer join.
Inner join returns only the matching records from both tables.
Left join returns all records from the left table and the matching records from the right table.
Right join returns all records from the right table and the matching records from the left table.
Full outer join returns all records when there is a match in either the left or rig...
The number of rows after applying join operations depends on the type of join used and the data in the tables being joined.
Inner join retains only the rows that have matching values in both tables
Left join retains all rows from the left table and the matched rows from the right table
Right join retains all rows from the right table and the matched rows from the left table
Full outer join retains all rows when there is a ...
VLOOKUP in Excel is used to search for a value in a table and return a corresponding value, while SQL functions like JOIN and WHERE are used to retrieve data from multiple tables based on specified conditions.
VLOOKUP is specific to Excel and works on a single table, while SQL functions can work on multiple tables.
VLOOKUP requires the table to be sorted in ascending order, while SQL functions do not have this requiremen...
I applied via Company Website and was interviewed before Feb 2023. There were 2 interview rounds.
Simple SQL & Excel test
I was asked a medium-level SQL question that involved the RANK function.
I appeared for an interview in Apr 2025, where I was asked the following questions.
SQL code to identify duplicate records in a table based on specific columns.
Use the GROUP BY clause to group records by the columns you want to check for duplicates.
Apply the HAVING clause to filter groups that have a count greater than 1.
Example SQL query: SELECT column1, column2, COUNT(*) FROM table_name GROUP BY column1, column2 HAVING COUNT(*) > 1;
This query will return all combinations of column1 and column2 th...
I applied via Approached by Company and was interviewed in Apr 2023. There were 6 interview rounds.
Operation questions like vlookups, Hlookups, xlookups etc,.
Reporting questions like pivot tables, typing speed, power pivots, pricing sheet preparation
SQL queries like joins, aggregation functions, filters etc,.
I can prepare a bar chart with product operations data within a few minutes.
I have experience using data visualization tools like Tableau, Power BI, or Excel to quickly create bar charts.
I am proficient in organizing and cleaning data to ensure accurate representation in the chart.
I can customize the chart with relevant labels, colors, and titles to make it visually appealing and easy to understand.
You can merge multiple Excel files together using the 'Consolidate' feature in Excel.
Open Excel and go to the 'Data' tab
Click on 'Consolidate' in the 'Data Tools' group
Select the files you want to merge and choose the desired options
Click 'OK' to merge the files together
Some of the top questions asked at the Urban Company Senior Data Analyst interview -
Category Manager
664
salaries
| ₹6 L/yr - ₹13 L/yr |
Business Development Associate
512
salaries
| ₹2.4 L/yr - ₹6 L/yr |
Beautician
159
salaries
| ₹2.4 L/yr - ₹6.6 L/yr |
AC Technician
150
salaries
| ₹1.9 L/yr - ₹6 L/yr |
Senior Category Manager
143
salaries
| ₹12.8 L/yr - ₹22 L/yr |
Housejoy
Urban Ladder
Udaan
Swiggy