Filter interviews by
I applied via Job Portal and was interviewed in Jul 2024. There were 2 interview rounds.
Top trending discussions
I applied via Company Website and was interviewed in Oct 2024. There was 1 interview round.
Data Warehouse stores structured data for reporting and analysis, Data Lakes store raw and unstructured data, Tables are basic data structures.
Data Warehouse is used for storing structured data from various sources for reporting and analysis.
Data Lakes store raw and unstructured data in its native format for future processing and analysis.
Tables are basic data structures used to organize and store data in a structured ...
Group by is used to group rows that have the same values into summary rows, while Partition by is used to divide the result set into partitions to which the function is applied separately.
Group by is used with aggregate functions to group rows based on a column or set of columns.
Partition by is used with window functions to divide the result set into partitions.
Group by is used with SELECT statement, while Partition by...
Basic aptitude ques were asked
I applied via Company Website and was interviewed in Feb 2024. There were 2 interview rounds.
I was interviewed in Feb 2024.
I was interviewed before Mar 2021.
Round duration - 60 minutes
Round difficulty - Easy
Technical Interview round with questions on ML mainly.
What are the assumptions of linear regression model?
There are four assumptions associated with a linear regression model:
1. Linearity: The relationship between X and the mean of Y is linear.
2. Homoscedasticity: The variance of residual is the same for any value of X.
3. Independence: Observations are independent of each other.
4. Normality: For any fixed value of X, Y is normally distributed.
What problems do multicollinearity in regression analysis cause?
1. The coefficient estimates can swing wildly based on which other independent variables are in the model. The coefficients become very sensitive to small changes in the model.
2. Multicollinearity reduces the precision of the estimated coefficients, which weakens the statistical power of your regression model. You might not be able to trust the p-values to identify independent variables that are statistically significa
What are different measures used to check performance of classification model?
Confusion Matrix : A confusion matrix is a table with two dimensions viz. “Actual” and “Predicted” and furthermore, both the dimensions have “True Positives (TP)”, “True Negatives (TN)”, “False Positives (FP)”, “False Negatives (FN)".
Precision : Precision = TP / (TP + FP)
Out of all that were marked as positive, how many are actually truly positive.
Recall/ Sensitivity : Recall = TP/ (TN + FN)
Out of all the actual ...
What are disadvantage of logistics regression?
1. If the number of observations is lesser than the number of features, Logistic Regression should not be used, otherwise, it may lead to overfitting.
2. It constructs linear boundaries.
3. The major limitation of Logistic Regression is the assumption of linearity between the dependent variable and the independent variables.
4. It can only be used to predict discrete functions. Hence, the dependent variable of Logistic Re
Tip 1 : Prepare basics of ML, stats and Sql properly.
Tip 2 : Go through all the previous interview experiences from Codestudio and Leetcode.
Tip 3 : Do at-least 2 good projects and you must know every bit of them.
Tip 1 : Have at-least 2 good projects explained in short with all important points covered.
Tip 2 : Every skill must be mentioned.
Tip 3 : Focus on skills, projects and experiences more.
I applied via Referral and was interviewed before Nov 2020. There were 4 interview rounds.
Assumptions of linear regression
Linear relationship between independent and dependent variables
Homoscedasticity (constant variance) of errors
Independence of errors
Normal distribution of errors
No multicollinearity among independent variables
Multicollinearity in regression analysis affects the accuracy and interpretability of the model.
Multicollinearity occurs when two or more independent variables are highly correlated.
It leads to unstable and unreliable estimates of regression coefficients.
It reduces the precision of the estimates and increases the standard errors.
It makes it difficult to interpret the individual effects of the independent variables.
It c...
Measures to check performance of classification model
Accuracy
Precision
Recall
F1 Score
ROC Curve
Confusion Matrix
Logistic regression assumes linear relationship between independent and dependent variables.
May not perform well with non-linear data
May overfit or underfit the data
May be sensitive to outliers
May require large sample size for stable results
Site Engineer
5
salaries
| ₹2.7 L/yr - ₹8.3 L/yr |
Programmer Analyst
4
salaries
| ₹2.2 L/yr - ₹9 L/yr |
Back Office Executive
4
salaries
| ₹1.2 L/yr - ₹2.2 L/yr |
Market Research Consultant
4
salaries
| ₹20.5 L/yr - ₹30 L/yr |
Senior Engineer
3
salaries
| ₹4.1 L/yr - ₹6.1 L/yr |
Indian Railway Catering and Tourism
Yatra
MakeMyTrip
Ola Cabs