Filter interviews by
Top trending discussions
My friends think of me as reliable, supportive, and always up for a good time.
Reliable - always there when they need help or support
Supportive - willing to listen and offer advice
Fun-loving - enjoys socializing and trying new things
I applied via Recruitment Consultant and was interviewed in Dec 2018. There were 3 interview rounds.
I chose Data Science field because of its potential to solve complex problems and make a positive impact on society.
Fascination with data and its potential to drive insights
Desire to solve complex problems and make a positive impact on society
Opportunity to work with cutting-edge technology and tools
Ability to work in a variety of industries and domains
Examples: Predictive maintenance in manufacturing, fraud detection
Linear Regression is used for predicting continuous numerical values, while Logistic Regression is used for predicting binary categorical values.
Linear Regression predicts a continuous output, while Logistic Regression predicts a binary output.
Linear Regression uses a linear equation to model the relationship between the independent and dependent variables, while Logistic Regression uses a logistic function.
Linear Regr...
Confusion matrix is a table used to evaluate the performance of a classification model.
It is a 2x2 matrix that shows the number of true positives, false positives, true negatives, and false negatives.
It helps in calculating various metrics like accuracy, precision, recall, and F1 score.
It is useful in identifying the strengths and weaknesses of a model and improving its performance.
Example: In a binary classification p...
No, confusion matrix is not used in Linear Regression.
Confusion matrix is used to evaluate classification models.
Linear Regression is a regression model, not a classification model.
Evaluation metrics for Linear Regression include R-squared, Mean Squared Error, etc.
KNN is a non-parametric algorithm used for classification and regression tasks.
KNN stands for K-Nearest Neighbors.
It works by finding the K closest data points to a given test point.
The class or value of the test point is then determined by the majority class or average value of the K neighbors.
KNN can be used for both classification and regression tasks.
It is a simple and easy-to-understand algorithm, but can be compu
Random Forest is an ensemble learning method that builds multiple decision trees and combines their outputs to improve accuracy.
Random Forest is a type of supervised learning algorithm used for classification and regression tasks.
It creates multiple decision trees and combines their outputs to make a final prediction.
Each decision tree is built using a random subset of features and data points to reduce overfitting.
Ran...
I have worked on various projects involving data analysis, machine learning, and predictive modeling.
Developed a predictive model to forecast customer churn for a telecommunications company.
Built a recommendation system using collaborative filtering for an e-commerce platform.
Performed sentiment analysis on social media data to understand customer opinions and preferences.
Implemented a fraud detection system using anom...
I was interviewed before Feb 2024.
I applied via Company Website and was interviewed before Oct 2023. There was 1 interview round.
ML questions alogorithims
I applied via Campus Placement and was interviewed in Feb 2024. There were 2 interview rounds.
Coding test which requires python and few aptitude and technical questions
I applied via Naukri.com and was interviewed in Aug 2024. There were 2 interview rounds.
Handle missing values by imputation, deletion, or using algorithms that can handle missing data.
Impute missing values using mean, median, mode, or predictive modeling
Delete rows or columns with missing values if they are insignificant
Use algorithms like XGBoost, Random Forest, or LightGBM that can handle missing data
Precision measures the accuracy of positive predictions, while recall measures the ability to find all positive instances.
Precision is the ratio of correctly predicted positive observations to the total predicted positives.
Recall is the ratio of correctly predicted positive observations to all actual positives.
Precision is important when the cost of false positives is high, while recall is important when the cost of fa...
I applied via Campus Placement
Aptitude + Technical Questions
Coding Questions of SQL and DSA
To transpose a matrix in Python, use numpy.transpose() or the T attribute.
Use numpy.transpose() function to transpose a matrix.
Alternatively, use the T attribute of a numpy array.
Example: np.transpose(matrix) or matrix.T
NumPy is a library for numerical computing in Python, while Pandas is a data manipulation and analysis tool.
NumPy provides support for large, multi-dimensional arrays and matrices, along with a collection of mathematical functions to operate on these arrays.
Pandas offers data structures like DataFrame for easy data manipulation and analysis, with tools for reading and writing data from various file formats.
Both librari...
Python has various visualization libraries like Matplotlib, Seaborn, Plotly, and Bokeh.
Matplotlib is a widely used library for creating static, interactive, and animated plots.
Seaborn is built on top of Matplotlib and provides a high-level interface for creating attractive and informative statistical graphics.
Plotly is great for creating interactive plots and dashboards.
Bokeh is another interactive visualization librar...
based on 2 reviews
Rating in categories
Senior Data Science Engineer
44
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Scientist
41
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Science Engineer
31
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Data Scientist
22
salaries
| ₹0 L/yr - ₹0 L/yr |
Lead Data Consultant
16
salaries
| ₹0 L/yr - ₹0 L/yr |
Fractal Analytics
Mu Sigma
Tiger Analytics
LatentView Analytics