i
Sigmoid
Filter interviews by
I applied via Naukri.com and was interviewed in Jun 2024. There were 5 interview rounds.
Dropout helps prevent overfitting in neural networks by randomly setting a fraction of input units to zero during training.
Dropout helps in preventing overfitting by reducing the interdependence between neurons
It acts as a regularization technique by randomly setting a fraction of input units to zero during training
Dropout forces the network to learn redundant representations, making it more robust and generalizable
It ...
XGBoost can handle missing values (NaN) by assigning them to a default direction during tree construction.
XGBoost treats NaN values as missing values and learns the best direction to go at each node to handle them
During tree construction, XGBoost assigns NaN values to the default direction based on the training data statistics
XGBoost can handle missing values in both input features and target variables
Utilize feature engineering techniques like one-hot encoding or target encoding to handle datasets with many categories.
Use feature engineering techniques like one-hot encoding to convert categorical variables into numerical values
Consider using target encoding to encode categorical variables based on the target variable
Apply dimensionality reduction techniques like PCA or LDA to reduce the number of features
Use tree-b...
Case study involved creating a churn model with an imbalanced dataset. It contained a lot of missing values in numerical features which were correlated, Also the scaling was highly skewed. Categorical data contained a lot of low frequency categories. They wanted a final model performance on a test dataset on chosen KPIs (I chose F1-score).
Top trending discussions
I was interviewed in May 2024.
Questions based on ML,PYTHON, DATA VISUALIZATION
TF-IDF is a numerical statistic that reflects the importance of a word in a document relative to a collection of documents.
TF-IDF stands for Term Frequency-Inverse Document Frequency
It is used in Natural Language Processing (NLP) to determine the importance of a word in a document
TF-IDF is calculated by multiplying the term frequency (TF) by the inverse document frequency (IDF)
It helps in identifying the most important
ML,DL,Python,NLP,Data VIsualization
TF-IDF is a numerical statistic that reflects the importance of a word in a document relative to a collection of documents.
TF-IDF stands for Term Frequency-Inverse Document Frequency.
It is used in Natural Language Processing (NLP) to determine the importance of a word in a document.
TF-IDF is calculated by multiplying the term frequency (TF) of a word by the inverse document frequency (IDF) of the word.
It helps in ident...
I applied via Naukri.com and was interviewed before Dec 2023. There were 3 interview rounds.
Test of Basic data structures in Python include lists, tuples, and dictionaries, as well as loops and conditional statements.
Framework and requirements for chatbot implementation.
My friends think of me as reliable, supportive, and always up for a good time.
Reliable - always there when they need help or support
Supportive - willing to listen and offer advice
Fun-loving - enjoys socializing and trying new things
I applied via Recruitment Consultant and was interviewed in Dec 2018. There were 3 interview rounds.
I chose Data Science field because of its potential to solve complex problems and make a positive impact on society.
Fascination with data and its potential to drive insights
Desire to solve complex problems and make a positive impact on society
Opportunity to work with cutting-edge technology and tools
Ability to work in a variety of industries and domains
Examples: Predictive maintenance in manufacturing, fraud detection
Linear Regression is used for predicting continuous numerical values, while Logistic Regression is used for predicting binary categorical values.
Linear Regression predicts a continuous output, while Logistic Regression predicts a binary output.
Linear Regression uses a linear equation to model the relationship between the independent and dependent variables, while Logistic Regression uses a logistic function.
Linear Regr...
Confusion matrix is a table used to evaluate the performance of a classification model.
It is a 2x2 matrix that shows the number of true positives, false positives, true negatives, and false negatives.
It helps in calculating various metrics like accuracy, precision, recall, and F1 score.
It is useful in identifying the strengths and weaknesses of a model and improving its performance.
Example: In a binary classification p...
No, confusion matrix is not used in Linear Regression.
Confusion matrix is used to evaluate classification models.
Linear Regression is a regression model, not a classification model.
Evaluation metrics for Linear Regression include R-squared, Mean Squared Error, etc.
KNN is a non-parametric algorithm used for classification and regression tasks.
KNN stands for K-Nearest Neighbors.
It works by finding the K closest data points to a given test point.
The class or value of the test point is then determined by the majority class or average value of the K neighbors.
KNN can be used for both classification and regression tasks.
It is a simple and easy-to-understand algorithm, but can be compu
Random Forest is an ensemble learning method that builds multiple decision trees and combines their outputs to improve accuracy.
Random Forest is a type of supervised learning algorithm used for classification and regression tasks.
It creates multiple decision trees and combines their outputs to make a final prediction.
Each decision tree is built using a random subset of features and data points to reduce overfitting.
Ran...
I have worked on various projects involving data analysis, machine learning, and predictive modeling.
Developed a predictive model to forecast customer churn for a telecommunications company.
Built a recommendation system using collaborative filtering for an e-commerce platform.
Performed sentiment analysis on social media data to understand customer opinions and preferences.
Implemented a fraud detection system using anom...
General aptitude basics
Mcq and basic ml model building
I applied via Approached by Company
Transformers are a type of neural network architecture that utilizes self-attention mechanisms to process sequential data.
Transformers use self-attention mechanisms to weigh the importance of different input elements, allowing for parallel processing of sequences.
Unlike RNNs and LSTMs, Transformers do not rely on sequential processing, making them more efficient for long-range dependencies.
Transformers have been shown ...
Different types of Attention include self-attention, global attention, and local attention.
Self-attention focuses on relationships within the input sequence itself.
Global attention considers the entire input sequence when making predictions.
Local attention only attends to a subset of the input sequence at a time.
Examples include Transformer's self-attention mechanism, Bahdanau attention, and Luong attention.
GPT is a generative model while BERT is a transformer model for natural language processing.
GPT is a generative model that predicts the next word in a sentence based on previous words.
BERT is a transformer model that considers the context of a word by looking at the entire sentence.
GPT is unidirectional, while BERT is bidirectional.
GPT is better for text generation tasks, while BERT is better for understanding the cont
Data scientists analyze data to gain insights, machine learning (ML) involves algorithms that improve automatically through experience, and artificial intelligence (AI) refers to machines mimicking human cognitive functions.
Data scientists analyze large amounts of data to uncover patterns and insights.
Machine learning involves developing algorithms that improve automatically through experience.
Artificial intelligence r...
I applied via Naukri.com and was interviewed in Jun 2024. There were 4 interview rounds.
First round is coding round where two use cases are there. Need to solve them
based on 1 interview
Interview experience
Software Development Engineer II
86
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Scientist
49
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Engineer
49
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Data Scientist
42
salaries
| ₹0 L/yr - ₹0 L/yr |
Software Development Engineer
37
salaries
| ₹0 L/yr - ₹0 L/yr |
Fractal Analytics
Mu Sigma
Tiger Analytics
LatentView Analytics