i
IBM
Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards
Filter interviews by
I applied via Job Portal and was interviewed before Feb 2023. There was 1 interview round.
Hyperparameters in random forest are parameters that are set before the learning process begins.
Hyperparameters control the behavior of the random forest algorithm.
They are set by the data scientist and are not learned from the data.
Examples of hyperparameters in random forest include the number of trees, the maximum depth of trees, and the number of features considered at each split.
A QnA system with LLM is a system that uses the Language Model for Information Retrieval and Question Answering.
Preprocess the input question and convert it into a format suitable for the LLM model.
Fine-tune the LLM model on a dataset of question-answer pairs.
Use the fine-tuned model to generate answers for new questions.
Evaluate the performance of the QnA system using metrics like precision, recall, and F1 score.
Itera...
Unit testing is a process of testing individual units of code to ensure they function correctly.
Write test cases for each unit of code
Test inputs, outputs, and edge cases
Use testing frameworks like JUnit or pytest
Automate tests to run regularly
Ensure tests are independent, isolated, and repeatable
I applied via Recruitment Consultant and was interviewed in Sep 2020. There were 3 interview rounds.
I applied via Campus Placement and was interviewed before Sep 2020. There were 3 interview rounds.
I applied via Approached by Company and was interviewed in May 2022. There were 3 interview rounds.
Outliers can be handled by removing, transforming or imputing them. Imbalanced datasets can be handled by resampling techniques. Feature engineering involves creating new features from existing ones.
Outliers can be removed using statistical methods like z-score or IQR.
Outliers can be transformed using techniques like log transformation or box-cox transformation.
Outliers can be imputed using techniques like mean imputat...
I applied via Approached by Company and was interviewed before Feb 2023. There was 1 interview round.
Hypothesis testing is a statistical method used to make inferences about a population based on sample data.
It involves formulating a hypothesis about a population parameter, collecting data, and using statistical tests to determine if the data supports or rejects the hypothesis.
There are two types of hypotheses: null hypothesis (H0) and alternative hypothesis (H1).
Common statistical tests for hypothesis testing include...
Null hypothesis is a statement that there is no significant difference or relationship between variables being studied.
Null hypothesis is typically denoted as H0 in statistical hypothesis testing.
It is the default assumption that there is no effect or relationship.
The alternative hypothesis (Ha) is the opposite of the null hypothesis.
For example, in a study testing a new drug, the null hypothesis would be that the drug...
Supervised learning uses labeled data to train a model, while unsupervised learning uses unlabeled data.
Supervised learning requires labeled data for training
Unsupervised learning does not require labeled data
Examples of supervised learning include classification and regression
Examples of unsupervised learning include clustering and dimensionality reduction
I applied via Company Website and was interviewed before Jan 2020. There was 1 interview round.
I applied via Company Website and was interviewed in Aug 2021. There was 1 interview round.
The choice of ML model depends on the problem, data, and desired outcome.
Consider the problem type: classification, regression, clustering, etc.
Analyze the data: size, quality, features, and target variable.
Evaluate model performance: accuracy, precision, recall, F1-score.
Consider interpretability, scalability, and computational requirements.
Experiment with multiple models: decision trees, SVM, neural networks, etc.
Use...
I applied via Referral and was interviewed in Mar 2021. There were 4 interview rounds.
Data science is the field of extracting insights and knowledge from data using various techniques and tools.
Data science involves collecting, cleaning, and analyzing data to extract insights.
It uses various techniques such as machine learning, statistical modeling, and data visualization.
Data science is used in various fields such as finance, healthcare, and marketing.
Examples of data science applications include fraud...
Python and R are programming languages commonly used in data science and statistical analysis.
Python is a general-purpose language with a large community and many libraries for data manipulation and machine learning.
R is a language specifically designed for statistical computing and graphics, with a wide range of packages for data analysis and visualization.
Both languages are popular choices for data scientists and hav...
The duration of IBM Data Scientist interview process can vary, but typically it takes about less than 2 weeks to complete.
based on 1 interview
Interview experience
based on 61 reviews
Rating in categories
Application Developer
11.9k
salaries
| ₹5.7 L/yr - ₹25.8 L/yr |
Software Engineer
5.6k
salaries
| ₹7.1 L/yr - ₹22.3 L/yr |
Advisory System Analyst
5.2k
salaries
| ₹9.5 L/yr - ₹28.4 L/yr |
Senior Software Engineer
5k
salaries
| ₹8.1 L/yr - ₹32 L/yr |
Software Developer
4.8k
salaries
| ₹11.5 L/yr - ₹32 L/yr |
Oracle
TCS
Cognizant
Accenture