Info Edge
ITC's Agri Business Division Interview Questions and Answers
Q1. 1) explain correlation and convaraince 2) how logistic differ from linear regression
Correlation measures the strength and direction of a linear relationship between two variables, while covariance measures the extent to which two variables change together.
Correlation ranges from -1 to 1, where 1 indicates a perfect positive relationship, -1 indicates a perfect negative relationship, and 0 indicates no relationship.
Covariance can be positive, negative, or zero. A positive covariance indicates that as one variable increases, the other variable also tends to in...read more
Q2. Explain all of decision tree and random forest?
Decision tree is a tree-like model of decisions and their possible consequences, while random forest is an ensemble learning method that builds multiple decision trees and merges them together.
Decision tree is a flowchart-like structure where each internal node represents a decision based on an attribute, each branch represents the outcome of the decision, and each leaf node represents a class label.
Random forest is a collection of decision trees trained on random subsets of ...read more
Q3. What is Linearregression
Linear regression is a statistical method used to model the relationship between a dependent variable and one or more independent variables.
Linear regression is used to predict the value of a dependent variable based on the value of one or more independent variables.
It assumes a linear relationship between the independent and dependent variables.
The goal of linear regression is to find the best-fitting line that minimizes the sum of squared differences between the observed an...read more
Q4. What is random forest
Random forest is an ensemble learning method used for classification and regression tasks.
Random forest is a collection of decision trees that are trained on random subsets of the data.
Each tree in the random forest independently predicts the target variable, and the final prediction is made by averaging the predictions of all trees.
Random forest is robust to overfitting and noisy data, and it can handle large datasets with high dimensionality.
Random forest can be used for bo...read more
Q5. WHat is xgboost
XGBoost is an optimized distributed gradient boosting library designed for efficient and accurate large-scale machine learning.
XGBoost stands for eXtreme Gradient Boosting.
It is a popular machine learning algorithm known for its speed and performance.
XGBoost is used for regression, classification, ranking, and user-defined prediction problems.
It is based on the gradient boosting framework and uses decision trees as base learners.
XGBoost has implementations in various programm...read more
Q6. Describe LSTM and GRU
LSTM and GRU are types of recurrent neural networks used for processing sequential data.
LSTM (Long Short-Term Memory) networks are capable of learning long-term dependencies in data.
GRU (Gated Recurrent Unit) networks are simpler than LSTM and have fewer parameters.
LSTM has three gates (input, output, forget) while GRU has two gates (update, reset).
LSTM is better at capturing long-term dependencies but is more complex, while GRU is simpler and faster to train.
Both LSTM and GR...read more
Q7. Define Hypothesis Testing
Hypothesis testing is a statistical method used to make inferences about a population based on sample data.
Hypothesis testing involves formulating a null hypothesis and an alternative hypothesis.
It aims to determine if there is enough evidence to reject the null hypothesis in favor of the alternative hypothesis.
Common methods of hypothesis testing include t-tests, chi-square tests, and ANOVA.
The p-value is used to determine the significance of the results obtained from hypoth...read more
Interview Process at ITC's Agri Business Division
Top Data Scientist Interview Questions from Similar Companies
Reviews
Interviews
Salaries
Users/Month