Ericsson
10+ Intelligent Medical Billing Solutions Interview Questions and Answers
What are the assumptions of linear regression model?
What problems do multicollinearity in regression analysis cause?
What are different measures used to check performance of classification model?
What are disadvantage of logistics regression?
Q5. What are different measures used to check performance of classification model
Measures to check performance of classification model
Accuracy
Precision
Recall
F1 Score
ROC Curve
Confusion Matrix
Q6. What is effect of multicollinearity ik regression analysis?
Multicollinearity in regression analysis affects the accuracy and interpretability of the model.
Multicollinearity occurs when two or more independent variables are highly correlated.
It leads to unstable and unreliable estimates of regression coefficients.
It reduces the precision of the estimates and increases the standard errors.
It makes it difficult to interpret the individual effects of the independent variables.
It can be detected using correlation matrix, variance inflatio...read more
Q7. isolatn forest work? evalution metrics in laymann tems , pyspark basics , job lib
Isolation Forest is an anomaly detection algorithm that works by isolating outliers in a dataset.
Isolation Forest is an unsupervised machine learning algorithm used for anomaly detection.
It works by randomly selecting a feature and then randomly selecting a split value between the maximum and minimum values of the selected feature.
The number of splits required to isolate an outlier is used as a measure of its abnormality.
Evaluation metrics for Isolation Forest in layman's ter...read more
Q8. What are disadvantage of logistics regression?
Logistic regression assumes linear relationship between independent and dependent variables.
May not perform well with non-linear data
May overfit or underfit the data
May be sensitive to outliers
May require large sample size for stable results
Q9. Difference between Data Warehouse, Data Lakes and Tables
Data Warehouse stores structured data for reporting and analysis, Data Lakes store raw and unstructured data, Tables are basic data structures.
Data Warehouse is used for storing structured data from various sources for reporting and analysis.
Data Lakes store raw and unstructured data in its native format for future processing and analysis.
Tables are basic data structures used to organize and store data in a structured format.
Examples: Data Warehouse - Amazon Redshift, Data La...read more
Q10. Query difference betweeen Group by and Partition by
Group by is used to group rows that have the same values into summary rows, while Partition by is used to divide the result set into partitions to which the function is applied separately.
Group by is used with aggregate functions to group rows based on a column or set of columns.
Partition by is used with window functions to divide the result set into partitions.
Group by is used with SELECT statement, while Partition by is used within the OVER() clause in a window function.
Exa...read more
Q11. State assumptions of linear regression?
Assumptions of linear regression
Linear relationship between independent and dependent variables
Homoscedasticity (constant variance) of errors
Independence of errors
Normal distribution of errors
No multicollinearity among independent variables
More about working at Ericsson
Top HR Questions asked in Intelligent Medical Billing Solutions
Interview Process at Intelligent Medical Billing Solutions
Top Data Scientist Interview Questions from Similar Companies
Reviews
Interviews
Salaries
Users/Month