IBM
10+ EWNS Holidays Interview Questions and Answers
Q1. What PCA, Decision tree and computer vision
PCA is a dimensionality reduction technique, decision tree is a classification algorithm, and computer vision is a field of study focused on enabling computers to interpret and understand visual information.
PCA is used to reduce the number of variables in a dataset while retaining the most important information.
Decision trees are used to classify data based on a set of rules and conditions.
Computer vision involves using algorithms and techniques to enable computers to interpr...read more
Q2. What advanced SQL queries were used in your project?
Utilized advanced SQL queries for data analysis, aggregation, and reporting in various projects.
Used Common Table Expressions (CTEs) for recursive queries to analyze hierarchical data.
Implemented window functions like ROW_NUMBER() and RANK() for ranking patients based on their treatment outcomes.
Executed complex JOIN operations to merge data from multiple tables, enhancing data insights.
Applied subqueries for filtering datasets, such as finding patients with above-average tes...read more
Q3. what are hyperparameters in random forest
Hyperparameters in random forest are parameters that are set before the learning process begins.
Hyperparameters control the behavior of the random forest algorithm.
They are set by the data scientist and are not learned from the data.
Examples of hyperparameters in random forest include the number of trees, the maximum depth of trees, and the number of features considered at each split.
Q4. What is the python language
Python is a high-level programming language known for its simplicity and readability.
Python is widely used for web development, data analysis, artificial intelligence, and scientific computing.
It emphasizes code readability and uses indentation for block delimiters.
Python has a large standard library and a vibrant community of developers.
Example: print('Hello, World!')
Example: import pandas as pd
Q5. What is the code problems
Code problems refer to issues or errors in the code that need to be identified and fixed.
Code problems can include syntax errors, logical errors, or performance issues.
Examples of code problems include missing semicolons, incorrect variable assignments, or inefficient algorithms.
Identifying and resolving code problems is a key skill for data scientists to ensure accurate and efficient data analysis.
Q6. What is the python code
Python code is a programming language used for data analysis, machine learning, and scientific computing.
Python code is written in a text editor or an integrated development environment (IDE)
Python code is executed using a Python interpreter
Python code can be used for data manipulation, visualization, and modeling
Q7. What is the lnderssip
The question seems to be incomplete or misspelled.
It is possible that the interviewer made a mistake while asking the question.
Ask for clarification or context to provide a relevant answer.
Q8. how to do QnA system with LLM
A QnA system with LLM is a system that uses the Language Model for Information Retrieval and Question Answering.
Preprocess the input question and convert it into a format suitable for the LLM model.
Fine-tune the LLM model on a dataset of question-answer pairs.
Use the fine-tuned model to generate answers for new questions.
Evaluate the performance of the QnA system using metrics like precision, recall, and F1 score.
Iteratively improve the system by refining the training data an...read more
Q9. Write some SQL queries on given situations
Q10. How to design conversational flow
Designing conversational flow involves structuring dialogue for clarity, engagement, and user satisfaction.
Define user goals: Understand what users want to achieve, e.g., booking an appointment.
Map out conversation paths: Create flowcharts to visualize possible dialogues.
Use natural language: Ensure the bot understands and responds in a human-like manner.
Incorporate error handling: Plan for misunderstandings and provide fallback options.
Test and iterate: Continuously refine t...read more
Q11. how to do unit testing
Unit testing is a process of testing individual units of code to ensure they function correctly.
Write test cases for each unit of code
Test inputs, outputs, and edge cases
Use testing frameworks like JUnit or pytest
Automate tests to run regularly
Ensure tests are independent, isolated, and repeatable
Q12. Rate yourself in python from 0-5
Q13. Window's function in SQL
Window functions in SQL allow for performing calculations across a set of table rows related to the current row.
Window functions operate on a set of rows defined by an OVER() clause.
They do not change the number of rows returned by a query.
Common window functions include ROW_NUMBER(), RANK(), and SUM().
Example: SELECT name, salary, RANK() OVER (ORDER BY salary DESC) AS rank FROM employees;
Window functions can be partitioned using PARTITION BY to group results.
Q14. Describe RAG approach
RAG (Retrieval-Augmented Generation) combines retrieval of relevant data with generative models for enhanced information synthesis.
RAG uses a two-step process: retrieval of relevant documents followed by generation of responses based on those documents.
It leverages large language models (LLMs) to generate contextually relevant answers, improving accuracy and relevance.
For example, in a customer support chatbot, RAG can pull information from a knowledge base and generate tailo...read more
More about working at IBM
Interview Process at EWNS Holidays
Top Data Scientist Interview Questions from Similar Companies
Reviews
Interviews
Salaries
Users/Month