i
IBM
Proud winner of ABECA 2025 - AmbitionBox Employee Choice Awards
Filter interviews by
Utilized advanced SQL queries for data analysis, aggregation, and reporting in various projects.
Used Common Table Expressions (CTEs) for recursive queries to analyze hierarchical data.
Implemented window functions like ROW_NUMBER() and RANK() for ranking patients based on their treatment outcomes.
Executed complex JOIN operations to merge data from multiple tables, enhancing data insights.
Applied subqueries for filt...
SQL queries are essential for data manipulation and retrieval in databases, enabling complex data analysis and reporting.
SELECT Statement: Used to retrieve data from a database. Example: SELECT * FROM employees WHERE department = 'Sales';
JOIN Operations: Combine rows from two or more tables based on a related column. Example: SELECT orders.id, customers.name FROM orders JOIN customers ON orders.customer_id = custo...
Designing conversational flow involves structuring dialogue for clarity, engagement, and user satisfaction.
Define user goals: Understand what users want to achieve, e.g., booking an appointment.
Map out conversation paths: Create flowcharts to visualize possible dialogues.
Use natural language: Ensure the bot understands and responds in a human-like manner.
Incorporate error handling: Plan for misunderstandings and p...
RAG (Retrieval-Augmented Generation) combines retrieval of relevant data with generative models for enhanced information synthesis.
RAG uses a two-step process: retrieval of relevant documents followed by generation of responses based on those documents.
It leverages large language models (LLMs) to generate contextually relevant answers, improving accuracy and relevance.
For example, in a customer support chatbot, RA...
What people are saying about IBM
I would rate myself a 4 in Python, with strong skills in data manipulation, analysis, and machine learning applications.
Data Manipulation: Proficient in using libraries like Pandas for data cleaning and transformation, e.g., merging datasets and handling missing values.
Data Visualization: Experienced with Matplotlib and Seaborn for creating insightful visualizations, such as scatter plots and heatmaps.
Machine Lear...
Python code is a programming language used for data analysis, machine learning, and scientific computing.
Python code is written in a text editor or an integrated development environment (IDE)
Python code is executed using a Python interpreter
Python code can be used for data manipulation, visualization, and modeling
Python is a high-level programming language known for its simplicity and readability.
Python is widely used for web development, data analysis, artificial intelligence, and scientific computing.
It emphasizes code readability and uses indentation for block delimiters.
Python has a large standard library and a vibrant community of developers.
Example: print('Hello, World!')
Example: import pandas as pd
The question seems to be incomplete or misspelled.
It is possible that the interviewer made a mistake while asking the question.
Ask for clarification or context to provide a relevant answer.
Code problems refer to issues or errors in the code that need to be identified and fixed.
Code problems can include syntax errors, logical errors, or performance issues.
Examples of code problems include missing semicolons, incorrect variable assignments, or inefficient algorithms.
Identifying and resolving code problems is a key skill for data scientists to ensure accurate and efficient data analysis.
Choosing the right model depends on data characteristics, problem complexity, and performance metrics.
Model performance: Some models may outperform others based on metrics like accuracy, precision, or recall. For example, Random Forest may perform better than Logistic Regression on complex datasets.
Data characteristics: The nature of the data (e.g., linear vs. non-linear relationships) influences model choice. For...
I applied via Approached by Company and was interviewed in Aug 2024. There were 2 interview rounds.
*****, arjumpudi satyanarayana
Python is a high-level programming language known for its simplicity and readability.
Python is widely used for web development, data analysis, artificial intelligence, and scientific computing.
It emphasizes code readability and uses indentation for block delimiters.
Python has a large standard library and a vibrant community of developers.
Example: print('Hello, World!')
Example: import pandas as pd
Code problems refer to issues or errors in the code that need to be identified and fixed.
Code problems can include syntax errors, logical errors, or performance issues.
Examples of code problems include missing semicolons, incorrect variable assignments, or inefficient algorithms.
Identifying and resolving code problems is a key skill for data scientists to ensure accurate and efficient data analysis.
Python code is a programming language used for data analysis, machine learning, and scientific computing.
Python code is written in a text editor or an integrated development environment (IDE)
Python code is executed using a Python interpreter
Python code can be used for data manipulation, visualization, and modeling
The project is a machine learning model to predict customer churn for a telecommunications company.
Developing predictive models using machine learning algorithms
Analyzing customer data to identify patterns and trends
Evaluating model performance and making recommendations for reducing customer churn
The question seems to be incomplete or misspelled.
It is possible that the interviewer made a mistake while asking the question.
Ask for clarification or context to provide a relevant answer.
Choosing the right model depends on data characteristics, problem complexity, and performance metrics.
Model performance: Some models may outperform others based on metrics like accuracy, precision, or recall. For example, Random Forest may perform better than Logistic Regression on complex datasets.
Data characteristics: The nature of the data (e.g., linear vs. non-linear relationships) influences model choice. For inst...
DSA,ML,SQL,stats,DL,
I appeared for an interview in Mar 2025, where I was asked the following questions.
RAG (Retrieval-Augmented Generation) combines retrieval of relevant data with generative models for enhanced information synthesis.
RAG uses a two-step process: retrieval of relevant documents followed by generation of responses based on those documents.
It leverages large language models (LLMs) to generate contextually relevant answers, improving accuracy and relevance.
For example, in a customer support chatbot, RAG can...
Designing conversational flow involves structuring dialogue for clarity, engagement, and user satisfaction.
Define user goals: Understand what users want to achieve, e.g., booking an appointment.
Map out conversation paths: Create flowcharts to visualize possible dialogues.
Use natural language: Ensure the bot understands and responds in a human-like manner.
Incorporate error handling: Plan for misunderstandings and provid...
I applied via Company Website and was interviewed in Nov 2024. There was 1 interview round.
Its really easy if you know python well
60 min hackerrank test,with one mysql medium difficulty question and one python medium/little hard level difficulty
I appeared for an interview before Mar 2024, where I was asked the following questions.
Utilized advanced SQL queries for data analysis, aggregation, and reporting in various projects.
Used Common Table Expressions (CTEs) for recursive queries to analyze hierarchical data.
Implemented window functions like ROW_NUMBER() and RANK() for ranking patients based on their treatment outcomes.
Executed complex JOIN operations to merge data from multiple tables, enhancing data insights.
Applied subqueries for filtering...
Window functions in SQL allow for performing calculations across a set of table rows related to the current row.
Window functions operate on a set of rows defined by an OVER() clause.
They do not change the number of rows returned by a query.
Common window functions include ROW_NUMBER(), RANK(), and SUM().
Example: SELECT name, salary, RANK() OVER (ORDER BY salary DESC) AS rank FROM employees;
Window functions can be partit...
SQL queries are essential for data manipulation and retrieval in databases, enabling complex data analysis and reporting.
SELECT Statement: Used to retrieve data from a database. Example: SELECT * FROM employees WHERE department = 'Sales';
JOIN Operations: Combine rows from two or more tables based on a related column. Example: SELECT orders.id, customers.name FROM orders JOIN customers ON orders.customer_id = customers....
I applied via Company Website and was interviewed in Nov 2023. There was 1 interview round.
I applied via Job Portal and was interviewed before Feb 2023. There was 1 interview round.
Hyperparameters in random forest are parameters that are set before the learning process begins.
Hyperparameters control the behavior of the random forest algorithm.
They are set by the data scientist and are not learned from the data.
Examples of hyperparameters in random forest include the number of trees, the maximum depth of trees, and the number of features considered at each split.
A QnA system with LLM is a system that uses the Language Model for Information Retrieval and Question Answering.
Preprocess the input question and convert it into a format suitable for the LLM model.
Fine-tune the LLM model on a dataset of question-answer pairs.
Use the fine-tuned model to generate answers for new questions.
Evaluate the performance of the QnA system using metrics like precision, recall, and F1 score.
Itera...
Unit testing is a process of testing individual units of code to ensure they function correctly.
Write test cases for each unit of code
Test inputs, outputs, and edge cases
Use testing frameworks like JUnit or pytest
Automate tests to run regularly
Ensure tests are independent, isolated, and repeatable
I applied via Company Website and was interviewed before Dec 2023. There were 2 interview rounds.
It was of hackerrank and 2 question
The duration of IBM Data Scientist interview process can vary, but typically it takes about less than 2 weeks to complete.
based on 16 interview experiences
Difficulty level
Duration
based on 62 reviews
Rating in categories
Application Developer
12.5k
salaries
| ₹5.2 L/yr - ₹26.5 L/yr |
Software Engineer
5.9k
salaries
| ₹8.2 L/yr - ₹25 L/yr |
Software Developer
5.7k
salaries
| ₹13.4 L/yr - ₹39.5 L/yr |
Senior Software Engineer
5.4k
salaries
| ₹14.1 L/yr - ₹36 L/yr |
Advisory System Analyst
4.5k
salaries
| ₹13.6 L/yr - ₹23 L/yr |
Oracle
TCS
Cognizant
Accenture