Data Analyst

filter-iconFilter interviews by

1000+ Data Analyst Interview Questions and Answers

Updated 11 Mar 2025

Popular Companies

search-icon

Q151. What could be the possible reasons for delay in delivery?

Ans.

Possible reasons for delay in delivery

  • Logistical issues such as traffic or weather

  • Inadequate inventory management

  • Production delays

  • Customs clearance delays

  • Incorrect address or contact information

  • Technical issues with delivery vehicles or equipment

Q152. What do you know about sql commands?

Ans.

SQL commands are used to manipulate and retrieve data from relational databases.

  • SQL stands for Structured Query Language

  • Common SQL commands include SELECT, INSERT, UPDATE, and DELETE

  • SQL commands are used to create, modify, and delete database objects such as tables, views, and indexes

  • Examples: SELECT * FROM customers; INSERT INTO orders (customer_id, order_date) VALUES (1, '2021-01-01'); UPDATE products SET price = 10.99 WHERE id = 1; DELETE FROM customers WHERE id = 1;

Q153. What are dis advantages of data analytics? Difference between data science and data analyst?

Ans.

Disadvantages of data analytics include potential privacy concerns, data quality issues, and the need for skilled professionals. Data science involves more advanced techniques and focuses on predictive modeling.

  • Privacy concerns: Data analytics may involve handling sensitive information, raising privacy issues for individuals or organizations.

  • Data quality issues: Inaccurate or incomplete data can lead to incorrect analysis and decision-making.

  • Skilled professionals required: Da...read more

Q154. What are the algorithms you know in machine learning and their details ?

Ans.

Various machine learning algorithms with brief details

  • Supervised Learning: Linear Regression, Logistic Regression, Support Vector Machines (SVM), Decision Trees, Random Forest

  • Unsupervised Learning: K-means Clustering, Hierarchical Clustering, Principal Component Analysis (PCA)

  • Reinforcement Learning: Q-Learning, Deep Q-Networks (DQN)

  • Neural Networks: Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM)

Are these interview questions helpful?

Q155. What is cte? Uses of cte . Have you applied it before?

Ans.

CTE stands for Common Table Expressions. It is a temporary result set that can be referenced within a SELECT, INSERT, UPDATE, or DELETE statement.

  • CTEs are defined using the WITH keyword in SQL.

  • They help improve readability and maintainability of complex queries.

  • CTEs can be recursive, allowing for hierarchical data querying.

  • Examples: Recursive CTEs for querying organizational hierarchies, CTEs for data transformation before joining tables.

Q156. How to select consecutive data from a column

Ans.

Use SQL query with window functions like ROW_NUMBER() to select consecutive data from a column

  • Use ROW_NUMBER() to assign a unique number to each row based on a specific order

  • Use a self join or subquery to compare the row numbers of consecutive rows

  • Filter the rows where the row numbers are consecutive to select the consecutive data

Share interview questions and help millions of jobseekers 🌟

man-with-laptop

Q157. What are the technical skills that you use to analyse the data?

Ans.

I use technical skills such as programming languages, data visualization tools, statistical analysis, and database management to analyze data.

  • Programming languages (e.g. Python, R, SQL)

  • Data visualization tools (e.g. Tableau, Power BI)

  • Statistical analysis techniques (e.g. regression analysis, hypothesis testing)

  • Database management (e.g. SQL Server, MySQL)

Q158. Do you know SQL, what visualization do you prefer? Why that visualization tool? What are the areas of your expertise in visualization? Do you follow any other practice to ease up your work?

Ans.

Yes, I am proficient in SQL and my preferred visualization tool is Tableau. My expertise lies in creating interactive dashboards and reports.

  • Proficient in SQL and Tableau

  • Expertise in creating interactive dashboards and reports

  • Familiar with other visualization tools like Power BI and QlikView

  • Use best practices like using appropriate chart types and color schemes

  • Regularly update and maintain dashboards to ensure accuracy and relevance

Data Analyst Jobs

Data Analyst 0-5 years
S&P Global Inc.
4.2
Hyderabad / Secunderabad
Data Analyst, Field Oper Return & ReCommerce 5-7 years
Amazon India Software Dev Centre Pvt Ltd
4.1
Hyderabad / Secunderabad
Data Analyst, WW Returns & Recommerce 5-7 years
Amazon India Software Dev Centre Pvt Ltd
4.1
Hyderabad / Secunderabad

Q159. Why ai and machine learning is only marketing term and not a real technology

Ans.

AI and machine learning are real technologies with practical applications in various industries.

  • AI and machine learning have been successfully used in various industries such as healthcare, finance, and transportation to improve efficiency and accuracy.

  • Companies like Google, Amazon, and Facebook heavily rely on AI and machine learning algorithms to enhance their products and services.

  • AI and machine learning technologies have shown significant advancements in natural language ...read more

Q160. How to created flat tables using sql

Ans.

Flat tables can be created using SQL by selecting the desired columns from multiple tables and joining them based on common keys.

  • Identify the tables that need to be included in the flat table

  • Determine the common keys between the tables

  • Use the SELECT statement to choose the desired columns from each table

  • Join the tables using the common keys

  • Apply any necessary filtering or sorting

  • Save the result as a new table or view

Q161. If i'm availabe for three months and what day to day activity would look like.

Ans.

As a data analyst for three months, I would be responsible for analyzing data, creating reports, and presenting insights to stakeholders.

  • Analyze data to identify trends and patterns

  • Create reports and visualizations to communicate insights

  • Collaborate with stakeholders to understand their needs and provide recommendations

  • Use statistical tools and techniques to validate findings

  • Continuously monitor and improve data quality

  • Examples: analyzing sales data to identify top-performing...read more

Q162. Pull dataset from SQL to Power BI and perform the visualization

Ans.

Use Power BI to connect to SQL database, import dataset, and create visualizations

  • Connect Power BI to SQL database

  • Import dataset from SQL into Power BI

  • Create visualizations using the imported data

Q163. What is the difference between Union and Union all ?

Ans.

Union combines the results of two or more SELECT statements, while Union all includes all rows, including duplicates.

  • Union removes duplicate rows from the result set, while Union all includes all rows.

  • Union sorts the result set, while Union all does not.

  • Union is slower than Union all because it performs a distinct operation.

  • Example: SELECT column1 FROM table1 UNION SELECT column1 FROM table2;

  • Example: SELECT column1 FROM table1 UNION ALL SELECT column1 FROM table2;

Q164. How to separate portion before @ and after @ from mail using python?

Ans.

Use Python to separate portion before and after @ in an email address.

  • Use the split() method to separate the email address into two parts based on the @ symbol.

  • Access the parts using index 0 for the portion before @ and index 1 for the portion after @.

  • Example: email = 'example@email.com'; parts = email.split('@'); before = parts[0]; after = parts[1];

Q165. What is the difference between a merge query and an append query?

Ans.

Merge query combines data from two or more tables based on a common column, while append query adds data from one table to the end of another.

  • Merge query combines data from multiple tables based on a common column, creating a new result set.

  • Append query adds the rows from one table to the end of another table, increasing the total number of rows.

  • Merge query is used to combine related data from different sources, while append query is used to add new records to an existing tab...read more

Q166. Write a basic query explaining the order of execution?

Ans.

The order of execution in a query determines the sequence in which operations are performed.

  • 1. The order of execution in a query typically follows the sequence: FROM, WHERE, GROUP BY, HAVING, SELECT, ORDER BY.

  • 2. The FROM clause specifies the tables involved in the query.

  • 3. The WHERE clause filters the rows based on specified conditions.

  • 4. The GROUP BY clause groups the rows that have the same values into summary rows.

  • 5. The HAVING clause filters the groups based on specified ...read more

Q167. How do you prioritize your tasks in a data project?

Ans.

I prioritize tasks in a data project by assessing deadlines, importance, dependencies, and impact on overall project goals.

  • Identify deadlines for each task and prioritize based on urgency

  • Consider the importance of each task in relation to project goals

  • Take into account dependencies between tasks and prioritize accordingly

  • Assess the potential impact of completing each task on the overall project success

Q168. What is the difference between the DELETE and DROP commands in SQL?

Ans.

DELETE is used to remove rows from a table, while DROP is used to remove entire tables from a database.

  • DELETE is a DML command used to remove specific rows from a table based on a condition.

  • DROP is a DDL command used to remove entire tables, views, or databases from a database.

  • DELETE does not remove the table structure, only the data within it.

  • DROP removes the table structure along with all its data, and it cannot be rolled back.

  • Example: DELETE FROM table_name WHERE condition...read more

Q169. What is the difference between UNION and UNION ALL in SQL?

Ans.

UNION combines and removes duplicates, UNION ALL combines without removing duplicates.

  • UNION merges the result sets of two or more SELECT statements, removing duplicates.

  • UNION ALL merges the result sets of two or more SELECT statements, including duplicates.

  • UNION is slower than UNION ALL as it involves removing duplicates.

  • Example: SELECT column1 FROM table1 UNION SELECT column1 FROM table2;

  • Example: SELECT column1 FROM table1 UNION ALL SELECT column1 FROM table2;

Q170. What is the LIKE operator, and how is it used in SQL?

Ans.

The LIKE operator is used in SQL to search for a specified pattern in a column.

  • The LIKE operator is used with the WHERE clause to search for a specified pattern in a column.

  • It allows for the use of wildcards such as % (matches any sequence of characters) and _ (matches any single character).

  • For example, 'SELECT * FROM table_name WHERE column_name LIKE 'a%'' will return all rows where the column starts with 'a'.

Q171. Cross over studies how are they designed and its interpretation

Ans.

Cross over studies are designed to compare two or more treatments on the same group of subjects.

  • Subjects receive one treatment for a certain period of time, then switch to another treatment.

  • The order of treatments is randomized to avoid bias.

  • The washout period between treatments is important to eliminate carryover effects.

  • Data is analyzed using paired t-tests or ANOVA.

  • Interpretation of results should consider carryover effects and treatment order.

  • Example: comparing the effect...read more

Q172. In detail - oncology Cancer stages TNM staging system Cancer Grades Types of cancer treatment Types of cancer

Ans.

Overview of oncology including cancer stages, TNM staging system, cancer grades, types of cancer treatment, and types of cancer.

  • Cancer stages range from 0 to IV, with higher numbers indicating more advanced cancer

  • TNM staging system is used to describe the extent of cancer in a patient's body

  • Cancer grades range from 1 to 4, with higher numbers indicating more abnormal cells

  • Types of cancer treatment include surgery, chemotherapy, radiation therapy, immunotherapy, and targeted t...read more

Q173. what is impairment and explain the process.

Ans.

Impairment is the reduction in the value of an asset due to damage, obsolescence, or other factors.

  • Impairment is a decrease in the value of an asset.

  • It can be caused by physical damage, obsolescence, or changes in market conditions.

  • The impairment process involves assessing the asset's current value and comparing it to its original cost.

  • If the current value is lower, the asset is impaired and its value is adjusted accordingly.

  • Impairment can be temporary or permanent, and can a...read more

Q174. Do you have experience working in cloud environments?

Ans.

Yes, I have experience working in cloud environments.

  • I have worked with AWS, Azure, and Google Cloud Platform.

  • I have experience with cloud-based data storage and processing.

  • I have used cloud-based tools for data visualization and analysis.

  • I am familiar with cloud security and compliance measures.

Q175. How would you retrieve data over than 3 months in dynamic dashboard

Ans.

To retrieve data over 3 months in a dynamic dashboard, use a date range filter and ensure the data source is updated regularly.

  • Create a date range filter in the dashboard to select a time period of over 3 months

  • Ensure the data source is updated regularly to include the required data

  • Use SQL queries or data extraction tools to pull the necessary data for the dashboard

  • Consider automating the data retrieval process to ensure accuracy and efficiency

Q176. What is Dicreat Chart. What is the role of this Chart

Ans.

A Dicreat Chart is a type of chart that displays data points in a discrete manner, typically using bars or columns.

  • Dicreat Charts are used to represent categorical data, where each category is represented by a separate bar or column.

  • They are commonly used in market research, survey data analysis, and comparison of different categories.

  • Examples of Dicreat Charts include bar charts, column charts, and stacked bar charts.

Q177. write a query to retrive all the employee names who have joined in last 30 days

Ans.

Query to retrieve employee names who joined in last 30 days

  • Use the current date and subtract 30 days to get the date 30 days ago

  • Join the employee table with the date joined column to filter employees who joined in the last 30 days

Q178. Which databases have you worked on?

Ans.

I have worked on databases such as MySQL, SQL Server, and MongoDB.

  • MySQL

  • SQL Server

  • MongoDB

Q179. Do you have any technical certification how many programming language do you know

Ans.

Yes, I have technical certifications and I am proficient in multiple programming languages.

  • I have a certification in SQL from Oracle

  • I am proficient in Python, R, and Java

  • I have experience with data visualization tools like Tableau and Power BI

Q180. What is multicollinearity and what are its effects?

Ans.

Multicollinearity is a phenomenon where two or more independent variables in a regression model are highly correlated.

  • It can lead to unstable and unreliable estimates of regression coefficients.

  • It can make it difficult to determine the individual effect of each independent variable on the dependent variable.

  • It can also result in inflated standard errors and p-values, making it difficult to identify statistically significant variables.

  • It can be detected using methods such as c...read more

Q181. Explain a time when you had multiple tasks. How do manage them?

Ans.

I prioritize tasks based on deadlines and importance, utilizing to-do lists and time management techniques.

  • Prioritize tasks based on deadlines and importance

  • Utilize to-do lists to keep track of tasks

  • Use time management techniques such as the Pomodoro technique

  • Delegate tasks when possible to lighten the workload

Q182. How will you optimize ticket closing process

Ans.

I will streamline the process by identifying bottlenecks and implementing automation.

  • Analyze current process flow

  • Identify areas of delay or inefficiency

  • Implement automation tools to reduce manual effort

  • Set up clear communication channels between team members

  • Track progress and adjust as needed

Q183. what is Sql ,what is RDBMS, What are non sql tools, what is schema

Ans.

SQL is a programming language used to manage relational databases. RDBMS is a software used to manage databases. Non-SQL tools are alternatives to SQL. Schema is a blueprint of a database.

  • SQL is used to manage relational databases

  • RDBMS is a software used to manage databases

  • Non-SQL tools are alternatives to SQL, such as NoSQL databases

  • Schema is a blueprint of a database, defining its structure and relationships

Q184. Guestimates question: How many women's drive red car in the Bangalore

Ans.

It is impossible to accurately estimate the number of women driving red cars in Bangalore without specific data.

  • There is no specific data available on the number of women driving red cars in Bangalore.

  • Estimating the number of women driving red cars in Bangalore would require access to vehicle registration data or a survey of car owners.

  • Factors such as car ownership trends among women and the popularity of red cars would also impact the estimate.

Q185. What is the DAX function in Power BI, and how is it used?

Ans.

DAX (Data Analysis Expressions) is a formula language used in Power BI to create custom calculations and aggregations.

  • DAX functions are used to create calculated columns, measures, and calculated tables in Power BI.

  • DAX functions can perform various operations like mathematical calculations, logical comparisons, and text manipulation.

  • Examples of DAX functions include SUM, AVERAGE, CALCULATE, FILTER, and RELATED.

Q186. What is the difference between a measure and a calculated column in Power BI?

Ans.

A measure is a calculation based on the data model, while a calculated column is a new column created in the data model.

  • Measures are dynamic and calculated on the fly, while calculated columns are static and stored in the data model.

  • Measures are used for aggregations and calculations, while calculated columns are used for adding new data columns.

  • Measures are typically used in visualizations, while calculated columns are used for filtering and sorting.

  • Example: A measure could ...read more

Q187. Can you describe the experience with data analysis and its tools?

Ans.

I have extensive experience in data analysis using tools such as Excel, SQL, Python, and Tableau.

  • Proficient in Excel for data cleaning, manipulation, and visualization

  • Strong SQL skills for querying databases and extracting relevant information

  • Experience with Python for statistical analysis and machine learning

  • Familiarity with Tableau for creating interactive dashboards and reports

Q188. How do you determine which variable is important in predictive model ?

Ans.

Variables importance in predictive model is determined using techniques like feature selection, correlation analysis, and machine learning algorithms.

  • Use feature selection techniques like Recursive Feature Elimination (RFE) or SelectKBest to identify important variables.

  • Analyze correlation between variables and target variable to determine importance.

  • Utilize machine learning algorithms like Random Forest or Gradient Boosting to rank variables based on their impact on model pe...read more

Q189. What is your experience with API integration in Thoughtspot?

Ans.

I have extensive experience with API integration in Thoughtspot.

  • Developed custom API integrations to pull data from external sources into Thoughtspot

  • Utilized Thoughtspot's REST API to automate data loading and report generation

  • Worked closely with IT teams to troubleshoot and optimize API connections

Q190. In which situations what type of charts will be used

Ans.

Charts are used to represent data visually. Different types of charts are used based on the type of data and purpose.

  • Line charts for showing trends over time

  • Bar charts for comparing values

  • Pie charts for showing proportions

  • Scatter plots for showing relationships between variables

  • Heat maps for showing density or distribution

  • Histograms for showing frequency distribution

  • Box plots for showing distribution and outliers

  • Gantt charts for showing project timelines

Q191. Suppose you have hundred banks to update. How much time will it take?

Ans.

The time taken to update hundred banks will depend on various factors such as the complexity of updates, resources available, and efficiency of the process.

  • Time taken will depend on the size and complexity of updates needed for each bank.

  • Efficiency of the process and resources available will also impact the time taken.

  • For example, if each bank requires a simple update that can be done quickly, it may take less time compared to complex updates that require more time and resour...read more

Q192. What are the main points that you will look for to confirm any location?

Ans.

To confirm a location, I would look for consistency in multiple data points such as address, coordinates, landmarks, and geotags.

  • Verify the address provided matches with known landmarks or businesses in the area

  • Check the coordinates provided against mapping services like Google Maps

  • Look for geotags or location data associated with photos or social media posts

  • Confirm the location using street view images or satellite imagery

Q193. What visualizations you use for different forms of data and contexts?

Ans.

I use a variety of visualizations such as bar charts, line graphs, scatter plots, and heat maps depending on the type of data and context.

  • Bar charts are useful for comparing categories of data

  • Line graphs are great for showing trends over time

  • Scatter plots help in identifying relationships between variables

  • Heat maps are effective for visualizing large datasets and identifying patterns

Q194. What are the tokens in Postgress

Ans.

Tokens in Postgres are the smallest unit of input that can be processed by the parser.

  • Tokens are used to identify and categorize the different parts of a SQL statement.

  • Examples of tokens include keywords, identifiers, operators, and literals.

  • The parser uses tokens to create a parse tree, which is used to execute the SQL statement.

Q195. What is WASO, TSO and EPS?

Ans.

WASO is Wake After Sleep Onset, TSO is Total Sleep Time, and EPS is Earnings Per Share.

  • WASO is the amount of time spent awake after initially falling asleep.

  • TSO is the total amount of time spent sleeping, including both REM and non-REM sleep.

  • EPS is a financial metric that represents the portion of a company's profit allocated to each outstanding share of common stock.

  • WASO and TSO are commonly used in sleep studies, while EPS is used in financial analysis.

Q196. What methods are used to identify the peak and trough of a signal?

Ans.

Various methods such as visual inspection, mathematical algorithms, and statistical techniques are used to identify the peak and trough of a signal.

  • Visual inspection of the signal waveform to identify the highest point as the peak and the lowest point as the trough.

  • Mathematical algorithms like finding the derivative of the signal and locating the points where it equals zero to identify peaks and troughs.

  • Statistical techniques such as moving average or peak detection algorithm...read more

Q197. How many Customers has purchased same item on the Same day more than onnce

Ans.

To find customers who purchased the same item multiple times on the same day.

  • Identify unique customers who purchased the same item multiple times on the same day

  • Check for duplicate transactions by customer and item on the same day

  • Aggregate the data to count the number of customers who made multiple purchases of the same item on the same day

Q198. Can you explain what a Pivot Table is?

Ans.

A Pivot Table is a data summarization tool used in spreadsheet programs to analyze, summarize, and present data in a tabular format.

  • Pivot tables allow users to reorganize and summarize selected columns and rows of data to obtain desired insights.

  • Users can easily group and filter data, perform calculations, and create visualizations using pivot tables.

  • Pivot tables are commonly used in Excel and other spreadsheet programs for data analysis and reporting.

  • For example, a sales man...read more

Q199. How much flowers were to be accumulated at the end of the day if each person offered 5 flowers a day at the temple.

Ans.

The total number of flowers accumulated at the end of the day can be calculated by multiplying the number of people by the number of flowers each person offers.

  • Multiply the number of people by 5 to get the total number of flowers accumulated

  • For example, if there are 10 people, the total number of flowers accumulated would be 10 * 5 = 50

Q200. star and dimension schema and window function in SQL

Ans.

Star and dimension schema are used in data warehousing to organize data, while window functions in SQL are used for analytical queries.

  • Star schema is a type of schema where a central fact table is connected to multiple dimension tables.

  • Dimension schema is a type of schema where each dimension is represented by a separate table.

  • Window functions in SQL are used to perform calculations across a set of table rows related to the current row.

  • Examples of window functions include ROW...read more

Previous
1
2
3
4
5
6
7
Next
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories

Interview experiences of popular companies

3.7
 • 10.5k Interviews
3.8
 • 8.2k Interviews
3.6
 • 7.6k Interviews
3.7
 • 5.6k Interviews
3.7
 • 5.6k Interviews
3.7
 • 4.8k Interviews
3.8
 • 2.8k Interviews
3.7
 • 740 Interviews
4.1
 • 277 Interviews
3.4
 • 81 Interviews
View all

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary

Recently Viewed
COMPANY BENEFITS
Ernst & Young
No Benefits
COMPANY BENEFITS
Ernst & Young
No Benefits
COMPANY BENEFITS
Ernst & Young
No Benefits
JOBS
Accenture
No Jobs
JOBS
Browse jobs
Discover jobs you love
JOBS
Browse jobs
Discover jobs you love
JOBS
Verve Mobile
No Jobs
INTERVIEWS
Genpact
No Interviews
JOBS
Swiss Re
No Jobs
DESIGNATION
Data Analyst Interview Questions
Share an Interview
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+

Reviews

4 L+

Interviews

4 Cr+

Salaries

1 Cr+

Users/Month

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter