Senior Data Analyst

100+ Senior Data Analyst Interview Questions and Answers

Updated 16 Dec 2024

Q51. Diffrence between direct connection and live connection?

Ans.

Direct connection is a static connection to a data source, while live connection is a dynamic connection that updates in real-time.

  • Direct connection requires data to be imported into the analysis tool, while live connection accesses data directly from the source.

  • Live connection allows for real-time updates and analysis, while direct connection requires manual refreshing.

  • Direct connection is useful for small datasets or when data is not frequently updated, while live connectio...read more

Q52. How to find null values from a data using sql?

Ans.

To find null values from a data using SQL, use the IS NULL operator.

  • Use the IS NULL operator to check for null values in a specific column.

  • Combine the IS NULL operator with the WHERE clause to filter out null values.

  • Use the COUNT function to count the number of null values in a column.

  • Example: SELECT * FROM table_name WHERE column_name IS NULL;

  • Example: SELECT COUNT(column_name) FROM table_name WHERE column_name IS NULL;

Q53. What is incremental refresh in Power BI?

Ans.

Incremental refresh is a feature in Power BI that allows you to refresh only a portion of your data instead of the entire dataset.

  • Incremental refresh is useful for large datasets that take a long time to refresh.

  • It allows you to refresh only the new or updated data since the last refresh.

  • You can set up incremental refresh by defining a range of values for a column that represents the refresh window.

  • For example, you can set up incremental refresh to only refresh data from the ...read more

Q54. What are the ways to optimize Tableau dashboard?

Ans.

Optimizing Tableau dashboard involves improving performance, usability, and visual appeal.

  • Limit the number of worksheets on a dashboard to improve performance

  • Use filters and parameters effectively to allow for interactivity

  • Optimize data source connections for faster loading times

  • Utilize dashboard actions to create dynamic interactions

  • Design with user experience in mind to ensure clarity and ease of use

Are these interview questions helpful?

Q55. calc retained revived new customer per month basis what are joins in sql what is dense rank

Ans.

To calculate retained, revived, and new customers per month basis, joins in SQL and dense rank are used.

  • To calculate retained, revived, and new customers per month basis, you would typically use SQL queries to join different tables containing customer data.

  • Joins in SQL are used to combine rows from two or more tables based on a related column between them.

  • Dense rank is a function in SQL that assigns a rank to each row within a partition of a result set, with no gaps in the ra...read more

Q56. Write DAX to calculate some values.

Ans.

Calculating values using DAX in Power BI

  • Use SUMX function to calculate sum of values in a column

  • Use CALCULATE function to apply filters and conditions to calculations

  • Use DAX measures to create reusable calculations

  • Example: Calculate total sales for a specific product category

Share interview questions and help millions of jobseekers 🌟

man-with-laptop

Q57. difference in tree map & heat Map in tableau

Ans.

Tree map displays hierarchical data using nested rectangles, while heat map shows data values using color intensity.

  • Tree map visualizes hierarchical data with nested rectangles, where the size of each rectangle represents a measure.

  • Heat map displays data values using color intensity, with darker colors indicating higher values.

  • Tree map is useful for showing proportions within a whole, while heat map is effective for identifying patterns and trends in data.

  • Example: A tree map ...read more

Q58. What is purpose of google analytics

Ans.

Google Analytics is a web analytics service that tracks and reports website traffic.

  • Tracks website traffic and user behavior

  • Provides insights into user demographics and interests

  • Helps optimize website performance and marketing strategies

  • Offers customizable reports and dashboards

  • Integrates with other Google products like AdWords and Search Console

Senior Data Analyst Jobs

Senior Data Analyst 4-8 years
Maersk Global Service Centres India Pvt. Ltd.
4.3
Mumbai
Sr Data Analyst 3-9 years
Target Corporation India Pvt Ltd
4.2
Bangalore / Bengaluru
Sr Data Analyst 4-8 years
Target Corporation India Pvt Ltd
4.2
Bangalore / Bengaluru

Q59. Write sql code to get 2nd order for each customer for each day

Ans.

SQL code to retrieve 2nd order for each customer for each day

  • Use a subquery to rank orders for each customer by date

  • Filter the results to only include the 2nd order for each customer for each day

Q60. Define a metric that can tell the best performing genre on the platform

Ans.

One metric to determine the best performing genre on the platform is the average number of plays per song within each genre.

  • Calculate the average number of plays per song for each genre

  • Compare the averages to identify the genre with the highest average

  • Genres with higher average plays can be considered the best performing on the platform

Q61. Explain the difference between Arithmetic mean, Harmonic mean and Geometric mean?

Ans.

Arithmetic mean is the sum of all values divided by the number of values. Harmonic mean is the reciprocal of the arithmetic mean of the reciprocals of the values. Geometric mean is the nth root of the product of n values.

  • Arithmetic mean is most commonly used and is calculated by summing all values and dividing by the number of values.

  • Harmonic mean is used when dealing with rates or ratios, and is calculated by taking the reciprocal of the arithmetic mean of the reciprocals of...read more

Q62. Explain classification models I used and why

Ans.

I have used logistic regression and decision tree models for classification.

  • Logistic regression is a linear model used for binary classification.

  • Decision tree is a non-linear model used for multi-class classification.

  • Logistic regression is simple and easy to interpret while decision tree can handle non-linear relationships.

  • I chose these models based on the nature of the data and the problem at hand.

Q63. Select * from table where student_name isnull() ;

Ans.

This SQL query selects all rows from a table where the student_name column is null.

  • The query uses the SELECT statement to retrieve data from a table.

  • The * symbol means all columns will be selected.

  • The WHERE clause filters the results to only include rows where student_name is null.

Q64. How to handle large datasets.

Ans.

Handling large datasets involves optimizing storage, processing, and analysis techniques.

  • Use distributed computing frameworks like Hadoop or Spark to process data in parallel.

  • Utilize data compression techniques to reduce storage requirements.

  • Implement indexing and partitioning strategies to improve query performance.

  • Consider using cloud-based storage and computing resources for scalability.

  • Use sampling techniques to work with subsets of data for initial analysis.

Q65. How to use parameters.

Ans.

Parameters are variables that can be passed into a function or procedure to customize its behavior.

  • Parameters are defined when creating a function or procedure.

  • They can be used to pass values into the function or procedure.

  • Parameters can be required or optional, with default values specified.

  • They help make functions more flexible and reusable.

  • Example: def calculate_area(length, width):

Q66. Steps Involved In Any Data Analysis Project

Ans.

Steps involved in a data analysis project include defining objectives, collecting data, cleaning and organizing data, analyzing data, and presenting findings.

  • Define project objectives and goals

  • Collect relevant data sources

  • Clean and organize data to ensure accuracy

  • Perform data analysis using statistical methods and tools

  • Present findings and insights to stakeholders

Q67. What are the types of Filters in Power Bi?

Ans.

Types of Filters in Power Bi include visual-level filters, page-level filters, report-level filters, and drillthrough filters.

  • Visual-level filters apply only to a specific visual on a report page

  • Page-level filters apply to all visuals on a specific report page

  • Report-level filters apply to all visuals on all report pages

  • Drillthrough filters allow users to see details of a specific data point by drilling through to another report page

Q68. Explain feature engineering process in ML modelling

Ans.

Feature engineering is the process of selecting and transforming relevant features from raw data to improve model performance.

  • Identify relevant features based on domain knowledge and data exploration

  • Transform features to improve their quality and relevance

  • Create new features by combining or extracting information from existing features

  • Select the most important features using feature selection techniques

  • Iterate the process to improve model performance

Q69. What are analytical functions explain all

Ans.

Analytical functions are used in SQL to perform calculations across a set of rows related to the current row.

  • Analytical functions operate on a group of rows and return a single result for each row.

  • They can be used to calculate moving averages, running totals, rank, and percentiles.

  • Examples include ROW_NUMBER(), RANK(), DENSE_RANK(), LEAD(), LAG(), SUM(), AVG().

Q70. write a sql query to find total amount of order placed by each customer

Ans.

This query calculates the total amount of orders placed by each customer.

  • Use the GROUP BY clause to group the orders by customer.

  • Use the SUM() function to calculate the total amount of each group.

  • Join the orders table with the customers table to get the customer names.

Q71. DBMS key concepts and explation with examples of use case

Ans.

DBMS key concepts and examples of use cases

  • DBMS stands for Database Management System

  • Key concepts include data modeling, query optimization, transaction management

  • Use cases include online transaction processing (OLTP), data warehousing, customer relationship management (CRM)

  • Examples: MySQL for OLTP, Oracle for data warehousing, Salesforce for CRM

Q72. Create data model and visualizations.

Ans.

Creating data model and visualizations involves structuring data and presenting it visually.

  • Identify the key variables and relationships in the data

  • Design a data model that represents the data structure

  • Select appropriate visualization tools and techniques

  • Create visualizations such as charts, graphs, and dashboards

  • Ensure the visualizations effectively communicate insights from the data

Q73. Power BI optimization techniques.

Ans.

Power BI optimization techniques involve data model optimization, query optimization, and report design best practices.

  • Optimize data model by removing unnecessary columns and relationships

  • Use query folding to push data transformation tasks to the data source

  • Avoid using calculated columns and instead use measures for calculations

  • Limit the use of visuals that require high computational resources

  • Use aggregations and composite models for large datasets

Q74. What is the purpose of javascript

Ans.

JavaScript is a programming language used to create interactive web pages and web applications.

  • JavaScript is used to add interactivity to web pages by manipulating the Document Object Model (DOM).

  • It can be used for form validation, creating animations, and building web-based games.

  • JavaScript can also be used on the server-side with Node.js to build scalable web applications.

  • Popular JavaScript frameworks include React, Angular, and Vue.js.

Q75. Use case study implementation of logics

Ans.

Implementing logics in a use case study to analyze data and derive insights.

  • Identify the problem statement and objectives of the study

  • Collect relevant data and clean it for analysis

  • Apply statistical and machine learning techniques to derive insights

  • Interpret the results and communicate findings to stakeholders

Q76. Write SQL to print the user with the highest restaurant order. There are couple of joins to be used.

Ans.

Use SQL with joins to find user with highest restaurant order

  • Use JOIN to connect user table with order table

  • Group by user and sum the order amounts

  • Order by sum in descending order and limit to 1 result

Q77. Data blending Vs Joins in tableau

Ans.

Data blending combines data from different sources in Tableau, while joins combine data from the same source.

  • Data blending is used when data comes from different sources and cannot be joined directly.

  • Joins are used when data comes from the same source and can be combined based on a common key.

  • Data blending allows for more complex analysis by combining unrelated data sets.

  • Joins are more efficient for combining related data sets with a common key.

  • Example: Data blending can be u...read more

Q78. Significance Of Exploratory Data Analysis

Ans.

Exploratory Data Analysis is crucial for understanding the data, identifying patterns, and making informed decisions.

  • Helps in understanding the distribution of data

  • Identifies outliers and missing values

  • Detects patterns and relationships between variables

  • Assists in selecting appropriate statistical techniques

  • Visualizes data for better insights

  • Example: Using histograms to understand the distribution of customer ages in a dataset

Q79. What are sql aggregate functions?

Ans.

SQL aggregate functions perform calculations on a set of values and return a single value.

  • Aggregate functions include COUNT, SUM, AVG, MAX, and MIN.

  • COUNT returns the number of rows in a table or the number of non-null values in a column.

  • SUM returns the sum of all values in a column.

  • AVG returns the average of all values in a column.

  • MAX returns the highest value in a column.

  • MIN returns the lowest value in a column.

Q80. how sql query work behind the scenes

Ans.

SQL queries work by parsing and executing commands to retrieve data from databases.

  • SQL query is written in a specific syntax to communicate with databases.

  • The query is parsed by the database management system to understand the command.

  • The database engine then executes the query to retrieve the requested data.

  • Queries can involve selecting, updating, inserting, or deleting data from tables.

  • Example: SELECT * FROM table_name WHERE condition;

Q81. CREATE A KIP TO TRACK CUSTOMER SATISFACTION

Ans.

Create a KPI to track customer satisfaction.

  • Define the metrics to measure customer satisfaction, such as Net Promoter Score (NPS) or Customer Satisfaction Score (CSAT).

  • Set a target for the KPI based on industry benchmarks or company goals.

  • Collect feedback from customers through surveys, social media, or customer support interactions.

  • Analyze the data to identify trends and areas for improvement.

  • Regularly report on the KPI to stakeholders and take action to improve customer sat...read more

Q82. Optimization of Service center model design

Ans.

Optimizing service center model design involves analyzing data to identify areas for improvement and implementing changes to increase efficiency and customer satisfaction.

  • Analyze data on customer needs and service center performance

  • Identify areas for improvement, such as wait times or staffing levels

  • Implement changes to increase efficiency and customer satisfaction, such as adding self-service options or adjusting staffing schedules

  • Continuously monitor and evaluate the effect...read more

Q83. Tell us a time you lead a data analytics project

Q84. Cloud technologies and implications

Ans.

Cloud technologies offer scalability, flexibility, and cost savings for data storage and processing.

  • Cloud computing allows for on-demand access to computing resources

  • Data can be stored and processed in the cloud, reducing the need for on-premise infrastructure

  • Cloud technologies offer scalability, allowing for easy expansion or contraction of resources as needed

  • Cloud computing can also offer cost savings compared to traditional on-premise solutions

  • Examples of cloud technologie...read more

Q85. Difference between where and having

Ans.

WHERE is used to filter rows before grouping, HAVING is used to filter groups after grouping.

  • WHERE is used with SELECT statement to filter rows based on a condition

  • HAVING is used with GROUP BY statement to filter groups based on a condition

  • WHERE is applied before grouping, HAVING is applied after grouping

  • Example: SELECT * FROM table WHERE column = 'value'

  • Example: SELECT column, COUNT(*) FROM table GROUP BY column HAVING COUNT(*) > 1

Q86. Joint function, it's application and definition

Ans.

Joint function refers to the movement and stability of joints in the body.

  • Joint function involves the movement of bones at a joint.

  • It also includes the stability provided by ligaments and tendons.

  • Examples include the knee joint allowing bending and straightening of the leg.

  • Another example is the shoulder joint allowing rotation and lifting of the arm.

Q87. 1.Vlookup ? 2. Extract mail id from text entry ?

Ans.

1. Vlookup is a function in Excel used to search for a specific value in a table and return a corresponding value. 2. To extract email id from text entry, we can use a combination of functions like LEFT, RIGHT, FIND, and LEN.

  • Vlookup is used to find a value in a table and return a corresponding value from the same row.

  • Syntax: VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])

  • To extract email id from text entry, we can use LEFT function to find the position of '@...read more

Q88. What do you know about R square?

Ans.

R square is a statistical measure that represents the proportion of the variance for a dependent variable that's explained by an independent variable.

  • R square is also known as the coefficient of determination.

  • It ranges from 0 to 1, with 1 indicating a perfect fit.

  • It is used to evaluate the goodness of fit of a regression model.

  • Higher R square values indicate that the model explains a larger proportion of the variance in the dependent variable.

  • For example, an R square of 0.8 m...read more

Q89. What is the procedure execution of grap

Ans.

The procedure execution of grap involves defining the graph structure, adding nodes and edges, and performing various graph operations.

  • Define the graph structure by creating a graph object

  • Add nodes to the graph using the add_node() method

  • Add edges between nodes using the add_edge() method

  • Perform graph operations such as traversal, shortest path finding, and connectivity checking

Q90. what is sql and how do you rate yourself

Ans.

SQL is a programming language used for managing and manipulating relational databases.

  • SQL stands for Structured Query Language

  • It is used to communicate with databases to retrieve, insert, update, and delete data

  • Common SQL commands include SELECT, INSERT, UPDATE, DELETE

  • Knowledge of SQL is essential for data analysis and data manipulation tasks

Q91. what are strength and weaknesses

Ans.

Strengths include strong analytical skills and attention to detail. Weaknesses include lack of experience with certain software programs.

  • Strength: Strong analytical skills

  • Strength: Attention to detail

  • Weakness: Lack of experience with certain software programs

Q92. how to add file from sharepoint

Ans.

To add a file from SharePoint, you can use the SharePoint API or sync the SharePoint library with your local machine.

  • Access the SharePoint site and navigate to the library where you want to add the file.

  • Click on the 'Upload' button and select the file you want to upload.

  • Alternatively, you can sync the SharePoint library with your local machine using OneDrive or SharePoint Sync.

  • Once synced, you can simply drag and drop the file into the synced folder to upload it to SharePoint...read more

Q93. How to connect data sources in power bi

Ans.

Data sources in Power BI can be connected through various methods including importing data from files, databases, online services, and streaming datasets.

  • Import data from files such as Excel, CSV, XML, JSON, etc.

  • Connect to databases like SQL Server, MySQL, Oracle, etc.

  • Utilize online services connectors for platforms like Google Analytics, Salesforce, Facebook, etc.

  • Set up streaming datasets for real-time data analysis.

Q94. Use of SELECTEDVALUE() fax function?

Ans.

SELECTEDVALUE() is a DAX function used to return the value of a column if it has only one distinct value.

  • Used in Power BI to retrieve the value of a column if it has only one distinct value

  • Returns an error if the column has more than one distinct value

  • Can be used in combination with IF statement to handle errors

  • Example: SELECTEDVALUE('Table'[Column], 'No value found')

  • Example: IF(ISBLANK(SELECTEDVALUE('Table'[Column])), 'No value found', SELECTEDVALUE('Table'[Column]))

Q95. Difference between having and where

Ans.

The main difference between 'having' and 'where' in SQL is that 'having' is used with aggregate functions, while 'where' is used with individual rows.

  • HAVING is used to filter groups of rows based on aggregate functions, such as COUNT, SUM, AVG, etc.

  • WHERE is used to filter individual rows based on specific conditions.

  • HAVING is applied after the GROUP BY clause, while WHERE is applied before.

  • Example: SELECT department, AVG(salary) FROM employees GROUP BY department HAVING AVG(s...read more

Q96. What is Interest coverage ratio

Ans.

Interest coverage ratio measures a company's ability to pay interest on its debt with its earnings.

  • Interest coverage ratio is calculated by dividing a company's earnings before interest and taxes (EBIT) by its interest expenses.

  • A higher interest coverage ratio indicates that a company is more capable of meeting its interest obligations.

  • A lower interest coverage ratio may signal financial distress and an increased risk of default.

  • For example, if a company has EBIT of $1,000,00...read more

Q97. Lod in tableau , Row level security

Ans.

Row level security in Tableau allows users to restrict access to specific rows of data based on user roles or permissions.

  • Row level security in Tableau is implemented by creating user filters based on user roles or permissions.

  • Users can only see data that they have been granted access to, based on the row level security settings.

  • Row level security can be set up using Tableau Server or Tableau Online.

  • Example: A sales manager can only see sales data for their region, while a ma...read more

Q98. find the second largest element in a python array

Ans.

Find the second largest element in a Python array

  • Sort the array in descending order

  • Return the element at index 1, which is the second largest element

Q99. difference between random forest and decision tree

Ans.

Random forest is an ensemble learning method using multiple decision trees to improve accuracy and reduce overfitting.

  • Random forest uses multiple decision trees to make predictions.

  • Decision tree is a single tree structure used for classification and regression.

  • Random forest reduces overfitting by averaging the predictions of multiple trees.

  • Decision tree can be prone to overfitting if not pruned properly.

  • Random forest is more robust and accurate compared to a single decision t...read more

Q100. 2. How to print a graps.

Ans.

To print a graph, you can use data visualization libraries like Matplotlib or Seaborn in Python.

  • Use Matplotlib library in Python to create and display graphs.

  • Import matplotlib.pyplot module to access plotting functions.

  • Use functions like plt.plot() or plt.scatter() to create different types of graphs.

  • Finally, use plt.show() to display the graph on the screen.

Previous
1
2
3
4
Next
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories

Interview experiences of popular companies

3.7
 • 10k Interviews
3.9
 • 7.8k Interviews
3.8
 • 5.4k Interviews
3.8
 • 4.6k Interviews
3.8
 • 703 Interviews
3.4
 • 485 Interviews
3.9
 • 390 Interviews
4.0
 • 237 Interviews
3.6
 • 208 Interviews
4.0
 • 124 Interviews
View all

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary

Senior Data Analyst Interview Questions
Share an Interview
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+

Reviews

4 L+

Interviews

4 Cr+

Salaries

1 Cr+

Users/Month

Contribute to help millions
Get AmbitionBox app

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter