Senior Data Analyst
100+ Senior Data Analyst Interview Questions and Answers
Q51. Diffrence between direct connection and live connection?
Direct connection is a static connection to a data source, while live connection is a dynamic connection that updates in real-time.
Direct connection requires data to be imported into the analysis tool, while live connection accesses data directly from the source.
Live connection allows for real-time updates and analysis, while direct connection requires manual refreshing.
Direct connection is useful for small datasets or when data is not frequently updated, while live connectio...read more
Q52. How to find null values from a data using sql?
To find null values from a data using SQL, use the IS NULL operator.
Use the IS NULL operator to check for null values in a specific column.
Combine the IS NULL operator with the WHERE clause to filter out null values.
Use the COUNT function to count the number of null values in a column.
Example: SELECT * FROM table_name WHERE column_name IS NULL;
Example: SELECT COUNT(column_name) FROM table_name WHERE column_name IS NULL;
Q53. What is incremental refresh in Power BI?
Incremental refresh is a feature in Power BI that allows you to refresh only a portion of your data instead of the entire dataset.
Incremental refresh is useful for large datasets that take a long time to refresh.
It allows you to refresh only the new or updated data since the last refresh.
You can set up incremental refresh by defining a range of values for a column that represents the refresh window.
For example, you can set up incremental refresh to only refresh data from the ...read more
Q54. What are the ways to optimize Tableau dashboard?
Optimizing Tableau dashboard involves improving performance, usability, and visual appeal.
Limit the number of worksheets on a dashboard to improve performance
Use filters and parameters effectively to allow for interactivity
Optimize data source connections for faster loading times
Utilize dashboard actions to create dynamic interactions
Design with user experience in mind to ensure clarity and ease of use
Q55. calc retained revived new customer per month basis what are joins in sql what is dense rank
To calculate retained, revived, and new customers per month basis, joins in SQL and dense rank are used.
To calculate retained, revived, and new customers per month basis, you would typically use SQL queries to join different tables containing customer data.
Joins in SQL are used to combine rows from two or more tables based on a related column between them.
Dense rank is a function in SQL that assigns a rank to each row within a partition of a result set, with no gaps in the ra...read more
Q56. Write DAX to calculate some values.
Calculating values using DAX in Power BI
Use SUMX function to calculate sum of values in a column
Use CALCULATE function to apply filters and conditions to calculations
Use DAX measures to create reusable calculations
Example: Calculate total sales for a specific product category
Share interview questions and help millions of jobseekers 🌟
Q57. difference in tree map & heat Map in tableau
Tree map displays hierarchical data using nested rectangles, while heat map shows data values using color intensity.
Tree map visualizes hierarchical data with nested rectangles, where the size of each rectangle represents a measure.
Heat map displays data values using color intensity, with darker colors indicating higher values.
Tree map is useful for showing proportions within a whole, while heat map is effective for identifying patterns and trends in data.
Example: A tree map ...read more
Q58. What is purpose of google analytics
Google Analytics is a web analytics service that tracks and reports website traffic.
Tracks website traffic and user behavior
Provides insights into user demographics and interests
Helps optimize website performance and marketing strategies
Offers customizable reports and dashboards
Integrates with other Google products like AdWords and Search Console
Senior Data Analyst Jobs
Q59. Write sql code to get 2nd order for each customer for each day
SQL code to retrieve 2nd order for each customer for each day
Use a subquery to rank orders for each customer by date
Filter the results to only include the 2nd order for each customer for each day
Q60. Define a metric that can tell the best performing genre on the platform
One metric to determine the best performing genre on the platform is the average number of plays per song within each genre.
Calculate the average number of plays per song for each genre
Compare the averages to identify the genre with the highest average
Genres with higher average plays can be considered the best performing on the platform
Q61. Explain the difference between Arithmetic mean, Harmonic mean and Geometric mean?
Arithmetic mean is the sum of all values divided by the number of values. Harmonic mean is the reciprocal of the arithmetic mean of the reciprocals of the values. Geometric mean is the nth root of the product of n values.
Arithmetic mean is most commonly used and is calculated by summing all values and dividing by the number of values.
Harmonic mean is used when dealing with rates or ratios, and is calculated by taking the reciprocal of the arithmetic mean of the reciprocals of...read more
Q62. Explain classification models I used and why
I have used logistic regression and decision tree models for classification.
Logistic regression is a linear model used for binary classification.
Decision tree is a non-linear model used for multi-class classification.
Logistic regression is simple and easy to interpret while decision tree can handle non-linear relationships.
I chose these models based on the nature of the data and the problem at hand.
Q63. Select * from table where student_name isnull() ;
This SQL query selects all rows from a table where the student_name column is null.
The query uses the SELECT statement to retrieve data from a table.
The * symbol means all columns will be selected.
The WHERE clause filters the results to only include rows where student_name is null.
Q64. How to handle large datasets.
Handling large datasets involves optimizing storage, processing, and analysis techniques.
Use distributed computing frameworks like Hadoop or Spark to process data in parallel.
Utilize data compression techniques to reduce storage requirements.
Implement indexing and partitioning strategies to improve query performance.
Consider using cloud-based storage and computing resources for scalability.
Use sampling techniques to work with subsets of data for initial analysis.
Q65. How to use parameters.
Parameters are variables that can be passed into a function or procedure to customize its behavior.
Parameters are defined when creating a function or procedure.
They can be used to pass values into the function or procedure.
Parameters can be required or optional, with default values specified.
They help make functions more flexible and reusable.
Example: def calculate_area(length, width):
Q66. Steps Involved In Any Data Analysis Project
Steps involved in a data analysis project include defining objectives, collecting data, cleaning and organizing data, analyzing data, and presenting findings.
Define project objectives and goals
Collect relevant data sources
Clean and organize data to ensure accuracy
Perform data analysis using statistical methods and tools
Present findings and insights to stakeholders
Q67. What are the types of Filters in Power Bi?
Types of Filters in Power Bi include visual-level filters, page-level filters, report-level filters, and drillthrough filters.
Visual-level filters apply only to a specific visual on a report page
Page-level filters apply to all visuals on a specific report page
Report-level filters apply to all visuals on all report pages
Drillthrough filters allow users to see details of a specific data point by drilling through to another report page
Q68. Explain feature engineering process in ML modelling
Feature engineering is the process of selecting and transforming relevant features from raw data to improve model performance.
Identify relevant features based on domain knowledge and data exploration
Transform features to improve their quality and relevance
Create new features by combining or extracting information from existing features
Select the most important features using feature selection techniques
Iterate the process to improve model performance
Q69. What are analytical functions explain all
Analytical functions are used in SQL to perform calculations across a set of rows related to the current row.
Analytical functions operate on a group of rows and return a single result for each row.
They can be used to calculate moving averages, running totals, rank, and percentiles.
Examples include ROW_NUMBER(), RANK(), DENSE_RANK(), LEAD(), LAG(), SUM(), AVG().
Q70. write a sql query to find total amount of order placed by each customer
This query calculates the total amount of orders placed by each customer.
Use the GROUP BY clause to group the orders by customer.
Use the SUM() function to calculate the total amount of each group.
Join the orders table with the customers table to get the customer names.
Q71. DBMS key concepts and explation with examples of use case
DBMS key concepts and examples of use cases
DBMS stands for Database Management System
Key concepts include data modeling, query optimization, transaction management
Use cases include online transaction processing (OLTP), data warehousing, customer relationship management (CRM)
Examples: MySQL for OLTP, Oracle for data warehousing, Salesforce for CRM
Q72. Create data model and visualizations.
Creating data model and visualizations involves structuring data and presenting it visually.
Identify the key variables and relationships in the data
Design a data model that represents the data structure
Select appropriate visualization tools and techniques
Create visualizations such as charts, graphs, and dashboards
Ensure the visualizations effectively communicate insights from the data
Q73. Power BI optimization techniques.
Power BI optimization techniques involve data model optimization, query optimization, and report design best practices.
Optimize data model by removing unnecessary columns and relationships
Use query folding to push data transformation tasks to the data source
Avoid using calculated columns and instead use measures for calculations
Limit the use of visuals that require high computational resources
Use aggregations and composite models for large datasets
Q74. What is the purpose of javascript
JavaScript is a programming language used to create interactive web pages and web applications.
JavaScript is used to add interactivity to web pages by manipulating the Document Object Model (DOM).
It can be used for form validation, creating animations, and building web-based games.
JavaScript can also be used on the server-side with Node.js to build scalable web applications.
Popular JavaScript frameworks include React, Angular, and Vue.js.
Q75. Use case study implementation of logics
Implementing logics in a use case study to analyze data and derive insights.
Identify the problem statement and objectives of the study
Collect relevant data and clean it for analysis
Apply statistical and machine learning techniques to derive insights
Interpret the results and communicate findings to stakeholders
Q76. Write SQL to print the user with the highest restaurant order. There are couple of joins to be used.
Use SQL with joins to find user with highest restaurant order
Use JOIN to connect user table with order table
Group by user and sum the order amounts
Order by sum in descending order and limit to 1 result
Q77. Data blending Vs Joins in tableau
Data blending combines data from different sources in Tableau, while joins combine data from the same source.
Data blending is used when data comes from different sources and cannot be joined directly.
Joins are used when data comes from the same source and can be combined based on a common key.
Data blending allows for more complex analysis by combining unrelated data sets.
Joins are more efficient for combining related data sets with a common key.
Example: Data blending can be u...read more
Q78. Significance Of Exploratory Data Analysis
Exploratory Data Analysis is crucial for understanding the data, identifying patterns, and making informed decisions.
Helps in understanding the distribution of data
Identifies outliers and missing values
Detects patterns and relationships between variables
Assists in selecting appropriate statistical techniques
Visualizes data for better insights
Example: Using histograms to understand the distribution of customer ages in a dataset
Q79. What are sql aggregate functions?
SQL aggregate functions perform calculations on a set of values and return a single value.
Aggregate functions include COUNT, SUM, AVG, MAX, and MIN.
COUNT returns the number of rows in a table or the number of non-null values in a column.
SUM returns the sum of all values in a column.
AVG returns the average of all values in a column.
MAX returns the highest value in a column.
MIN returns the lowest value in a column.
Q80. how sql query work behind the scenes
SQL queries work by parsing and executing commands to retrieve data from databases.
SQL query is written in a specific syntax to communicate with databases.
The query is parsed by the database management system to understand the command.
The database engine then executes the query to retrieve the requested data.
Queries can involve selecting, updating, inserting, or deleting data from tables.
Example: SELECT * FROM table_name WHERE condition;
Q81. CREATE A KIP TO TRACK CUSTOMER SATISFACTION
Create a KPI to track customer satisfaction.
Define the metrics to measure customer satisfaction, such as Net Promoter Score (NPS) or Customer Satisfaction Score (CSAT).
Set a target for the KPI based on industry benchmarks or company goals.
Collect feedback from customers through surveys, social media, or customer support interactions.
Analyze the data to identify trends and areas for improvement.
Regularly report on the KPI to stakeholders and take action to improve customer sat...read more
Q82. Optimization of Service center model design
Optimizing service center model design involves analyzing data to identify areas for improvement and implementing changes to increase efficiency and customer satisfaction.
Analyze data on customer needs and service center performance
Identify areas for improvement, such as wait times or staffing levels
Implement changes to increase efficiency and customer satisfaction, such as adding self-service options or adjusting staffing schedules
Continuously monitor and evaluate the effect...read more
Q83. Tell us a time you lead a data analytics project
Q84. Cloud technologies and implications
Cloud technologies offer scalability, flexibility, and cost savings for data storage and processing.
Cloud computing allows for on-demand access to computing resources
Data can be stored and processed in the cloud, reducing the need for on-premise infrastructure
Cloud technologies offer scalability, allowing for easy expansion or contraction of resources as needed
Cloud computing can also offer cost savings compared to traditional on-premise solutions
Examples of cloud technologie...read more
Q85. Difference between where and having
WHERE is used to filter rows before grouping, HAVING is used to filter groups after grouping.
WHERE is used with SELECT statement to filter rows based on a condition
HAVING is used with GROUP BY statement to filter groups based on a condition
WHERE is applied before grouping, HAVING is applied after grouping
Example: SELECT * FROM table WHERE column = 'value'
Example: SELECT column, COUNT(*) FROM table GROUP BY column HAVING COUNT(*) > 1
Q86. Joint function, it's application and definition
Joint function refers to the movement and stability of joints in the body.
Joint function involves the movement of bones at a joint.
It also includes the stability provided by ligaments and tendons.
Examples include the knee joint allowing bending and straightening of the leg.
Another example is the shoulder joint allowing rotation and lifting of the arm.
Q87. 1.Vlookup ? 2. Extract mail id from text entry ?
1. Vlookup is a function in Excel used to search for a specific value in a table and return a corresponding value. 2. To extract email id from text entry, we can use a combination of functions like LEFT, RIGHT, FIND, and LEN.
Vlookup is used to find a value in a table and return a corresponding value from the same row.
Syntax: VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])
To extract email id from text entry, we can use LEFT function to find the position of '@...read more
Q88. What do you know about R square?
R square is a statistical measure that represents the proportion of the variance for a dependent variable that's explained by an independent variable.
R square is also known as the coefficient of determination.
It ranges from 0 to 1, with 1 indicating a perfect fit.
It is used to evaluate the goodness of fit of a regression model.
Higher R square values indicate that the model explains a larger proportion of the variance in the dependent variable.
For example, an R square of 0.8 m...read more
Q89. What is the procedure execution of grap
The procedure execution of grap involves defining the graph structure, adding nodes and edges, and performing various graph operations.
Define the graph structure by creating a graph object
Add nodes to the graph using the add_node() method
Add edges between nodes using the add_edge() method
Perform graph operations such as traversal, shortest path finding, and connectivity checking
Q90. what is sql and how do you rate yourself
SQL is a programming language used for managing and manipulating relational databases.
SQL stands for Structured Query Language
It is used to communicate with databases to retrieve, insert, update, and delete data
Common SQL commands include SELECT, INSERT, UPDATE, DELETE
Knowledge of SQL is essential for data analysis and data manipulation tasks
Q91. what are strength and weaknesses
Strengths include strong analytical skills and attention to detail. Weaknesses include lack of experience with certain software programs.
Strength: Strong analytical skills
Strength: Attention to detail
Weakness: Lack of experience with certain software programs
Q92. how to add file from sharepoint
To add a file from SharePoint, you can use the SharePoint API or sync the SharePoint library with your local machine.
Access the SharePoint site and navigate to the library where you want to add the file.
Click on the 'Upload' button and select the file you want to upload.
Alternatively, you can sync the SharePoint library with your local machine using OneDrive or SharePoint Sync.
Once synced, you can simply drag and drop the file into the synced folder to upload it to SharePoint...read more
Q93. How to connect data sources in power bi
Data sources in Power BI can be connected through various methods including importing data from files, databases, online services, and streaming datasets.
Import data from files such as Excel, CSV, XML, JSON, etc.
Connect to databases like SQL Server, MySQL, Oracle, etc.
Utilize online services connectors for platforms like Google Analytics, Salesforce, Facebook, etc.
Set up streaming datasets for real-time data analysis.
Q94. Use of SELECTEDVALUE() fax function?
SELECTEDVALUE() is a DAX function used to return the value of a column if it has only one distinct value.
Used in Power BI to retrieve the value of a column if it has only one distinct value
Returns an error if the column has more than one distinct value
Can be used in combination with IF statement to handle errors
Example: SELECTEDVALUE('Table'[Column], 'No value found')
Example: IF(ISBLANK(SELECTEDVALUE('Table'[Column])), 'No value found', SELECTEDVALUE('Table'[Column]))
Q95. Difference between having and where
The main difference between 'having' and 'where' in SQL is that 'having' is used with aggregate functions, while 'where' is used with individual rows.
HAVING is used to filter groups of rows based on aggregate functions, such as COUNT, SUM, AVG, etc.
WHERE is used to filter individual rows based on specific conditions.
HAVING is applied after the GROUP BY clause, while WHERE is applied before.
Example: SELECT department, AVG(salary) FROM employees GROUP BY department HAVING AVG(s...read more
Q96. What is Interest coverage ratio
Interest coverage ratio measures a company's ability to pay interest on its debt with its earnings.
Interest coverage ratio is calculated by dividing a company's earnings before interest and taxes (EBIT) by its interest expenses.
A higher interest coverage ratio indicates that a company is more capable of meeting its interest obligations.
A lower interest coverage ratio may signal financial distress and an increased risk of default.
For example, if a company has EBIT of $1,000,00...read more
Q97. Lod in tableau , Row level security
Row level security in Tableau allows users to restrict access to specific rows of data based on user roles or permissions.
Row level security in Tableau is implemented by creating user filters based on user roles or permissions.
Users can only see data that they have been granted access to, based on the row level security settings.
Row level security can be set up using Tableau Server or Tableau Online.
Example: A sales manager can only see sales data for their region, while a ma...read more
Q98. find the second largest element in a python array
Find the second largest element in a Python array
Sort the array in descending order
Return the element at index 1, which is the second largest element
Q99. difference between random forest and decision tree
Random forest is an ensemble learning method using multiple decision trees to improve accuracy and reduce overfitting.
Random forest uses multiple decision trees to make predictions.
Decision tree is a single tree structure used for classification and regression.
Random forest reduces overfitting by averaging the predictions of multiple trees.
Decision tree can be prone to overfitting if not pruned properly.
Random forest is more robust and accurate compared to a single decision t...read more
Q100. 2. How to print a graps.
To print a graph, you can use data visualization libraries like Matplotlib or Seaborn in Python.
Use Matplotlib library in Python to create and display graphs.
Import matplotlib.pyplot module to access plotting functions.
Use functions like plt.plot() or plt.scatter() to create different types of graphs.
Finally, use plt.show() to display the graph on the screen.
Interview Questions of Similar Designations
Top Interview Questions for Senior Data Analyst Related Skills
Interview experiences of popular companies
Calculate your in-hand salary
Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Reviews
Interviews
Salaries
Users/Month