TCS
30+ Unient India Interview Questions and Answers
Q1. Describe the steps for Universe design from scratch?
Universe design involves identifying data sources, defining business requirements, creating a schema, and building the universe.
Identify data sources and gather requirements
Create a schema and map data to it
Build the universe using a tool like SAP BusinessObjects
Test and validate the universe
Publish the universe for end-users to access
Q2. Can you describe the experience with data analysis and its tools?
I have extensive experience in data analysis using tools such as Excel, SQL, Python, and Tableau.
Proficient in Excel for data cleaning, manipulation, and visualization
Strong SQL skills for querying databases and extracting relevant information
Experience with Python for statistical analysis and machine learning
Familiarity with Tableau for creating interactive dashboards and reports
Q3. How do you determine which variable is important in predictive model ?
Variables importance in predictive model is determined using techniques like feature selection, correlation analysis, and machine learning algorithms.
Use feature selection techniques like Recursive Feature Elimination (RFE) or SelectKBest to identify important variables.
Analyze correlation between variables and target variable to determine importance.
Utilize machine learning algorithms like Random Forest or Gradient Boosting to rank variables based on their impact on model pe...read more
Q4. How do you optimize the report when working with large datasets
Optimizing reports with large datasets involves using efficient data processing techniques and visualization tools.
Use data aggregation to summarize large datasets
Utilize indexing and proper data structures for faster retrieval
Implement parallel processing or distributed computing for quicker analysis
Leverage data visualization tools like Tableau or Power BI for easier interpretation
Consider using sampling techniques to work with subsets of data for initial analysis
Q5. Describe one complex report developed in SAP BO?
Developed a complex sales report in SAP BO
Report included multiple data sources and complex calculations
Used advanced visualization techniques to present data
Allowed for drill-down capabilities to analyze sales data at different levels
Included filters and prompts for user customization
Report was automated to run on a regular schedule and distribute to stakeholders
Q6. What is V loopup and H loopup
Vlookup and Hlookup are Excel functions used to search for specific data in a table.
Vlookup searches for a value in the first column of a table and returns a corresponding value in the same row from a specified column.
Hlookup searches for a value in the first row of a table and returns a corresponding value in the same column from a specified row.
Both functions are commonly used in data analysis and can save time when working with large datasets.
Q7. What is difference between array and list
Arrays have fixed size and homogeneous elements, while lists can dynamically resize and have heterogeneous elements.
Arrays have a fixed size, while lists can dynamically resize.
Arrays store elements of the same data type, while lists can store elements of different data types.
Arrays are accessed by index, while lists are accessed by iterating through elements.
Q8. Difference between Power BI Tableau, advantages and limitations of both
Power BI and Tableau are both popular data visualization tools, each with its own advantages and limitations.
Power BI is more user-friendly and integrates well with Microsoft products, while Tableau offers more advanced visualization options.
Power BI is better suited for business users who are already using Microsoft products, while Tableau is preferred by data analysts and data scientists.
Power BI is more cost-effective for small to medium-sized businesses, while Tableau is ...read more
Q9. How do you share the report with users
I share reports with users through email, online portals, presentations, and interactive dashboards.
Send reports via email attachments
Upload reports to online portals for easy access
Present findings in meetings or webinars
Create interactive dashboards for users to explore data
Provide access to reports through secure data sharing platforms
Q10. what is data analytics
Data analytics is the process of examining data sets to draw conclusions about the information they contain.
It involves collecting, cleaning, and transforming data
Applying statistical and mathematical models to analyze the data
Visualizing and presenting the results in a meaningful way
Examples include predicting customer behavior, identifying trends, and optimizing business processes
Q11. What is different b/w dbms and rdms
DBMS is a software system that manages databases, while RDBMS is a type of DBMS that stores data in a structured format using tables.
DBMS stands for Database Management System, while RDBMS stands for Relational Database Management System.
RDBMS stores data in a structured format using tables with rows and columns, while DBMS may or may not use a structured format.
RDBMS enforces ACID properties (Atomicity, Consistency, Isolation, Durability) for database transactions, ensuring ...read more
Q12. What is implicit and explicit measures
Implicit and explicit measures are two types of assessments used in data analysis to gather information.
Implicit measures are unconscious assessments that are not easily controlled or influenced by the individual being assessed.
Explicit measures are conscious assessments that are directly stated or easily articulated by the individual being assessed.
Implicit measures are often used in psychology to assess attitudes or biases that individuals may not be aware of.
Explicit measu...read more
Q13. What is RLS in Power BI
RLS in Power BI stands for Row-Level Security, which allows users to restrict access to certain rows of data based on their role or permissions.
RLS helps in controlling access to data at the row level
It allows users to define security roles and rules to restrict data access
Users can create filters based on roles to limit data visibility
For example, a manager can only see data related to their department using RLS
Q14. Explain the functions in SQL and Explain OOPS.
SQL functions are used to perform operations on data stored in a database. OOPS stands for Object-Oriented Programming System.
SQL functions are used to manipulate data in a database, such as SUM, AVG, COUNT, etc.
OOPS is a programming paradigm based on the concept of objects, which can contain data in the form of fields and code in the form of procedures.
Examples of SQL functions: SELECT COUNT(*) FROM table_name; SELECT AVG(column_name) FROM table_name;
Examples of OOPS concept...read more
Q15. how to manage data
Data management involves organizing, storing, protecting, and maintaining data to ensure its accuracy and accessibility.
Identify the type of data and its source
Establish a data governance framework
Implement data security measures
Use data management tools and software
Regularly backup and archive data
Ensure compliance with data regulations
Monitor data quality and integrity
Provide access to authorized users only
Q16. Types of join left right self cross
Types of joins in SQL include left, right, self, and cross joins.
Left join: Returns all records from the left table and the matched records from the right table.
Right join: Returns all records from the right table and the matched records from the left table.
Self join: Joins a table with itself.
Cross join: Returns the Cartesian product of the two tables.
Q17. explain some of the ML algorithms
Machine learning algorithms are models that learn patterns from data to make predictions or decisions.
Supervised learning algorithms: Decision Trees, Random Forest, Support Vector Machines
Unsupervised learning algorithms: K-means clustering, Hierarchical clustering, Principal Component Analysis
Reinforcement learning algorithms: Q-learning, Deep Q Networks
Neural network algorithms: Convolutional Neural Networks, Recurrent Neural Networks
Q18. explain the flow of data analysis
Data analysis involves collecting, cleaning, processing, analyzing, and interpreting data to extract insights and make informed decisions.
Collect data from various sources
Clean and preprocess data to remove errors and inconsistencies
Perform statistical analysis and data mining techniques
Visualize data using charts, graphs, and dashboards
Interpret results and draw conclusions
Communicate findings to stakeholders
Q19. What is Dax how it can be used
DAX (Data Analysis Expressions) is a formula language used in Power BI, Excel, and other Microsoft tools for data analysis and calculations.
DAX is used to create custom calculations, measures, and columns in Power BI reports.
It can be used to perform complex calculations, such as aggregations, comparisons, and filtering.
DAX functions can be used to manipulate data, perform time intelligence calculations, and create advanced visualizations.
For example, you can use DAX to calcu...read more
Q20. Joins and different clause in SQL
Joins and different clause in SQL
Joins are used to combine rows from two or more tables based on a related column between them
Common types of joins include INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
Clauses like WHERE, GROUP BY, HAVING, and ORDER BY are used to filter, group, and sort data in SQL queries
Q21. why we use analytics
Analytics helps us make data-driven decisions and improve business outcomes.
Analytics provides insights into customer behavior and preferences.
It helps identify trends and patterns in data that can be used to optimize business processes.
Analytics can also be used to measure the effectiveness of marketing campaigns and other initiatives.
It enables businesses to make data-driven decisions that lead to improved outcomes and increased profitability.
For example, a retailer can use...read more
Q22. Difference between power bi and tableau
Power BI is more user-friendly and integrates well with Microsoft products, while Tableau offers more advanced visualization capabilities.
Power BI is more user-friendly and integrates well with Microsoft products
Tableau offers more advanced visualization capabilities
Power BI is better suited for users who are familiar with Microsoft products
Tableau is preferred by users who require more complex data visualization and analysis
Power BI is often used for quick and simple data an...read more
Q23. what is long term goal
My long term goal is to become a data science manager and lead a team of analysts to drive impactful business decisions through data-driven insights.
Continue developing technical skills in data analysis and machine learning
Gain leadership experience through project management and team collaboration
Obtain advanced degrees or certifications in data science or related fields
Work towards becoming a subject matter expert in a specific industry or domain
Q24. Cte in sql and su queries
Common table expressions (CTE) in SQL are temporary result sets that can be referenced within a SELECT, INSERT, UPDATE, or DELETE statement.
CTEs are defined using the WITH keyword followed by a name and a query.
They can be recursive, allowing a query to reference itself.
CTEs improve readability and maintainability of complex queries.
Example: WITH cte AS (SELECT * FROM table_name) SELECT * FROM cte;
Subqueries (SU queries) are queries nested within another query.
They can be use...read more
Q25. What is Array ?
An array is a data structure that stores a collection of elements of the same type in a contiguous memory location.
Arrays have a fixed size determined at the time of declaration.
Elements in an array can be accessed using their index.
Arrays can store primitive data types or objects.
Example: String[] names = {"Alice", "Bob", "Charlie"};
Q26. What is Compiler ?
A compiler is a software program that translates code written in a high-level programming language into machine code that can be executed by a computer.
Converts high-level programming language code into machine code
Checks for syntax errors and other issues in the code
Generates executable files that can be run on a computer
Examples: GCC (GNU Compiler Collection), Clang, Microsoft Visual C++ Compiler
Q27. What is database
A database is a structured collection of data that is organized and stored for easy access, management, and retrieval.
A database is used to store and manage large amounts of data.
It allows for efficient retrieval and manipulation of data.
Examples of databases include MySQL, Oracle, and MongoDB.
Q28. Write a procedure to read data
Procedure to read data involves defining data source, selecting appropriate tools, importing data, and cleaning/preprocessing data.
Identify the data source (e.g. CSV file, database, API)
Select appropriate tools for reading data (e.g. Python pandas library, SQL queries)
Import data into chosen tool (e.g. read_csv() function in pandas)
Clean and preprocess data as needed (e.g. handling missing values, removing duplicates)
Q29. What is Dax equation
DAX (Data Analysis Expressions) is a formula language used in Power BI, Excel, and other Microsoft tools for data analysis and calculations.
DAX formulas are used to create calculated columns, measures, and calculated tables in Power BI and Excel.
DAX functions can be used for aggregation, filtering, and manipulating data.
Examples of DAX functions include SUM, AVERAGE, CALCULATE, FILTER, and RELATED.
DAX formulas can reference columns, tables, and other measures in the data mode...read more
Q30. spark and its benefits
Spark is a fast and powerful big data processing framework that offers benefits like speed, ease of use, and versatility.
Spark is known for its speed, as it can process data up to 100 times faster than traditional Hadoop MapReduce.
It offers ease of use with high-level APIs in Java, Scala, Python, and SQL, making it accessible to a wide range of users.
Spark is versatile, supporting various workloads such as batch processing, real-time streaming, machine learning, and graph pro...read more
Q31. huddles of data engineer
Huddles of data engineers refer to collaborative meetings or discussions among data engineers to share insights, solve problems, and make decisions.
Huddles are typically informal and can be scheduled or ad-hoc.
They provide a platform for data engineers to brainstorm, troubleshoot, and exchange ideas.
Huddles may involve reviewing code, discussing data pipelines, or addressing technical challenges.
Effective huddles promote teamwork, communication, and knowledge sharing within t...read more
Q32. Tools and technologies used
I have experience using tools such as SQL, Python, Excel, Tableau, and R for data analysis.
SQL
Python
Excel
Tableau
R
Q33. CGPA received in college
My CGPA in college was 3.8 out of 4.0.
My CGPA was 3.8 out of 4.0
I graduated with honors
I consistently maintained a high GPA throughout college
Q34. advantages of sql
SQL allows for efficient data retrieval, manipulation, and analysis in relational databases.
SQL is widely used in querying databases to retrieve specific data.
It allows for data manipulation, such as adding, updating, and deleting records.
SQL can perform complex data analysis tasks, such as aggregations and joins.
It provides a standardized language for interacting with relational databases.
SQL is essential for generating reports and visualizations based on database data.
Q35. advantages of python
Python is a versatile programming language known for its simplicity, readability, and vast library support.
Easy to learn and use, making it ideal for beginners and experienced programmers alike
Extensive library support for data analysis, machine learning, web development, and more
Strong community support with active forums and resources for problem-solving
Cross-platform compatibility allows for seamless integration with different operating systems
Versatile applications in var...read more
More about working at TCS
Top HR Questions asked in Unient India
Interview Process at Unient India
Top Data Analyst Interview Questions from Similar Companies
Reviews
Interviews
Salaries
Users/Month