i
Proftware
Filter interviews by
I applied via Company Website and was interviewed in Dec 2022. There were 4 interview rounds.
Utilize data visualization, statistical analysis, and machine learning techniques to identify patterns and correlations in large datasets for strategic decision making.
Perform exploratory data analysis to understand the structure and relationships within the dataset
Utilize data visualization techniques such as scatter plots, histograms, and heatmaps to identify patterns and correlations
Conduct statistical analysis incl...
To analyze a complex dataset, start by understanding the data, cleaning and structuring it, performing exploratory data analysis, applying statistical methods, and creating visualizations for insights.
Understand the business objectives and goals to align the analysis with company's growth strategy.
Clean and structure the dataset by identifying and handling missing values, outliers, and inconsistencies.
Perform explorato...
Encountered a complex data analysis problem and successfully navigated through it
Encountered a data set with missing values and outliers
Utilized data cleaning techniques such as imputation and outlier detection
Applied statistical analysis and machine learning algorithms to identify patterns and trends
Visualized the data using tools like Tableau for better understanding
Collaborated with domain experts to gain insights a
Top trending discussions
posted on 24 Oct 2024
I applied via Referral and was interviewed in Sep 2024. There was 1 interview round.
I applied via Company Website and was interviewed in Mar 2024. There were 2 interview rounds.
Good questions. 3 SQL case studies at end
I applied via Approached by Company and was interviewed in Aug 2024. There were 3 interview rounds.
AB testing is a method used to compare two versions of a webpage or app to determine which one performs better.
AB testing involves creating two versions (A and B) of a webpage or app with one differing element
Users are randomly assigned to either version A or B to measure performance metrics
The version that performs better in terms of the desired outcome is selected for implementation
Example: Testing two different call...
It was a classification problem
I have worked on projects involving data analysis, visualization, and predictive modeling.
Developed predictive models using machine learning algorithms
Performed data cleaning and preprocessing to ensure data quality
Created interactive dashboards for data visualization
Collaborated with cross-functional teams to derive insights from data
I applied via campus placement at Sastra University and was interviewed in Mar 2024. There were 2 interview rounds.
Well designed to test the aptitude competence of the candidate.
I applied via Company Website and was interviewed in Jul 2021. There was 1 interview round.
Complex SQL scenarios and their results
Using subqueries to filter data
Joining multiple tables with complex conditions
Using window functions to calculate running totals
Pivoting data to transform rows into columns
Using recursive queries to traverse hierarchical data
I applied via Company Website and was interviewed before May 2020. There were 3 interview rounds.
I applied via Naukri.com and was interviewed before May 2021. There were 2 interview rounds.
posted on 17 Dec 2024
I applied via Instahyre and was interviewed in Nov 2024. There was 1 interview round.
Use SQL query to count number of reportees for each manager and filter out those with atleast 5 reportees.
Write a SQL query to count number of reportees for each manager using GROUP BY clause
Add HAVING clause to filter out managers with atleast 5 reportees
Example: SELECT managerId, COUNT(id) AS num_reportees FROM table_name GROUP BY managerId HAVING num_reportees >= 5
Use libraries like pandas and dask to efficiently manage large datasets in Python.
Use pandas library for data manipulation and analysis.
Use dask library for parallel computing and out-of-core processing.
Optimize memory usage by loading data in chunks or using data types efficiently.
Consider using cloud services like AWS S3 or Google BigQuery for storing and processing large datasets.
Some commonly used Python libraries for Data Analysts are Pandas, NumPy, Matplotlib, and Scikit-learn.
Pandas - used for data manipulation and analysis
NumPy - used for numerical computing and working with arrays
Matplotlib - used for data visualization
Scikit-learn - used for machine learning and data mining
based on 1 review
Rating in categories
Maxgen Technologies
VDart
ANR Software Private Limited
Magic Edtech