Filter interviews by
I applied via Approached by Company and was interviewed before Nov 2023. There was 1 interview round.
Top trending discussions
I applied via Naukri.com and was interviewed in Oct 2024. There were 2 interview rounds.
Spark performance problems can arise due to inefficient code, data skew, resource constraints, and improper configuration.
Inefficient code can lead to slow performance, such as using collect() on large datasets.
Data skew can cause uneven distribution of data across partitions, impacting processing time.
Resource constraints like insufficient memory or CPU can result in slow Spark jobs.
Improper configuration settings, su...
I applied via Naukri.com and was interviewed in Aug 2024. There was 1 interview round.
Build a pipeline to connect http source and bring data in adls
Set up a data ingestion tool like Apache NiFi or Azure Data Factory to pull data from the http source
Transform the data as needed using tools like Apache Spark or Azure Databricks
Store the data in Azure Data Lake Storage (ADLS) for further processing and analysis
Dimension tables store descriptive attributes while fact tables store quantitative data.
Dimension tables contain attributes that describe the business entities, such as customer, product, or time.
Fact tables contain quantitative data or metrics that are typically numeric and additive, such as sales revenue or quantity sold.
Dimension tables are denormalized and have a one-to-many relationship with fact tables.
Fact table...
Use a self join to find employees with 2 managers in SQL query.
Use a self join on the employee table to join the table with itself.
Count the number of distinct managers for each employee.
Filter the results to only include employees with 2 managers.
I applied via LinkedIn and was interviewed in Jul 2024. There were 2 interview rounds.
It was pair programming round where we need to attempt a couple of Spark Scenario along with the Interviewer. You will have a boiler plate code with some functionalities to be filled up. You will be assessed on writing clean and extensible code and test cases.
ALL() ignores all filters in the query context, while ALLSELECTED() ignores only filters on columns in the visual.
ALL() removes all filters from the specified column or table.
ALLSELECTED() removes filters from the specified column or table, but keeps filters on other columns in the visual.
Example: ALL('Table') would remove all filters on the 'Table' in the query context.
Example: ALLSELECTED('Column') would remove filte...
COUNT() counts only numeric values, while COUNTA() counts all non-empty cells.
COUNT() counts only cells with numerical values.
COUNTA() counts all non-empty cells, including text and errors.
Example: COUNT(A1:A5) will count only cells with numbers, while COUNTA(A1:A5) will count all non-empty cells.
Developed a predictive model to forecast sales based on historical data
Collected and cleaned historical sales data
Performed exploratory data analysis to identify trends and patterns
Built and trained a machine learning model using regression techniques
Evaluated model performance using metrics like RMSE and MAE
Cache is used for temporary storage of data in memory, while persist is used for saving data to disk for long-term storage.
Cache is typically faster as it stores data in memory for quick access.
Persist saves data to disk for durability and long-term storage.
Cache is often used for temporary data that can be recomputed if lost, while persist is used for important data that needs to be retained.
Examples: Using cache for ...
Reverse a sentence using Python
Split the sentence into words using split() method
Reverse the list of words using list slicing
Join the reversed list of words back into a sentence using join() method
Basic Apititude Questions
WHERE is used to filter rows before grouping, HAVING is used to filter groups after grouping.
WHERE is used with SELECT statement to filter rows based on a condition
HAVING is used with GROUP BY statement to filter groups based on a condition
WHERE is applied before data is grouped, HAVING is applied after data is grouped
Example: SELECT * FROM table WHERE column = 'value'
Example: SELECT column, COUNT(*) FROM table GROUP B
Row level filters are applied before aggregations, while aggregate filters are applied after aggregations.
Row level filters are used to filter individual rows of data before any aggregations are performed.
Aggregate filters are used to filter the aggregated results after the data has been grouped and summarized.
Row level filters are typically applied using WHERE clause in SQL, while aggregate filters are applied using H...
posted on 2 Jan 2025
Basic python questions
I was interviewed in Dec 2023.
Medical knowledge based questions
based on 1 interview
Interview experience
based on 1 review
Rating in categories
Associate Recruiter
152
salaries
| ₹2.2 L/yr - ₹5.3 L/yr |
Technical Recruiter
148
salaries
| ₹2.5 L/yr - ₹6.7 L/yr |
Software Engineer
80
salaries
| ₹2.5 L/yr - ₹12.2 L/yr |
Senior Technical Recruiter
77
salaries
| ₹3 L/yr - ₹7 L/yr |
HR Executive
65
salaries
| ₹2 L/yr - ₹5.3 L/yr |
TCS
Infosys
Wipro
HCLTech