i
Tiger Analytics
Filter interviews by
Clear (1)
Some questions related to the File Handling, NLP Pipeline, Model Deployments
posted on 23 Jul 2023
I applied via Recruitment Consulltant and was interviewed in Jun 2023. There were 4 interview rounds.
I applied via Job Portal and was interviewed in Mar 2024. There were 3 interview rounds.
Spark cluster sizing depends on workload, data size, memory requirements, and processing speed.
Consider the size of the data being processed
Take into account the memory requirements of the Spark jobs
Factor in the processing speed needed for the workload
Scale the cluster based on the number of nodes and cores required
Monitor performance and adjust cluster size as needed
Implement a pipeline based on given conditions and data requirement
posted on 21 Mar 2022
I applied via Naukri.com and was interviewed in Sep 2021. There were 3 interview rounds.
Questions related to cloud types, ADF activities, advanced SQL, and basic OOPs concepts.
Types of cloud include public, private, and hybrid
ADF activities include data ingestion, transformation, and loading
Advanced SQL includes window functions, subqueries, and joins
Basic OOPs concepts include encapsulation, inheritance, and polymorphism
posted on 7 Jan 2025
I applied via Approached by Company and was interviewed before Jan 2024. There were 3 interview rounds.
Basics of SQL, Python
Experience based questions, SQL and Python
I have worked on projects involving building data pipelines, optimizing data storage, and developing machine learning models.
Built data pipelines using Apache Spark and Airflow
Optimized data storage by implementing partitioning and indexing strategies
Developed machine learning models for predictive analytics
posted on 27 Mar 2024
I applied via Approached by Company and was interviewed in Sep 2023. There were 2 interview rounds.
Use SQL query with subquery to find nth highest salary
Use ORDER BY and LIMIT to get the nth highest salary
Use a subquery to exclude the top n-1 salaries before selecting the nth highest salary
Databricks is a unified data analytics platform that includes components like Databricks Workspace, Databricks Runtime, and Databricks Delta.
Databricks Workspace: Collaborative environment for data science and engineering teams.
Databricks Runtime: Optimized Apache Spark cluster for data processing.
Databricks Delta: Unified data management system for data lakes.
To read a JSON file, use a programming language's built-in functions or libraries to parse the file and extract the data.
Use a programming language like Python, Java, or JavaScript to read the JSON file.
Import libraries like json in Python or json-simple in Java to parse the JSON data.
Use functions like json.load() in Python to load the JSON file and convert it into a dictionary or object.
Access the data in the JSON fi...
To find the second highest salary in SQL, use the MAX function with a subquery or the LIMIT clause.
Use the MAX function with a subquery to find the highest salary first, then use a WHERE clause to exclude it and find the second highest salary.
Alternatively, use the LIMIT clause to select the second highest salary directly.
Make sure to handle cases where there may be ties for the highest salary.
Spark cluster configuration involves setting up memory, cores, and other parameters for optimal performance.
Specify the number of executors and executor memory
Set the number of cores per executor
Adjust the driver memory based on the application requirements
Configure shuffle partitions for efficient data processing
Enable dynamic allocation for better resource utilization
based on 1 interview
Interview experience
Senior Analyst
524
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Scientist
487
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Engineer
471
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Software Engineer
379
salaries
| ₹0 L/yr - ₹0 L/yr |
Data Analyst
237
salaries
| ₹0 L/yr - ₹0 L/yr |
Fractal Analytics
Mu Sigma
LatentView Analytics
AbsolutData