Filter interviews by
Top trending discussions
I applied via Recruitment Consulltant and was interviewed in Dec 2024. There was 1 interview round.
SQL order of execution determines the sequence in which different clauses are processed in a query.
SQL query is parsed and validated first
Next, the query optimizer creates an execution plan
Execution plan includes steps like table scans, index scans, joins, etc.
Finally, the query is executed and results are returned
I was interviewed in Sep 2024.
They asked various questions on guestimates and few SQL Questions
Databricks is a unified analytics platform that provides a collaborative environment for data scientists, engineers, and analysts.
Databricks is built on top of Apache Spark, providing a unified platform for data engineering, data science, and business analytics.
Internals of Databricks include a cluster manager, job scheduler, and workspace for collaboration.
Optimization techniques in Databricks include query optimizati...
I applied via Approached by Company and was interviewed in Apr 2024. There was 1 interview round.
I have handled terabytes of data in my POCs, including data from various sources and formats.
Handled terabytes of data in POCs
Worked with data from various sources and formats
Used tools like Hadoop, Spark, and SQL for data processing
Repartition is used for increasing partitions for parallelism, while coalesce is used for decreasing partitions to reduce shuffling.
Repartition is used when there is a need for more partitions to increase parallelism.
Coalesce is used when there are too many partitions and need to reduce them to avoid shuffling.
Example: Repartition can be used before a join operation to evenly distribute data across partitions for bette...
Designing/configuring a cluster for 10 petabytes of data involves considerations for storage capacity, processing power, network bandwidth, and fault tolerance.
Consider using a distributed file system like HDFS or object storage like Amazon S3 to store and manage the large volume of data.
Implement a scalable processing framework like Apache Spark or Hadoop to efficiently process and analyze the data in parallel.
Utilize...
I applied via Job Portal and was interviewed in Aug 2024. There were 3 interview rounds.
Its mandatory test even for experience people
I applied via Company Website and was interviewed in Feb 2024. There were 4 interview rounds.
Cyclic linked lists are linked lists where the last node points back to the first node, creating a loop.
Cyclic linked lists have no NULL pointers, making it difficult to determine the end of the list.
They can be used to efficiently represent circular data structures like a round-robin scheduling algorithm.
Detecting cycles in a linked list can be done using Floyd's cycle-finding algorithm.
Real world problem: Predicting customer churn in a subscription-based service
Collect and analyze customer data such as usage patterns, demographics, and interactions
Use machine learning algorithms to identify factors leading to churn
Implement targeted retention strategies based on the analysis
Monitor and evaluate the effectiveness of the strategies over time
Asked me two string array question one was to reverse a string without any pre build function and second one was a medium question to print the number and the count of it into the next level of the tree
I manage my work by prioritizing tasks, setting goals, staying organized, and communicating effectively with team members.
Prioritize tasks based on deadlines and importance
Set clear goals and milestones to track progress
Stay organized with tools like project management software
Communicate effectively with team members to ensure alignment and collaboration
Technician
4
salaries
| ₹1.8 L/yr - ₹2.5 L/yr |
Softwaretest Engineer
4
salaries
| ₹2 L/yr - ₹5.6 L/yr |
Software Engineer
3
salaries
| ₹4.1 L/yr - ₹9.8 L/yr |
Senior Analyst
3
salaries
| ₹17 L/yr - ₹20 L/yr |
Process Associate
3
salaries
| ₹2.2 L/yr - ₹2.5 L/yr |
Infosys
TCS
Wipro
HCLTech