Filter interviews by
I applied via Naukri.com and was interviewed in Nov 2019. There were 6 interview rounds.
I applied via Approached by Company
Top trending discussions
I applied via Company Website and was interviewed in Jun 2024. There was 1 interview round.
Data migration is the process of transferring data from one system to another.
Data migration involves transferring data from one storage system to another.
It can also involve moving data from one format to another.
Data migration is often necessary when upgrading systems or consolidating data.
Examples include migrating data from an old CRM system to a new one, or moving data from on-premises servers to the cloud.
Agile methodologies within data migration involve iterative and incremental processes to efficiently move data from one system to another.
Scrum: Breaks down the data migration process into smaller tasks called sprints, with regular meetings to track progress.
Kanban: Visualizes the data migration workflow on a board, allowing for continuous delivery of data.
Lean: Focuses on minimizing waste and maximizing value during t...
I applied via Walk-in and was interviewed in May 2024. There were 2 interview rounds.
It was good,Who will pass the test they will get into interview otherwise they are not selected for next round.
I cleared my Aptitude test.
I was interviewed in Aug 2021.
Big Data refers to large volumes of data that cannot be processed using traditional methods.
Big Data involves processing and analyzing large volumes of data
It includes structured, unstructured, and semi-structured data
Examples include social media data, sensor data, and financial transactions
I applied via Walk-in and was interviewed in Dec 2024. There were 5 interview rounds.
Given task Statics standard deviations Attrition Average of given table values and Given graph economi graph and poverty graph base on that need to gave answers 30 qustion and 60 min time duration
I was interviewed in Dec 2024.
Work done from all assigments
I was interviewed in Dec 2024.
I was interviewed in Dec 2024.
I applied via Naukri.com and was interviewed in Oct 2024. There were 2 interview rounds.
Optimizing SQL queries involves using indexes, avoiding unnecessary joins, and optimizing the query structure.
Use indexes on columns frequently used in WHERE clauses
Avoid using SELECT * and only retrieve necessary columns
Optimize joins by using INNER JOIN instead of OUTER JOIN when possible
Use EXPLAIN to analyze query performance and make necessary adjustments
Performance optimization in Spark involves tuning configurations, optimizing code, and utilizing caching.
Tune Spark configurations such as executor memory, number of executors, and shuffle partitions.
Optimize code by reducing unnecessary shuffles, using efficient transformations, and avoiding unnecessary data movements.
Utilize caching to store intermediate results in memory and avoid recomputation.
Example: In my projec...
SparkContext is the main entry point for Spark functionality, while SparkSession is the entry point for Spark SQL.
SparkContext is the entry point for low-level API functionality in Spark.
SparkSession is the entry point for Spark SQL functionality.
SparkContext is used to create RDDs (Resilient Distributed Datasets) in Spark.
SparkSession provides a unified entry point for reading data from various sources and performing
When a spark job is submitted, various steps are executed at the backend to process the job.
The job is submitted to the Spark driver program.
The driver program communicates with the cluster manager to request resources.
The cluster manager allocates resources (CPU, memory) to the job.
The driver program creates DAG (Directed Acyclic Graph) of the job stages and tasks.
Tasks are then scheduled and executed on worker nodes ...
Calculate second highest salary using SQL and pyspark
Use SQL query with ORDER BY and LIMIT to get the second highest salary
In pyspark, use orderBy() and take() functions to achieve the same result
The two types of modes for Spark architecture are standalone mode and cluster mode.
Standalone mode: Spark runs on a single machine with a single JVM and is suitable for development and testing.
Cluster mode: Spark runs on a cluster of machines managed by a cluster manager like YARN or Mesos for production workloads.
Client mode is better for very less latency due to direct communication with the cluster.
Client mode allows direct communication with the cluster, reducing latency.
Standalone mode requires an additional layer of communication, increasing latency.
Client mode is preferred for real-time applications where low latency is crucial.
Software Developer
4
salaries
| ₹6.5 L/yr - ₹8.8 L/yr |
Senior IOS Developer
3
salaries
| ₹5.5 L/yr - ₹5.5 L/yr |
TCS
Accenture
Wipro
Cognizant