i
Altimetrik
Filter interviews by
I applied via Recruitment Consulltant and was interviewed in Sep 2024. There were 2 interview rounds.
Accumulators are shared variables that are updated by worker nodes and can be used for aggregating information across tasks.
Accumulators are used for implementing counters and sums in Spark.
They are only updated by worker nodes and are read-only by the driver program.
Accumulators are useful for debugging and monitoring purposes.
Example: counting the number of errors encountered during processing.
Spark architecture is a distributed computing framework that consists of a driver program, cluster manager, and worker nodes.
Spark architecture includes a driver program that manages the execution of the Spark application.
It also includes a cluster manager that allocates resources and schedules tasks on worker nodes.
Worker nodes are responsible for executing the tasks and storing data in memory or disk.
Spark architectu...
Query to find duplicate data using SQL
Use GROUP BY and HAVING clause to identify duplicate records
Select columns to check for duplicates
Use COUNT() function to count occurrences of each record
Pub/sub is a messaging pattern where senders (publishers) of messages do not program the messages to be sent directly to specific receivers (subscribers).
Pub/sub stands for publish/subscribe.
Publishers send messages to a topic, and subscribers receive messages from that topic.
It allows for decoupling of components in a system, enabling scalability and flexibility.
Examples include Apache Kafka, Google Cloud Pub/Sub, and
I have used services like BigQuery, Dataflow, Pub/Sub, and Cloud Storage in GCP.
BigQuery for data warehousing and analytics
Dataflow for real-time data processing
Pub/Sub for messaging and event ingestion
Cloud Storage for storing data and files
I applied via Naukri.com and was interviewed in Dec 2024. There were 4 interview rounds.
NA kjwnoi wniowe nfiow flmi
NA fklwmoiwef,m ionfwno njnwfeio onfwp
I applied via Recruitment Consulltant and was interviewed in Apr 2024. There was 1 interview round.
Hackerearth, advanced sql queries on joins, string literals, sql conceptual MCQ
Designing ETL flow in Google Cloud Platform (GCP) involves defining data sources, transformation processes, and loading destinations.
Identify data sources and extract data using GCP services like Cloud Storage, BigQuery, or Cloud SQL.
Transform data using tools like Dataflow or Dataprep to clean, enrich, and aggregate data.
Load transformed data into target destinations such as BigQuery, Cloud Storage, or other databases...
Altimetrik interview questions for designations
I applied via LinkedIn and was interviewed before Jan 2024. There were 2 interview rounds.
Assesment it was from codility i guess
Get interview-ready with Top Altimetrik Interview Questions
I applied via Naukri.com and was interviewed in Mar 2024. There was 1 interview round.
MCQ questions related to pyspark and big data technologies.
Use a dictionary to find duplicates in a list of strings in Python.
Create an empty dictionary to store the count of each string in the list.
Iterate through the list and update the count in the dictionary for each string.
Print out the strings that have a count greater than 1 to find duplicates.
I was interviewed before May 2023.
Conceptual round with some pyspark coding
Sql join related question and pyspark coding
Top trending discussions
2 Interview rounds
based on 15 reviews
Rating in categories
Senior Software Engineer
1.2k
salaries
| ₹9 L/yr - ₹35 L/yr |
Staff Engineer
832
salaries
| ₹10.9 L/yr - ₹40 L/yr |
Senior Engineer
618
salaries
| ₹9 L/yr - ₹30 L/yr |
Software Engineer
310
salaries
| ₹4.7 L/yr - ₹18.5 L/yr |
Senior Staff Engineer
214
salaries
| ₹15 L/yr - ₹43 L/yr |
Accenture
Persistent Systems
Mphasis
LTIMindtree