i
Publicis Sapient
Filter interviews by
I applied via Company Website and was interviewed in Sep 2024. There were 2 interview rounds.
Platform - Hackerank
Duration - 2 Hours
Topics - Spark and SQL
Common file formats used in data storages include CSV, JSON, Parquet, Avro, and ORC. Parquet is best for compression.
CSV (Comma-Separated Values) - simple and widely used, but not efficient for large datasets
JSON (JavaScript Object Notation) - human-readable and easy to parse, but can be inefficient for storage
Parquet - columnar storage format that is highly efficient for compression and query performance
Avro - efficie...
Python program to find the most repeating substring in a list of words.
Iterate through each word in the list
Generate all possible substrings for each word
Count the occurrences of each substring using a dictionary
Find the substring with the highest count
I applied via LinkedIn and was interviewed in Nov 2024. There was 1 interview round.
I applied via LinkedIn and was interviewed in Nov 2024. There were 2 interview rounds.
Questions related to Spring Boot/Junit/Messaging Queue/Microservices
The Observer Pattern is a behavioral design pattern where an object (subject) maintains a list of dependents (observers) that are notified of any state changes.
Create an interface for the observer with methods like update()
Create a subject class with methods to add, remove, and notify observers
Implement concrete observer classes that implement the observer interface
When the state of the subject changes, notify all obse
Fork Join Framework is a feature in Java for parallelizing tasks.
ForkJoinPool class is used to create and manage ForkJoinTasks.
ForkJoinTask class represents a task that can be forked and joined.
Use fork() method to asynchronously execute a subtask.
Use join() method to wait for the result of a subtask.
Publicis Sapient interview questions for popular designations
Fetch cart items and display them
Create a function to fetch cart items from database
Display the items in the cart on the user interface
Handle empty cart scenarios
Consider pagination for large number of items
Fetch characters and their movies
Use API like IMDb or The Movie Database to fetch movie data
Parse the data to extract characters and their associated movies
Store the characters and movies in an array of strings
Get interview-ready with Top Publicis Sapient Interview Questions
I was interviewed in Sep 2024.
There were MCQ questions consists of different sections like Design Pattern,Java 8
NER training using deep learning
I approach assignments by breaking them down into smaller tasks, setting deadlines, and regularly checking progress.
Break down the assignment into smaller tasks to make it more manageable
Set deadlines for each task to stay on track
Regularly check progress to ensure everything is on schedule
Seek feedback from colleagues or supervisors to improve the quality of work
I applied via Approached by Company and was interviewed in Sep 2024. There were 3 interview rounds.
It’s about Pyspark for data engineer position
I applied via Recruitment Consulltant and was interviewed in Jul 2024. There were 2 interview rounds.
If a job fails in the pipeline and data processing cycle is over, it can lead to incomplete or inaccurate data.
Incomplete data may affect downstream processes and analysis
Data quality may be compromised if errors are not addressed
Monitoring and alerting systems should be in place to detect and handle failures
Re-running the failed job or implementing error handling mechanisms can help prevent issues in the future
Repartition increases the number of partitions in a DataFrame, while coalesce reduces the number of partitions without shuffling data.
Repartition involves a full shuffle of the data across the cluster, which can be expensive.
Coalesce minimizes data movement by only creating new partitions if necessary.
Repartition is typically used when increasing parallelism or evenly distributing data, while coalesce is used for reduc...
SQL code to get the city1 city2 distance of table with repeating city1 and city2 values
Use a self join on the table to match city1 and city2
Calculate the distance between the cities using appropriate formula
Consider using a subquery if needed
Data partitioning in a pipeline involves dividing data into smaller chunks for processing and analysis.
Data can be partitioned based on a specific key or attribute, such as date, location, or customer ID.
Partitioning helps distribute data processing tasks across multiple nodes or servers for parallel processing.
Common partitioning techniques include range partitioning, hash partitioning, and list partitioning.
Example: ...
Some of the top questions asked at the Publicis Sapient interview -
The duration of Publicis Sapient interview process can vary, but typically it takes about less than 2 weeks to complete.
based on 445 interviews
Interview experience
based on 3.2k reviews
Rating in categories
Senior Associate
2.2k
salaries
| ₹0 L/yr - ₹0 L/yr |
Associate Technology L2
1.5k
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Associate Technology L1
1.2k
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Software Engineer
738
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Associate 2
626
salaries
| ₹0 L/yr - ₹0 L/yr |
Accenture
IBM
TCS
Infosys