i
Integral Ad Science
Filter interviews by
I applied via Recruitment Consulltant and was interviewed before Mar 2022. There were 6 interview rounds.
It was a Atitude + English round and after passing only main technical round will start
You will be asked coding questions and SQL quries
I applied via Naukri.com and was interviewed in Nov 2022. There were 4 interview rounds.
How do you feel if you win in an argument with your manager on a particular matter.
I applied via LinkedIn and was interviewed in Apr 2023. There were 3 interview rounds.
Asked Logical questions
Integral Ad Science interview questions for popular designations
Budgeting involves creating a financial plan for a specific period of time.
Identify the time period for the budget
Determine the sources of income
Estimate expenses and allocate funds accordingly
Regularly monitor and adjust the budget as needed
I applied via Naukri.com and was interviewed before Jun 2023. There were 2 interview rounds.
I have worked with various technologies including Java, Python, SQL, and AWS.
Java
Python
SQL
AWS
DSA Python Java Programming
I applied via Referral and was interviewed in Nov 2021. There was 1 interview round.
Coalesce is used to select the first non-null value from a set of columns. Repartition is used to shuffle data across nodes.
Coalesce reduces the number of partitions to the minimum required.
Repartition increases or decreases the number of partitions.
Coalesce is a narrow transformation while repartition is a wide transformation.
Coalesce is used to optimize data for queries while repartition is used to balance data acros...
Optimizing joins involves selecting appropriate join types, indexing tables, and minimizing data movement.
Choose the appropriate join type based on the size and structure of the tables being joined
Index the tables on the join columns to speed up the join process
Minimize data movement by selecting only the necessary columns and filtering rows before joining
Consider using denormalization or materialized views to precompu
RDD is a low-level distributed data structure while DataFrame is a high-level structured data abstraction.
RDD is immutable and unstructured while DataFrame is structured and has a schema
DataFrames are optimized for SQL queries and can be cached in memory
RDDs are more flexible and can be used for complex data processing tasks
DataFrames are easier to use and provide a more concise syntax for data manipulation
RDDs are the...
I applied via LinkedIn and was interviewed before Oct 2022. There were 4 interview rounds.
Basic dsa question in python and data engineering questions ,sql
Basic dsa question in python and data engineering questions
I applied via Recruitment Consulltant and was interviewed before Mar 2022. There were 3 interview rounds.
UCAT Test to test your skills on verbal reasoning, quantitate skills etc.
Next was a one-on-one round with the Manager where we talked about my role and what they expected from me.
Topic was Ad Tech. It was an open book exam.
Top trending discussions
Spark Context is the entry point to any Spark functionality while Spark Session is a unified entry point for Spark 2.0+.
Spark Context is the old entry point to Spark functionality.
Spark Session is a unified entry point for Spark 2.0+.
Spark Context is used to create RDDs, accumulators and broadcast variables.
Spark Session is used to create DataFrames, execute SQL queries and read data from external sources.
Repartitioning increases partitions while Coalesce reduces partitions.
Repartitioning shuffles data across the cluster and can be used to increase parallelism.
Coalesce merges partitions without shuffling data and can be used to reduce overhead.
Repartitioning is expensive and should be used sparingly.
Coalesce is faster but may not be as effective as repartitioning in increasing parallelism.
Both can be used to optimize da
Sql query to find Second Highest Salary
Use ORDER BY and LIMIT to select the second highest salary
Use subquery to select the maximum salary and exclude it from the result set
Handle cases where there are ties for the highest salary
Spark is a distributed computing engine that processes large datasets in parallel across a cluster of computers.
Spark uses a master-slave architecture with a driver program that coordinates tasks across worker nodes.
Data is stored in Resilient Distributed Datasets (RDDs) that can be cached in memory for faster processing.
Spark supports multiple programming languages including Java, Scala, and Python.
Spark can be used f...
Broadcast Join is a technique used in distributed computing to optimize join operations.
Broadcast Join is used when one table is small enough to fit in memory of all nodes in a cluster.
The smaller table is broadcasted to all nodes in the cluster, reducing network traffic.
Broadcast Join is faster than other join techniques when used appropriately.
Example: Joining a small reference table with a large fact table in a data
based on 7 interviews
Interview experience
based on 18 reviews
Rating in categories
5-7 Yrs
Not Disclosed
6-10 Yrs
Not Disclosed
Software Engineer
23
salaries
| ₹8.4 L/yr - ₹18.5 L/yr |
Associate Software Engineer
8
salaries
| ₹7 L/yr - ₹12 L/yr |
Data Engineer
7
salaries
| ₹9.6 L/yr - ₹22 L/yr |
Senior Software Engineer
5
salaries
| ₹14.8 L/yr - ₹28 L/yr |
Senior Integration Manager
4
salaries
| ₹45 L/yr - ₹60 L/yr |
R.R. Donnelley
Epsilon
Regalix
Groupm Media