Filter interviews by
I applied via Naukri.com and was interviewed in Nov 2024. There were 2 interview rounds.
Initially, I began with a medium-level question on LeetCode regarding the longest unique substring. Following that, I was asked about fundamental concepts in data engineering and two SQL questions: one to find the second highest employee salary and the other to determine the highest marks of students in each section.
here are all the questions
You have a dataset with employees, salaries, bonuses, and departments. How
would you calculate the total compensation for each department while
considering that bonuses are applied only to salaries above $50,000?
second highest sal in each department
select empId, dense_rank() over (partition by department order by salary decs) as rnk from employees_table where rank =2
in a streaming job processing event data, how would you handle late-arriving
data to ensure its included in calculations but doesnt delay real-time insights?"
Write PySpark logic to calculate the daily average sales amount for each product
category from a sales dataset spanning years. Consider that the df contains
'category', 'sales_amount', 'sale_date'
sales_df = sales_df.groupBy("category", sale_date).agg({"avg":"sales_amount"}).withColumnRenamed("avg(sales_amount), "Avg_sales_amount"")
-----/
You have a dataset with employees' salaries, bonuses, and departments. How
would you calculate the total compensation for each department while
considering that bonuses are applied only to salaries above $50,000?
in a streaming job processing event data, how would you handle late-arriving
data to ensure its included in calculations but doesnt delay real-time insights?"
Write PySpark logic to calculate the daily average sales amount for each product
category from a sales dataset spanning years. Consider that the df contains 'category', 'sales_amount', 'sale_date'
It about python basic question on string manipulation
PySpark architecture is a distributed computing framework that allows for processing large datasets using Python.
PySpark is built on top of Apache Spark, a fast and general-purpose cluster computing system.
It utilizes a master-slave architecture with a driver program that communicates with a cluster manager to distribute tasks to worker nodes.
PySpark supports various data sources such as HDFS, Apache Hive, and Apache H...
Designing a low-level design for streaming data processing system
Use a distributed system like Apache Kafka for real-time data streaming
Implement data processing pipelines using technologies like Apache Spark or Flink
Ensure fault tolerance and scalability by using a microservices architecture
Leverage cloud services like AWS Kinesis or Google Cloud Dataflow for managed streaming solutions
A happy number is a number which eventually reaches 1 when replaced by the sum of the square of each digit.
Create a function to determine if a number is happy by repeatedly squaring the digits and summing them until the result is 1 or a cycle is detected.
Use a set to keep track of seen numbers to detect cycles.
Example: For number 19, the process would be 1^2 + 9^2 = 82, 8^2 + 2^2 = 68, 6^2 + 8^2 = 100, 1^2 + 0^2 + 0^2
Transformer architecture is a type of neural network architecture commonly used in natural language processing tasks.
Utilizes self-attention mechanism to weigh the importance of different words in a sentence
Consists of encoder and decoder layers for tasks like machine translation
Introduced by the paper 'Attention is All You Need' by Vaswani et al.
Popular implementations include BERT, GPT, and TransformerXL
I applied via LinkedIn and was interviewed in Feb 2024. There were 2 interview rounds.
Aidetic Software interview questions for popular designations
I applied via Naukri.com and was interviewed in Feb 2024. There were 2 interview rounds.
Asked some List coding questions.
I applied via Job Portal and was interviewed in Mar 2022. There were 3 interview rounds.
DSA
I applied via Referral and was interviewed before Sep 2022. There were 3 interview rounds.
Code for parsing a triangle
Use a loop to iterate through each line of the triangle
Split each line into an array of numbers
Store the parsed numbers in a 2D array or a list of lists
The ASCII value is a numerical representation of a character. It includes both capital and small alphabets.
ASCII values range from 65 to 90 for capital letters A to Z.
ASCII values range from 97 to 122 for small letters a to z.
For example, the ASCII value of 'A' is 65 and the ASCII value of 'a' is 97.
I applied via Campus Placement and was interviewed before Sep 2020. There were 3 interview rounds.
I applied via Job Portal and was interviewed before May 2023. There were 2 interview rounds.
Coding and AI related Questions
based on 5 interviews
Interview experience
based on 6 reviews
Rating in categories
4-8 Yrs
Not Disclosed
3-5 Yrs
Not Disclosed
Data Scientist
6
salaries
| ₹6 L/yr - ₹14.1 L/yr |
Senior Data Scientist
5
salaries
| ₹18 L/yr - ₹20 L/yr |
Software Developer
4
salaries
| ₹5.5 L/yr - ₹13 L/yr |
Backend Developer
4
salaries
| ₹5.5 L/yr - ₹10.8 L/yr |
QA Engineer
4
salaries
| ₹8 L/yr - ₹10.5 L/yr |
TCS
Infosys
Wipro
HCLTech