Filter interviews by
I applied via LinkedIn and was interviewed in Aug 2023. There were 2 interview rounds.
Normalization in SQL is the process of organizing data in a database to reduce redundancy and improve data integrity.
1NF (First Normal Form) - Each column in a table must contain atomic values, and there should be no repeating groups.
2NF (Second Normal Form) - Table should be in 1NF and all non-key attributes are fully functional dependent on the primary key.
3NF (Third Normal Form) - Table should be in 2NF and there sh...
Alter is used to modify the structure of a table, while update is used to modify the data in a table.
Alter is used to add, remove, or modify columns in a table.
Update is used to change the values of existing records in a table.
Alter can change the structure of a table, such as adding a new column or changing the data type of a column.
Update is used to modify the data in a table, such as changing the value of a specific
Use left join for computationally efficient way to find customer names from customer profile and transaction tables.
Use left join to combine customer profile and transaction tables based on customer id
Left join will include all customers from profile table even if they don't have transactions
Subquery may be less efficient as it has to be executed for each row in the result set
Using self join to analyze customer behavior in an e-commerce platform.
Identifying patterns in customer purchase history
Analyzing customer preferences based on past purchases
Segmenting customers based on their buying behavior
Use SQL query with window function to rank members by transaction amount in each city.
Use SQL query with PARTITION BY clause to group members by city
Use ORDER BY clause to rank members by transaction amount
Select the second highest member for each city
CTE is a temporary result set that can be referenced within a SELECT, INSERT, UPDATE, or DELETE statement. It is different from a Stored Procedure as it is only available for the duration of the query.
CTE stands for Common Table Expression and is defined using the WITH keyword.
CTEs are mainly used for recursive queries, complex joins, and simplifying complex queries.
CTEs are not stored in the database like Stored Proce...
List comprehension is a concise way to create lists in Python by applying an expression to each item in an iterable.
Syntax: [expression for item in iterable]
Can include conditionals: [expression for item in iterable if condition]
Example: squares = [x**2 for x in range(10)]
Lambda function is a serverless computing service that runs code in response to events and automatically manages the computing resources required.
Lambda functions are event-driven and can be triggered by various AWS services such as S3, DynamoDB, API Gateway, etc.
They are written in languages like Python, Node.js, Java, etc.
Lambda functions are scalable and cost-effective as you only pay for the compute time you consum...
A generator function is a function that can pause and resume its execution, allowing it to yield multiple values over time.
Generator functions are defined using the 'function*' syntax in JavaScript.
They use the 'yield' keyword to return values one at a time.
Generators can be iterated over using a 'for...of' loop.
They are useful for generating sequences of values lazily, improving memory efficiency.
Transformation in pyspark is lazy evaluation while Actions trigger execution of transformations.
Transformations are operations that are not executed immediately but create a plan for execution.
Actions are operations that trigger the execution of transformations and return results.
Examples of transformations include map, filter, and reduceByKey.
Examples of actions include collect, count, and saveAsTextFile.
Map applies a function to each element in a collection and returns a new collection. Flatmap applies a function that returns a collection to each element and flattens the result.
Map transforms each element in a collection using a function and returns a new collection.
Flatmap applies a function that returns a collection to each element and flattens the result into a single collection.
Map does not flatten nested collecti...
Broadcast Variables are read-only shared variables that are cached on each machine in a cluster for efficient data distribution.
Broadcast Variables are used to efficiently distribute large read-only datasets to all nodes in a Spark cluster.
They are useful for tasks like joining a small lookup table with a large dataset.
Broadcast variables are cached in memory on each machine to avoid unnecessary data shuffling during c
Top trending discussions
I applied via Campus Placement
I applied via Walk-in and was interviewed before May 2021. There were 2 interview rounds.
I appeared for an interview in Oct 2016.
I cannot provide investment advice, but here are five companies that have shown strong financial performance in recent years.
Apple - consistently high revenue and profit margins
Amazon - dominant player in e-commerce and cloud computing
Microsoft - strong growth in cloud computing and enterprise software
Alphabet (Google) - diversified revenue streams and strong advertising business
Visa - dominant player in the payments i
The Brexit vote could have both positive and negative effects on the Indian economy.
Positive effects: Increased trade opportunities with the UK, potential for attracting foreign investments from companies relocating from the UK.
Negative effects: Uncertainty in global markets leading to volatility in exchange rates, potential decline in exports to the UK.
Example: Indian IT companies may face challenges due to stricter i...
I have worked on various databases including MySQL, Oracle, and MongoDB.
MySQL
Oracle
MongoDB
Different types of join are inner join, left join, right join, and full outer join.
Inner join returns only the matching rows from both tables.
Left join returns all the rows from the left table and matching rows from the right table.
Right join returns all the rows from the right table and matching rows from the left table.
Full outer join returns all the rows from both tables, with NULL values in the columns where there ...
I applied via Campus Placement
1hr test , basic apti questions .
American express discussion and economics
I applied via Campus Placement and was interviewed in Jun 2024. There was 1 interview round.
A p-value is a measure used in statistical hypothesis testing to determine the strength of evidence against the null hypothesis.
A p-value is the probability of obtaining results as extreme as the observed results, assuming the null hypothesis is true.
A p-value is compared to a significance level (usually 0.05) to determine if the null hypothesis should be rejected.
A p-value less than the significance level indicates st
The output of a**2 is the square of the value of a.
The output is the value of a multiplied by itself
For example, if a = 3, then the output would be 9 (3*3)
append() adds elements to a single DataFrame, while concat() combines multiple DataFrames.
append() is a method used to add rows to a DataFrame.
concat() is a function used to combine multiple DataFrames along a particular axis.
append() modifies the original DataFrame, while concat() returns a new DataFrame.
Example: df1.append(df2) vs pd.concat([df1, df2])
I appeared for an interview before Feb 2024.
I applied via Company Website and was interviewed before Feb 2023. There were 3 interview rounds.
Case statements in SQL allow for conditional logic and can be used to perform different actions based on specified conditions.
Case statements are used to evaluate a set of conditions and return a result based on the first condition that is met.
They can be used in SELECT, WHERE, and ORDER BY clauses.
Case statements can include multiple conditions and can also have an ELSE clause to handle cases where none of the conditi...
Some of the top questions asked at the TransOrg Analytics Data Engineer interview for experienced candidates -
based on 1 interview
Interview experience
based on 1 review
Rating in categories
3-5 Yrs
₹ 13-12 LPA
3-5 Yrs
₹ 13-12 LPA
Data Analyst
47
salaries
| ₹6 L/yr - ₹20 L/yr |
Analyst
41
salaries
| ₹7 L/yr - ₹13.2 L/yr |
Analytics Specialist
34
salaries
| ₹7.5 L/yr - ₹21 L/yr |
Data Scientist
22
salaries
| ₹8 L/yr - ₹20 L/yr |
Data Science Analyst
12
salaries
| ₹8 L/yr - ₹12 L/yr |
JPMorgan Chase & Co.
Wells Fargo
Citicorp
HSBC Group