Filter interviews by
Informatica is a data integration tool used for ETL (Extract, Transform, Load) processes in data engineering.
Informatica is used for extracting data from various sources like databases, flat files, etc.
It can transform the data according to business rules and load it into a target data warehouse or database.
Informatica provides a visual interface for designing ETL workflows and monitoring data integration processes.
It ...
Datastage is an ETL tool used for extracting, transforming, and loading data from various sources to a target destination.
Datastage is part of the IBM Information Server suite.
It provides a graphical interface to design and run data integration jobs.
Datastage supports parallel processing for high performance.
It can connect to a variety of data sources such as databases, flat files, and web services.
Datastage jobs can b...
I applied via Approached by Company and was interviewed in Sep 2024. There was 1 interview round.
I appeared for an interview before Jun 2024, where I was asked the following questions.
I applied via Naukri.com and was interviewed before Nov 2023. There was 1 interview round.
Bigquery architecture is a distributed, serverless, highly scalable, and cost-effective data warehouse designed for large-scale data analytics.
Bigquery uses a distributed architecture to store and query data across multiple servers for high performance.
It is serverless, meaning users do not need to manage any infrastructure and can focus on analyzing data.
Bigquery is highly scalable, allowing users to easily scale up o...
Data ingestion is the process of collecting, importing, and processing data from various sources into a storage system.
Data ingestion involves extracting data from different sources such as databases, APIs, files, and streaming platforms.
The extracted data is then transformed and loaded into a data warehouse, data lake, or other storage systems for analysis.
Common tools used for data ingestion include Apache Kafka, Apa...
I applied via Recruitment Consulltant and was interviewed before Nov 2023. There was 1 interview round.
SQL joins combine rows from two or more tables based on related columns, enabling complex queries and data analysis.
INNER JOIN: Returns records with matching values in both tables. Example: SELECT * FROM A INNER JOIN B ON A.id = B.id;
LEFT JOIN: Returns all records from the left table and matched records from the right table. Example: SELECT * FROM A LEFT JOIN B ON A.id = B.id;
RIGHT JOIN: Returns all records from the ri...
Top trending discussions
Databricks is a unified data analytics platform that includes components like Databricks Workspace, Databricks Runtime, and Databricks Delta.
Databricks Workspace: Collaborative environment for data science and engineering teams.
Databricks Runtime: Optimized Apache Spark cluster for data processing.
Databricks Delta: Unified data management system for data lakes.
To read a JSON file, use a programming language's built-in functions or libraries to parse the file and extract the data.
Use a programming language like Python, Java, or JavaScript to read the JSON file.
Import libraries like json in Python or json-simple in Java to parse the JSON data.
Use functions like json.load() in Python to load the JSON file and convert it into a dictionary or object.
Access the data in the JSON fi...
To find the second highest salary in SQL, use the MAX function with a subquery or the LIMIT clause.
Use the MAX function with a subquery to find the highest salary first, then use a WHERE clause to exclude it and find the second highest salary.
Alternatively, use the LIMIT clause to select the second highest salary directly.
Make sure to handle cases where there may be ties for the highest salary.
Spark cluster configuration involves setting up memory, cores, and other parameters for optimal performance.
Specify the number of executors and executor memory
Set the number of cores per executor
Adjust the driver memory based on the application requirements
Configure shuffle partitions for efficient data processing
Enable dynamic allocation for better resource utilization
I applied via Referral and was interviewed in Jul 2024. There were 2 interview rounds.
1hour,Time speed distance
1hour,sql,python,algebra,Average
based on 11 interview experiences
Difficulty level
Duration
based on 25 reviews
Rating in categories
Data Engineer
205
salaries
| ₹4.5 L/yr - ₹10.7 L/yr |
Engineer 1
172
salaries
| ₹4 L/yr - ₹9.1 L/yr |
L2 Engineer
142
salaries
| ₹6.2 L/yr - ₹14 L/yr |
Technical Lead
101
salaries
| ₹20.3 L/yr - ₹35 L/yr |
Associate Engineer
98
salaries
| ₹2.7 L/yr - ₹6 L/yr |
Tekwissen
Softenger
XcelServ Solutions
Capital Numbers Infotech