i
Infovision
Filter interviews by
I was interviewed in Jan 2024.
Window functions in SQL are used to perform calculations across a set of table rows related to the current row.
Window functions are used to calculate values based on a set of rows related to the current row.
They allow for ranking, aggregation, and other calculations within a specific window of rows.
Common window functions include ROW_NUMBER(), RANK(), DENSE_RANK(), and SUM() OVER().
To perform a full load in SSIS, you can use the Data Flow Task with a source and destination component.
Create a Data Flow Task in the Control Flow tab of the SSIS package.
Add a source component to extract data from the source system.
Add a destination component to load data into the destination system.
Map the columns from the source to the destination.
Run the package to execute the full load.
Incremental load in ADF and SSIS involves identifying new or updated data and loading only those changes.
In ADF, use watermark columns to track the last loaded value and filter data based on this value
In SSIS, use CDC (Change Data Capture) components or custom scripts to identify new or updated data
Both ADF and SSIS support incremental loading by comparing source and target data to determine changes
Linked Service connects to external data sources, while Dataset represents the data within the data store.
Linked Service is used to connect to external data sources like databases, APIs, and file systems.
Dataset represents the data within the data store and can be used for data processing and analysis.
Linked Service defines the connection information and credentials needed to access external data sources.
Dataset define...
Integration Runtime (IR) is a compute infrastructure that provides data integration capabilities across different network environments.
IR is used in Azure Data Factory to provide data integration capabilities
There are three types of IR: Azure, Self-hosted, and Azure-SSIS
Azure IR is fully managed by Microsoft and is used for data movement in the cloud
Self-hosted IR allows data movement between on-premises and cloud data...
Use a single activity with dynamic parameterization and loops to copy data.
Use a loop to iterate through the data source and destination locations.
Parameterize the source and destination locations to dynamically copy data.
Utilize a scripting language like Python or PowerShell to implement the logic.
Example: Use a Python script with a loop to copy files from one folder to another.
Example: Use PowerShell script with dyna...
Snowflake is a cloud-based data warehousing platform that allows for easy and scalable data storage and analysis.
Snowflake is a fully managed service that works on a pay-as-you-go model.
It separates storage and compute resources, allowing for better scalability and cost-effectiveness.
Snowflake supports SQL queries and has built-in support for semi-structured data like JSON and XML.
It provides features like automatic sc...
Delete removes rows one by one and can be rolled back, while Truncate removes all rows at once and cannot be rolled back.
Delete is a DML command, while Truncate is a DDL command.
Delete can be rolled back using a transaction, while Truncate cannot be rolled back.
Delete fires triggers on each row deletion, while Truncate does not fire triggers.
Delete is slower as it removes rows one by one, while Truncate is faster as it...
To insert non-duplicate data into a target table, you can use methods like using a unique constraint, using a merge statement, or using a temporary table.
Use a unique constraint on the target table to prevent duplicate entries.
Use a merge statement to insert data into the target table only if it does not already exist.
Use a temporary table to store the new data, then insert only the non-duplicate records into the targe
Identifying long running queries in SQL
Monitor query execution times using tools like SQL Server Profiler or Performance Monitor
Check system views like sys.dm_exec_requests or sys.dm_exec_query_stats for query durations
Use query hints like OPTION (RECOMPILE) to force recompilation of queries for better performance
Top trending discussions
I was interviewed in Oct 2024.
Designing an ADF pipeline for data processing
Identify data sources and destinations
Define data transformations and processing steps
Consider scheduling and monitoring requirements
Utilize ADF activities like Copy Data, Data Flow, and Databricks
Implement error handling and logging mechanisms
Discussing expected and current salary for negotiation purposes.
Be honest about your current salary and provide a realistic expectation for your desired salary.
Highlight your skills and experience that justify your desired salary.
Be open to negotiation and willing to discuss other benefits besides salary.
Research industry standards and salary ranges for similar positions to support your negotiation.
Focus on the value y...
I applied via Naukri.com and was interviewed in Oct 2024. There were 2 interview rounds.
posted on 16 Dec 2024
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
Mostly verbal and maths questions.
DDL stands for Data Definition Language and is used to define the structure of database objects. DML stands for Data Manipulation Language and is used to manipulate data within the database.
DDL is used to create, modify, and delete database objects such as tables, indexes, and views
DML is used to insert, update, delete, and retrieve data from the database
Examples of DDL statements include CREATE TABLE, ALTER TABLE, DRO...
1- sql adn 10 spark questions.
posted on 8 Nov 2024
I applied via Campus Placement
Aptitude test had few quant and verbal questions then SQL MCQs and 3 Coding question
There is no one 'better' coding language, as it depends on the specific use case and requirements.
The best coding language depends on the project requirements, team expertise, and ecosystem support.
For data engineering, languages like Python, Scala, and SQL are commonly used for their data processing capabilities.
Python is popular for its simplicity and extensive libraries like Pandas and NumPy, while Scala is known fo...
I applied via Company Website and was interviewed in Jul 2024. There was 1 interview round.
Pods are the smallest deployable units in Kubernetes, consisting of one or more containers.
Pods are used to run and manage containers in Kubernetes
Each pod has its own unique IP address within the Kubernetes cluster
Pods can contain multiple containers that share resources and are scheduled together
Pods are ephemeral and can be easily created, destroyed, or replicated
Pods can be managed and scaled using Kubernetes contr
Interview experience
Softwaretest Engineer
309
salaries
| ₹2.5 L/yr - ₹6.2 L/yr |
Senior Software Engineer
291
salaries
| ₹6 L/yr - ₹21.7 L/yr |
Software Engineer
245
salaries
| ₹2.8 L/yr - ₹10 L/yr |
Technical Lead
176
salaries
| ₹10.3 L/yr - ₹35 L/yr |
Software Developer
143
salaries
| ₹3 L/yr - ₹10.2 L/yr |
TCS
Wipro
HCLTech
Tech Mahindra