i
Concentrix Catalyst
Filter interviews by
Spark performance tuning methods involve optimizing resource allocation, data partitioning, and caching.
Optimize resource allocation by adjusting memory and CPU settings in Spark configurations.
Partition data effectively to distribute work evenly across nodes.
Utilize caching to store intermediate results in memory for faster access.
Use broadcast variables for small lookup tables to reduce shuffle operations.
Monitor and...
Use Pyspark to remove regex characters from column values
Use the regexp_replace function in Pyspark to remove regex characters from column values
Specify the regex pattern to match and the replacement string
Apply the regexp_replace function to the desired column in the DataFrame
I have experience working as a Data Engineer at XYZ Company for 2 years.
Developed ETL pipelines to extract, transform, and load data from various sources
Optimized database performance and implemented data quality checks
Collaborated with cross-functional teams to design and implement data solutions
Top trending discussions
I applied via Naukri.com and was interviewed in Jul 2024. There were 2 interview rounds.
There are three types of integration runtime: Self-hosted, Azure, and SSIS
Self-hosted integration runtime is installed on a local machine or a virtual machine within an on-premises network
Azure integration runtime is managed by Azure Data Factory and runs in the Azure cloud
SSIS integration runtime is used to run SQL Server Integration Services packages in Azure Data Factory
There are two types of triggers in Azure Data Factory: Schedule-based triggers and Event-based triggers.
Schedule-based triggers are based on a time schedule and can be set to run at specific intervals.
Event-based triggers are triggered by events such as the completion of a pipeline run or the arrival of new data.
Triggers can be used to automate the execution of pipelines in Azure Data Factory.
I chose Accenture for its reputation, global presence, and opportunities for growth.
Accenture is a renowned company known for its innovative solutions and cutting-edge technology.
The global presence of Accenture provides opportunities to work on diverse projects and collaborate with experts from around the world.
Accenture offers ample opportunities for career growth and development through training programs and mentors...
I applied via LinkedIn and was interviewed before Apr 2023. There were 2 interview rounds.
MCQ based technical questions they have asked. Covering most of the basic abinitio components
I applied via Campus Placement and was interviewed before Aug 2022. There were 5 interview rounds.
Aptitude Questions related to Basic Quanitative aptitue, psuedo code snippets,Computer Fundamental Questions, Related to Operating System
There are 3 coding question
1.Easy (Related to Arrays)
2.Medium(String related questions)
3.Medium(Stack related questions)
I applied via Naukri.com and was interviewed in Nov 2024. There were 2 interview rounds.
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
Different types of joins available in Databricks include inner join, outer join, left join, right join, and cross join.
Inner join: Returns only the rows that have matching values in both tables.
Outer join: Returns all rows when there is a match in either table.
Left join: Returns all rows from the left table and the matched rows from the right table.
Right join: Returns all rows from the right table and the matched rows ...
Implementing fault tolerance in a data pipeline involves redundancy, monitoring, and error handling.
Use redundant components to ensure continuous data flow
Implement monitoring tools to detect failures and bottlenecks
Set up automated alerts for immediate response to issues
Design error handling mechanisms to gracefully handle failures
Use checkpoints and retries to ensure data integrity
AutoLoader is a feature in data engineering that automatically loads data from various sources into a data warehouse or database.
Automates the process of loading data from different sources
Reduces manual effort and human error
Can be scheduled to run at specific intervals
Examples: Apache Nifi, AWS Glue
To connect to different services in Azure, you can use Azure SDKs, REST APIs, Azure Portal, Azure CLI, and Azure PowerShell.
Use Azure SDKs for programming languages like Python, Java, C#, etc.
Utilize REST APIs to interact with Azure services programmatically.
Access and manage services through the Azure Portal.
Leverage Azure CLI for command-line interface interactions.
Automate tasks using Azure PowerShell scripts.
Linked Services are connections to external data sources or destinations in Azure Data Factory.
Linked Services define the connection information needed to connect to external data sources or destinations.
They can be used in Data Factory pipelines to read from or write to external systems.
Examples of Linked Services include Azure Blob Storage, Azure SQL Database, and Amazon S3.
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
I applied via Campus Placement and was interviewed in Dec 2024. There were 4 interview rounds.
Two coding questions related to matrices and heaps.
I applied via Naukri.com and was interviewed in Dec 2024. There were 4 interview rounds.
NA kjwnoi wniowe nfiow flmi
NA fklwmoiwef,m ionfwno njnwfeio onfwp
based on 1 interview
Interview experience
Senior Software Engineer
155
salaries
| ₹9.8 L/yr - ₹31 L/yr |
Software Engineer
129
salaries
| ₹6.9 L/yr - ₹25 L/yr |
Software Engineer Level 1
41
salaries
| ₹7 L/yr - ₹24.5 L/yr |
Software Engineer2
29
salaries
| ₹16 L/yr - ₹32 L/yr |
Senior Associate
24
salaries
| ₹6.5 L/yr - ₹21 L/yr |
TCS
Wipro
Infosys
HCLTech