i
Helm360
Filter interviews by
I applied via Company Website and was interviewed in Jul 2021. There were 4 interview rounds.
SQL is a programming language used to manage and manipulate relational databases.
SQL stands for Structured Query Language
It is used to create, modify, and query databases
Common SQL commands include SELECT, INSERT, UPDATE, and DELETE
SQL is used in various industries including finance, healthcare, and e-commerce
Joins are used in SQL to combine data from two or more tables based on a related column.
INNER JOIN returns only the matching rows from both tables
LEFT JOIN returns all rows from the left table and matching rows from the right table
RIGHT JOIN returns all rows from the right table and matching rows from the left table
FULL OUTER JOIN returns all rows from both tables
CROSS JOIN returns the Cartesian product of both tables
Developed a predictive model to forecast sales for a retail company
Used historical sales data to train the model
Incorporated external factors such as weather and holidays
Evaluated model performance using metrics like RMSE and MAE
I applied via Naukri.com and was interviewed in Nov 2024. There were 2 interview rounds.
The Aptitude Test session accesses mathematical and logical reasoning abilities
Vlookup is a function in Excel used to search for a value in a table and return a corresponding value from another column.
Vlookup stands for 'Vertical Lookup'
It is commonly used in Excel to search for a value in the leftmost column of a table and return a value in the same row from a specified column
Syntax: =VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])
Example: =VLOOKUP(A2, B2:D10, 3, FALSE) - searc...
My day in my previous organization involved analyzing large datasets, creating reports, and presenting findings to stakeholders.
Reviewing and cleaning large datasets to ensure accuracy
Creating visualizations and reports to communicate insights
Collaborating with team members to identify trends and patterns
Presenting findings to stakeholders in meetings or presentations
I possess strong technical skills in data analysis, including proficiency in programming languages, statistical analysis, and data visualization tools.
Proficient in programming languages such as Python, R, SQL
Skilled in statistical analysis and data modeling techniques
Experience with data visualization tools like Tableau, Power BI
Knowledge of machine learning algorithms and techniques
A Pivot Table is a data summarization tool used in spreadsheet programs to analyze, summarize, and present data in a tabular format.
Pivot tables allow users to reorganize and summarize selected columns and rows of data to obtain desired insights.
Users can easily group and filter data, perform calculations, and create visualizations using pivot tables.
Pivot tables are commonly used in Excel and other spreadsheet program...
To find the highest-paid employee in each department, we need to group employees by department and then select the employee with the highest salary in each group.
Group employees by department
Find the employee with the highest salary in each group
Retrieve the employee's name, salary, and department name
The aptitude test lasts 30 minutes and focuses on topics relevant to data engineering, including Spark, SQL, Azure, and PySpark.
The coding test is a one-hour examination on PySpark.
I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.
posted on 31 Dec 2024
Apache Spark architecture includes a cluster manager, worker nodes, and driver program.
Apache Spark architecture consists of a cluster manager, which allocates resources and schedules tasks.
Worker nodes execute tasks and store data in memory or disk.
Driver program coordinates tasks and communicates with the cluster manager.
Spark applications run as independent sets of processes on a cluster, coordinated by the SparkCon...
reduceBy is used to aggregate data based on key, while groupBy is used to group data based on key.
reduceBy is a transformation that combines the values of each key using an associative function and a neutral 'zero value'.
groupBy is a transformation that groups the data based on a key and returns a grouped data set.
reduceBy is more efficient for aggregating data as it reduces the data before shuffling, while groupBy shu...
RDD is a low-level abstraction representing a distributed collection of objects, while DataFrame is a higher-level abstraction representing a distributed collection of data organized into named columns.
RDD is more suitable for unstructured data and low-level transformations, while DataFrame is more suitable for structured data and high-level abstractions.
DataFrames provide optimizations like query optimization and code...
The different modes of execution in Apache Spark include local mode, standalone mode, YARN mode, and Mesos mode.
Local mode: Spark runs on a single machine with one executor.
Standalone mode: Spark runs on a cluster managed by a standalone cluster manager.
YARN mode: Spark runs on a Hadoop cluster using YARN as the resource manager.
Mesos mode: Spark runs on a Mesos cluster with Mesos as the resource manager.
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
Different types of joins available in Databricks include inner join, outer join, left join, right join, and cross join.
Inner join: Returns only the rows that have matching values in both tables.
Outer join: Returns all rows when there is a match in either table.
Left join: Returns all rows from the left table and the matched rows from the right table.
Right join: Returns all rows from the right table and the matched rows ...
Implementing fault tolerance in a data pipeline involves redundancy, monitoring, and error handling.
Use redundant components to ensure continuous data flow
Implement monitoring tools to detect failures and bottlenecks
Set up automated alerts for immediate response to issues
Design error handling mechanisms to gracefully handle failures
Use checkpoints and retries to ensure data integrity
AutoLoader is a feature in data engineering that automatically loads data from various sources into a data warehouse or database.
Automates the process of loading data from different sources
Reduces manual effort and human error
Can be scheduled to run at specific intervals
Examples: Apache Nifi, AWS Glue
To connect to different services in Azure, you can use Azure SDKs, REST APIs, Azure Portal, Azure CLI, and Azure PowerShell.
Use Azure SDKs for programming languages like Python, Java, C#, etc.
Utilize REST APIs to interact with Azure services programmatically.
Access and manage services through the Azure Portal.
Leverage Azure CLI for command-line interface interactions.
Automate tasks using Azure PowerShell scripts.
Linked Services are connections to external data sources or destinations in Azure Data Factory.
Linked Services define the connection information needed to connect to external data sources or destinations.
They can be used in Data Factory pipelines to read from or write to external systems.
Examples of Linked Services include Azure Blob Storage, Azure SQL Database, and Amazon S3.
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
I have a background in data analysis with experience in using tools like Python, SQL, and Tableau.
I have a degree in Statistics and have worked as a Data Analyst for 3 years.
My daily activities include cleaning and analyzing data, creating visualizations, and presenting insights to stakeholders.
I use Python for data manipulation and analysis, SQL for querying databases, and Tableau for creating interactive dashboards.
I...
Advanced Excel and Power BI are tools used for data analysis and visualization in companies and for clients.
Advanced Excel allows for complex data manipulation, analysis, and visualization using features like pivot tables, macros, and VBA programming.
Power BI is a business analytics tool that provides interactive visualizations and business intelligence capabilities, connecting to various data sources.
These tools are u...
I have extensive experience in using Advanced Excel and Power BI for data analysis projects.
Created complex formulas and macros in Excel to automate data processing tasks
Designed interactive dashboards in Power BI to visualize and analyze data trends
Integrated data from multiple sources into Power BI for comprehensive analysis
Used Power Query and Power Pivot in Excel to manipulate and analyze large datasets
Provided dat...
Credit and operations concepts in relation to KYC procedures and client data privacy.
Credit refers to the extension of money or resources to a client based on their financial history and ability to repay.
Operations involve the day-to-day processes and procedures within a financial institution to ensure smooth functioning.
KYC procedures are used to verify the identity of clients to prevent fraud and money laundering.
Pri...
I applied via Job Fair and was interviewed in Nov 2024. There were 2 interview rounds.
I was interviewed in Dec 2024.
Software Engineer
41
salaries
| ₹3.2 L/yr - ₹8.8 L/yr |
Senior Software Engineer
27
salaries
| ₹7 L/yr - ₹16 L/yr |
QA Engineer
26
salaries
| ₹5.5 L/yr - ₹8.5 L/yr |
Associate Software Engineer
17
salaries
| ₹3.5 L/yr - ₹6.6 L/yr |
Msbi Consultant
9
salaries
| ₹3.5 L/yr - ₹5 L/yr |
Saviom
Nalashaa Solutions
Accops Systems
Innovapptive