i
Birlasoft
Filter interviews by
Driver node is the node in Spark that manages the execution of a Spark application, while Executors are the nodes that actually perform the computation.
Driver node coordinates tasks and schedules work across Executors
Executors are responsible for executing tasks assigned by the Driver node
Driver node maintains information about the Spark application and distributes tasks to Executors
Executors run computations and store
Top trending discussions
posted on 10 Sep 2024
I applied via Job Portal and was interviewed in Aug 2024. There were 2 interview rounds.
Sql queries and knowledge of different syntaxes
Filter students with marks greater than 80 in all subjects
Iterate through each student's marks in all subjects
Check if all marks are greater than 80 for a student
Return the student if all marks are greater than 80
Syntax to define schema of a file for loading
Use CREATE EXTERNAL TABLE statement in SQL
Specify column names and data types in the schema definition
Example: CREATE EXTERNAL TABLE MyTable (col1 INT, col2 STRING) USING CSV
posted on 8 Nov 2024
Terraform is an open-source infrastructure as code software tool created by HashiCorp.
Terraform allows users to define and provision infrastructure using a declarative configuration language.
It supports multiple cloud providers such as AWS, Azure, Google Cloud, and more.
Terraform uses 'terraform plan' to create an execution plan and 'terraform apply' to apply the changes.
It helps in automating the creation, modificatio...
Azure DevOps is a set of development tools provided by Microsoft to help teams collaborate and deliver high-quality software.
Azure DevOps includes services such as Azure Repos, Azure Pipelines, Azure Boards, Azure Artifacts, and Azure Test Plans.
It allows for version control, continuous integration/continuous deployment (CI/CD), project management, and testing.
Teams can plan, build, test, and deploy applications using ...
CI/CD pipelines automate the process of building, testing, and deploying code changes.
CI/CD stands for Continuous Integration/Continuous Deployment
Automates the process of integrating code changes into a shared repository and deploying them to production
Helps in detecting and fixing integration errors early in the development process
Enables faster delivery of software updates and improvements
Popular tools for CI/CD pip...
Docker is a platform for developing, shipping, and running applications in containers. Kubernetes is a container orchestration tool for managing containerized applications across a cluster of nodes.
Docker allows developers to package applications and their dependencies into containers for easy deployment.
Kubernetes automates the deployment, scaling, and management of containerized applications.
Docker containers are lig...
I applied via Approached by Company and was interviewed in Sep 2024. There was 1 interview round.
Microservices are a software development technique where applications are composed of small, independent services that communicate with each other.
Microservices are small, independent services that focus on specific business functions.
Each microservice runs its own process and communicates with other services through APIs.
Microservices can be developed, deployed, and scaled independently.
Examples of companies using mic
Containerisation is a method of packaging, distributing, and running applications in isolated environments called containers.
Containers are lightweight, portable, and efficient units that encapsulate an application and its dependencies.
They provide a consistent environment for applications to run across different platforms.
Popular containerization platforms include Docker and Kubernetes.
Containerisation helps in achiev...
posted on 4 Sep 2024
I applied via LinkedIn and was interviewed in Aug 2024. There were 2 interview rounds.
posted on 13 Aug 2024
I applied via Walk-in and was interviewed in Jul 2024. There was 1 interview round.
Performance tuning in Azure Data Factory involves optimizing data flows and activities to improve efficiency and reduce processing time.
Identify bottlenecks in data flows and activities
Optimize data partitioning and distribution
Use appropriate data integration patterns
Leverage caching and parallel processing
Monitor and analyze performance metrics
posted on 12 Sep 2024
I was interviewed in Mar 2024.
Types of clusters in Databricks include Standard, High Concurrency, and Single Node clusters.
Standard cluster: Suitable for running single jobs or workflows.
High Concurrency cluster: Designed for multiple users running concurrent jobs.
Single Node cluster: Used for development and testing purposes.
To find the highest 2nd salary in the employee table, you need to use SQL query with the ORDER BY and LIMIT clauses.
Use SQL query with ORDER BY clause to sort salaries in descending order
Use LIMIT 1,1 to get the second highest salary
Use SQL query with GROUP BY and HAVING clause to find duplicate rows.
Use GROUP BY to group rows with same values
Use HAVING COUNT(*) > 1 to filter out duplicate rows
Example: SELECT column1, column2, COUNT(*) FROM table_name GROUP BY column1, column2 HAVING COUNT(*) > 1
Azure Synapse is a cloud-based analytics service that brings together big data and data warehousing.
Azure Synapse integrates big data and data warehousing capabilities in a single service
It allows for data ingestion, preparation, management, and serving for BI and machine learning
Supports both serverless and provisioned resources for data processing
Offers integration with Azure Machine Learning, Power BI, and Azure Dat
Software Developer
1.5k
salaries
| ₹2.5 L/yr - ₹11 L/yr |
Senior Software Engineer
1.3k
salaries
| ₹5.5 L/yr - ₹20 L/yr |
Software Engineer
1.1k
salaries
| ₹2.5 L/yr - ₹10.5 L/yr |
Technical Lead
1.1k
salaries
| ₹10 L/yr - ₹33.8 L/yr |
Technical Specialist
1.1k
salaries
| ₹7.7 L/yr - ₹28.5 L/yr |
Accenture
Cognizant
Capgemini
Wipro