Filter interviews by
Currently working on developing a real-time data processing pipeline for a financial services company.
Designing and implementing data ingestion processes using Apache Kafka
Building data processing workflows with Apache Spark
Optimizing data storage and retrieval with Apache Hadoop
Collaborating with data scientists to integrate machine learning models into the pipeline
Top trending discussions
I applied via Approached by Company and was interviewed in Nov 2024. There was 1 interview round.
I applied via Job Portal and was interviewed in May 2024. There was 1 interview round.
List is an ordered collection of elements with duplicates allowed, while set is an unordered collection of unique elements.
List maintains the order of elements, while set does not guarantee any specific order.
List allows duplicate elements, while set does not allow duplicates.
Example: List - [1, 2, 3, 1], Set - {1, 2, 3}
Optimizing PySpark involves tuning configurations, using efficient transformations/actions, and leveraging caching.
Tune PySpark configurations for optimal performance (e.g. adjusting memory settings, parallelism)
Use efficient transformations/actions to minimize unnecessary data shuffling (e.g. using narrow transformations like map instead of wide transformations like groupByKey)
Leverage caching to persist intermediate
SOLID principles are a set of five design principles in object-oriented programming to make software designs more understandable, flexible, and maintainable.
S - Single Responsibility Principle: A class should have only one reason to change.
O - Open/Closed Principle: Software entities should be open for extension but closed for modification.
L - Liskov Substitution Principle: Objects of a superclass should be replaceable...
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
Various data warehousing techniques include dimensional modeling, star schema, snowflake schema, and data vault.
Dimensional modeling involves organizing data into facts and dimensions to facilitate easy querying and analysis.
Star schema is a type of dimensional modeling where a central fact table is connected to multiple dimension tables.
Snowflake schema is an extension of star schema where dimension tables are normali...
My analytics work has helped the organization make data-driven decisions, improve operational efficiency, and identify new opportunities for growth.
Developed data models and algorithms to optimize business processes
Generated insights from large datasets to drive strategic decision-making
Identified trends and patterns to improve customer experience and retention
Implemented data governance policies to ensure data quality
I would respond in various situations by remaining calm, assessing the situation, and providing a thoughtful and strategic solution.
Remain calm and composed
Assess the situation thoroughly
Provide a thoughtful and strategic solution
Communicate effectively with all parties involved
Both career and team are important, but ultimately career growth should be prioritized.
Career growth is essential for personal development and achieving professional goals.
A strong team can support career growth by providing mentorship, collaboration, and opportunities for learning.
Balancing career and team dynamics is key to long-term success in any role.
posted on 28 Oct 2024
DBA stands for Database Administrator. The architecture of DBA involves managing and maintaining databases to ensure data integrity and security.
DBA is responsible for installing, configuring, and upgrading database software.
They monitor database performance and troubleshoot issues.
DBA designs and implements backup and recovery strategies to prevent data loss.
They also manage user access and security permissions within...
Maintaining the database involves regular monitoring, performance tuning, applying patches, and ensuring backups are taken regularly.
Regularly monitor database performance and usage
Perform routine maintenance tasks such as applying patches and updates
Take regular backups to ensure data integrity and disaster recovery
Implement security measures to protect the database from unauthorized access
Optimize database performanc
posted on 18 Jun 2024
I applied via Naukri.com and was interviewed in May 2024. There was 1 interview round.
posted on 4 Jun 2024
posted on 16 Oct 2023
I applied via campus placement at St Francis Institute of Technology, Mumbai and was interviewed in Apr 2023. There were 5 interview rounds.
Basic Apptitude. SQL & logic in
Very easy topic. You just need to have fluent English.
They took two GD as many people were selected
posted on 17 Sep 2024
I applied via Referral and was interviewed before Sep 2023. There was 1 interview round.
A database is a structured collection of data that is organized in a way that allows for easy access, management, and retrieval.
A database is used to store and organize data in a structured format.
It allows for efficient retrieval and manipulation of data.
Examples of databases include MySQL, Oracle, SQL Server, and MongoDB.
MySQL is an open-source relational database management system that is popular for its ease of use, flexibility, and scalability.
MySQL is an open-source RDBMS, allowing users to freely use, modify, and distribute the software.
It is known for its ease of use, making it accessible for beginners and experts alike.
MySQL is highly flexible, supporting various data types and storage engines.
It is popular for its scalability, ...
To retrieve a set of data in a database, you can use SQL queries to specify the data you want to retrieve.
Use SELECT statement to specify the columns you want to retrieve
Use FROM clause to specify the table where the data is stored
Use WHERE clause to specify any conditions for filtering the data
Use ORDER BY clause to specify the order in which the data should be returned
Use LIMIT clause to specify the maximum number of
To know disk utilization in Linux, use commands like df, du, and iostat.
Use 'df' command to display disk space usage on Linux filesystems.
Use 'du' command to estimate file space usage.
Use 'iostat' command to monitor system input/output device loading.
Check disk utilization with tools like 'iotop' and 'atop'.
posted on 2 Jun 2023
I applied via Referral and was interviewed before Jun 2022. There were 4 interview rounds.
Interview experience
based on 2 reviews
Rating in categories
GL Accountant
187
salaries
| ₹3.6 L/yr - ₹10.1 L/yr |
Financial Analyst
126
salaries
| ₹3.6 L/yr - ₹9.5 L/yr |
Financial Associate
90
salaries
| ₹3 L/yr - ₹6.5 L/yr |
Data Engineer
74
salaries
| ₹8.9 L/yr - ₹37.2 L/yr |
Software Engineer
55
salaries
| ₹6 L/yr - ₹22 L/yr |
Accenture
IBM
TCS
Wipro