i
TCS
Filter interviews by
I have utilized optimization techniques such as indexing, caching, and parallel processing in my projects.
Implemented indexing on large datasets to improve query performance
Utilized caching to store frequently accessed data and reduce load times
Implemented parallel processing to speed up data processing tasks
Lineage tracks the history of data transformations, while DAG is a graph structure with nodes representing tasks and edges representing dependencies.
Lineage focuses on the history of data transformations, showing how data has been derived or modified.
DAG is a graph structure where nodes represent tasks and edges represent dependencies between tasks.
Lineage helps in understanding the data flow and ensuring data qua...
Cache is temporary storage used to store frequently accessed data for quick retrieval, while persistence refers to storing data permanently.
Cache is temporary and volatile, while persistence is permanent and non-volatile
Cache is typically faster to access than persistence
Examples of cache include browser cache, CPU cache, and in-memory cache systems like Redis
Examples of persistence include databases like MySQL, P...
Tuples are immutable and fixed in size, while lists are mutable and can change in size.
Tuples are created using parentheses, while lists are created using square brackets.
Tuples are faster than lists for iteration and accessing elements.
Tuples are used for heterogeneous data types, while lists are used for homogeneous data types.
What people are saying about TCS
External tables are stored outside the database while internal tables are stored within the database.
External tables are created using the LOCATION clause to specify the data location.
Internal tables are created using the CREATE TABLE statement.
External tables can be accessed by multiple databases while internal tables are specific to a single database.
External tables are not managed by the database and can be del...
I use a combination of programming languages, tools, and frameworks to analyze and process large datasets.
Utilize programming languages like Python, Java, or Scala for data processing
Leverage tools like Hadoop, Spark, or Kafka for distributed computing
Implement frameworks like MapReduce or Apache Flink for data analysis
Use SQL or NoSQL databases for data storage and retrieval
Implemented a real-time data processing system using Apache Kafka and Spark for analyzing customer behavior.
Developed data pipelines to ingest, process, and analyze large volumes of data
Utilized Apache Kafka for real-time data streaming
Implemented machine learning algorithms for predictive analytics
Optimized data storage and retrieval for faster query performance
Hive metastore is a central repository that stores metadata for Hive tables, including schema and location.
Hive metastore is used to manage metadata for Hive tables.
It stores information about the schema, location, and other attributes of tables.
The metastore can be configured to use different databases, such as MySQL or PostgreSQL.
It allows for sharing metadata across multiple Hive instances.
The metastore can be ...
Spark architecture is a distributed computing framework that consists of a cluster manager, a distributed storage system, and a processing engine.
Spark architecture is based on a master-slave architecture.
The cluster manager is responsible for managing the resources of the cluster.
The distributed storage system is used to store data across the cluster.
The processing engine is responsible for executing the tasks on...
I applied via Job Portal and was interviewed in Sep 2023. There were 3 interview rounds.
Easy only so prepare well that's it
I use a combination of programming languages, tools, and frameworks to analyze and process large datasets.
Utilize programming languages like Python, Java, or Scala for data processing
Leverage tools like Hadoop, Spark, or Kafka for distributed computing
Implement frameworks like MapReduce or Apache Flink for data analysis
Use SQL or NoSQL databases for data storage and retrieval
Implemented a real-time data processing system using Apache Kafka and Spark for analyzing customer behavior.
Developed data pipelines to ingest, process, and analyze large volumes of data
Utilized Apache Kafka for real-time data streaming
Implemented machine learning algorithms for predictive analytics
Optimized data storage and retrieval for faster query performance
External tables are stored outside the database while internal tables are stored within the database.
External tables are created using the LOCATION clause to specify the data location.
Internal tables are created using the CREATE TABLE statement.
External tables can be accessed by multiple databases while internal tables are specific to a single database.
External tables are not managed by the database and can be deleted ...
I expect a competitive salary based on my skills, experience, and industry standards for a Big Data Engineer.
Based on my research, the average salary for a Big Data Engineer in this region is between $100,000 and $130,000.
I have over 5 years of experience in data engineering, which positions me for a salary on the higher end of that range.
I am also open to discussing additional benefits such as bonuses, stock options, ...
I applied via Referral and was interviewed before Jan 2024. There was 1 interview round.
Lineage tracks the history of data transformations, while DAG is a graph structure with nodes representing tasks and edges representing dependencies.
Lineage focuses on the history of data transformations, showing how data has been derived or modified.
DAG is a graph structure where nodes represent tasks and edges represent dependencies between tasks.
Lineage helps in understanding the data flow and ensuring data quality ...
I have utilized optimization techniques such as indexing, caching, and parallel processing in my projects.
Implemented indexing on large datasets to improve query performance
Utilized caching to store frequently accessed data and reduce load times
Implemented parallel processing to speed up data processing tasks
Cache is temporary storage used to store frequently accessed data for quick retrieval, while persistence refers to storing data permanently.
Cache is temporary and volatile, while persistence is permanent and non-volatile
Cache is typically faster to access than persistence
Examples of cache include browser cache, CPU cache, and in-memory cache systems like Redis
Examples of persistence include databases like MySQL, Postgr...
I applied via Naukri.com and was interviewed in Sep 2022. There were 2 interview rounds.
Hive metastore is a central repository that stores metadata for Hive tables, including schema and location.
Hive metastore is used to manage metadata for Hive tables.
It stores information about the schema, location, and other attributes of tables.
The metastore can be configured to use different databases, such as MySQL or PostgreSQL.
It allows for sharing metadata across multiple Hive instances.
The metastore can be acces...
Spark architecture is a distributed computing framework that consists of a cluster manager, a distributed storage system, and a processing engine.
Spark architecture is based on a master-slave architecture.
The cluster manager is responsible for managing the resources of the cluster.
The distributed storage system is used to store data across the cluster.
The processing engine is responsible for executing the tasks on the ...
I applied via Campus Placement and was interviewed before Aug 2023. There were 4 interview rounds.
General maths and English
Basic program coding
I applied via Walk-in and was interviewed before May 2023. There was 1 interview round.
Tuples are immutable and fixed in size, while lists are mutable and can change in size.
Tuples are created using parentheses, while lists are created using square brackets.
Tuples are faster than lists for iteration and accessing elements.
Tuples are used for heterogeneous data types, while lists are used for homogeneous data types.
I applied via Company Website and was interviewed before Feb 2020. There was 1 interview round.
I applied via Job Portal and was interviewed before Dec 2019. There was 1 interview round.
I applied via Naukri.com and was interviewed in Aug 2018. There was 0 interview round.
Some of the top questions asked at the TCS Big Data Engineer interview -
based on 6 interview experiences
Difficulty level
Duration
based on 56 reviews
Rating in categories
System Engineer
1.1L
salaries
| ₹3.9 L/yr - ₹8.3 L/yr |
IT Analyst
65.4k
salaries
| ₹7.8 L/yr - ₹14.5 L/yr |
AST Consultant
53.7k
salaries
| ₹12 L/yr - ₹20.6 L/yr |
Assistant System Engineer
33.2k
salaries
| ₹2.4 L/yr - ₹6.3 L/yr |
Associate Consultant
33.1k
salaries
| ₹16.2 L/yr - ₹28.1 L/yr |
Amazon
Wipro
Infosys
Accenture