i
Cognizant
Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards
Filter interviews by
I applied via Naukri.com and was interviewed in Nov 2023. There was 1 interview round.
GCP BigQuery is a serverless, highly scalable, and cost-effective data warehouse for analyzing big data sets.
BigQuery is a fully managed, petabyte-scale data warehouse that enables super-fast SQL queries using the processing power of Google's infrastructure.
BigQuery's architecture includes storage, Dremel execution engine, and SQL layer.
Cloud Composer is a managed workflow orchestration service that helps you create, s...
The GCP services used in our project include BigQuery, Dataflow, Pub/Sub, and Cloud Storage.
BigQuery for data warehousing and analytics
Dataflow for real-time data processing
Pub/Sub for messaging and event ingestion
Cloud Storage for storing data and files
Cloud Functions are event-driven functions that run in response to cloud events.
Serverless functions that automatically scale based on demand
Can be triggered by events from various cloud services
Supports multiple programming languages like Node.js, Python, etc.
To schedule a job to trigger every hour in Airflow, you can use the Cron schedule interval
Define a DAG (Directed Acyclic Graph) in Airflow
Set the schedule_interval parameter to '0 * * * *' to trigger the job every hour
Example: schedule_interval='0 * * * *'
Use Python's slicing feature to display a string in reverse order.
Use string slicing with a step of -1 to reverse the string.
Example: 'hello'[::-1] will output 'olleh'.
Pub/Sub is a messaging service that allows communication between independent applications.
Pub/Sub is used for real-time messaging and event-driven systems.
It is commonly used for data ingestion, streaming analytics, and event-driven architectures.
Examples of Pub/Sub services include Google Cloud Pub/Sub, Apache Kafka, and Amazon SNS/SQS.
I applied via Naukri.com and was interviewed in Apr 2022. There were 3 interview rounds.
Bigquery is a cloud-based data warehousing tool used for analyzing large datasets quickly. Pubsub is a messaging service, Dataflow is a data processing tool, and Cloud Storage is a scalable object storage service.
Bigquery is used for analyzing large datasets quickly
Pubsub is a messaging service used for asynchronous communication between applications
Dataflow is a data processing tool used for batch and stream processin...
What people are saying about Cognizant
I applied via Naukri.com and was interviewed in Nov 2024. There were 2 interview rounds.
Developed a data pipeline to ingest, process, and analyze customer feedback data for a retail company.
Used Google Cloud Platform services like BigQuery, Dataflow, and Pub/Sub for data processing.
Implemented data cleansing and transformation techniques to ensure data quality.
Created visualizations and dashboards using tools like Data Studio for stakeholders to easily interpret the data.
GCP offers different storage classes for varying performance and cost requirements.
Standard Storage: for frequently accessed data
Nearline Storage: for data accessed less frequently
Coldline Storage: for data accessed very infrequently
Archive Storage: for data stored for long-term retention
SQL optimization techniques focus on improving query performance by reducing execution time and resource usage.
Use indexes to speed up data retrieval
Avoid using SELECT * and instead specify only the columns needed
Optimize joins by using appropriate join types and conditions
Limit the use of subqueries and instead use JOINs where possible
Use EXPLAIN to analyze query execution plans and identify bottlenecks
I applied via LinkedIn and was interviewed in Oct 2024. There were 2 interview rounds.
I have experience working on projects involving data processing, transformation, and analysis using GCP services like BigQuery, Dataflow, and Dataproc.
Utilized BigQuery for storing and querying large datasets
Implemented data pipelines using Dataflow for real-time data processing
Utilized Dataproc for running Apache Spark and Hadoop clusters for data processing
Worked on data ingestion and transformation using Cloud Stora
I applied via Naukri.com and was interviewed in Oct 2024. There was 1 interview round.
IAM is Identity and Access Management, SA is Service Account, BigQuery is a data warehouse, QlikSense is a data visualization tool, GitHub is a version control system, Spark is a distributed computing framework, Airflow is a workflow automation tool, Bigtable is a NoSQL database, Cloud Composer is a managed workflow orchestration service, Pub/Sub is a messaging service.
IAM is used to manage access to resources in Googl...
I applied via Naukri.com and was interviewed in Jun 2024. There was 1 interview round.
Check if a string is a palindrome or not
Compare the string with its reverse to check for palindrome
Ignore spaces and punctuation marks when comparing
Examples: 'racecar' is a palindrome, 'hello' is not
Use Python to create a GCS bucket
Import the necessary libraries like google.cloud.storage
Authenticate using service account credentials
Use the library functions to create a new bucket
Python code to trigger a dataflow job in cloud function
Use the googleapiclient library to interact with the Dataflow API
Authenticate using service account credentials
Submit a job to Dataflow using the projects.locations.templates.launch endpoint
I applied via Company Website and was interviewed in Sep 2023. There were 3 interview rounds.
BigQuery is used for analyzing large datasets and running complex queries, while SQL is used for querying databases.
BigQuery is used for analyzing large datasets quickly and efficiently
SQL is used for querying databases to retrieve specific data
BigQuery can handle petabytes of data, making it ideal for big data analysis
SQL can be used to perform operations like filtering, sorting, and aggregating data
I applied via Company Website and was interviewed before Mar 2023. There were 2 interview rounds.
SQL joins are used to combine rows from two or more tables based on a related column between them.
SQL joins are used to retrieve data from multiple tables based on a related column between them
Types of SQL joins include INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
In BigQuery, joins can be performed using standard SQL syntax
Example: SELECT * FROM table1 INNER JOIN table2 ON table1.column = table2.column
I applied via LinkedIn and was interviewed before Nov 2021. There were 3 interview rounds.
based on 3 reviews
Rating in categories
Associate
72.4k
salaries
| ₹5.1 L/yr - ₹16 L/yr |
Programmer Analyst
55.4k
salaries
| ₹2.4 L/yr - ₹9.7 L/yr |
Senior Associate
48.3k
salaries
| ₹9 L/yr - ₹27 L/yr |
Senior Processing Executive
28.6k
salaries
| ₹1.8 L/yr - ₹9 L/yr |
Technical Lead
17.6k
salaries
| ₹5.9 L/yr - ₹24 L/yr |
TCS
Infosys
Wipro
Accenture