Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by Cognizant Team. If you also belong to the team, you can get access from here

Cognizant Verified Tick

Compare button icon Compare button icon Compare
3.8

based on 47.9k Reviews

Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards

zig zag pattern zig zag pattern

Filter interviews by

Cognizant Pyspark Developer Interview Questions, Process, and Tips

Updated 30 Dec 2024

Top Cognizant Pyspark Developer Interview Questions and Answers

  • Q1. What is the difference between coalesce and repartition, as well as between cache and persist?
  • Q2. What is the SQL query to find the second highest rank in a dataset?
  • Q3. What is the SQL code for calculating year-on-year growth percentage with year-wise grouping?
View all 8 questions

Cognizant Pyspark Developer Interview Experiences

2 interviews found

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Walk-in and was interviewed in Nov 2024. There were 3 interview rounds.

Round 1 - One-on-one 

(2 Questions)

  • Q1. What are the optimization techniques used in Apache Spark?
  • Ans. 

    Optimization techniques in Apache Spark improve performance and efficiency.

    • Partitioning data to distribute work evenly

    • Caching frequently accessed data in memory

    • Using broadcast variables for small lookup tables

    • Optimizing shuffle operations by reducing data movement

    • Applying predicate pushdown to filter data early

  • Answered by AI
  • Q2. What is the difference between coalesce and repartition, as well as between cache and persist?
  • Ans. 

    Coalesce reduces the number of partitions without shuffling data, while repartition increases the number of partitions by shuffling data. Cache and persist are used to persist RDDs in memory.

    • Coalesce is used to reduce the number of partitions without shuffling data, while repartition is used to increase the number of partitions by shuffling data.

    • Coalesce is more efficient when reducing partitions as it avoids shuffling...

  • Answered by AI
Round 2 - One-on-one 

(2 Questions)

  • Q1. What is the SQL query to find the second highest rank in a dataset?
  • Ans. 

    SQL query to find the second highest rank in a dataset

    • Use the ORDER BY clause to sort the ranks in descending order

    • Use the LIMIT and OFFSET clauses to skip the highest rank and retrieve the second highest rank

    • Example: SELECT rank FROM dataset ORDER BY rank DESC LIMIT 1 OFFSET 1

  • Answered by AI
  • Q2. What is the SQL code for calculating year-on-year growth percentage with year-wise grouping?
  • Ans. 

    The SQL code for calculating year-on-year growth percentage with year-wise grouping.

    • Use the LAG function to get the previous year's value

    • Calculate the growth percentage using the formula: ((current year value - previous year value) / previous year value) * 100

    • Group by year to get year-wise growth percentage

  • Answered by AI
Round 3 - One-on-one 

(2 Questions)

  • Q1. What tools are used to connect Google Cloud Platform (GCP) with Apache Spark?
  • Ans. 

    To connect Google Cloud Platform with Apache Spark, tools like Dataproc, Cloud Storage, and BigQuery can be used.

    • Use Google Cloud Dataproc to create managed Spark and Hadoop clusters on GCP.

    • Store data in Google Cloud Storage and access it from Spark applications.

    • Utilize Google BigQuery for querying and analyzing large datasets directly from Spark.

  • Answered by AI
  • Q2. What is the process to orchestrate code in Google Cloud Platform (GCP)?
  • Ans. 

    Orchestrating code in GCP involves using tools like Cloud Composer or Cloud Dataflow to schedule and manage workflows.

    • Use Cloud Composer to create, schedule, and monitor workflows using Apache Airflow

    • Utilize Cloud Dataflow for real-time data processing and batch processing tasks

    • Use Cloud Functions for event-driven serverless functions

    • Leverage Cloud Scheduler for job scheduling

    • Integrate with other GCP services like BigQ...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Cognizant Pyspark Developer interview:
  • sql
  • spark
  • python
  • Cloud
Interview preparation tips for other job seekers - It is essential to prepare thoroughly before the interview.
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What is the difference between coalesce and repartition in data processing?
  • Ans. 

    Coalesce reduces the number of partitions without shuffling data, while repartition reshuffles data to create a specific number of partitions.

    • Coalesce is used to reduce the number of partitions without shuffling data

    • Repartition is used to increase or decrease the number of partitions by shuffling data

    • Coalesce is more efficient when reducing partitions as it avoids shuffling

    • Repartition is useful when you need to explici...

  • Answered by AI
  • Q2. What is the difference between a DataFrame and an RDD (Resilient Distributed Dataset)?
  • Ans. 

    DataFrame is a higher-level abstraction built on top of RDD, providing more structure and optimization capabilities.

    • DataFrames are distributed collections of data organized into named columns, similar to tables in a relational database.

    • RDDs are lower-level abstractions representing a collection of objects distributed across a cluster, with no inherent structure.

    • DataFrames provide optimizations like query optimization a...

  • Answered by AI

Pyspark Developer Interview Questions Asked at Other Companies

asked in TCS
Q1. Tell me about your current project. Difference between managed an ... read more
asked in Cognizant
Q2. What is the difference between coalesce and repartition, as well ... read more
asked in Cognizant
Q3. What is the process to orchestrate code in Google Cloud Platform ... read more
asked in Cognizant
Q4. What is the SQL code for calculating year-on-year growth percenta ... read more
asked in Cognizant
Q5. What is the SQL query to find the second highest rank in a datase ... read more

Pyspark Developer Jobs at Cognizant

View all

Interview questions from similar companies

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. Why Spark is used?
  • Ans. 

    Spark is used for big data processing due to its speed, scalability, and ease of use.

    • Spark is used for processing large volumes of data quickly and efficiently.

    • It offers in-memory processing which makes it faster than traditional MapReduce.

    • Spark provides a wide range of libraries for diverse tasks like SQL, streaming, machine learning, and graph processing.

    • It can run on various platforms like Hadoop, Kubernetes, and st...

  • Answered by AI
  • Q2. What are RDDs and DataFrames
  • Ans. 

    RDDs and DataFrames are data structures in Apache Spark for processing and analyzing large datasets.

    • RDDs (Resilient Distributed Datasets) are the fundamental data structure of Spark, representing a collection of elements that can be operated on in parallel.

    • DataFrames are distributed collections of data organized into named columns, similar to a table in a relational database.

    • DataFrames are built on top of RDDs, providi...

  • Answered by AI

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Naukri.com and was interviewed in Sep 2024. There was 1 interview round.

Round 1 - Coding Test 

1. Find duplicate
2. 2,3 highest salary

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
No response

I applied via Naukri.com and was interviewed in Jan 2024. There were 2 interview rounds.

Round 1 - Coding Test 

Basic python coding, list, dict, generators etc

Round 2 - HR 

(1 Question)

  • Q1. Salary negotiation

Interview Preparation Tips

Topics to prepare for DXC Technology Pyspark Developer interview:
  • Python
  • Spark
  • RDD
  • SQL
Interview preparation tips for other job seekers - Code well
Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Conceptual questions
Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Basic SQL and Python Questions
Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Company Website and was interviewed in Dec 2024. There were 5 interview rounds.

Round 1 - Aptitude Test 

An aptitude test is an assessment to measure a candidate mental aptitude/mental ability

Round 2 - Technical 

(3 Questions)

  • Q1. Can you briefly Tell us about yourself?
  • Q2. What do you consider to be your strongest points?
  • Q3. Which programming languages do you use regularly in your work
Round 3 - Group Discussion 

A group discussion is a structured conversation in which participants share their ideas, perspectives, and solutions on a specific topic.

Round 4 - HR 

(2 Questions)

  • Q1. What motivates you to seek employment at Wipro?
  • Q2. What are your long-term career goals?
Round 5 - Assignment 

Assignment details refer to the specifics of an assignment, including any documents and/or emails or subsequent correspondence between the company and the client.

Interview Preparation Tips

Topics to prepare for Wipro Software Developer interview:
  • Your academic achievement
  • Responsibility related to your p
  • The way you address mistakes at
Interview preparation tips for other job seekers - Reflect on the activities you enjoy and what brings you happiness, while also taking into account your skills and strengths.
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Walk-in and was interviewed in Oct 2024. There were 2 interview rounds.

Round 1 - HR 

(3 Questions)

  • Q1. 2 way 3 way matching
  • Ans. 

    2 way 3 way matching is a process used in accounting to ensure accuracy in financial transactions.

    • 2 way matching involves matching the invoice with the purchase order

    • 3 way matching involves matching the invoice with the purchase order and the receiving report

    • Helps in detecting errors or discrepancies in the billing process

  • Answered by AI
  • Q2. Accounts payable and accounts receivable
  • Q3. Some general questions
Round 2 - Technical 

(7 Questions)

  • Q1. T code to review po
  • Ans. 

    ME23N

    • ME23N is the T code used to review purchase orders in SAP

    • It allows users to view details of a specific purchase order, such as vendor information, item details, and delivery status

  • Answered by AI
  • Q2. If you find mismatch in quantity who to you contact
  • Ans. 

    Contact the inventory manager or supervisor for resolution.

    • Contact inventory manager or supervisor immediately

    • Verify the discrepancy with relevant documentation

    • Discuss potential causes of the mismatch and work towards resolution

  • Answered by AI
  • Q3. Vendor ledger entry
  • Q4. Where you see grn
  • Ans. 

    GRN stands for Goods Receipt Note, which is a document used in the procurement process to confirm the receipt of goods.

    • GRN is used in the procurement process to verify that the goods ordered have been received in good condition.

    • It includes details such as the quantity, quality, and condition of the goods received.

    • GRN is an important document for inventory management and accounting purposes.

    • It is typically generated by ...

  • Answered by AI
  • Q5. First self introduction
  • Q6. How you learn from where you learn
  • Q7. You have theoretical knowledge no practical how can u do
  • Ans. 

    I would seek hands-on experience through internships, workshops, or online tutorials to bridge the gap between theory and practice.

    • Seek internships or part-time jobs in the field to gain practical experience

    • Participate in workshops or training sessions to apply theoretical knowledge in real-life scenarios

    • Utilize online tutorials or courses to practice and implement theoretical concepts

    • Collaborate with professionals in ...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Listen questions properly what the interviewer are asking and say what the need other wise silent my big mistake is saying more what they asking
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Walk-in and was interviewed in Aug 2024. There were 3 interview rounds.

Round 1 - Technical 

(6 Questions)

  • Q1. Introduced yourself?
  • Ans. 

    I am a Senior Software Developer with 8 years of experience in full-stack development, specializing in Java and Angular.

    • 8 years of experience in full-stack development

    • Specialize in Java and Angular

    • Strong problem-solving skills

    • Experience with Agile methodologies

    • Excellent communication and teamwork abilities

  • Answered by AI
  • Q2. What is the use of the yield() method in threading?
  • Ans. 

    The yield() method in threading is used to give up the CPU and allow other threads to run.

    • Allows other threads to run by giving up the CPU temporarily

    • Useful for preventing one thread from dominating the CPU

    • Can be used to implement cooperative multitasking

  • Answered by AI
  • Q3. What is hash collision and how it can be handled?
  • Ans. 

    Hash collision occurs when two different inputs produce the same hash value. It can be handled by using techniques like chaining or open addressing.

    • Hash collision is a common issue in hash tables where different keys produce the same hash value.

    • One way to handle hash collisions is by using chaining, where each bucket in the hash table stores a linked list of key-value pairs with the same hash value.

    • Another way to handl...

  • Answered by AI
  • Q4. What is use of Method reference instead lambada expressions in java 8?
  • Ans. 

    Method references provide a more concise way to refer to methods by name instead of using lambda expressions.

    • Method references can make code more readable and maintainable by reducing boilerplate code.

    • They can be used to refer to static methods, instance methods, and constructors.

    • Example: list.forEach(System.out::println) is equivalent to list.forEach(item -> System.out.println(item)).

  • Answered by AI
  • Q5. What is vertical and horizontal scaling in micro services?
  • Ans. 

    Vertical scaling increases the capacity of a single server, while horizontal scaling adds more servers to distribute the load.

    • Vertical scaling involves increasing the resources of a single server, such as CPU, RAM, or storage.

    • Horizontal scaling involves adding more servers to distribute the load, often done through containerization or virtualization.

    • Vertical scaling is limited by the capacity of a single server, while ...

  • Answered by AI
  • Q6. Write a java program to make upper case of first letter in the each word in the below String "i am a java programmer"?
  • Ans. 

    Java program to capitalize first letter of each word in a given string

    • Split the input string by space to get individual words

    • Iterate through each word and capitalize the first letter

    • Join the words back together to form the final capitalized string

  • Answered by AI
Round 2 - Magerial round 

(4 Questions)

  • Q1. How the escalation matrix works in case of production defects?
  • Ans. 

    Escalation matrix defines the process for escalating production defects based on severity and impact.

    • Escalation matrix typically includes levels of escalation, responsible parties at each level, and criteria for escalating defects.

    • For example, a Level 1 escalation may involve the development team, while a Level 2 escalation may involve senior management.

    • Criteria for escalation may include severity of the defect, impact...

  • Answered by AI
  • Q2. What is your recent challenging task you have done in current project?
  • Ans. 

    Implemented a real-time data synchronization feature between multiple servers.

    • Designed a custom protocol for efficient data transfer

    • Implemented server-side logic to handle data conflicts

    • Utilized websockets for real-time communication

  • Answered by AI
  • Q3. Can you have ability to handle tasks individually?
  • Ans. 

    Yes, I have the ability to handle tasks individually.

    • I have a proven track record of successfully completing projects on my own.

    • I am self-motivated and can prioritize tasks effectively.

    • I have strong problem-solving skills which allow me to tackle challenges independently.

  • Answered by AI
  • Q4. Is it okay to give the Deliverables under pressure?
  • Ans. 

    Yes, it is okay to give deliverables under pressure as long as quality is not compromised.

    • Meeting deadlines is important in software development.

    • Pressure can sometimes lead to increased focus and productivity.

    • Communication with stakeholders about realistic timelines is key.

    • Prioritizing tasks and managing time effectively can help in delivering under pressure.

  • Answered by AI
Round 3 - HR 

(4 Questions)

  • Q1. Introduce yourself?
  • Ans. 

    I am a Senior Software Developer with 8 years of experience in developing web applications using various technologies.

    • Experienced in full stack development

    • Proficient in languages like Java, JavaScript, and Python

    • Skilled in using frameworks like Spring, React, and Django

    • Strong understanding of database management systems

    • Familiar with Agile development methodologies

  • Answered by AI
  • Q2. Why your leaving current company?
  • Ans. 

    Seeking new challenges and growth opportunities.

    • Desire for career advancement

    • Looking for new challenges

    • Seeking better work-life balance

    • Company restructuring or changes in management

  • Answered by AI
  • Q3. What is your current CTC and expected CTC?
  • Ans. 

    My current CTC is $100,000 and my expected CTC is $120,000.

    • Current CTC: $100,000

    • Expected CTC: $120,000

  • Answered by AI
  • Q4. Do you have all the relevant documents of previous companies?
  • Ans. 

    Yes, I have all relevant documents from previous companies.

    • I have copies of offer letters, employment contracts, and performance reviews.

    • I also have any relevant certifications or training records.

    • I can provide references from previous employers if needed.

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Please prepare thoroughly related to java8 and mutithreading and Microservices

Skills evaluated in this interview

Cognizant Interview FAQs

How many rounds are there in Cognizant Pyspark Developer interview?
Cognizant interview process usually has 2 rounds. The most common rounds in the Cognizant interview process are One-on-one Round and Technical.
How to prepare for Cognizant Pyspark Developer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Cognizant. The most common topics and skills that interviewers at Cognizant expect are Pyspark, Spark, Python, Big Data and Life.
What are the top questions asked in Cognizant Pyspark Developer interview?

Some of the top questions asked at the Cognizant Pyspark Developer interview -

  1. What is the difference between coalesce and repartition, as well as between cac...read more
  2. What is the SQL query to find the second highest rank in a datas...read more
  3. What is the SQL code for calculating year-on-year growth percentage with year-w...read more

Tell us how to improve this page.

People are getting interviews through

based on 1 Cognizant interview
WalkIn
100%
Low Confidence
?
Low Confidence means the data is based on a small number of responses received from the candidates.
Cognizant Pyspark Developer Salary
based on 24 salaries
₹4 L/yr - ₹11.1 L/yr
11% less than the average Pyspark Developer Salary in India
View more details

Cognizant Pyspark Developer Reviews and Ratings

based on 1 review

4.0/5

Rating in categories

3.0

Skill development

3.0

Work-Life balance

3.0

Salary & Benefits

3.0

Job Security

3.0

Company culture

3.0

Promotions/Appraisal

3.0

Work Satisfaction

Explore 1 Review and Rating
Pyspark Developer- Indore

Bangalore / Bengaluru

4-12 Yrs

Not Disclosed

Pyspark Developer

Pan inida

7-9 Yrs

Not Disclosed

Explore more jobs
Associate
72.4k salaries
unlock blur

₹5.1 L/yr - ₹15.9 L/yr

Programmer Analyst
55.4k salaries
unlock blur

₹2.4 L/yr - ₹9.6 L/yr

Senior Associate
48.3k salaries
unlock blur

₹8.9 L/yr - ₹27.1 L/yr

Senior Processing Executive
28.6k salaries
unlock blur

₹1.8 L/yr - ₹9 L/yr

Technical Lead
17.6k salaries
unlock blur

₹5.9 L/yr - ₹24.2 L/yr

Explore more salaries
Compare Cognizant with

TCS

3.7
Compare

Infosys

3.7
Compare

Wipro

3.7
Compare

Accenture

3.9
Compare

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Did you find this page helpful?
Yes No
write
Share an Interview