Upload Button Icon Add office photos
filter salaries All Filters

107 Capgemini Engineering Jobs

PySpark/Databricks Engineer - Big Data Technologies (2-13 yrs)

2-13 years

PySpark/Databricks Engineer - Big Data Technologies (2-13 yrs)

Capgemini Engineering

posted 4d ago

Job Role Insights

Flexible timing

Job Description

Job : PySpark/Databricks Engineer

Open for Multiple Locations with WFO and WFH

Job Description :

We are looking for a PySpark solutions developer and data engineer that is able to design and build solutions for one of our Fortune 500 Client programs, which aims to build a data standardized and curation-based Hadoop cluster

This high visibility, fast-paced key initiative will integrate data across internal and external sources, provide analytical insights, and integrate with the customer s critical systems

Key Responsibilities :

- Ability to design, build and unit test applications on Spark framework on Python.

- Build PySpark based applications for both batch and streaming requirements, which will require in-depth knowledge on majority of Hadoop and NoSQL databases as well.

- Develop and execute data pipeline testing processes and validate business rules and policies.

- Optimize performance of the built Spark applications in Hadoop using configurations around Spark Context, Spark-SQL, Data Frame, and Pair RDDs.

- Optimize performance for data access requirements by choosing the appropriate native Hadoop file formats (Avro, Parquet, ORC etc) and compression codec respectively.

- Ability to design build real-time applications using Apache Kafka Spark Streaming

- Build integrated solutions leveraging Unix shell scripting, RDBMS, Hive, HDFS File System, HDFS File Types, HDFS compression codec.

- Build data tokenization libraries and integrate with Hive Spark for column-level obfuscation

- Experience in processing large amounts of structured and unstructured data, including integrating data from multiple sources.

- Create and maintain integration and regression testing framework on Jenkins integrated with BitBucket and/or GIT repositories

- Participate in the agile development process, and document and communicate issues and bugs relative to data standards in scrum meetings

- Work collaboratively with onsite and offshore team.

- Develop review technical documentation for artifacts delivered.

- Ability to solve complex data-driven scenarios and triage towards defects and production issues

- Ability to learn-unlearn-relearn concepts with an open and analytical mindset

- Participate in code release and production deployment.

- Challenge and inspire team members to achieve business results in a fast paced and quickly changing environment

- BE/B.Tech/ B.Sc. in Computer Science/Statistics, Econometrics from an accredited college or university.

- Minimum 3 years of extensive experience in design, build and deployment of PySpark-based applications.

- Expertise in handling complex large-scale Big Data environments preferably (20Tb+).

- Minimum 3 years of experience in the following: HIVE, YARN, HDFS preferably on Hortonworks Data Platform.

- Good implementation experience of OOPS concepts.

- Hands-on experience writing complex SQL queries, exporting, and importing large amounts of data using utilities.

- Ability to build abstracted, modularized reusable code components.

- Hands-on experience in generating/parsing XML, JSON documents, and REST API request/responses


Functional Areas: Other

Read full job description

Capgemini Engineering Interview Questions & Tips

Prepare for Capgemini Engineering Engineer roles with real interview advice

People are getting interviews at Capgemini Engineering through

(based on 201 Capgemini Engineering interviews)
Job Portal
Campus Placement
Company Website
Referral
Walkin
Recruitment Consultant
42%
13%
12%
10%
6%
3%
14% candidates got the interview through other sources.
High Confidence
?
High Confidence means the data is based on a large number of responses received from the candidates.

What people at Capgemini Engineering are saying

3.7
 Rating based on 34 Engineer reviews

Likes

Worst company ever no salary hike,no permotion,no leave, rotational shift,manager are not supportive

Dislikes

Worst company ever no salary hike,no permotion,no leave, rotational shift,manager not supportive

Read 34 reviews

Engineer salary at Capgemini Engineering

reported by 330 employees with 2-7 years exp.
₹2.9 L/yr - ₹10.1 L/yr
7% more than the average Engineer Salary in India
View more details

What Capgemini Engineering employees are saying about work life

based on 2.1k employees
80%
88%
72%
81%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

Capgemini Engineering Benefits

Cafeteria
Work From Home
Health Insurance
Gymnasium
Soft Skill Training
Job Training +6 more
View more benefits

Compare Capgemini Engineering with

TCS

3.7
Compare

Infosys

3.7
Compare

Wipro

3.7
Compare

HCLTech

3.6
Compare

Tech Mahindra

3.6
Compare

Accenture

3.9
Compare

IBM

4.1
Compare

LTIMindtree

3.7
Compare

Teleperformance

3.9
Compare

Ericsson

4.2
Compare

FIS

3.9
Compare

KPMG India

3.5
Compare

Oracle

3.7
Compare

JLL

4.2
Compare

Standard Chartered

3.8
Compare

Bosch Global Software Technologies

4.0
Compare

Nagarro

4.0
Compare

IQVIA

3.9
Compare

Optum

4.0
Compare

Bosch

4.2
Compare

Similar Jobs for you

Senior Big Data Engineer at AEXONIC TECHNOLOGIES PRIVATE LIMITED

5-8 Yrs

₹ 10-26 LPA

Senior Big Data Engineer at AEXONIC TECHNOLOGIES PRIVATE LIMITED

5-8 Yrs

₹ 10-22 LPA

AWS Data Engineer at AppSierra Solutions Pvt Ltd

Bangalore / Bengaluru, Hyderabad / Secunderabad

5-12 Yrs

₹ 15-60 LPA

Senior Big Data Engineer at Yogy HR Solutions

6-10 Yrs

₹ 20-24 LPA

Platform Architect at O9 SOLUTIONS MANAGEMENT INDIA PRIVATE LIMITED

1-2 Yrs

₹ 28-35 LPA

Engineer at Delta Systech India Pvt Ltd.

Chennai

6-11 Yrs

₹ 8-24 LPA

Data Engineer 3 at Anzyglobal

Bangalore / Bengaluru

5-8 Yrs

₹ 15-20 LPA

Pyspark Developer at TQuanta Technologies Pvt. Ltd.

6-15 Yrs

₹ 12-30 LPA

Azure Data Engineer at Juniper Consultancy Services

Bangalore / Bengaluru

8-12 Yrs

₹ 11-25 LPA

Pyspark Developer at C2E Consultancy

7-12 Yrs

₹ 20-36 LPA

Pascal/Delphi Developer (4-12 yrs)

4-12 Yrs

1d ago·via hirist.com

Data Analyst - SQL/SSIS (5-20 yrs)

5-20 Yrs

1d ago·via hirist.com

Database Administrator (3-10 yrs)

3-10 Yrs

1d ago·via hirist.com

DevOps Engineer - AWS/Azure (1-7 yrs)

1-7 Yrs

2d ago·via hirist.com

IT Support Engineer (3-13 yrs)

3-13 Yrs

2d ago·via hirist.com

Software Engineer - C++ (1-18 yrs)

1-18 Yrs

2d ago·via hirist.com
write
Share an Interview