Upload Button Icon Add office photos
filter salaries All Filters

45 Recro Jobs

Recro - Big Data Developer - Hadoop/PySpark (5-7 yrs)

5-7 years

Recro - Big Data Developer - Hadoop/PySpark (5-7 yrs)

Recro

posted 4d ago

Job Role Insights

Flexible timing

Job Description

We are seeking a Big Data Engineer (Python & PySpark) who will be responsible for developing and optimizing data pipelines using Python and PySpark, along with handling large datasets in distributed systems.

The ideal candidate will have hands-on experience with Apache Spark, Hadoop, Hive, Kafka, and cloud-based solutions, particularly on Google Cloud Platform (GCP).

This position requires someone who is highly technical, can design scalable data architectures, and is capable of performance tuning to meet business requirements.

The successful candidate will also work on cloud-based solutions, data transformation, and work closely with other teams to optimize big data applications.

Key Responsibilities :

- Design and develop scalable data pipelines using Python and PySpark to handle large volumes of structured and unstructured data.

- Integrate data from diverse sources into data processing workflows to ensure data availability for analytics and reporting.

- Build and optimize Apache Spark jobs for data transformation, aggregation, and processing at scale.

- Tune the performance of Spark jobs and manage resource allocation to ensure efficient processing across large datasets.

- Work with Big Data tools including Hadoop, HDFS, Hive, and Kafka to manage and process large datasets in distributed systems.

- Utilize Kafka for stream processing and ensure that data pipelines handle both batch and real-time data efficiently.

- Implement cloud-based big data solutions using Google Cloud Platform (GCP), particularly Google Cloud Dataproc, BigQuery, and other relevant GCP services.

- Optimize cloud data storage, processing, and computing resources to ensure cost-effective scaling.

- Conduct performance tuning for Big Data applications and ensure the scalability and reliability of data systems.

- Troubleshoot issues related to the performance, efficiency, and quality of the data pipelines and applications.

- Work closely with Data Scientists, Data Analysts, and other engineers to ensure seamless integration of big data applications.

- Collaborate with cross-functional teams to understand business requirements and translate them into scalable data solutions.

Skills & Qualifications :

Technical Skills :

- Strong hands-on experience with Python for building data processing pipelines and PySpark for working with distributed data systems.

- Proficiency in using PySpark RDDs and DataFrames to perform large-scale data transformations and aggregations.

- In-depth knowledge of Apache Spark (RDDs, DataFrames, tuning, etc.) for distributed data processing.

- Strong experience with Hadoop, HDFS, Hive, and Kafka for managing and processing large datasets in a distributed environment.

- Hands-on experience with Google Cloud Platform (GCP) services, including Google Cloud Dataproc, BigQuery, and Cloud Storage.

- Familiarity with cloud-based data infrastructure, data storage, and processing solutions for large-scale applications.

- Strong background in handling large-scale, distributed systems, ensuring the reliable processing of massive datasets across clusters.

- Knowledge of partitioning, shuffling, and data storage strategies to optimize distributed data jobs.

- Experience in performance tuning for distributed data applications, ensuring efficiency in both batch and stream processing jobs.

- Familiarity with optimizing resource usage in cloud environments to reduce processing time and cost


Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Big Data Developer roles with real interview advice

People are getting interviews at Recro through

(based on 10 Recro interviews)
Job Portal
Referral
70%
20%
10% candidates got the interview through other sources.
Moderate Confidence
?
Moderate Confidence means the data is based on a sufficient number of responses received from the candidates

What people at Recro are saying

What Recro employees are saying about work life

based on 34 employees
96%
97%
71%
Flexible timing
Monday to Friday
No travel
View more insights

Recro Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare Recro with

TCS

3.7
Compare

Infosys

3.7
Compare

Wipro

3.7
Compare

HCLTech

3.5
Compare

Tech Mahindra

3.6
Compare

Cognizant

3.8
Compare

Accenture

3.9
Compare

Capgemini

3.8
Compare

IBM

4.1
Compare

LTIMindtree

3.9
Compare

Udaan

4.0
Compare

Swiggy

3.8
Compare

CARS24

3.6
Compare

BlackBuck

3.8
Compare

Ninjacart

4.0
Compare

Blinkit

3.8
Compare

Rivigo

3.9
Compare

Meesho

3.7
Compare

Paisabazaar.com

3.5
Compare

Tata 1mg

3.7
Compare

Similar Jobs for you

Big Data Engineer at IT

5-8 Yrs

₹ 15-20 LPA

Big Data Developer at ACS Consultants

Metros

6-12 Yrs

₹ 15-33 LPA

Big Data Engineer at SGS Technical Services Pvt. Ltd

Pune

3-5 Yrs

₹ 9-24 LPA

Big Data Engineer at Spruce IT Pvt. Ltd.

5-6 Yrs

₹ 12-18 LPA

Big Data Engineer at Corner Tree Consulting P Ltd

Pune

4-10 Yrs

₹ 22-28 LPA

Big Data Engineer at NucleusTeq Consulting Pvt. Ltd.

Indore

5-10 Yrs

₹ 15-25 LPA

Big Data Engineer at Lakshya Software Technologies Private Limited

5-8 Yrs

₹ 15-24 LPA

Big Data Developer at Hirexa Solutions

Chennai

6-7 Yrs

₹ 19-36 LPA

Senior Data Engineer at Recro

Bangalore / Bengaluru

5-8 Yrs

₹ 15-25 LPA

Big Data Engineer at Zimetrics Technologies Private Limited

4-6 Yrs

₹ 16-25 LPA

write
Share an Interview