Upload Button Icon Add office photos
filter salaries All Filters

44 Bluebyte Technologies Jobs

Data Engineer - Spark/Hadoop (3-8 yrs)

3-8 years

Data Engineer - Spark/Hadoop (3-8 yrs)

Bluebyte Technologies

posted 3d ago

Job Description

Job Description :


Main Skills : Apache Airflow, Java, Maven, SQL, GCP services like Big Query, Cloud Composer, Data Proc, Data Flow

Design & Implement Data Pipelines :

- Develop, implement, and maintain scalable data pipelines using Google Cloud Dataflow and Apache Beam.

- Ensure the pipelines can process large-scale data efficiently with proper data validation, transformation, and loading.

Cloud Infrastructure & GCP Services :

- Leverage a variety of GCP services including BigQuery, Cloud Storage, Pub/Sub, Cloud Functions, and Cloud Composer to build, deploy, and manage data workflows.

- Utilize Google Cloud SDK and other cloud tools for managing cloud resources and automating workflows.

Optimize Data Flow & Performance :

- Monitor and optimize pipeline performance to ensure that data processing is cost-effective and efficient, meeting service-level agreements (SLAs).

- Troubleshoot and resolve issues related to data quality, pipeline execution failures, and performance bottlenecks.

Data Quality & Transformation :

- Implement data validation and cleaning techniques to ensure the accuracy and consistency of data throughout the pipeline.

- Develop transformation logic to process structured, semi-structured, and unstructured data from various sources.

Collaboration & Documentation :

- Collaborate with data scientists, analysts, and other stakeholders to ensure data flows meet the analytical needs of the business.

- Maintain clear documentation for data pipeline designs, architecture, and operational procedures.

Automation & CI/CD :

- Implement automation strategies for pipeline deployment, testing, and monitoring using CI/CD tools such as Cloud Build, Jenkins, or GitLab CI.

Security & Compliance :

- Follow best practices for securing data and ensuring compliance with industry regulations, including encryption, access control, and auditing.

Reporting & Monitoring :

- Implement monitoring and alerting for data pipelines using tools such as Google Stackdriver, Cloud Monitoring, and Cloud Logging.

- Generate reports on pipeline health, data quality, and performance for internal stakeholders.

Required Skills and Qualifications :

Experience :

- 3+ years of experience in data engineering or cloud engineering, specifically working with Google Cloud Platform (GCP).

- Proficiency in building data pipelines using Google Dataflow, Apache Beam, or similar tools.

- Strong experience with BigQuery, Cloud Storage, Pub/Sub, and Cloud Functions for data processing and management.

Technical Skills :


- Expertise in SQL and scripting languages (e.g., Python, Java, Scala).

- Experience with distributed data processing and big data technologies such as Apache Hadoop, Spark, or Kafka.

- Understanding of data modeling, ETL processes, and data warehousing.

- Familiarity with cloud security concepts, including IAM roles, encryption, and network security in GCP.

Soft Skills :


- Strong analytical and problem-solving abilities.

- Excellent communication skills for collaborating with cross-functional teams.

- Ability to manage multiple projects and priorities in a fast-paced environment.

Education :


- Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.

- Relevant certifications, such as Google Cloud Professional Data Engineer, are a plus.


Functional Areas: Software/Testing/Networking

Read full job description

Compare Bluebyte Technologies with

TCS

3.7
Compare

Accenture

3.9
Compare

Cognizant

3.8
Compare

Wipro

3.7
Compare

Capgemini

3.8
Compare

HDFC Bank

3.9
Compare

ICICI Bank

4.0
Compare

Infosys

3.7
Compare

HCLTech

3.6
Compare

Tech Mahindra

3.6
Compare

Genpact

3.9
Compare

Teleperformance

3.9
Compare

Concentrix Corporation

3.8
Compare

Axis Bank

3.8
Compare

Amazon

4.1
Compare

Jio

3.9
Compare

Reliance Retail

3.9
Compare

IBM

4.1
Compare

iEnergizer

4.7
Compare

HDB Financial Services

4.0
Compare

Similar Jobs for you

Data Engineer at Fractal31 Pvt Ltd

3-10 Yrs

₹ 10-20 LPA

Data Engineer at Emperen Technologies

5-16 Yrs

₹ 15-30 LPA

Data Engineer at Zelarsoft Pvt.Ltd.

7-11 Yrs

₹ 15-30 LPA

Data Engineer at Codersbrain India Private Limited

Bangalore / Bengaluru

5-7 Yrs

₹ 25-30 LPA

Data Engineer at Novo Tree Minds

Mumbai

2-5 Yrs

₹ 10-25 LPA

Data Engineer at Softpath Technologies LLC

4-6 Yrs

₹ 11-28 LPA

Big Data Engineer at DamcoSoft Pvt Ltd

6-10 Yrs

₹ 12-30 LPA

Data Engineer at Tekgence India Private Limited

Chennai, Pune

5-11 Yrs

₹ 15-30 LPA

Senior Data Engineer at Global KPO

Gurgaon / Gurugram

5-8 Yrs

₹ 25-32 LPA

Data Engineer at Acrocede Technologies Private Limited

Bangalore / Bengaluru

5-10 Yrs

₹ 15-30 LPA

Data Engineer - Spark/Hadoop (3-8 yrs)

3-8 Yrs

3d ago·via hirist.com

Snowflake Administrator - SQL/Python (10-12 yrs)

10-12 Yrs

2d ago·via hirist.com

Data Validation Specialist - Snowflake (4-5 yrs)

4-5 Yrs

5d ago·via hirist.com

Full Stack Developer (5-7 yrs)

5-7 Yrs

9d ago·via hirist.com

Manager - Logistics - BFS (4-5 yrs)

4-5 Yrs

10d ago·via iimjobs.com

Project Manager - Credit Card (8-10 yrs)

8-10 Yrs

12d ago·via iimjobs.com

Generative AI Architect - NLP (12-13 yrs)

12-13 Yrs

13d ago·via hirist.com

Salesforce Developer - Apex/Visual Force (10-12 yrs)

10-12 Yrs

20d ago·via hirist.com
write
Share an Interview