Upload Button Icon Add office photos
filter salaries All Filters

9 Sagclay Jobs

Data Engineer

5-10 years

Chennai

1 vacancy

Data Engineer

Sagclay

posted 11d ago

Job Description

Development:
Design, build, and maintain robust, scalable, and high-performance data pipelines to ingest, process, and store large volumes of structured and unstructured data.
Utilize Apache Spark within Databricks to process big data efficiently, leveraging distributed computing to process large datasets in parallel.
Integrate data from a variety of internal and external sources, including databases, APIs, cloud storage, and real-time streaming data.
Data Integration & Storage:
Implement and maintain data lakes and warehouses, using technologies like Databricks, Azure Synapse, Redshift, BigQuery to store and retrieve data.
Design and implement data models, schemas, and architecture for efficient querying and storage.
Data Transformation & Optimization:
Leverage Databricks and Apache Spark to perform data transformations at scale, ensuring data is cleaned, transformed, and optimized for analytics.
Write and optimize Spark SQL, PySpark, and Scala code to process large datasets in real-time and batch jobs.
Work on ETL processes to extract, transform, and load data from various sources into cloud-based data environments.
Big Data Tools & Technologies:
Utilize cloud-based big data platforms (e.g., AWS, Azure, Google Cloud) in conjunction with Databricks for distributed data processing and storage.
Implement and maintain data pipelines using Apache Kafka, Apache Flink, and other data streaming technologies for real-time data processing.
Collaboration & Stakeholder Engagement:
Work with data scientists, data analysts, and business stakeholders to define data requirements and deliver solutions that align with business objectives.
Collaborate with cloud engineers, data architects, and other teams to ensure smooth integration and data flow between systems.
Monitoring & Automation:
Build and implement monitoring solutions for data pipelines, ensuring consistent performance, identifying issues, and optimizing workflows.
Automate data ingestion, transformation, and validation processes to reduce manual intervention and increase efficiency.
Document data pipeline processes, architectures, and data models to ensure clarity and maintainability.
Adhere to best practices in data engineering, software development, version control, and code review.
Required Skills & Qualifications:
Education: Bachelors degree in Computer Science, Engineering, Data Science, or a related field (or equivalent experience).

Technical Skills:
Apache Spark: Strong hands-on experience with Spark, specifically within Databricks (PySpark, Scala, Spark SQL).
Experience working with cloud-based platforms such as AWS, Azure, or Google Cloud, particularly in the context of big data processing and storage.
Proficiency in SQL and experience with cloud data warehouses (e.g., Redshift, BigQuery, Snowflake).
Strong programming skills in Python, Scala, or Java.
Big Data & Cloud Technologies:
Experience with distributed computing concepts and scalable data processing architectures.
Familiarity with data lake architectures and frameworks (e.g., AWS S3, Azure Data Lake).
Data Engineering Concepts:
Strong understanding of ETL processes, data modeling, and database design.
Experience with batch and real-time data processing techniques.
Familiarity with data quality, data governance, and privacy regulations.
Problem Solving & Analytical Skills:
Strong troubleshooting skills for resolving issues in data pipelines and performance optimization.
Ability to work with large, complex datasets, and perform data wrangling and cleaning.

Employment Type: Full Time, Permanent

Read full job description

Compare Sagclay with

TCS

3.7
Compare

Accenture

3.9
Compare

Wipro

3.7
Compare

Cognizant

3.8
Compare

Capgemini

3.7
Compare

HDFC Bank

3.9
Compare

ICICI Bank

4.0
Compare

Infosys

3.6
Compare

HCLTech

3.5
Compare

Tech Mahindra

3.5
Compare

Genpact

3.8
Compare

Teleperformance

3.9
Compare

Concentrix Corporation

3.8
Compare

Axis Bank

3.8
Compare

Amazon

4.1
Compare

Jio

3.9
Compare

Reliance Retail

3.9
Compare

IBM

4.0
Compare

iEnergizer

4.6
Compare

LTIMindtree

3.8
Compare

Similar Jobs for you

Data Engineer at TransOrg

Gurgaon / Gurugram, Bangalore / Bengaluru + 1

3-5 Yrs

₹ 10-20 LPA

Azure Data Engineer at Tiger Analytics

Hyderabad / Secunderabad, Chennai + 1

5-9 Yrs

₹ 15-30 LPA

Lead Data Engineer at Atyeti

Pune

5-9 Yrs

₹ 25-40 LPA

Data Engineer at Decision Minds

Bangalore / Bengaluru

7-12 Yrs

₹ 11-20 LPA

Big Data Architect at Tiger Analytics

Pune, Chennai + 1

9-14 Yrs

₹ 30-40 LPA

Big Data Architect at Tiger Analytics

Gurgaon / Gurugram, Chennai + 1

11-18 Yrs

₹ 30-45 LPA

Azure Data Engineer at Vaco Binary Semantics

Hyderabad / Secunderabad, Gurgaon / Gurugram + 1

12-15 Yrs

₹ 25-40 LPA

Data Engineer at Agilisium

Hyderabad / Secunderabad, Chennai

4-8 Yrs

₹ 15-25 LPA

Senior Data Engineer at Tiger Analytics

Hyderabad / Secunderabad, Chennai

6-10 Yrs

₹ 10-20 LPA

Data Engineer at Hankersystems India

Kochi, Kolkata + 1

3-5 Yrs

₹ 40-45 LPA

Data Engineer

5-10 Yrs

Chennai

11d ago·via naukri.com

Salesforce Developer - LWC/Vlocity (3-7 yrs)

3-7 Yrs

11d ago·via hirist.com

Salesforce Developer

5-10 Yrs

Hyderabad / Secunderabad, Chennai, Bangalore / Bengaluru

11d ago·via naukri.com

Java Developer

5-10 Yrs

Chennai

11d ago·via naukri.com

Accounts Payable Professional

1-6 Yrs

Chennai

11d ago·via naukri.com

Mainframe Developer - JCL/COBOL (5-7 yrs)

5-7 Yrs

1mon ago·via hirist.com

Director - Data Science (10-12 yrs)

10-12 Yrs

2mon ago·via hirist.com

Embedded C Developer

5-10 Yrs

Chennai, Bangalore / Bengaluru

5mon ago·via naukri.com

DevOPS Lead

3-8 Yrs

Chennai

5mon ago·via naukri.com
write
Share an Interview