Upload Button Icon Add office photos
filter salaries All Filters

12 Caspex Corp Jobs

Big Data Engineer @ Mumbai

6-11 years

Hyderabad / Secunderabad

1 vacancy

Big Data Engineer @ Mumbai

Caspex Corp

posted 2d ago

Job Role Insights

Flexible timing

Job Description

Job Title: Product Engineer - Big Data

Location: Mumbai, India

Type: Full-Time

Job Summary:

As a Product Engineer - Big Data, you will be responsible for designing, building, and optimizing large-scale data processing pipelines using the latest Big Data technologies. You will collaborate with cross-functional teams, including data scientists, analysts, and product managers, to ensure data is easily accessible, secure, and reliable. Your focus will be on delivering high-quality, scalable solutions for data storage, ingestion, and analysis, while also driving continuous improvements across the data lifecycle.

Key Responsibilities:

  • ETL Pipeline Development & Optimization
    Design and implement complex end-to-end ETL pipelines to handle large-scale data ingestion and processing. Leverage AWS services like EMR, Glue, S3, MSK (Managed Streaming for Kafka), DMS (Database Migration Service), Athena, and EC2 to streamline data workflows and ensure high availability and reliability.
  • Big Data Processing
    Develop and optimize real-time and batch data processing systems using Apache Flink, PySpark, and Apache Kafka. Ensure the data is processed in a fault-tolerant manner, with strong focus on scalability and performance. Work with Apache Hudi for managing datasets and enabling incremental data processing and updates.
  • Data Modeling & Warehousing
    Design and implement data warehouse solutions that support both analytical and operational use cases. Model complex datasets into optimized data structures, ensuring high performance, easy access, and query efficiency for internal stakeholders.
  • Cloud Infrastructure Development
    Build scalable cloud-based data infrastructure leveraging AWS tools. Ensure data pipelines are resilient and adaptable to changes in data volume and variety, with a focus on minimizing costs and maximizing efficiency using services like Managed Apache Airflow for orchestration and EC2 for compute resources.
  • Data Analysis & Insights
    Work closely with business teams and data scientists to understand data needs and deliver high-quality datasets. Conduct in-depth analysis to derive insights from the data, identifying key trends, patterns, and anomalies that can drive business decisions. Present findings in a clear and actionable format.
  • Real-time & Batch Data Integration
    Enable seamless integration of both real-time streaming & batch data from systems like AWS MSK. Ensure consistency in data ingestion and processing across different formats and sources, providing a unified view of the data ecosystem.
  • CI/CD & Automation
    Utilize Jenkins to establish and maintain continuous integration and delivery pipelines. Implement automated testing and deployment workflows, ensuring that new features and updates are seamlessly integrated into production environments without disruptions.
  • Data Security & Compliance
    Collaborate with security teams to ensure that data pipelines comply with organizational and regulatory standards, including GDPR, HIPAA, or any other relevant compliance frameworks. Implement data governance frameworks to ensure data integrity, security, and traceability throughout the data lifecycle.
  • Collaboration & Cross-Functional Work
    Partner with other engineers, data scientists, product managers, and business stakeholders to understand data requirements and deliver scalable solutions. Collaborate in agile teams, participate in sprint planning, and contribute to architectural discussions.
  • Troubleshooting & Performance Tuning
    Identify and resolve performance bottlenecks in data pipelines. Ensure optimal performance through proactive monitoring, tuning, and applying best practices for data ingestion and storage.

Skills & Qualifications:

Must-Have Skills:

  1. AWS Expertise:
    Hands-on experience with core AWS services related to Big Data, including but not limited to EMR, Managed Apache Airflow, Glue, S3, DMS, MSK, Athena, and EC2. Deep understanding of cloud-native data architecture.
  2. Big Data Technologies:
    Proficiency in PySpark and SQL for data transformations and analysis. Experience working with large-scale data processing frameworks such as Apache Flink and Kafka.
  3. Data Frameworks:
    Strong knowledge of Apache Hudi for data lake operations, including CDC (Change Data Capture) and incremental data processing.
  4. Database Modeling & Data Warehousing:
    Expertise in designing and implementing scalable data models for OLAP and OLTP systems. Solid understanding of data warehousing best practices and techniques.
  5. ETL Pipeline Development:
    Proven experience in building robust, scalable ETL pipelines for processing both real-time and batch data across various platforms.
  6. Data Analysis & Insights:
    Ability to conduct complex data analysis to extract valuable business insights. Strong problem-solving skills with a data-driven approach to decision-making.
  7. CI/CD & Automation:
    Basic to intermediate knowledge of CI/CD pipelines using Jenkins or similar tools to automate deployment and monitoring of data pipelines.

Preferred Skills:

  • Experience with containerization and orchestration tools like Docker and Kubernetes for data infrastructure deployment.
  • Familiarity with data governance frameworks and tools to ensure compliance and security.
  • Knowledge of monitoring tools such as AWS CloudWatch, Splunk or Prometheus to keep track of the health and performance of data systems.

Employment Type: Full Time, Permanent

Read full job description

Prepare for Big Data Engineer roles with real interview advice

What people at Caspex Corp are saying

5.0
 Rating based on 1 Big Data Engineer review

Likes

This company is very good and work culture also very nice

  • Salary - Excellent
  • +6 more
Dislikes

No I don't have anything to dislike

Read 1 Big Data Engineer review

Big Data Engineer salary at Caspex Corp

reported by 8 employees
₹4 L/yr - ₹7.5 L/yr
49% less than the average Big Data Engineer Salary in India
View more details

What Caspex Corp employees are saying about work life

based on 44 employees
66%
64%
100%
60%
Flexible timing
Monday to Friday
No travel
Night Shift
View more insights

Caspex Corp Benefits

Submitted by Company
Benefits / 401(K)S
Learning Management
Workers Comp
Submitted by Employees
Work From Home
Free Transport
Child care
Gymnasium
Cafeteria
Free Food +6 more
View more benefits

Compare Caspex Corp with

TCS

3.7
Compare

Infosys

3.6
Compare

Wipro

3.7
Compare

HCLTech

3.5
Compare

Tech Mahindra

3.5
Compare

LTIMindtree

3.8
Compare

Mphasis

3.4
Compare

Hexaware Technologies

3.6
Compare

Persistent Systems

3.5
Compare

Accel Frontline

3.9
Compare

VHS Consulting

3.7
Compare

IVTL Infoview Technologies

3.6
Compare

Apex CoVantage

3.3
Compare

DynPro

3.8
Compare

Avontix

3.9
Compare

Dataflow Group

3.1
Compare

Mol Information Processing Services India

4.0
Compare

Starmark Software

3.5
Compare

Flatworld Mortgage Processing

3.4
Compare

Continuum Managed Services

4.0
Compare

Similar Jobs for you

Site Reliability Engineer at Caspex Corp

Hyderabad / Secunderabad

6-10 Yrs

₹ 20-30 LPA

Big Data Engineer at Iridium Cloud Systems

Delhi/Ncr

7-12 Yrs

₹ 20-35 LPA

Big Data Engineer at GSPANN

Bangalore / Bengaluru

6-11 Yrs

₹ 15-30 LPA

Data Engineer at Statusneo Technology Consulting

Noida, Gurgaon / Gurugram + 1

4-9 Yrs

₹ 20-30 LPA

Big Data Engineer at Spruce IT Pvt. Ltd.

5-6 Yrs

₹ 12-18 LPA

Big Data Engineer at Changeleaders

5-8 Yrs

₹ 18-30 LPA

Big Data Engineer at Valueleaf

7-9 Yrs

₹ 25-30 LPA

Big Data Engineer at InfoCepts

Pune, Chennai + 1

5-10 Yrs

₹ 7-17 LPA

Big Data Engineer at Corner Tree Consulting P Ltd

4-10 Yrs

₹ 22-28 LPA

Big Data Engineer at BRISKWIN IT SOLUTIONS PRIVATE LIMITED

6-9 Yrs

₹ 19-24 LPA

Big Data Engineer @ Mumbai

6-11 Yrs

Hyderabad / Secunderabad

2d ago·via naukri.com

Sr AWS Cloud Engineer

6-11 Yrs

Hyderabad / Secunderabad

3d ago·via naukri.com

Full Stack Engineer (NodeJS, VueJS, JavaScript / TypeScript)

5-10 Yrs

₹ 20 - 25L/yr

Chennai

3d ago·via naukri.com

Java Developer

6-11 Yrs

Hyderabad / Secunderabad

3d ago·via naukri.com

Site Reliability Engineer with AWS

6-10 Yrs

Hyderabad / Secunderabad

3d ago·via naukri.com

Senior Full Stack Developer

7-12 Yrs

Hyderabad / Secunderabad

4d ago·via naukri.com

Full Stack Java Developer - Immediate Joiner

8-12 Yrs

₹ 20 - 30L/yr

Hyderabad / Secunderabad

5d ago·via naukri.com

Payroll Specialist

4-9 Yrs

₹ 4 - 8L/yr

Chennai

16d ago·via naukri.com

C# Developer with JavaScript

5-10 Yrs

₹ 15 - 30L/yr

Chennai

23d ago·via naukri.com

US HR Senior Specialist Onboarding and benefits

10-20 Yrs

₹ 8 - 18L/yr

Bangalore / Bengaluru

23d ago·via naukri.com
write
Share an Interview