Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Maiora

Compare

3.3

based on 5 Reviews

5 Maiora Jobs

Data Engineer - Python/PySpark (4-6 yrs)

MAIORA

3.3

based on 5 Reviews

4-6 years

Maiora

posted 18hr ago

Job Role Insights

Key skills for the job

Data Engineering Python Cloud Services Big Data Hadoop Administration Pyspark

+ 1 more

Job Description

That You Will Do :

- Responsible for the documentation, design, development, and architecture of Hadoop applications.

- Must have at least 3 years of hands-on working knowledge on Big Data technologies such as Impala, Hive, Hadoop, Spark, Spark streaming, Kafka etc.

- Excellent programming skills in python.

- Experience with stream-processing systems : Storm, Spark-Streaming, etc.

- Experience with relational SQL and NoSQL databases, including Vertica.

- Experience with cloud services.

- Cloudera Hadoop Distribution, Shell Scripting, Superset, Hands on with cluster management.

- Development : Create and maintain scalable big data applications using Python, Spark, Hive, and Impala.

- Data Pipelines : Develop and optimize data processing pipelines to handle large datasets.

- Integration : Implement data ingestion, transformation, and loading processes.

- Collaboration : Work with data scientists and analysts to meet data requirements.

- Quality Control : Ensure data quality, integrity, and security.

- Performance : Monitor and troubleshoot performance issues to improve efficiency.

- Documentation : Participate in code reviews, testing, and documentation.

- Learning : Stay updated with industry trends and advancements in big data technologies.

Requirements :

- Bachelor's or Master's degree in Computer Science, IT, or related field.

- 3 years in a Big Data Developer role.

- Proficiency in Python.

- Strong experience with Apache Spark.

- Hands-on experience with Hive and Impala.

- Familiarity with Hadoop, HDFS, Kafka, and other big data tools.

- Knowledge of data modeling, ETL processes, and data warehousing concepts.

Soft Skills :

- Excellent problem-solving, communication, and teamwork skills.

- Requirements 3 years in a Big Data Developer role.

- Proficiency in Python.

- Strong experience with Apache Spark.

- Hands-on experience with Hive and Impala.

- Familiarity with Hadoop, HDFS, Kafka, and other big data tools.

- Knowledge of data modeling, ETL processes, and data warehousing concepts.

Soft Skills :

- Excellent problem-solving, communication, and teamwork skills.

- Collaborate with a dynamic team in a fast-paced environment to develop and maintain Python-based applications.

- Write clean, scalable, and well-documented code.

- Design and implement software solutions, ensuring high performance and responsiveness.

- Optimize code for maximum efficiency and maintainability.

- Collaborate with cross-functional teams to define, design, and ship new features.

- Contribute to the entire software development lifecycle, from concept to deployment.

- Troubleshoot, debug, and address software defects and issues.

- Stay updated on industry best practices and emerging technologies.

Required Skills :

- Strong proficiency in Python and PySpark.

- Experience in writing SQL queries & scripting.

- Experience in creating ETL flows and data orchestration.

- Experience in working on files : CSV, Excel, Parquet

- Good to have working experience with Databricks, and Spark Server.

- Good to have working experience with PowerBi, Tableau.

- Knowledge of database systems : MySQL, PostgreSQL,OracleDB, MSSQL.

- Familiarity with version control systems, particularly Git.

- Exposure to DevOps practices and tools.

- Exposure to cloud services, particularly AWS.

- Experience in managing Apache Airflow.

Qualifications :

- Bachelor's degree in Computer Science or a related field.

- Strong problem-solving and algorithmic thinking.

- Ability to work collaboratively in a team-oriented environment