Design, develop, and implement scalable and efficient data pipelines in the cloud using Python, SQL, and relevant technologies.
Build and maintain data infrastructure on platforms such as AWS, leveraging services like EMR, Redshift, and others.
Collaborate with data scientists, analysts, and other stakeholders to understand their requirements and provide the necessary data solutions.
Develop and optimize ETL (Extract, Transform, Load) processes to ensure the accuracy, completeness, and timeliness of data.
Create and maintain data models, schemas, and database structures using PostgreSQL and other relevant database technologies.
Experience reporting tools such as Superset, (good to have : Domo, or Tableau, Quicksight) to develop visually appealing and insightful data visualizations and dashboards.
Monitor and optimize the performance and scalability of data systems, ensuring high availability and reliability.
Implement and maintain data security and privacy measures to protect sensitive information.
Collaborate with the engineering team to integrate data solutions into existing applications or build new applications as required.
Stay up-to-date with industry trends, emerging technologies, and best practices in data engineering
Qualifications:
Bachelors or masters degree in Computer Science, Engineering, or a related field.
Strong proficiency in Python and SQL for data manipulation, analysis, and scripting.
Extensive experience with cloud platforms, particularly AWS, and working knowledge of services like EMR, Redshift, and S3.
Solid understanding of data warehousing concepts and experience with relational databases like PostgreSQL.
Familiarity with data visualization and reporting tools such as Superset, Domo, or Tableau.
Experience with building and maintaining data pipelines using tools like Airflow.
Knowledge of Python web frameworks like Flask or Django for building data-driven applications.
Strong problem-solving and analytical skills, with a keen attention to detail.
Excellent communication and collaboration skills, with the ability to work effectively in a team environment.
Proven ability to work in a fast-paced environment, prioritize tasks, and meet deadlines.