Work closely with Software Engineers, Data Scientists, Data Operations, and Business Analysts to meet the company s data storage, access, and analysis needs.
Develop and implement processes for data cleaning, ensuring data quality and integrity.
Collaborate with the Development and Operations teams to deploy and maintain clustered computing across multi-cloud environments.
Monitor, maintain, and enhance existing data pipelines to ensure reliability and data integrity.
Maintain documentation of data infrastructure, processes, and data dictionaries.
Work independently on logically complex tasks with some external dependencies while tracking and reliably pushing the work through the process.
Take ownership of the codebase they work in and contribute to any required improvements.
Solicit feedback from peers, teammates, and managers to identify improvement areas and take steps to learn and grow.
Follow defined engineering processes and share new tools or processes to help the team be more collaborative, effective, or efficient.
Stay updated with industry trends and emerging technologies to improve data infrastructure and processes continuously.
Basic Qualifications :
Bachelor s degree in computer science or a related field
2-3 years of relevant engineering experience
Some relevant experience in Data Engineering space
Concerned with the success of their team
Respectful towards teammates regardless of their abilities
Able to work in a highly collaborative software development environment
Curious to understand the works problem space and why.
Passionate about testing, code quality, and continuous integration
Persistent while facing roadblocks, dispatching them efficiently, and pulling in others as necessary.
Comfortable with source control, especially git
Experience with cloud platforms - AWS or GCP
Necessary Experience on:
SQL, and Query Optimization.
Scala/Python.
Hands-on experience with Apache Spark or similar big data tools.
Preferred Qualifications:
Passionate about solving problems for a fast-paced FinTech company
We would be delighted to hire someone who has knowledge or some experience with:
Experience with databases like Redshift, Postgres, or similar tools.
Understanding of data warehousing concepts and methodologies
Orchestration tools like Airflow etc.
Familiarity with DevOps tools such as K8s, GitHub actions, or Docker
Familiar with Agile methodologies such as Scrum or Kanban