6 Whitetable Jobs
Data Engineer - ETL/Python (3-6 yrs)
Whitetable
posted 5d ago
Key skills for the job
Company Overview :
An end-to-end investment business focused on managing capital to build and grow a diversified portfolio. The company specializes in :
- Sourcing funds and identifying high-growth sectors through thorough market research.
- Executing investments via equity, debt, or hybrid structures after rigorous due diligence.
- Providing strategic guidance and operational expertise to portfolio companies for accelerated growth.
- Monitoring portfolio performance and planning exits through IPOs, mergers, or acquisitions to maximize returns.
- Aligning investments with ESG principles to ensure sustainable growth, transparency, compliance, and ethical practices for long-term value creation.
About the Role :
As a Data Engineer for the Data Science team, you will play a pivotal role in enriching and maintaining the organization's central repository of datasets. This repository serves as the backbone for advanced data analytics and machine learning applications, enabling actionable insights from financial and market data.
You will work closely with cross-functional teams to design and implement robust ETL pipelines that automate data updates and ensure accessibility across the organization. This is a critical role requiring technical expertise in building scalable data pipelines, ensuring data quality, and supporting data analytics and reporting infrastructure for business growth.
Key Responsibilities :
1. ETL Development :
- Design, develop, and maintain efficient ETL processes for handling multi-scale datasets.
- Implement and optimize data transformation and validation processes to ensure data accuracy and consistency.
- Collaborate with cross-functional teams to gather data requirements and translate business logic into ETL workflows.
2. Data Pipeline Architecture :
- Architect, build, and maintain scalable and high-performance data pipelines to enable seamless data flow.
- Evaluate and implement modern technologies to enhance the efficiency and reliability of data pipelines.
- Build pipelines for extracting data via web scraping to source sector-specific datasets on an ad hoc basis.
3. Data Modeling :
- Design and implement data models to support analytics and reporting needs across teams.
- Optimize database structures to enhance performance and scalability.
4. Data Quality and Governance :
- Develop and implement data quality checks and governance processes to ensure data integrity.
- Collaborate with stakeholders to define and enforce data quality standards across the organization.
5. Documentation and Communication :
- Maintain detailed documentation of ETL processes, data models, and other key workflows.
- Effectively communicate complex technical concepts to non-technical stakeholders and business users.
6. Cross-Functional Collaboration :
- Work closely with the Quant team and developers to design and optimize data pipelines.
- Collaborate with external stakeholders to understand business requirements and translate them into technical solutions.
Essential Requirements :
Basic Qualifications :
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Proven experience as a Data Engineer with expertise in ETL techniques (minimum 2 years).
- 3-6 years of strong programming experience in languages such as Python, Java, or Scala.
- Hands-on experience in web scraping to extract and transform data from publicly available web sources.
- Proficiency with cloud-based data platforms such as AWS, Azure, or GCP.
- Strong knowledge of SQL and experience with relational and non-relational databases.
- Deep understanding of data warehousing concepts and architectures.
- Familiarity with big data technologies like Hadoop, Spark, and Kafka.
- Experience with data modeling tools and techniques.
- Excellent problem-solving, analytical, and communication skills.
Preferred Qualifications :
- Master's degree in Computer Science or Data Science.
- Knowledge of data streaming and real-time processing frameworks.
- Familiarity with data governance and security best practices.
Functional Areas: Software/Testing/Networking
Read full job description5-8 Yrs
Bangalore / Bengaluru
7-12 Yrs
Remote