Professionals hailing from the worlds best universities, business schools, and engineering institutes including Harvard, Yale, Carnegie Mellon, Duke, Georgia Tech, Indian Institute of Management (IIM), and Indian Institute of Technology (IIT).
Job Title : Lead Data Scientist
Job Location : Pune
Job summary: HiLabs is looking for highly motivated and skilled Lead/Sr. Data Scientist focused on the application of emerging technologies. The candidates must be well versed with Python, Scala, Spark, SQL and AWS platform. The individuals who will join the new Evolutionary Platform team should be continually striving to advance AI/ML excellence and technology innovation. The mission is to power the next generation of the digital product and services through innovation, collaboration, and transparency. You will be a technology leader and doer who enjoys working in a dynamic, fast- paced environment.
Responsibilities:
Leverage AI/ML techniques and solutions to identify and mathematically interpret complex healthcare problems.
Full-stack development of data pipelines involving Big Data.
Design and development of robust application/data pipelines using Python, Scala, Spark, and SQL
Lead a team of Data Scientists, developers as well as clinicians to strategize, design and evaluate AI based solutions to healthcare problems.
Increase efficiency and improve the quality of solutions offered.
Managing the complete ETL pipeline development process from conception to deployment
Collaborating with and guiding the team on writing, building, and deployment of data software
Following best design and development practices to ensure high quality code.
Design, build and maintain efficient, secure, reusable, and reliable code
Perform code reviews, testing, and debugging
Desired Profile:
Bachelors or Master s degrees in computer science, Mathematics, or any other quantitative discipline from Premium/Tier 1 institutions
5 to 7 years of experience in developing robust ETL data pipelines and implementing advanced AI/ML algorithms (GenAI is a plus).
Strong experience working with technologies like Python, Scala, Spark, Apache Solr, MySQL, Airflow, AWS etc.
Experience working with Relational databases like MySQL, SQLServer, Oracle etc.
Good understanding of large system architecture and design
Understands the core concepts of Machine Learning and the math behind it.
Experience working in AWS/Azure cloud environment
Experience using Version Control tools such as Bitbucket/GIT code repository
Experience using tools like Maven/Jenkins, JIRA
Experience working in an Agile software delivery environment, with exposure to continuous integration and continuous delivery tools
Great collaboration and interpersonal skills
Ability to work with team members and lead by example in code, feature development, and knowledge sharing