16 Dataflow Group Jobs
DataFlow - Lead Data Scientist - Computer Vision (5-8 yrs)
Dataflow Group
posted 16hr ago
Flexible timing
Key skills for the job
About us :
The DataFlow Group is a leading global provider of specialized Primary Source Verification (PSV) solutions, and background screening and immigration compliance services. The DataFlow Group partners with clients across the public and private sectors to assist them in mitigating potential risk by exposing fraudulent :
Education Degrees :
- Employment Certificates
- Practice Licences
- Work Permits
- Passports
Job Description :
As a Lead Data Scientist, you will be responsible for overseeing the development of data models, analytics systems, and algorithms to derive actionable insights and optimize business processes. You will collaborate with cross-functional teams such as engineering, product, and business stakeholders to deliver data-driven solutions that improve business outcomes
Key Responsibilities :
1. Model Selection & Training :
- Lead the process of selecting and training state-of-the-art machine learning models tailored to specific computer vision tasks, including object detection, image segmentation, and anomaly detection.
- Apply advanced techniques for model fine-tuning and optimization to achieve the best performance for various applications.
- Leverage your expertise in YOLO, ViT (Vision Transformers), and other popular architectures to develop high-performing models.
2. Computer Vision & Image Processing :
- Develop and implement computer vision algorithms for tasks such as object detection, image segmentation and anomaly detection.
- Use tools like OpenCV for image pre-processing, augmentation, and other necessary transformations for robust model training.
3. Model Deployment & Serving :
- Deploy machine learning models in production environments using TensorFlow or PyTorch Serving to create highly scalable and performant inference pipelines.
- Ensure smooth integration of models with production systems and optimise them for latency, throughput, and memory efficiency.
4. MLOps & Data Pipelines :
- Build and maintain end-to-end machine learning pipelines, from data ingestion and processing to model inference and monitoring.
- Apply best practices in ML Ops for versioning, tracking experiments, automating workflows, and managing deployments.
- Work with cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes) for model serving.
5. SQL & Data Management :
- Write optimised and efficient SQL queries to extract and manipulate large datasets for model training and evaluation.
- Analyse structured and unstructured data to generate insights and features for model development.
6. Model Evaluation & Performance Metrics :
Use statistical methods and metrics to evaluate model performance, ensuring high accuracy, precision, recall, F1 score, etc., on real-world tasks.
Educational qualifications & skills required :
Qualifications :
- Bachelors or Masters. in Computer Science, Machine Learning, Artificial Intelligence, or related field.
- 5+ years of hands-on experience in data science and machine learning, with a focus on computer vision.
Technical know-how :
- Strong expertise in computer vision tasks, including object detection (YOLO), image segmentation, and image classification.
- Knowledge of using tools like LabelMe to create image annotations required for training data labelling.
- In-depth knowledge of deep learning frameworks such as TensorFlow, PyTorch, and Keras.
- Familiarity with Vision Transformer (ViT) models and their applications.
- Proficiency with OpenCV for image pre-processing, augmentation, and other related tasks.
- Experience in building and deploying machine learning models using TensorFlow Serving, PyTorch Serving, or similar tools.
- Strong MLOps experience, with hands-on knowledge of creating and maintaining data pipelines, automating workflows, and managing model deployments using tools like Docker, Kubernetes, and cloud platforms (AWS, GCP, Azure).
Data Handling :
- Expertise in writing optimised SQL queries for large-scale data processing and analysis.
- Ability to work with large, complex datasets and efficiently query, clean, and process them for training and inference purposes.
Model Evaluation :
- Strong understanding of statistical methods and metrics to evaluate machine learning model performance. Experience with A/B testing, model validation techniques, and performance benchmarking.
Good to have :
- Experience with large-scale distributed machine learning systems.
- Knowledge of advanced techniques such as self-supervised learning, transfer learning, or reinforcement learning.
- Familiarity with data versioning and experiment tracking tools like MLflow or DVC.
- Experience in integrating machine learning models into production environments for real-time inference.
Functional Areas: Other
Read full job descriptionPrepare for Lead Data Scientist roles with real interview advice
12-14 Yrs
10-15 Yrs