The Data Engineering team builds and optimizes systems spanning data ingestion, processing, storage optimization and more. We work closely with engineers and the product team to build highly scalable systems that tackle real-world data problems and provide our customers with accurate, real-time, fault tolerant solutions to their ever-growing data needs. We support various OLTP and analytics environments, including our Advanced Analytics and Digital Experience Management products.
We are looking for skilled engineers experienced with building and optimizing cloud-scale distributed systems to develop our next-generation ingestion, processing and storage solutions. You will work closely with other engineers and the product team to build highly scalable systems that tackle real-world data problems. Our customers depend on us to provide accurate, real-time and fault tolerant solutions to their ever growing data needs. This is a hands-on, impactful role that will help lead development, validation, publishing and maintenance of logical and physical data models that support various OLTP and analytics environments.
You will be part of a growing team of renowned industry experts in the exciting space of Data and Cloud Analytics
Your contributions will have a major impact on our global customer-base and across the industry through our market-leading products
You will solve complex, interesting challenges, and improve the depth and breadth of your technical and business skills.
What you will be doing
Lead the design, development, and deployment of AI/ML models for threat detection, anomaly detection, and predictive analytics in cloud and network security.
Architect and implement scalable data pipelines for processing large-scale datasets from logs, network traffic, and cloud environments.
Apply MLOps best practices to deploy and monitor machine learning models in production.
Collaborate with cloud architects and security analysts to develop cloud-native security solutions leveraging platforms like AWS, Azure, or GCP.
Build and optimize Retrieval-Augmented Generation (RAG) systems by integrating large language models (LLMs) with vector databases for real-time, context-aware applications.
Analyze network traffic, log data, and other telemetry to identify and mitigate cybersecurity threats.
Ensure data quality, integrity, and compliance with GDPR, HIPAA, or SOC 2 standards.
Drive innovation by integrating the latest AI/ML techniques into security products and services.
Mentor junior engineers and provide technical leadership across projects.
Required skills and experience
AI/ML Expertise
Proficiency in advanced machine learning techniques, including neural networks (eg, CNNs, Transformers) and anomaly detection.
Experience with AI frameworks like TensorFlow, PyTorch, and Scikit-learn .
Strong understanding of MLOps practices and tools (eg, MLflow, Kubeflow).
Experience building and deploying Retrieval-Augmented Generation (RAG) systems, including integration with LLMs and vector databases.
Data Engineering
Expertise designing and optimizing ETL/ELT pipelines for large-scale data processing.
Hands-on experience with big data technologies (eg, Apache Spark, Kafka, Flink).
Proficiency in working with relational and non-relational databases, including ClickHouse and BigQuery .
Familiarity with vector databases such as Pinecone and PGVector and their application in RAG systems.
Experience with cloud-native data tools like AWS Glue, BigQuery, or Snowflake.
Cloud and Security Knowledge
Strong understanding of cloud platforms (AWS, Azure, GCP ) and their services.
Experience with network security concepts, extended detection and response, and threat modeling.
Software Engineering
Proficiency in Python, Java, or Scala for data and ML solution development.
Expertise in scalable system design and performance optimization for high-throughput applications.
Leadership and Collaboration
Proven ability to lead cross-functional teams and mentor engineers.
Strong communication skills to present complex technical concepts to stakeholders.
Education
BSCS or equivalent required, MSCS or equivalent strongly preferred