Upload Button Icon Add office photos
filter salaries All Filters

1 Global E Job

Python Developer (Web Scraping, LLMs, Cloud, AI/ML)

2-4 years

Mumbai

2 vacancies

Python Developer (Web Scraping, LLMs, Cloud, AI/ML)

Global E

posted 15d ago

Job Role Insights

Job Description

Role & responsibilities

  • Develop web scraping solutions using Python, Selenium, and other scraping tools to collect data from various websites, managing structured (e.g., JSON, XML) and unstructured data (e.g., PDFs, text documents).
  • Build and maintain ETL pipelines using AWS Glue, PySpark, and Pandas to clean, transform, and load scraped data for analytics, ensuring data integrity and high performance.
  • Manage AWS Glue Catalog, Glue Jobs, and Workflows to orchestrate and automate cloud-based data processing for large-scale scraping operations.
  • Implement Large Language Models (LLMs) such as GPT, BERT, T5, and RAG (Retrieval-Augmented Generation) for advanced data extraction, text summarization, and entity recognition from unstructured sources.
    • Apply LLMs for NLP tasks such as text classification, document parsing, and content generation across different data formats like PDFs and text documents.
    • Use OCR technologies and models for extracting text from scanned documents and PDFs.
  • Design and maintain cloud-based workflows using AWS EC2, Lambda, S3, CloudWatch, and Athena to manage, optimize, and scale scraping operations efficiently.
  • Work with semi-structured (e.g., XML, JSON) and structured data to build robust ingestion and processing workflows.
  • Utilize cloud-native tools and compression strategies like Snappy and Gzip to optimize performance for large-scale datasets.
  • Automate and enhance scraping workflows by integrating LLMs and AI/ML for smarter data analysis, classification, and reporting.


Preferred candidate profile

We are seeking an experienced Python Developer with a strong background in web scraping, Large Language Models (LLMs), and cloud-based ETL workflows to join our dynamic engineering team. This position will involve developing and maintaining sophisticated data extraction pipelines that process unstructured, semi-structured, and structured data. The ideal candidate will have a proven track record of utilizing AI/ML models, particularly LLMs, to streamline and optimize data workflows.



Preferred Qualifications:

  • Advanced degree in Computer Science, Data Science, Engineering, or related field.
  • Experience with Docker, Kubernetes, or other containerization technologies.
  • Knowledge of distributed computing and parallel processing.
  • Experience with NLP (Natural Language Processing) tasks and tools such as spaCy, Hugging Face Transformers, or similar.
  • Familiarity with data pipelines and orchestration tools like Apache Airflow, Prefect, or similar.
  • Contributions to open-source projects or personal GitHub repositories showcasing your expertise.





Employment Type: Full Time, Permanent

Read full job description

Prepare for Python Developer roles with real interview advice

What people at Global E are saying

Python Developer salary at Global E

reported by 4 employees
₹3 L/yr - ₹4.5 L/yr
36% less than the average Python Developer Salary in India
View more details

Global E Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare Global E with

Udaan

4.0
Compare

Swiggy

3.8
Compare

CARS24

3.6
Compare

BlackBuck

3.8
Compare

Blinkit

3.7
Compare

Ninjacart

4.0
Compare

Rivigo

3.9
Compare

Meesho

3.7
Compare

Paisabazaar.com

3.5
Compare

Tata 1mg

3.7
Compare

Wheelseye Technology

3.7
Compare

Urban Company

3.4
Compare

PharmEasy

3.7
Compare

Zepto

3.5
Compare

Stanza Living

3.0
Compare

Rebel Foods

3.7
Compare

Dunzo

3.4
Compare

Porter

4.1
Compare

Cogoport

2.8
Compare

ShareChat

3.7
Compare

Similar Jobs for you

Engineer at RazorThink

Pune, Chennai + 1

3-7 Yrs

₹ 10-12 LPA

Software Engineer at Wing Assistant

Mumbai

2-3 Yrs

₹ 4-5 LPA

Software Engineer at Wing Assistant

Bangalore / Bengaluru

2-3 Yrs

₹ 4-5 LPA

Software Engineer at Wing Assistant

New Delhi

2-3 Yrs

₹ 4-5 LPA

Python Software Developer at Strategia Advisor

Mumbai

2-4 Yrs

₹ 6-7 LPA

Python Developer at Sigma Solve, Inc.

Ahmedabad, Gujarat

3-5 Yrs

₹ 10-15 LPA

Python Developer at Sigma Solve, Inc.

Ahmedabad

3-5 Yrs

₹ 10-11 LPA

Developer at Technostacks Infotech Pvt. Ltd.

Ahmedabad

2-6 Yrs

₹ 4-8 LPA

Developer at High Tech Infosystems

Jabalpur

3-7 Yrs

₹ 8-12 LPA

Developer at Ergobite

Pune

3-8 Yrs

₹ 5-10 LPA

Python Developer (Web Scraping, LLMs, Cloud, AI/ML)

2-4 Yrs

Mumbai

15d ago·via naukri.com
write
Share an Interview