Upload Button Icon Add office photos
filter salaries All Filters

1 Global E Python Developer Job

Python Developer (Web Scraping, LLMs, Cloud, AI/ML)

2-4 years

Mumbai

2 vacancies

Python Developer (Web Scraping, LLMs, Cloud, AI/ML)

Global E

posted 30d ago

Job Role Insights

Job Description

Role & responsibilities

  • Develop web scraping solutions using Python, Selenium, and other scraping tools to collect data from various websites, managing structured (e.g., JSON, XML) and unstructured data (e.g., PDFs, text documents).
  • Build and maintain ETL pipelines using AWS Glue, PySpark, and Pandas to clean, transform, and load scraped data for analytics, ensuring data integrity and high performance.
  • Manage AWS Glue Catalog, Glue Jobs, and Workflows to orchestrate and automate cloud-based data processing for large-scale scraping operations.
  • Implement Large Language Models (LLMs) such as GPT, BERT, T5, and RAG (Retrieval-Augmented Generation) for advanced data extraction, text summarization, and entity recognition from unstructured sources.
    • Apply LLMs for NLP tasks such as text classification, document parsing, and content generation across different data formats like PDFs and text documents.
    • Use OCR technologies and models for extracting text from scanned documents and PDFs.
  • Design and maintain cloud-based workflows using AWS EC2, Lambda, S3, CloudWatch, and Athena to manage, optimize, and scale scraping operations efficiently.
  • Work with semi-structured (e.g., XML, JSON) and structured data to build robust ingestion and processing workflows.
  • Utilize cloud-native tools and compression strategies like Snappy and Gzip to optimize performance for large-scale datasets.
  • Automate and enhance scraping workflows by integrating LLMs and AI/ML for smarter data analysis, classification, and reporting.


Preferred candidate profile

We are seeking an experienced Python Developer with a strong background in web scraping, Large Language Models (LLMs), and cloud-based ETL workflows to join our dynamic engineering team. This position will involve developing and maintaining sophisticated data extraction pipelines that process unstructured, semi-structured, and structured data. The ideal candidate will have a proven track record of utilizing AI/ML models, particularly LLMs, to streamline and optimize data workflows.



Preferred Qualifications:

  • Advanced degree in Computer Science, Data Science, Engineering, or related field.
  • Experience with Docker, Kubernetes, or other containerization technologies.
  • Knowledge of distributed computing and parallel processing.
  • Experience with NLP (Natural Language Processing) tasks and tools such as spaCy, Hugging Face Transformers, or similar.
  • Familiarity with data pipelines and orchestration tools like Apache Airflow, Prefect, or similar.
  • Contributions to open-source projects or personal GitHub repositories showcasing your expertise.





Employment Type: Full Time, Permanent

Read full job description

Prepare for Python Developer roles with real interview advice

What Python Developer at Global E are saying

Python Developer salary at Global E

reported by 4 employees
₹3 L/yr - ₹4.5 L/yr
36% less than the average Python Developer Salary in India
View more details

Global E Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare Global E with

Udaan

4.0
Compare

Swiggy

3.8
Compare

CARS24

3.6
Compare

BlackBuck

3.8
Compare

Blinkit

3.7
Compare

Ninjacart

4.0
Compare

Rivigo

3.9
Compare

Meesho

3.7
Compare

Paisabazaar.com

3.4
Compare

Tata 1mg

3.7
Compare

Wheelseye Technology

3.6
Compare

Urban Company

3.4
Compare

Zepto

3.5
Compare

PharmEasy

3.7
Compare

Stanza Living

3.0
Compare

Rebel Foods

3.7
Compare

Porter

4.1
Compare

Dunzo

3.4
Compare

Cogoport

2.8
Compare

Impact Guru

4.4
Compare

Similar Jobs for you

Engineer at RazorThink

Pune, Chennai + 1

3-7 Yrs

₹ 10-12 LPA

Senior Rpa Engineer at Stark Digital Media Services Pvt. Ltd.

Pune

4-5 Yrs

₹ 5-9 LPA

Data Engineer at Vantageiq Technologies Opc

Pune

3-5 Yrs

₹ 9-13 LPA

Developer at Technostacks Infotech Pvt. Ltd.

Ahmedabad

2-6 Yrs

₹ 4-8 LPA

Developer at High Tech Infosystems

Jabalpur

3-7 Yrs

₹ 8-12 LPA

Data Engineer at Balfour Beatty Plc

Bangalore / Bengaluru

3-8 Yrs

₹ 4.5-9.5 LPA

Ai Ml Engineer at Neudeep Technologies Pvt. Ltd

Pune

3-6 Yrs

₹ 9-15 LPA

Ai Ml Engineer at Harman Connected Services Corporation India Pvt.

Bangalore / Bengaluru

4-5 Yrs

₹ 6-7 LPA

Developer at Ergobite

Pune

3-8 Yrs

₹ 5-10 LPA

AWS Data Engineer at Sun Life Global Solutions

Gurgaon / Gurugram

3-6 Yrs

₹ 7-17 LPA

Python Developer (Web Scraping, LLMs, Cloud, AI/ML)

2-4 Yrs

Mumbai

30d ago·via naukri.com
write
Share an Interview