Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by Derive Management Solutions Pvt Ltd Team. If you also belong to the team, you can get access from here

Derive Management Solutions Pvt Ltd Verified Tick

Compare button icon Compare button icon Compare
filter salaries All Filters

2 Derive Management Solutions Pvt Ltd Jobs

Senior Web Scraping Developer (5-7 yrs)

5-7 years

Senior Web Scraping Developer (5-7 yrs)

Derive Management Solutions Pvt Ltd

posted 6d ago

Job Description

About the Job

Role Overview :

- We are seeking a highly skilled and experienced Senior Web Scraping Developer to lead the design, development, and maintenance of sophisticated web scraping solutions.


- You will be instrumental in building robust, scalable, and efficient data extraction pipelines, tackling complex anti-scraping measures, and ensuring the delivery of high-quality, actionable data.


- This role requires a deep understanding of web technologies, strong problem-solving skills, and the ability to work independently and collaboratively within a dynamic team.

Key Responsibilities :

Advanced Scraping Development :

- Design and implement advanced web scraping solutions using Python, JavaScript, and related frameworks (Scrapy, Selenium, Puppeteer, Beautiful Soup).

- Develop and maintain parsers to extract structured data from diverse and complex websites.

- Implement sophisticated techniques for handling dynamic content, including AJAX, JavaScript rendering, and single-page applications (SPAs).

- Optimize scraping processes for speed, reliability, and minimal resource consumption.

Anti-Crawling and Security :

- Develop and implement robust anti-crawling countermeasures, including IP rotation, proxy management, CAPTCHA solving, and user-agent manipulation.

- Analyze and circumvent advanced anti-scraping techniques, such as honeypots, bot detection, and rate limiting.

- Ensure compliance with website terms of service and ethical scraping practices.

- Implement and manage browser fingerprinting avoidance.

Data Pipeline and Storage :

- Design, develop, and maintain efficient data pipelines for processing, transforming, and storing scraped data.

- Optimize database schemas and queries for large-scale data storage and retrieval (PostgreSQL, MySQL, NoSQL databases).

- Implement data validation and quality control measures to ensure data accuracy and consistency.

- Implement data versioning, and change tracking.

Backend and Cloud Integration :

- Develop and maintain backend APIs using frameworks like Flask, FastAPI, Django, or Node.js to expose scraped data.

- Integrate web scraping solutions with cloud platforms (Azure) for scalability, reliability, and cost-effectiveness.

- Implement serverless functions and containerization (Docker, Kubernetes) for efficient deployment and management.

- Utilize message queues (RabbitMQ, Kafka) for distributed scraping tasks.

Data Analysis and ML/NLP :

- Utilize data processing libraries (NumPy, Pandas) for data cleaning, transformation, and analysis.

- Implement machine learning and natural language processing techniques for data enrichment, sentiment analysis, and information extraction.

- Develop and maintain data visualization tools and dashboards to present scraped data insights.

- Develop data quality reports, and anomaly detection.

Testing and Debugging :

- Conduct thorough testing of scraping scripts and data pipelines to ensure reliability and accuracy.

- Utilize API testing tools (Postman, MITM proxies, browser DevTools) for debugging and troubleshooting.

- Implement logging and monitoring systems to track scraping performance and identify issues.

- Implement unit testing, integration testing, and end-to-end testing.

Collaboration and Communication :

- Work closely with data scientists, analysts, and other stakeholders to understand data requirements and deliver solutions. -

- Document technical specifications, code, and processes.

- Mentor junior developers and share knowledge of web scraping best practices.

- Participate in code reviews.

Research and Development :

- Stay up to date with the newest techniques in web scraping, and anti-scraping countermeasures.

- Research and implement new tools and technologies to improve scraping efficiency and effectiveness.

Required Skills :

- Expert proficiency in Python and JavaScript.

- Extensive experience with web scraping frameworks and libraries (Scrapy, Selenium, Puppeteer, Beautiful Soup).

- Deep understanding of HTTP/HTTPS protocols, APIs, and web technologies.

- Strong knowledge of database systems (SQL, NoSQL) and data warehousing concepts.

- Proven experience with cloud platforms (Azure, AWS, GCP) and containerization technologies (Docker, Kubernetes).

- Advanced knowledge of anti-crawling techniques and security best practices.

- Experience with data processing and analysis libraries (NumPy, Pandas, Scikit-learn).

- Familiarity with machine learning and natural language processing concepts.

- Proficiency in API testing and debugging tools.

- Strong understanding of network protocols, and web browser internals.

- experience with version control systems (Git).

- Experience with CI/CD pipelines.

Qualifications :

- Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.

- 5+ years of professional experience in web scraping, data extraction, or related areas.

- Proven track record of developing and deploying large-scale web scraping solutions.

- Strong problem-solving and analytical skills.

- Excellent communication and collaboration skills.


Functional Areas: Other

Read full job description

What people at Derive Management Solutions Pvt Ltd are saying

Derive Management Solutions Pvt Ltd Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare Derive Management Solutions Pvt Ltd with

E-Solutions

3.4
Compare

Anri Solutions HR Services

3.4
Compare

Crescendo Global

4.1
Compare

Morpheus Human Consulting

4.7
Compare

Ubique Systems

2.9
Compare

Right Advisors

4.7
Compare

Ethos HR Management & Projects

4.1
Compare

WSNE Consulting

3.3
Compare

Sarthee Consultancy

4.3
Compare

Addrec Solutions

4.2
Compare

R2R Consults

4.0
Compare

Arnold Consulting

3.3
Compare

Unison International Consulting

4.3
Compare

Flipcarbon Integrated Solutions

4.3
Compare

EliteRecruitments

3.5
Compare

Zodiac HR Consultants

3.3
Compare

Genesis HR Services

3.0
Compare

KK Associates

4.1
Compare

Unisis Engineering

2.8
Compare

Jombay

3.4
Compare

Similar Jobs for you

Artificial Intelligence Engineer at Truelancer.com

5-7 Yrs

₹ 15-21 LPA

Senior Big Data Engineer at Live Connections Placements Private Limited

6-12 Yrs

₹ 25-35 LPA

ml engineer at Anlage Infotech (I) Pvt. Ltd.

3-5 Yrs

₹ 14-19 LPA

Backend Python Developer at infoplus

3-5 Yrs

₹ 12-15 LPA

Data Science at V-ALIGN TECHNOLOGIES PRIVATE LIMITED

3-5 Yrs

₹ 15-18 LPA

Artificial Intelligence Engineer at JetSynthesys

4-5 Yrs

₹ 12-15 LPA

Data Scientist at IndiaLends

4-5 Yrs

₹ 10-15 LPA

Machine Learning Engineer at Pro Integrate Consulting

4-5 Yrs

₹ 12-15 LPA

Python Developer Lead at ShopClues.com

5-8 Yrs

₹ 10-18 LPA

Backend Python Developer at Netlabs Global IT Services Pvt. Ltd.

3-5 Yrs

₹ 12-15 LPA

Derive Management Solutions Pvt Ltd Bangalore / Bengaluru Office Location

View all
Bangalore / Bengaluru, Karnataka Office
Headquarter
10th floor, RMZ Latitude Commercial Building, Bellary Rd, Hebbal, Bengaluru, Karnataka 560024 Bangalore / Bengaluru, Karnataka
560024

Senior Web Scraping Developer (5-7 yrs)

5-7 Yrs

6d ago·via hirist.com

Senior iOS Developer - Objective C/SWIFT (7-9 yrs)

7-9 Yrs

4mon ago·via hirist.com
write
Share an Interview