Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 1K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

RATE NOW!
- ABECA 2025
  
  RATE NOW!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Engaged Employer

Awign Enterprises

Compare

3.9

based on 248 Reviews

49 Awign Enterprises Jobs

Data Engineer - Web Crawling & Scraping (2-5 yrs)

Awign

3.9

based on 248 Reviews

2-5 years

Awign Enterprises

posted 24d ago

Job Role Insights

Flexible timing

Key skills for the job

Data Engineering Python AWS SQL Selenium Testing Web Scraping

+ 3 more

Job Description

Experience : 2+ years

Location : Pune

Duration : Permanent Opportunity

The successful candidate will be intelligent, accomplished, and energetic as demonstrated by his professional credentials. They will be passionate about working with data. This position requires creative and proactive critical thinking skills, an insatiable appetite for exploring new technologies related to real-time web automation/scraping. Individuals with additional experience in document and image analysis, business analytics, financial data analysis and risk management matters will be preferred.

Responsibilities :

- Work closely with the product team to fetch real-time data and design complex scraping flows to extract information from multiple sources.

- Research new data sources independently to document scraping methods, and infra requirements along with scaling and monitoring strategies.

- Continuously optimize the products for performance and cost per transaction

- Acquire, clean, standardize, transform, structure, and store data.

- Develop modules to extract data from documents and identify entities and relationships.

- Perform exploratory analysis on datasets to identify potential insights.

- Help other team members with optimizing data models and analytics.

- Maintain data integrity and consistency across multiple databases and applications.

- Build dashboards and visualizations to convey status, changes, and analysis of data.

- Research and learn new frameworks, languages, and technologies as needed.

Requirements :

- Expertise in Web crawling and scraping (i. e. Scrapy, Selenium, BS4 etc).

- Working Knowledge of Airflow

- Knowledge of working with Page Models, JS Rendering, Pop Ups, Tabs, IP Proxies, and Captchas.

- Knowledge of SQL and NoSQL databases (i. e PostgreSQL, MongoDB/DynamoDB, Neo4J).

- Knowledge of API Development using frameworks like Flask.

- Knowledge of machine learning libraries/frameworks is essential.

- Demonstrated experience in self-directed, primary-source research.

- Extracting, cleaning, and structuring data from unstructured or semi-structured sources like PDF, Text Files, Log files etc

- Proficiency in Python is a must.

- Good to have knowledge of serverless/container technologies for scraping like docker, cloud functions, google cloud run or similar.

- Good to have knowledge of Search frameworks like Elasticsearch or similar.

- Good to have knowledge of queue systems like Kafka, and RabbitMQ for data pipelines.

- Familiarity with data visualization tools and libraries (i. e. D3.js, Seaborn).

- Experience in GCP or AWS

- Professional engineering habits, including TDD, design patterns, code comments, design documentation, and version control (Git).

- Bachelor's degree in a related field, or equivalent self-study and demonstrated technical proficiency.

- Bonus : Knowledge of image processing and OCR for data extraction (like Tesseract)

- Knowledge of Text Analysis using NLP Frameworks

Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Data Engineer roles with real interview advice

People are getting interviews at Awign Enterprises through

(based on 21 Awign Enterprises interviews)

Job Portal

Referral

Company Website

Campus Placement

Recruitment Consultant

43%

33%

14%

High Confidence

What people at Awign Enterprises are saying

Data Engineer salary at Awign Enterprises

reported by 1 employee with 6 years exp.

₹13.5 L/yr - ₹17.2 L/yr

48% more than the average Data Engineer Salary in India

View more details

What Awign Enterprises employees are saying about work life

based on 248 employees

71%

35%

60%

93%

Flexible timing

Monday to Saturday

No travel

Day Shift

View more insights

Compare Awign Enterprises with

Waah Jobs

4.2

Compare

Internshala

3.9

Compare

Taskmo

3.8

Compare

Quikr

3.8

Compare

Urban Company

3.5

Compare

Zomato

3.8

Compare

Swiggy

3.8

Compare

Ola Cabs

3.4

Compare

Uber

4.2

Compare

Flipkart

4.0

Compare

MagicPin

3.1

Compare

Stylebaazar

4.0

Compare

Nestaway

4.0

Compare

Simplimadly

4.9

Compare

Ketto

3.8

Compare

Droom

3.8

Compare

ShopX

4.3

Compare

Furlenco

3.3

Compare

Meritto

3.7

Compare

Inshorts

4.0

Compare

Similar Jobs for you

Senior Data Engineer at JMJ CONSULTANTS

2-4 Yrs

₹ 12-14 LPA

Senior Python Developer at LeoForce India Private Limited

Pune

5-7 Yrs

₹ 15-20 LPA

Data Engineer at CRESCENDO GLOBAL LEADERSHIP HIRING INDIA PRIVATE L

Gurgaon / Gurugram

5-10 Yrs

₹ 10-20 LPA

Data Engineer at Hashone Careers Private Limited

Chennai

4-8 Yrs

₹ 5-22 LPA

Data Engineer at HyrEzy Talent Solutions

4-8 Yrs

₹ 15-25 LPA

Data Engineer at sm it services

Chennai

5-10 Yrs

₹ 22-24 LPA

Data Engineer at Golden Eagle IT Technologies Pvt Ltd.

Indore

2-4 Yrs

₹ 7-18 LPA

Data Engineer at SAN Engineering Solutions

6-8 Yrs

₹ 18-25 LPA

Software Development Engineer at Hindustan Times

1-5 Yrs

₹ 10-14 LPA

Data Engineer at Targeticon Digital Services Pvt. Ltd.

Mumbai

3-5 Yrs

₹ 6-16 LPA

Awign Enterprises Bangalore / Bengaluru Office Location

View all

Bangalore / Bengaluru (1)

Bengaluru/Bangalore, Karnataka Office

Headquarter

HSR Layout Bengaluru/Bangalore, Karnataka
560102