4+ Web Scraping Jobs in Bangalore (Bengaluru) | Web Scraping Job openings in Bangalore (Bengaluru)
Apply to 4+ Web Scraping Jobs in Bangalore (Bengaluru) on CutShort.io. Explore the latest Web Scraping Job opportunities across top companies like Google, Amazon & Adobe.

Web Scraping engineer
Python web scraping will be responsible for efficient web scraping/web crawling and parsing. The candidate should have demonstrated experience in web scraping and data extraction along with the ability to communicate effectively and adhere to set deadlines.
Responsibilities:
- Develop and maintain a service that extracts website data using scrapers and APIs across multiple sophisticated websites.
- Extract structured/unstructured data and manipulate data through text processing, image processing, regular expressions etc.
- Writing reusable, testable, and efficient code
- Seeking a Python Developer to develop and maintain web scraping solutions using BeautifulSoup, Scrapy, and Selenium.
- Responsibilities include handling dynamic content, proxies, CAPTCHAs, data extraction, optimization, and ensuring data accuracy.
- Implement and maintain robust, full-stack applications for web crawlers.
- Troubleshoot, debug, and improve existing web crawlers and data extraction systems.
- Utilize tools such as Scrapy and the Spider tool to enhance data crawling capabilities.
- Requirements:
- 0.5-2 years of work experience in Python-based web scraping
- Sound understanding and knowledge of Python and good experience in any of the web crawling tools like requests, scrapy, BeautifulSoup, Selenium etc.
- Strong interpersonal, verbal, and written communication skills in English
We are seeking an experienced Web Scraping Engineer to data extraction efforts for our enterprise clients. In this role, you will be tasked with creating and maintaining robust, large-scale scraping systems for gathering structured data.
Responsibilities:
Develop and optimize custom web scraping tools and workflows.
Integrate scraping systems with data storage solutions like SQL and NoSQL databases.
Troubleshoot and resolve scraping challenges, including CAPTCHAs, rate limiting, and IP blocking.
Provide technical guidance on scraping best practices and standards.
Skills Required:
Expert in Python and scraping libraries such as Scrapy and BeautifulSoup.
Deep understanding of web scraping techniques and challenges (CAPTCHAs, anti-bot measures).
Experience with cloud platforms (AWS, Google Cloud).
Strong background in databases and data storage systems (SQL, MongoDB).
- Does analytics to extract insights from raw historical data of the organization.
- Generates usable training dataset for any/all MV projects with the help of Annotators, if needed.
- Analyses user trends, and identifies their biggest bottlenecks in Hammoq Workflow.
- Tests the short/long term impact of productized MV models on those trends.
- Skills - Numpy, Pandas, SPARK, APACHE SPARK, PYSPARK, ETL mandatory.

● Good communication and collaboration skills with 4-7 years of experience.
● Ability to code and script with strong grasp of CS fundamentals, excellent problem solving abilities.
● Comfort with frequent, incremental code testing and deployment, Data management skills
● Good understanding of RDBMS
● Experience in building Data pipelines and processing large datasets .
● Knowledge of building Web Scraping and data mining is a plus.
● Working knowledge of open source tools such as mysql, Solr, ElasticSearch, Cassandra ( data stores )
would be a plus.
● Expert in Python programming
Role and responsibilities
● Inclined towards working in a start-up environment.
● Comfort with frequent, incremental code testing and deployment, Data management skills
● Design and Build robust and scalable data engineering solutions for structured and unstructured data for
delivering business insights, reporting and analytics.
● Expertise in troubleshooting, debugging, data completeness and quality issues and scaling overall
system performance.
● Build robust API ’s that powers our delivery points (Dashboards, Visualizations and other integrations).