Cutshort logo
Beautiful soup jobs

6+ Beautiful Soup Jobs in India

Apply to 6+ Beautiful Soup Jobs on CutShort.io. Find your next job, effortlessly. Browse Beautiful Soup Jobs and apply today!

icon
Hypersonix Inc

at Hypersonix Inc

2 candid answers
1 product
Reshika Mendiratta
Posted by Reshika Mendiratta
Remote only
8yrs+
Upto ₹30L / yr (Varies
)
Web Scraping
Python
Selenium
HTML/CSS
XPath
+2 more

About the Company

Hypersonix.ai is disrupting the e-commerce space with AI, ML, and advanced decision-making capabilities to drive real-time business insights. Built from the ground up using modern technologies, Hypersonix simplifies data consumption for customers across various industry verticals. We are seeking a well-rounded, hands-on product leader to help manage key capabilities and features in our platform.


Position Overview

We are seeking a highly skilled Web Scraping Architect to join our team. The successful candidate will be responsible for designing, implementing, and maintaining web scraping processes to gather data from various online sources efficiently and accurately. As a Web Scraping Specialist, you will play a crucial role in collecting data for competitor analysis and other business intelligence purposes.


Responsibilities

  • Scalability/Performance: Lead and provide expertise in scraping at scale e-commerce marketplaces.
  • Data Source Identification: Identify relevant websites and online sources from which data needs to be scraped. Collaborate with the team to understand data requirements and objectives.
  • Web Scraping Design: Develop and implement effective web scraping strategies to extract data from targeted websites. This includes selecting appropriate tools, libraries, or frameworks for the task.
  • Data Extraction: Create and maintain web scraping scripts or programs to extract the required data. Ensure the code is optimized, reliable, and can handle changes in the website's structure.
  • Data Cleansing and Validation: Cleanse and validate the collected data to eliminate errors, inconsistencies, and duplicates. Ensure data integrity and accuracy throughout the process.
  • Monitoring and Maintenance: Continuously monitor and maintain the web scraping processes. Address any issues that arise due to website changes, data format modifications, or anti-scraping mechanisms.
  • Scalability and Performance: Optimize web scraping procedures for efficiency and scalability, especially when dealing with a large volume of data or multiple data sources.
  • Compliance and Legal Considerations: Stay up-to-date with legal and ethical considerations related to web scraping, including website terms of service, copyright, and privacy regulations.
  • Documentation: Maintain detailed documentation of web scraping processes, data sources, and methodologies. Create clear and concise instructions for others to follow.
  • Collaboration: Collaborate with other teams such as data analysts, developers, and business stakeholders to understand data requirements and deliver insights effectively.
  • Security: Implement security measures to ensure the confidentiality and protection of sensitive data throughout the scraping process.


Requirements

  • Proven experience of 7+ years as a Web Scraping Specialist or similar role, with a track record of successful web scraping projects
  • Expertise in handling dynamic content, user-agent rotation, bypassing CAPTCHAs, rate limits, and use of proxy services
  • Knowledge of browser fingerprinting
  • Has leadership experience
  • Proficiency in programming languages commonly used for web scraping, such as Python, BeautifulSoup, Scrapy, or Selenium
  • Strong knowledge of HTML, CSS, XPath, and other web technologies relevant to web scraping and coding
  • Knowledge and experience in best-of-class data storage and retrieval for large volumes of scraped data
  • Understanding of web scraping best practices, including handling dynamic content, user-agent rotation, and IP address management
  • Attention to detail and ability to handle and process large volumes of data accurately
  • Familiarity with data cleansing techniques and data validation processes
  • Good communication skills and ability to collaborate effectively with cross-functional teams
  • Knowledge of web scraping ethics, legal considerations, and compliance with website terms of service
  • Strong problem-solving skills and adaptability to changing web environments


Preferred Qualifications

  • Bachelor’s degree in Computer Science, Data Science, Information Technology, or related fields
  • Experience with cloud-based solutions and distributed web scraping systems
  • Familiarity with APIs and data extraction from non-public sources
  • Knowledge of machine learning techniques for data extraction and natural language processing is desired but not mandatory
  • Prior experience in handling large-scale data projects and working with big data frameworks
  • Understanding of various data formats such as JSON, XML, CSV, etc.
  • Experience with version control systems like Git
Read more
Gmware Pvt Ltd
Bengaluru (Bangalore)
0 - 2 yrs
₹3L - ₹4L / yr
Web Scraping
Web crawling
scrapy
Beautiful Soup

We are seeking an experienced Web Scraping Engineer to data extraction efforts for our enterprise clients. In this role, you will be tasked with creating and maintaining robust, large-scale scraping systems for gathering structured data.


Responsibilities:


Develop and optimize custom web scraping tools and workflows.

Integrate scraping systems with data storage solutions like SQL and NoSQL databases.

Troubleshoot and resolve scraping challenges, including CAPTCHAs, rate limiting, and IP blocking.

Provide technical guidance on scraping best practices and standards.


Skills Required:


Expert in Python and scraping libraries such as Scrapy and BeautifulSoup.

Deep understanding of web scraping techniques and challenges (CAPTCHAs, anti-bot measures).

Experience with cloud platforms (AWS, Google Cloud).

Strong background in databases and data storage systems (SQL, MongoDB).

Read more
InstaClipapp

at InstaClipapp

2 candid answers
Rishikesh Chougule
Posted by Rishikesh Chougule
Remote only
2 - 5 yrs
$23K - $27K / yr
Python
Django
Flask
Beautiful Soup

Profile: Backen API developer

Should be well versed with API knowledge for social media scraping.


Full time remote job


Read more
OJCommerce

at OJCommerce

3 recruiters
Rajalakshmi N
Posted by Rajalakshmi N
Chennai
2 - 5 yrs
₹7L - ₹12L / yr
Beautiful Soup
Web Scraping
Python
Selenium

Role : Web Scraping Engineer

Experience : 2 to 3 Years

Job Location : Chennai

About OJ Commerce: 


OJ Commerce (OJC), a rapidly expanding and profitable online retailer, is headquartered in Florida, USA, with a fully-functional office in Chennai, India. We deliver exceptional value to our customers by harnessing cutting-edge technology, fostering innovation, and establishing strategic brand partnerships to enable a seamless, enjoyable shopping experience featuring high-quality products at unbeatable prices. Our advanced, data-driven system streamlines operations with minimal human intervention.

Our extensive product portfolio encompasses over a million SKUs and more than 2,500 brands across eight primary categories. With a robust presence on major platforms such as Amazon, Walmart, Wayfair, Home Depot, and eBay, we directly serve consumers in the United States.

As we continue to forge new partner relationships, our flagship website, www.ojcommerce.com, has rapidly emerged as a top-performing e-commerce channel, catering to millions of customers annually.

Job Summary:

We are seeking a Web Scraping Engineer and Data Extraction Specialist who will play a crucial role in our data acquisition and management processes. The ideal candidate will be proficient in developing and maintaining efficient web crawlers capable of extracting data from large websites and storing it in a database. Strong expertise in Python, web crawling, and data extraction, along with familiarity with popular crawling tools and modules, is essential. Additionally, the candidate should demonstrate the ability to effectively utilize API tools for testing and retrieving data from various sources. Join our team and contribute to our data-driven success!


Responsibilities:


  • Develop and maintain web crawlers in Python.
  • Crawl large websites and extract data.
  • Store data in a database.
  • Analyze and report on data.
  • Work with other engineers to develop and improve our web crawling infrastructure.
  • Stay up to date on the latest crawling tools and techniques.



Required Skills and Qualifications:


  • Bachelor's degree in computer science or a related field.
  • 2-3 years of experience with Python and web crawling.
  • Familiarity with tools / modules such as
  • Scrapy, Selenium, Requests, Beautiful Soup etc.
  • API tools such as Postman or equivalent. 
  • Working knowledge of SQL.
  • Experience with web crawling and data extraction.
  • Strong problem-solving and analytical skills.
  • Ability to work independently and as part of a team.
  • Excellent communication and documentation skills.


What we Offer

• Competitive salary

• Medical Benefits/Accident Cover

• Flexi Office Working Hours

• Fast paced start up

Read more
Our client provides data solutions for fraud prevention

Our client provides data solutions for fraud prevention

Agency job
Remote only
4 - 10 yrs
₹10L - ₹20L / yr
Web Scraping
Web crawling
Selenium
Selenium Web driver
Beautiful Soup
+6 more
  • Manage individual projects priorities, deadlines, and deliverables

  • Gather and process raw data at scale (including writing scripts, web scraping, calling/create

    APIs, etc.) from the web / internet

  • Develop frameworks for automating and maintaining constant flow of data from multiple

    sources

  • Identify, analysis, design, and implement internal process improvements

  • Design and implement tooling upgrades to increase stability and data quality

  • Help team to fix issues that occur in test and production environments

  • Automate software development processes, including build, deploy, and test

  • Manage and guide the team members

REQUIRED QUALIFICATIONS:

  •   4+ years of web crawling/ scraping experience is a must

  •   Strong knowledge of scraping frameworks such as Scrapy, Beautiful Soup, HTQL, Jsoup, Web-

    Harvest and others

  • Excellent verbal, written, and interpersonal communication skills in English

  •   Good to have Experience of complex crawling (like captcha, Mobile OTP based crawling,

    bypassing proxy)

  •   Sound Knowledge in Bot Management Techniques

  •   Experience in various data extraction methods (like data extraction from PDF Files, web

    pages, etc)

  •   Good understanding of HTML DOM, CSS, Javascript, and RESTful web services

  •   Good to have understanding of AWS

  •   Experience with Linux

  •   Experience with Java / Python

Read more
Pentoz Technology

at Pentoz Technology

2 recruiters
Yasodhamma Yasodhamma
Posted by Yasodhamma Yasodhamma
Bengaluru (Bangalore)
2 - 9 yrs
₹2L - ₹10L / yr
Python
Django
NumPy
pandas
Beautiful Soup
+1 more
  • In-depth knowledge in Core Python with Django building end to endapplications development.
  • Experience in Web technologies-HTML, CSS, Javascript.

 

  • Database - SQL Server/Postgres/ NoSQL database.

  • Good understanding of Algorithms, data structures.
  • Knowledge in ORM (Object Relational Mapper) libraries.

  • Experience in integrating multiple data sources and databases into onesystem.

  • Knowledge in REST / SOAP API
  • Knowledge in version control tools like Git
  • Experience with various cloud technologies.

 

Read more
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Find more jobs
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort