https://github.com/thunchanokbow/extract-data-from-website

beautifulsoup python3 selenium selenium-webdriver

Last synced: 4 months ago
JSON representation

Host: GitHub
URL: https://github.com/thunchanokbow/extract-data-from-website
Owner: thunchanokbow
Created: 2023-10-25T02:33:08.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2023-11-02T04:18:43.000Z (over 1 year ago)
Last Synced: 2025-01-09T09:58:45.473Z (6 months ago)
Topics: beautifulsoup, python3, selenium, selenium-webdriver
Language: Python
Homepage:
Size: 3.91 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

Extract Data from Indeed Website
============

The code is a Python script that uses BeautifulSoup to extract job titles, company names, locations, and salaries from Indeed job listings. The script works by first parsing **the HTML source code** of a Indeed job search page using BeautifulSoup.
Once the HTML is parsed, the script uses method to **find all of** the elements on the page These elements contain the individual **job listings**.
**After the script has processed** all of the job listings on the page, **it checks to see if there is a next button**. If there is a next button and it is enabled, the script clicks the button to navigate to the next page of job listings. **If the close button is enabled, then close the popup**.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/thunchanokbow/extract-data-from-website

Awesome Lists containing this project

README