Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/thunchanokbow/extract-data-from-website
https://github.com/thunchanokbow/extract-data-from-website
beautifulsoup python3 selenium selenium-webdriver
Last synced: 11 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/thunchanokbow/extract-data-from-website
- Owner: thunchanokbow
- Created: 2023-10-25T02:33:08.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-11-02T04:18:43.000Z (about 1 year ago)
- Last Synced: 2024-11-11T10:36:21.634Z (2 months ago)
- Topics: beautifulsoup, python3, selenium, selenium-webdriver
- Language: Python
- Homepage:
- Size: 3.91 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Extract Data from Indeed Website
============The code is a Python script that uses BeautifulSoup to extract job titles, company names, locations, and salaries from Indeed job listings. The script works by first parsing **the HTML source code** of a Indeed job search page using BeautifulSoup.
Once the HTML is parsed, the script uses method to **find all of** the elements on the page These elements contain the individual **job listings**.
**After the script has processed** all of the job listings on the page, **it checks to see if there is a next button**. If there is a next button and it is enabled, the script clicks the button to navigate to the next page of job listings. **If the close button is enabled, then close the popup**.