https://github.com/nick-peter-marcus/job-scraper-email-alert
Python script for job alerts and email notification
https://github.com/nick-peter-marcus/job-scraper-email-alert
email-notification python selenium-webdriver webscraping
Last synced: 4 months ago
JSON representation
Python script for job alerts and email notification
- Host: GitHub
- URL: https://github.com/nick-peter-marcus/job-scraper-email-alert
- Owner: nick-peter-marcus
- License: mit
- Created: 2023-05-04T01:19:22.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-04-03T21:10:11.000Z (7 months ago)
- Last Synced: 2025-04-30T22:49:20.646Z (6 months ago)
- Topics: email-notification, python, selenium-webdriver, webscraping
- Language: Python
- Homepage:
- Size: 129 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Python script for job alerts and email notification.
I built this Python script to get email notifications whenever new job ads have been posted on career sites relevant to me.
There is one central, modular script, job_scrape.py, calling individual functions in /websites/ each scraping a different career site.
Each scraper module works as follows:
1. Parse* webpages containing current job listings.
2. Process and store job details in a dictionary.
3. Compare current postings with those from last execution.
4. Extract new postings and, if applicable, filter on relevant criteria (e.g. location).
5. Return a dictionary containing details of the filtered postings, or None when there are no new jobs.
These results are then joined as formatted email texts (MIMEMultipart class) in job_scrape.py. A secured SMTP connection will be started and the email will be send.
This script is executed every 24h as a scheduled task on PythonAnywhere.
*Note:
For static webpages, the BeautifulSoup package is used to scrape and parse HTML-documents.
For dynamic webpages, Selenium's WebDriver is utilized, initiating a headless browser to capture rendered data.