https://github.com/ines-om/webscraping-ao3
WebScraping Project using BeautifulSoup4 - made for an internship at a social media startup 🍉
https://github.com/ines-om/webscraping-ao3
archiveofourown csv-files email-marketing fandom fanfiction lxml python python-script user user-management users webscraping webscrapper
Last synced: over 1 year ago
JSON representation
WebScraping Project using BeautifulSoup4 - made for an internship at a social media startup 🍉
- Host: GitHub
- URL: https://github.com/ines-om/webscraping-ao3
- Owner: ines-om
- Created: 2022-02-04T12:14:01.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2022-05-02T21:29:25.000Z (about 4 years ago)
- Last Synced: 2024-08-27T00:22:31.338Z (almost 2 years ago)
- Topics: archiveofourown, csv-files, email-marketing, fandom, fanfiction, lxml, python, python-script, user, user-management, users, webscraping, webscrapper
- Homepage:
- Size: 26.4 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# WebScraping-ao3
I'm currently involved in a project related to providing fandom creatives a new platform where they can truly develop their craft.
To do this, contacting said creatives is essential. We decided to scrape users' public emails from their profiles.
The profiles picked out were chosen based on their presence in the "recent" section of Archive of Our Own pages.
Other relevant information to be scraped will be:
- tags of the work they publish ( a.k.a fandoms they're involved with) - both characters and ships
- Their date of birth (for now, it's only a possibility but the app to be advertised for *is* GenZ centric)
Data is stored in csv file.