https://github.com/radical-data/arte-util-scraper
Software to scrape the Arte Útil archive.
https://github.com/radical-data/arte-util-scraper
beautifulsoup beautifulsoup4 pandas pandas-dataframe python scraper scraping-python scraping-websites
Last synced: about 2 months ago
JSON representation
Software to scrape the Arte Útil archive.
- Host: GitHub
- URL: https://github.com/radical-data/arte-util-scraper
- Owner: radical-data
- Created: 2022-06-04T14:00:37.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-06-04T14:06:26.000Z (about 4 years ago)
- Last Synced: 2025-02-26T07:23:59.481Z (over 1 year ago)
- Topics: beautifulsoup, beautifulsoup4, pandas, pandas-dataframe, python, scraper, scraping-python, scraping-websites
- Language: Python
- Homepage:
- Size: 2.93 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Arte Útil Scraper
## About the Project
This software scrapes the [Arte Útil archive](https://www.arte-util.org/projects/).
Arte Útil is a project that has inspired Radical Data's work, showing the possibilities of art as a tool within society. As the website hosting the archive has a limited number of possible interactions, we wanted an easier way to explore the projects in the archive.
We built this scraper to collect all of the projects' data into a spreadsheet and download all of the images.
## Getting Started
1. Install Python
1. Install `pandas` and `lxml`
2. Navigate to this folder: `cd ./arte-util-scraper`
3. Run `python scrape-archive.py`
## Other thoughts
Though I'm sure we would have done it anyway, Arte Útil encourages people to create alternative presentations of the Archive.
> Users are invited to download, print and build their own archive.
> _[Source](https://www.arte-util.org/tools/structure/)_