https://github.com/thisisparker/78_sampler
Bot powering @78_sampler, posting video clips of digitized 78 records from the Internet Archive
https://github.com/thisisparker/78_sampler
Last synced: 11 months ago
JSON representation
Bot powering @78_sampler, posting video clips of digitized 78 records from the Internet Archive
- Host: GitHub
- URL: https://github.com/thisisparker/78_sampler
- Owner: thisisparker
- Created: 2017-08-09T17:37:03.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2023-10-03T23:47:45.000Z (over 2 years ago)
- Last Synced: 2025-07-18T07:42:20.475Z (11 months ago)
- Language: Python
- Size: 17.5 MB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# @78_sampler
This bot picks a random 78 record from the Great 78 Project, downloads the image and audio, and renders together a 140-second clip suitable for posting to Twitter. Then it uploads that video with a link to the tune. Follow it at [@78_sampler](https://twitter.com/78_sampler).
The `internetarchive` module required by this script also provides a command line interface that can be used to generate the `georgeblood.txt` file. The [George Blood collection at the Internet Archive](https://archive.org/details/georgeblood) is continually being expanded, so I run the following command on occasion:
```
ia search collection:georgeblood --itemlist > georgeblood.txt
```
Note that the collection currently contains over 300,000 items, so it is normal for that command to take some time to run.
The collection contains some items I never want the bot to tweet; I maintain a file called `exclude.txt` with a series of strings that should not appear. I apply the filter in line with the request with a simple grep:
```
ia search collection:georgeblood --itemlist | grep -iv -f exclude.txt > georgeblood.txt
```