Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/noracodes/temphael
A Tumblr-scraping text post bot
https://github.com/noracodes/temphael
bot markov-chain scraping tumblr
Last synced: 24 days ago
JSON representation
A Tumblr-scraping text post bot
- Host: GitHub
- URL: https://github.com/noracodes/temphael
- Owner: NoraCodes
- Created: 2017-04-20T17:26:43.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-09-17T01:23:50.000Z (about 7 years ago)
- Last Synced: 2024-10-04T12:25:59.414Z (about 1 month ago)
- Topics: bot, markov-chain, scraping, tumblr
- Language: Python
- Size: 6.84 KB
- Stars: 14
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Geneva Bot
Scrape Tumblr blogs for a corpus, convert it into a Markov probablility matrix,
and generate text posts in the style of the original blog.This process consists of two scripts, `tscrape.py` and `genevabot.py`. The
first scrapes a Tumblr blog for text and tags (because lots of great content is
included in Tumblr tags!) and creates a PyMarkovChain probability database from
that data. The second simply reconsititues the probability database into memory
and generates some sentances from it.Here is a sample usage:
```
13:03:39: leo [~/Projects/genevabot]
$ ./tscrape.py clientsfromhell 1 10 --notags
Scraping page 1
[...]
Scraping page 10
Generating database...
Dumping database to clientsfromhell.posts.markov13:03:59: leo [~/Projects/genevabot]
$ ./genevabot.py clientsfromhell.posts.markov 10
Perfect for everything from invitations to logos, the 35+ unique fonts – for example, an idea
logo, and brand image for my sister’s cake
It’s a Samsung iPhone
Me: I removed all the details
do it for free, and still
By making it easy to make bold and professional designs,
Thanks for the time to actually review said contract, I was
After this phone call, I received some revisions from said customer that came in to work with us can pay our full fees and is standard across
I sent it over – along with a
What will we do
```