https://github.com/gamallo/mastodon-scrape
Scrapper of toots from Mastodon
https://github.com/gamallo/mastodon-scrape
mastodon-bot scraper scraper-python
Last synced: 3 months ago
JSON representation
Scrapper of toots from Mastodon
- Host: GitHub
- URL: https://github.com/gamallo/mastodon-scrape
- Owner: gamallo
- License: gpl-3.0
- Created: 2023-11-21T19:42:23.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-03-19T16:40:38.000Z (almost 2 years ago)
- Last Synced: 2025-07-27T00:31:06.001Z (5 months ago)
- Topics: mastodon-bot, scraper, scraper-python
- Language: Python
- Homepage:
- Size: 27.3 KB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# mastodon-scrape
Scraping toots from Mastodon (instance `mastodon.social`) without any authentification
## Requeriments:
* Bash and Python3
* Python modules: pandas, requests, json (use `pip install`)
## How to use:
* Module `timeline`: this module uses hashtags to retrieve toots over time, starting with the present and looking backwards. You need to specify a hashtag/keyword (for instance `-k putin`) and a specific period of time: for instance: `-s 2023-11-18 -t 2023-11-19`. This is a possible query:
```./mastodon -m timeline -k putin -s 2023-11-18 -t 2023-11-19```
* With the module `timeline', you can use language selection (pt, en, gl, es, ...):
```./mastodon -m timeline -k putin -s 2023-11-18 -t 2023-11-19 -l en```
* Module `search`: retrieve toots using a keyword. It only allows you to extract 40 toots for each search:
```./mastodon -m search -k trump```
## Output
The search gives a text plain file (`content.txt`) in directory `output`. Metadata of all retrieved toots are stored in directory `temp`.