https://github.com/gamallo/mastodon-scrape

Scrapper of toots from Mastodon
https://github.com/gamallo/mastodon-scrape

mastodon-bot scraper scraper-python

Last synced: 3 months ago
JSON representation

Scrapper of toots from Mastodon

Host: GitHub
URL: https://github.com/gamallo/mastodon-scrape
Owner: gamallo
License: gpl-3.0
Created: 2023-11-21T19:42:23.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-03-19T16:40:38.000Z (almost 2 years ago)
Last Synced: 2025-07-27T00:31:06.001Z (5 months ago)
Topics: mastodon-bot, scraper, scraper-python
Language: Python
Homepage:
Size: 27.3 KB
Stars: 1
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# mastodon-scrape

Scraping toots from Mastodon (instance `mastodon.social`) without any authentification

## Requeriments:
* Bash and Python3
* Python modules: pandas, requests, json (use `pip install`)

## How to use:

* Module `timeline`: this module uses hashtags to retrieve toots over time, starting with the present and looking backwards. You need to specify a hashtag/keyword (for instance `-k putin`) and a specific period of time: for instance: `-s 2023-11-18 -t 2023-11-19`. This is a possible query:

```./mastodon -m timeline -k putin -s 2023-11-18 -t 2023-11-19```

* With the module `timeline', you can use language selection (pt, en, gl, es, ...):

```./mastodon -m timeline -k putin -s 2023-11-18 -t 2023-11-19 -l en```

* Module `search`: retrieve toots using a keyword. It only allows you to extract 40 toots for each search:

```./mastodon -m search -k trump```

## Output

The search gives a text plain file (`content.txt`) in directory `output`. Metadata of all retrieved toots are stored in directory `temp`.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gamallo/mastodon-scrape

Awesome Lists containing this project

README