Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/erfannoury/pcrawler
Persian Twitter crawler
https://github.com/erfannoury/pcrawler
Last synced: 3 months ago
JSON representation
Persian Twitter crawler
- Host: GitHub
- URL: https://github.com/erfannoury/pcrawler
- Owner: erfannoury
- License: gpl-3.0
- Created: 2017-12-29T20:53:55.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-02-13T23:42:31.000Z (over 6 years ago)
- Last Synced: 2024-06-28T11:31:47.562Z (5 months ago)
- Language: Python
- Size: 30.7 MB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PCrawler
This is a fork of the awesome [Trenditter](https://github.com/trenditter/trenditter) project, modified to crawl and preserve tweets in Persian.
## TODO
- [ ] Better language identification to differentate between Persian and Arabic tweets, because at the moment, the Twitter language detection cannot with a high accuracy differentiate between Persian and Arabic tweets.
- [ ] Find Twitter bots and remove contributions (tweets, retweets) from them.