Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ftkurt/kurdish-twitter-data
Kurdish twitter data repository for Kurmanji and Sorani dialects
https://github.com/ftkurt/kurdish-twitter-data
datasets kurdish kurdish-language-library twitter
Last synced: 3 months ago
JSON representation
Kurdish twitter data repository for Kurmanji and Sorani dialects
- Host: GitHub
- URL: https://github.com/ftkurt/kurdish-twitter-data
- Owner: ftkurt
- License: gpl-3.0
- Created: 2020-06-07T13:39:26.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2020-06-07T21:27:15.000Z (over 4 years ago)
- Last Synced: 2024-01-28T23:08:40.078Z (9 months ago)
- Topics: datasets, kurdish, kurdish-language-library, twitter
- Size: 3.83 MB
- Stars: 4
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-kurdish - A Twitter dataset
README
# kurdish-twitter-data
Kurdish twitter data repository for Kurmanji and Sorani dialectsThis dataset includes a total of 29011 Kurmanji and 29010 Sorani tweets.
- Each line includes content for a new tweet
- No repeated content, each text entry is unique
- User-id mentions and URLS are replaced by USER_ID and URL respectively
- Any new lines characters are removed; hence first rule