Projects in Awesome Lists tagged with text-datasets
A curated list of projects in awesome lists tagged with text-datasets .
https://github.com/EmilHvitfeldt/textdata
Download, parse, store, and load text datasets instead of storing it in packages
Last synced: 30 Jul 2025
https://github.com/emilhvitfeldt/textdata
Download, parse, store, and load text datasets instead of storing it in packages
Last synced: 10 Apr 2025
https://github.com/nuhmanpk/Webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets
Last synced: 08 Jul 2025
https://github.com/nuhmanpk/webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets
Last synced: 21 Mar 2025
https://github.com/hsankesara/the-tweets-of-wisdom
A dataset which contains 30k+ so called "self-help" tweets from 100+ authors.
nlp text-data text-datasets tweepy tweets
Last synced: 13 Oct 2025
https://github.com/nevmenandr/nazirov-texts-dataset
Датасет с текстами Р. Г. Назирова
Last synced: 03 Jan 2026
https://github.com/infinitode/duplipy
DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.
ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting
Last synced: 15 Apr 2026