An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with text-datasets

A curated list of projects in awesome lists tagged with text-datasets .

https://github.com/EmilHvitfeldt/textdata

Download, parse, store, and load text datasets instead of storing it in packages

r rstats text-datasets

Last synced: 30 Jul 2025

https://github.com/emilhvitfeldt/textdata

Download, parse, store, and load text datasets instead of storing it in packages

r rstats text-datasets

Last synced: 10 Apr 2025

https://github.com/nuhmanpk/Webtrench

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets

Last synced: 08 Jul 2025

https://github.com/nuhmanpk/webtrench

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets

Last synced: 21 Mar 2025

https://github.com/hsankesara/the-tweets-of-wisdom

A dataset which contains 30k+ so called "self-help" tweets from 100+ authors.

nlp text-data text-datasets tweepy tweets

Last synced: 13 Oct 2025

https://github.com/nevmenandr/nazirov-texts-dataset

Датасет с текстами Р. Г. Назирова

dataset text-datasets

Last synced: 03 Jan 2026

https://github.com/infinitode/duplipy

DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.

ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting

Last synced: 15 Apr 2026