Projects in Awesome Lists by AndyTheFactory
A curated list of projects in awesome lists by AndyTheFactory .
https://github.com/andythefactory/newspaper4k
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
articles articles-data crawler datasets-preparation news newspaper3k python requests scraper scraping
Last synced: 15 May 2025
https://github.com/AndyTheFactory/newspaper4k
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
articles articles-data crawler datasets-preparation news newspaper3k python requests scraper scraping
Last synced: 14 Mar 2025
https://github.com/andythefactory/romanian-nlp-datasets
A list of Romanian NLP Datasets
nlp nlp-data nlp-dataset nlp-datasets nlp-resources romanian romanian-language
Last synced: 18 Feb 2025
https://github.com/andythefactory/article-extraction-dataset
Article title, authors, date and body extraction dataset.
article-extractor corpus corpus-builder corpus-tools dataset datasets html-to-markdown html2text news news-aggregator news-crawler readability scraping scraping-websites text-cleaning text-extraction text-mining text-preprocessing web-scraping
Last synced: 18 Feb 2025
https://github.com/andythefactory/ro-diacritics
Python package for Romanian diacritics restoration
bert diacritics diacritics-removal diacritics-restoration nlp romanian romanian-bert romanian-diacritics-restoration romanian-language transformers transformers-models
Last synced: 12 Apr 2025
https://github.com/andythefactory/ninox-api
An API wrapper for the ninox db api (https://docs.ninox.com/en/api/public-cloud-apis)
api api-client api-rest api-wrapper api-wrappers database database-connector ninox python python-3 python3
Last synced: 12 Mar 2025
https://github.com/andythefactory/fakenewsdataset
a consolidated and cleaned up fake news dataset classified in the following categories: reliable, unreliable, political, bias, fake, conspiracy, rumor clickbait, junk science, satire, hate
datasets deep-learning deep-neural-networks disinformation fake-news fake-news-analysis fake-news-articles fake-news-challenge fake-news-classification fake-news-dataset fake-news-detection misinformation
Last synced: 18 Feb 2025
https://github.com/andythefactory/jsdb
Fork of JSDB a javascript based scripting engine - originated from http://www.jsdb.org
Last synced: 18 Feb 2025
https://github.com/andythefactory/ro-paraphrase-bible
Romanian paraphrase corpus based on different translations/versions of the bible
Last synced: 18 Feb 2025
https://github.com/andythefactory/gzip_ranged_simple_httpserver
SimpleHTTPServer with support for Range requests and GZip Compressing
Last synced: 18 Feb 2025
https://github.com/andythefactory/keyla
Automatically exported from code.google.com/p/keyla
Last synced: 18 Feb 2025
https://github.com/andythefactory/delphichromiumembedded
Automatically exported from code.google.com/p/delphichromiumembedded
Last synced: 18 Feb 2025
https://github.com/andythefactory/germeval2019
UPB contribuition to GermEval 2019
Last synced: 18 Feb 2025
https://github.com/andythefactory/thephpfactory.com
Repository for thephpfactory.com website
Last synced: 26 Mar 2025