Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with scraped-data
A curated list of projects in awesome lists tagged with scraped-data .
https://github.com/CUNY-CL/wikipron
Massively multilingual pronunciation mining
computational-linguistics g2p language linguistics nlp phonetics phonology pronunciation python-api scraped-data speech
Last synced: 04 Nov 2024
https://github.com/warifp/shopee-scrape
Shopee Scrape is a tool that functions to collect data - the data needed, such as finding data from photos, prices, names, store locations and others.
curl curl-functions curl-library curlphp indonesia marketplace php php-library scrape scrape-images scrape-websites scraped-data scraper scraper-engine shopee shopee-api
Last synced: 28 Oct 2024
https://github.com/swader/diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
ai artificial-intelligence bot crawl crawling diffbot machine-learning nlp php scrape scraped-data scraper scraping
Last synced: 20 Dec 2024
https://github.com/racinmat/mal-analysis
github repo for MyAnimeList analysis. Also links to the MAL dataset.
analysis anime crawling data-science kaggle-dataset mal scraped-data
Last synced: 06 Nov 2024
https://github.com/benjaminvdb/DBRD
110k Dutch Book Reviews Dataset for Sentiment Analysis
dataset dataset-creation dutch nlp nlp-machine-learning python python3 scraped-data scraper
Last synced: 17 Nov 2024
https://github.com/fernandod1/producthunt-scraper
Producthunt.com famous website scraper script. Scrap all offers and save in spreadsheet excel file.
crawler crawling crawling-sites data-mining datamining producthunt producthunt-api producthunt-users python python-script python3 scrape scraped-data scraper scraper-engine scraping scraping-bot scraping-python scraping-tool scraping-websites
Last synced: 12 Nov 2024
https://github.com/faheel/file-extensions
JSON collection of scraped file extensions, along with their description and type, from FileInfo.com
file-extensions fileinfo json python3 scraped-data scraper website-scraper
Last synced: 12 Oct 2024
https://github.com/harshcasper/blind-app-reviews
Scraped reviews of over 25 companies from the Blind App ⚡️
blind-app company-reviews dataset nlp scrape scraped-data text-mining webscraping
Last synced: 08 Nov 2024
https://github.com/palahsu/youtubescraper
Scraping YouTube Video Description and Video Likes and Comments and Times and Replies! It's Automatically Extracting Data from Video.
scraped-data scraper scraping youtube youtube-api youtube-api-v3 youtube-data-analysis youtube-data-api-v3 youtube-data-scraping youtube-dl youtube-downloader youtube-scraper
Last synced: 10 Nov 2024
https://github.com/malina/metascraper
Metascraper is a Crystal library for web scraping.
Last synced: 26 Oct 2024
https://github.com/ephellon/game-store-catalog
Catalog of PlayStation, Xbox, and Nintendo games
catalog game games nintendo playstation playstore psn scraped-data store xbox
Last synced: 07 Nov 2024
https://github.com/dbritto-dev/udacity-cloud-devops-engineer-capstone
Capstone Project for Cloud DevOps Engineer on Udacity
capstone covid19 eks eksctl flask jenkins kubernetes pipeline scraped-data udacity-devops-nanodegree
Last synced: 15 Dec 2024
https://github.com/emibcn/covid-data
Store and serve daily collected data from https://dadescovid.org for sibling app at https://emibcn.github.io/covid/
backend cache charts covid-data generalitat-de-catalunya github-actions github-page github-pages hacktoberfest json json-objects scraped-data scraper storage workflow
Last synced: 14 Nov 2024
https://github.com/derrmru/whats-in-the-news
Data Visualisation of News Content
data-visualization nlp react scraped-data
Last synced: 05 Nov 2024
https://github.com/wurstbroteater/hometemp
Fetch apartment data, online data, visualize it, analyse it and send it via email.
apartment-management-system data-visualization raspberry-pi scraped-data temperature temperature-monitoring temperature-sensor
Last synced: 16 Dec 2024
https://github.com/fitzwilliammuseum/thresholds
An archive repository for thresholds.org.uk
cambridge-museums poetry scraped-data wordpress
Last synced: 14 Nov 2024
https://github.com/bhavyac16/flairifyme
FlairifyMe is a Reddit Flair Detector for r/india subreddit, that takes a post's URL as user input and predicts the flair for the post using a model generated by Logistic Regression.
flair-prediction flask hacktoberfest linear-svm logistic-regression naive-bayes-classifier nltk praw-reddit reddit-flair-detector scikit-learn scraped-data subreddit text-classification
Last synced: 11 Nov 2024