Projects in Awesome Lists tagged with data-gathering
A curated list of projects in awesome lists tagged with data-gathering .
https://github.com/fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
cc-news ccnews commoncrawl crawler data-gathering elasticsearch extract-articles extract-information extractor json news news-archive news-articles news-crawler news-extractor news-scraper news-websites nlp python roberta
Last synced: 13 May 2025
https://github.com/cacti/cacti
Cacti ™
cacti cacti-community cacti-framework data-gathering graph graph-templating network-discovery php rrd-files rrdtool
Last synced: 14 May 2025
https://github.com/Cacti/cacti
Cacti ™
cacti cacti-community cacti-framework data-gathering graph graph-templating network-discovery php rrd-files rrdtool
Last synced: 02 Apr 2025
https://github.com/vil/h4x-tools
Open source toolkit for scraping, OSINT and more.
data-gathering dirbuster email-osint h4x-tools hacking hacking-tool hacktools igscraper ip-scanner linux osint phone-number port-scanner python python-script python3 tools webhook-spammer webscraping websearch
Last synced: 08 Apr 2025
https://github.com/arup-group/social-data
Code and data for eviction and housing analysis in the US
burdened-households compare-counties cost county-level covid-19 data-analysis data-gathering database eviction evictions fred housing preventing-evictions python-scripts rent risk
Last synced: 12 Apr 2025
https://github.com/shadawck/glit
Retrieve all mails of users related to a git repository, a git user or a git organization
data-gathering email email-osint information-gathering multithreaded osint osint-tool rust
Last synced: 07 Mar 2026
https://github.com/viralvaghela/jwiki
Java tool to get wikipedia data
data-gathering java javatool javawikipeda wikipedia wikipedia-api wikipedia-scraper
Last synced: 26 Oct 2025
https://github.com/sondosaabed/oil-vs-bigtech-stock-investigation
💹📈Investigating the oils market prices in addition to the stock market prices between the start of 2001 to the end of 2023. 💰📉
advanced-data-wrangling api data-analyst-nanodegree data-assessment data-gathering data-science python requests wrangling-data
Last synced: 09 Apr 2025
https://github.com/dbrennand/twitter-stream-bot-data-gatherer
An application to watch the Twitter stream and send accounts to the Botometer API for analysis. The results are stored in a SQLite database.
bot-detection botometer data-gathering twitter-api twitter-bot-detection twitter-streaming-api
Last synced: 11 Apr 2025
https://github.com/darsan-in/job-crawler
The Job Crawler is an integral component of the Job RAID project, designed to automatically scrape and collect data from various job listing websites. This crawler enables Job RAID to aggregate comprehensive job listings, ensuring that users have access to up-to-date and relevant job opportunities.
automated-job-listings crawler-integration data-extraction data-gathering job-aggregator job-crawler job-data job-data-collection job-data-miner job-listing-crawler job-portal-scraping job-scraping job-scraping-tool job-search-automation job-search-engine multi-site-job-scraping real-time-job-data scraping-jobs web-crawler web-scraping
Last synced: 14 Feb 2026
https://github.com/iwansal64/instaf1nder-py
An open source Instagram profile lookup.
data-gathering linux osint-tool python
Last synced: 28 Apr 2026
https://github.com/ibrahimceyisakar/hotel-finder
Hotel finder system with Python includes data gathering, analyzing, and visualization.
data-analysis data-gathering data-visualization pandas plotly python selenium streamlit
Last synced: 06 May 2026
https://github.com/aadityasikder/Object-Detection-with-raspberry-pi-implementing-TinyML-models
Repository for Raspberry Pi-based object detection with TinyML models like TensorFlow Lite, PyTorch Nano, including data gathering, mAP evaluation, and image data preparation in Jupyter notebooks.
data-gathering datacleaning dataprocessing image-preparation object-detection pytorch-nano raspberry-pi-4 tensorflow-lite tinyml
Last synced: 16 Dec 2025
https://github.com/abhi18av/acm-icpc-problems-aggregator
acm-icpc browser-automation clojure data-gathering
Last synced: 23 Feb 2025
https://github.com/shafaq-aslam/data-gathering
A hands on collection of notebooks exploring multiple techniques of data gathering, from reading CSV, Excel, JSON, and SQL files to exporting data in various formats and fetching real time data through APIs. This repository documents my complete learning journey of data ingestion, preparation, and extraction for data analysis workflows.
api data-analysis data-export data-gathering data-import data-science jupyter-notebook machine-learning pandas python python3
Last synced: 21 May 2026
https://github.com/aadityasikder/object-detection-with-raspberry-pi-implementing-tinyml-models
Repository for Raspberry Pi-based object detection with TinyML models like TensorFlow Lite, PyTorch Nano, including data gathering, mAP evaluation, and image data preparation in Jupyter notebooks.
data-gathering datacleaning dataprocessing image-preparation object-detection pytorch-nano raspberry-pi-4 tensorflow-lite tinyml
Last synced: 18 Feb 2026
https://github.com/iabdullah215/reconpro
This tool performs a comprehensive security reconnaissance on a given domain, gathering information such as subdomains, SSL certificate details, open ports, HTTP headers, WHOIS data, and more. It generates a detailed JSON report of the findings for further analysis.
data-gathering open-source osint web-scraping
Last synced: 30 Jul 2025
https://github.com/aqueeqazam/web-scraping-for-data-gathering-and-mining
Web scraping is used by data mining experts and hackers to imitate conventional browsers and visit websites by following their hypertext structure. They then extract HTML content and data according to predetermined settings and store the data in local databases.
data-gathering data-mining web-mining web-scraping
Last synced: 29 May 2026
https://github.com/ankitmishralive/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 22 Mar 2025
https://github.com/grctest/boinc_scripts
automation boinc data-gathering gridcoin scripts server-status statistics team
Last synced: 26 Mar 2025
https://github.com/grip-on-software/data-gathering
Modules used to gather data from different data sources in software development processes
data-gathering software-development-process
Last synced: 24 Jan 2026
https://github.com/ankitgmishra/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 03 May 2026