data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/rclement/romain-clement.net
Freelance Software Engineer & Trainer
data freelancer machine-learning mkdocs mkdocs-material python
Last synced: 21 Mar 2025
https://github.com/bohnacker/data-manipulation
Some Javascript and Python scripts to manipulate (large) CSV files and JSON data.
data data-mining data-structures javascript python
Last synced: 18 May 2026
https://github.com/gappeah/apocalypse-food-prep-report
This PowerBI project focuses on visualising data for Apocalypse Food Prep, a company specialising in emergency food supplies. The dataset consists of various CSV files containing information on customers, locations, products, sales, sales teams, and state regions.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 25 Feb 2025
https://github.com/andygol/yamap
Yamap Ain't Map – deployment of OSM infrastructure project inspired by osm-seed
api data extract geo-data map openstreetmap osm
Last synced: 24 Jun 2025
https://github.com/jujuadams/ini-to-json
JSON+buffer replacement for native GameMaker INI functions.
data gamemaker gamemaker-studio-2 gms2 ini json save
Last synced: 21 Jul 2025
https://github.com/newrelic-experimental/newrelic-java-camel
Instrumentation of the New Relic Java Agent for the Camel framework
camel camel-jms data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data
Last synced: 10 Apr 2025
https://github.com/swaymm7/open-source-prompt-library
Here is where I store all my useful prompts
chatgpt-prompt data data-analytics data-engineering deepseek gpt ios llm macos prompt prompts prompts-template swift-package-manager tracker
Last synced: 16 Jul 2025
https://github.com/yessasvini23/machine_learning_specialization_deeplearning.ai
Contains all course modules, exercises and notes of ML Specialization by Andrew Ng, Stanford Un. and DeepLearning.ai in Coursera
andrew-ng andrew-ng-course andrew-ng-machine-learning classification data data-science deep-learning machine-learning machine-learning-algorithms neural-network nlp-machine-learning regression rnn-tensorflow
Last synced: 18 May 2026
https://github.com/umitkaanusta/smol-elt
a smol elt (not etl) pipeline for smol tasks
analytics automation aws aws-sns data data-engineering data-pipeline elt etl google-sheets pandas pipeline python spreadsheet web-scraping
Last synced: 10 May 2026
https://github.com/strmprivacy/docs
With STRM Privacy you can easily build privacy-by-design data pipelines and define data contracts to encode privacy inside your data. Data streams are pseudonymised or anonymised in real-time or batch. These are our docs.
data documentation docusaurus privacy privacy-enhancing-technologies
Last synced: 12 Jul 2025
https://github.com/aa-sikkkk/twitterdatamining
A Simple Script to mine data from X/Twitter
Last synced: 24 Jan 2026
https://github.com/varletjs/ruler-factory
A flexible, chainable validation rule factory for typeScript/javaScript.
chainable data factory form javascript rules typescript validation validator
Last synced: 12 Sep 2025
https://github.com/jewelzufo/free-tech-learning
A collection of free Tech Courses with Credly Credentials
ai cisco courses credly cybersecurity data data-science ibm ibm-cloud ibm-watson learning-resources tech
Last synced: 16 Feb 2026
https://github.com/csengupta1101/dig-student-files
This Repository will contain all student submissions at one place.
data datascience education machine-learning python students visualization
Last synced: 17 Jul 2025
https://github.com/vijishmadhavan/parse-clip
A simple CLIP based project for combining images from multiple datasets.
clip data datacleaning dataexploration dataset fastai image python
Last synced: 14 May 2026
https://github.com/olajideolagunju/gcp_mage_data_pipeline
An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.
automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders
Last synced: 07 Mar 2025
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 03 Apr 2025
https://github.com/njraladdin/newspapers-com-scraper
A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.
archive data newspapers scraper scraper-api scraping
Last synced: 06 Apr 2025
https://github.com/gianlucatruda/project_sleep
A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.
data experiment matplotlib python quantified science self sleep visualization
Last synced: 03 Apr 2025
https://github.com/oliver021/ecmalinq
The linq runtime and support to typescript/javascript ecosystem
collection data iterable iteration javascript library linq linq-expressions nodejs query stream stream-data structure typescript
Last synced: 13 May 2025
https://github.com/nalgeon/nalgeon.github.io
Everything about SQLite, Python, open data and awesome software
Last synced: 14 Jul 2025
https://github.com/arverma/data-engineer-interview-experience
My interview experience with the companies I interviewed with
big-data data data-engineer data-engineering engineering interview interview-practice interview-preparation interview-questions python3 spark sql
Last synced: 19 May 2026
https://github.com/stefanbohacek/fediverse-explorations
Exploring the fediverse through data, studies, and polls.
data data-visualization fediverse mastodon social-media
Last synced: 12 Apr 2025
https://github.com/psfried/dgen
Generate evil test data
csv data data-generation data-generator language testing-tools
Last synced: 18 Mar 2025
https://github.com/datawookie/data-diaspora
Various datasets used in tutorials and workshops.
Last synced: 20 Mar 2025
https://github.com/imagodata/filter_mate
FilterMate is a Qgis plugin, an everyday companion that allows you to easily filter your vector layers
data exploratory-data-analysis filter geospatial ogr postgis qgis qgis-plugin qgis3 qgis3-plugin spatialite sql vector-database
Last synced: 29 Apr 2026
https://github.com/cmudig/mosaic-profiler
A data profiler built with Mosaic
Last synced: 25 Oct 2025
https://github.com/sneels/parkds
Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)
cross-domain data database datasource datasources javascript source
Last synced: 24 Feb 2026
https://github.com/0xdir/htcds_dart
Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.
data humanitarian schema standards
Last synced: 24 Oct 2025
https://github.com/ryanmorr/fastmap
Accelerated hash maps
data hashmap javascript map performance
Last synced: 10 Oct 2025
https://github.com/udityamerit/python-librearies-for-data-science
Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data
beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow
Last synced: 06 Feb 2026
https://github.com/tomdoestech/website-scraping-example
data node-js nodejs scraping scraping-websites
Last synced: 16 Mar 2025
https://github.com/t3v/t3v_datamapper
The data mapper extension of TYPO3voilà.
data database datamapper extension laravel mapper t3v typo3 typo3-cms-extension typo3-extension typo3voila
Last synced: 27 Jan 2026
https://github.com/legopitstop/addons
All legopitstop's Bedrock add-ons in one place.
add-on assets behaviorpack data hacktoberfest minecraft mods modtoberfest resroucepack vanilla
Last synced: 06 Feb 2026
https://github.com/binarybardakshat/suryanayan
Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.
Last synced: 10 Oct 2025
https://github.com/geopython/pygeoapi-examples
Example pygeoapi deployment patterns and configurations
api data geospatial ogc ogc-api osgeo pygeoapi
Last synced: 11 Oct 2025
https://github.com/fcakyon/earth2-scraper
Up-to-date earth2.io data
data earth earth2 earth2io javascript json json-api prices-per-tile python scraper
Last synced: 09 May 2026
https://github.com/cicerops/monitoring-check-grafana
Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.
data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale
Last synced: 08 May 2026
https://github.com/chompfoods/sdk-csharp
C# SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp csharp csharp-sdk data database dll food grocery ingredients nuget nutrition raw recipes recipes-api restsharp sdk swagger
Last synced: 06 May 2026
https://github.com/ahmedshahriar/eda_basketball
basketball basketball-stats data data-science data-visualization pandas python python3 streamlit
Last synced: 04 May 2026
https://github.com/jderstd/spec
A standard for JSON responses
data error jder json response specification structure
Last synced: 13 May 2026
https://github.com/deveel/kista
Implementations of the repository pattern for .NET to support the domain-driven modeling
clean-architechture csharp data dotnet-core dotnetcore efcore entity entity-framework entity-manager layered-architecture mongodb repository repository-manager repository-pattern
Last synced: 08 Jun 2026
https://github.com/paladique/azuresample-guestbook
Guestbook using MySQL and Cosmos DB on Azure
cosmosdb data mysql spa websockets
Last synced: 30 Apr 2026
https://github.com/anicolaspp/mapr-data-gen
Data generator for MapR Data Platform
data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark
Last synced: 29 Apr 2026
https://github.com/tooleks/laravel-presenter
The Laravel Presenter Composer Package
collection composer data entity laravel mapper mapping php presenter representation view
Last synced: 28 Apr 2026
https://github.com/yazaabed/at-who-angular
wrapper for At.js that add mentions autocomplete to your application with angular component for using it on any AngularJS projects
angular-components angularjs autocomplete components data modules webpack wrapper
Last synced: 28 Apr 2026
https://github.com/luminovrym/pbo-biodata
Simulasi Cara Input Data dengan OOP
Last synced: 18 Jun 2026
https://github.com/mongodb-developer/rocket-analytics
Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.
data federation lucene lucenesearch mongodb s3 search sql
Last synced: 28 Apr 2026
https://github.com/justjavac/deno_data_dir
Returns the path to the user's data directory.
data deno deno-module deno-modules directory
Last synced: 27 Apr 2026
https://github.com/kanugurajesh/firebase-data
Adding data to firebase store
data firebase firebase-database python
Last synced: 27 Apr 2026
https://github.com/anthonykrivonos/ts-algo-masterclass
👾 Giant TypeScript algorithm and data structure masterclass to be constantly updated with important CS concepts.
algorithm class-project computer concepts data data-structures fundamentals giant library masterclass science structures typescript
Last synced: 11 May 2026
https://github.com/mukhopadhyay/opendata
Open Data ❤️
data data-science datasets deep-learning kaggle kaggle-dataset machine-learning open-source opendata
Last synced: 25 Apr 2026
https://github.com/joamag/pandas
Loads of pandas data from China with awesome data
data data-analysis jupyter notebook pandas
Last synced: 25 Apr 2026
https://github.com/corentinb/txtoredis
:fire: Push each line of a text file, to a Redis set
data datascience dataset go golang redis set
Last synced: 24 Apr 2026
https://github.com/d2hydro/fewspy
A Python API for the Deltares FEWS PI REST Web Service
data geopandas hydrology hydrometrics pandas python
Last synced: 23 Apr 2026
https://github.com/healthyregions/oeps
Opioid Environment Policy Scan - data explorer and backend management
data data-visualization public-health
Last synced: 21 Apr 2026
https://github.com/adanos-software/free-ticker-database
Free global stock & ETF ticker reference database - 50k+ tickers, 66 exchanges, 81 countries
Last synced: 10 May 2026
https://github.com/lilingxi01/bloark
Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.
architecture bloark data revision-based
Last synced: 05 Apr 2026
https://github.com/metapsy-project/data-gambling-psyctr
Database of psychological interventions for problem gambling and gambling disorder.
Last synced: 02 Apr 2026
https://github.com/d8a-tech/d8a
A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.
bigquery clickhouse data ga4 tracker
Last synced: 10 Apr 2026
https://github.com/robertmyles/riscobrasil
An R package to download 'Brazil Risk' data :chart_with_upwards_trend:
Last synced: 08 Apr 2025
https://github.com/ahmetfurkandemir/sahibinden-data-engineering-technical-case-study
Sahibinden.com Data Engineering Technical Case Study
case-study data data-engineering debezium docker flink kafka mongodb mysql pyflink pyspark python sahibinden spark
Last synced: 03 Mar 2026
https://github.com/muhammadibrahim313/datavue
"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.
analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit
Last synced: 10 Apr 2025
https://github.com/louisbrulenaudet/legalkit-pipeline
Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.
data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python
Last synced: 17 Mar 2025
https://github.com/geo2france/odema-dashboard
Tableaux de bord thématiques Odema
application client-side dashboard data echarts maplibre odema react waste
Last synced: 05 Feb 2026
https://github.com/abrudz/parsing
Dyalog APL expressions to parse common and unusual data formats from text files
apl csv data data-format dyalog-apl dyalogapl parsing
Last synced: 20 Mar 2026
https://github.com/woctezuma/steam-reviews-data
Data available to compute statistics of Steam reviews.
Last synced: 19 Mar 2026
https://github.com/huangcongqing/ranking-list
数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data
Last synced: 15 Feb 2026
https://github.com/platob/yggdrasil
arrow data databricks pandas polars spark sql
Last synced: 02 Jun 2026
https://github.com/gadenbuie/crantrack
Hourly snapshots of CRAN's incoming packages folder
Last synced: 12 Mar 2026
https://github.com/qeeqbox/data-compliance
Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse
compliance data data-compliance infosecsimplified qeeqbox
Last synced: 19 Mar 2026
https://github.com/fforres/webpack-plugin-dx-metrics
Webpack plugin to track webpack behaviour in datadog
data datadog developer-experience typescript visualization webpack
Last synced: 13 Feb 2026
https://github.com/onaio/gisida-react
React Dashboard library for Gisida.
dashboard data gisida map react visualization
Last synced: 28 Apr 2025
https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation
Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.
colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats
Last synced: 11 Feb 2026
https://github.com/enes9103/039_react_task_tracker-json_server
api axios-react css3 data javascript json-server react reactjs responsive todoapp
Last synced: 11 Feb 2026
https://github.com/rikurauhala/insights
Visualize your coding journey
cypress data data-visualization github github-api javascript material-ui octokit react statistics typescript vite
Last synced: 11 Feb 2026
https://github.com/bluegreen-labs/oneflux_containers
Containerized (docker) versions of the ONEFlux processing pipeline
data ecosystem fluxes micrometeorology processing
Last synced: 07 Oct 2025