data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/ncgl-git/eriparse
Python code to parse the cost-of-living HTML from erieri.com, i.e. https://www.erieri.com/cost-of-living/united-states/illinois/chicago
cost-of-living crime crime-data data economic-research-institute erieri webscraper
Last synced: 14 Jan 2026
https://github.com/vincentlaucsb/csv-data
A curated repository of real and fake CSV data for use in testing suites
Last synced: 08 Mar 2026
https://github.com/stdlib-js/array-base-fancy-slice-assign
Assign element values from a broadcasted input array to corresponding elements in an output array.
array assign assignment copy data fancy generic javascript node node-js nodejs shallow slice stdlib structure subseq subsequence types
Last synced: 06 Oct 2025
https://github.com/wangshouh/cryptofinancedata
An ipynb file containing data acquisition of futures, options and other financial derivatives
Last synced: 05 Oct 2025
https://github.com/igorwastaken/math-problems
Solve math problems easily with this utility library.
algorithm area data demography geography javascript math npm package population school typescript util utils
Last synced: 23 Feb 2026
https://github.com/nikoshet/rust-dms-cdc-operator
The rust-dms-cdc-operator is a Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios.
aws cdc data dms parquet pgdatadiff polars postgres rds rust s3 validation
Last synced: 18 Jan 2026
https://github.com/mewmix/drivehound
magic file signatures + python drive recovery magic
data disk file-signatures harddrive python recovery recovery-tool
Last synced: 08 Oct 2025
https://github.com/scienxlab/datasets
Some small datasets for demos, courses, testing, etc.
data open-data sample-data teaching-resources
Last synced: 09 Oct 2025
https://github.com/alexandregazagnes/rica-analysis
This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.
analysis argiculture business data data-analysis data-analytics food python
Last synced: 29 Apr 2026
https://github.com/definetlynotai/vulnscan_data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 11 Oct 2025
https://github.com/strata/data
Tools to help you read data from a range of different data providers.
Last synced: 27 Jan 2026
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/mccarthy-m-g/alda
An R data package for the book "Applied longitudinal data analysis: Modeling change and event occurrence" by Singer and Willett (2003).
data growth-curves longitudinal-data mixed-models nonlinear-mixed-models r r-package structural-equation-modeling survival-analysis time-to-event
Last synced: 19 Jan 2026
https://github.com/iamgmujtaba/github-python-daily-trending
This repository provides an automated, daily-updated list of the top trending Python repositories on GitHub. Using a GitHub Actions workflow, it scrapes data from GitHub's trending page, sorts the results by total stars, and generates a clean, well-structured README file
data data-scraping github-actions tranding tranding-bot
Last synced: 13 Oct 2025
https://github.com/tberey/social-stocks
A Graphical Data and Analysis Tool
data data-analysis data-science data-stream data-visualization database javascript mysql mysql-database node nodejs rest rest-api social-stocks stock-market stocks ticker-data tickers trends typescript
Last synced: 21 Jan 2026
https://github.com/geocollections/turvas
Database of peat geology
data data-visualization database estonia geology mineral-resources peat
Last synced: 05 Feb 2026
https://github.com/lahcenezzara/whatsapp-scraping-python
WhatsApp Scraping Python
automation data python scraping selenium whatsapp
Last synced: 05 Feb 2026
https://github.com/stdlib-js/ndarray-base-dtypes2signatures
Transform a list of array argument data types into a list of signatures.
api array base data dtype dtypes interface javascript multidimensional ndarray node node-js nodejs sig signatures stdlib types utilities utility utils
Last synced: 14 Apr 2026
https://github.com/potreic/etl-fashion-trend-analysis
✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊
airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends
Last synced: 27 Jan 2026
https://github.com/nicolasbizzozzero/datagenerator
Randomly generate various commonly used data
data data-generation data-generator data-science
Last synced: 18 Oct 2025
https://github.com/jaldekoa/fiscaldataapi
A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 27 Jan 2026
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/purarue/git_doc_history
copy/track file history in git, with python bindings to traverse and extract history/files/lines at some date
Last synced: 17 May 2026
https://github.com/farzai/geonames-php
This package provides a simple way to download Geonames data and format it for friendly use.
countries country-codes data geography geonames
Last synced: 24 Oct 2025
https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights
Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.
analytics data excel sas sasprogramming statistical-analysis
Last synced: 24 Feb 2026
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/maccccd/wsoa3029a_2444372
This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.
d3js data hacking visualization
Last synced: 24 Jan 2026
https://github.com/sefakcmn00/tensorflow_machine_learning_simple-
Artificial Neural Network(ANN) Perceptron
data mathplotlib pandas pandas-dataframe pandas-python sklearn tensorflow-examples tensorflow2
Last synced: 06 Feb 2026
https://github.com/ariqf1/learn_data
Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.
Last synced: 15 Apr 2026
https://github.com/fairspec/fairspec-typescript
Fairspec TypeScript is a fast data management framework built on top of the Fairspec standard and Polars DataFrames
ckan csv data dataframe dataset excel fair json ods polars quality schema sqlite table typescript validation zenodo
Last synced: 09 Feb 2026
https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2
Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.
data data-analysis deloitte excel forensic-analysis
Last synced: 06 Feb 2026
https://github.com/qbicsoftware/research-data-management
Documentation about the life science research data management at QBiC
data data-management data-stewardship documentation hacktoberfest life-science management metadata rdm reasearch-data-management
Last synced: 30 Jan 2026
https://github.com/tee8z/noaa-oracle
NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style
data duckdb-wasm noaa-weather parquet-files sql weather
Last synced: 17 Feb 2026
https://github.com/dandre3000/matrix
Matrix library
algebra array data data-structure math matrix vector
Last synced: 01 Feb 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/cworld1/novel-data
The data repository of novel analysis
Last synced: 01 Feb 2026
https://github.com/garcane/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 09 Feb 2026
https://github.com/3squared/smoulder
Smoulder is a really good data pipe
composition data facade-pattern forge-framework object-oriented
Last synced: 25 Apr 2026
https://github.com/lmuffato/project-mongodb-dataflights-trybe
Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB
back-end crud data database filter mongo mongodb query trybe-projects
Last synced: 16 Apr 2026
https://github.com/ismail-mouyahada/lodscroljs-library
LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks
data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll
Last synced: 13 Feb 2026
https://github.com/garcane/beverage-sales-analytics
This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 19 Mar 2026
https://github.com/nikhilash45/power-bi-vsualisation-of-joins
In This Power Bi Report User Can Visualis Join By Themselves , and it is easy to understand joins now.
business-analytics business-intelligence data data-analysis data-visualization joins powerbi sql visualization
Last synced: 19 Mar 2026
https://github.com/blacksujit/shikshamitra
Shiksha Mitra is an innovative MVP designed to reshape the way students learn through gamification. Our platform transforms the traditional approach to education by making learning engaging, interactive, and rewarding. As an MVP, Shiksha Mitra focuses on delivering core features that showcase the value of gamified learning,
ai data gamified-learning hackathon lms ml mlflow mlops mlops-workflow mvp pipeline platforn
Last synced: 28 Feb 2026
https://github.com/neomutt/sample-data
📚 Lists of things. Useful for developing and testing.
Last synced: 19 Mar 2026
https://github.com/mvicens/sporscor
TypeScript API to manage sport data getting scoreboards and statistics
api-client data score scoreboards sport statistics typescript
Last synced: 16 Feb 2026
https://github.com/huseyincenik/tableau
This repository contains Tableau visualizations and related resources for my project.
analytics api bianalyst business-analytics business-intelligence business-solutions dashboard data data-analysis data-science data-structures dataanalysis dataset datavisualization drilldown interactive-visualizations tableau tableau-dashboards viz
Last synced: 19 Mar 2026
https://github.com/eugenedakin/polyalphabeticcipher
PolyAlphabetic Cipher
data decryption encryption polyalphabetic polyalphabetic-cipher polyalphabetic-crypto polyalphabetic-substitution xojo
Last synced: 19 Mar 2026
https://github.com/docusign/extension-app-data-io-reference-implementation
Extension App for Data IO Reference Implementation for the Docusign IAM Platform
Last synced: 02 Mar 2026
https://github.com/agnosticeng/cli
Agnostic magic is now at your fingertips.
cli clickhouse data datalake datalakehouse
Last synced: 03 Mar 2026
https://github.com/aiwithqasim/recommendationengines
Recommendations Engines with IBM a project of DataScientist Nanodegree on Udacity. For this project i will analyze the interactions that users have with articles on the IBM Watson Studio platform, and make recommendations to them about new articles you think they will like.
data data-manging data-science ibm ipython-notebook normalization python3
Last synced: 18 Apr 2026
https://github.com/azmisahin/azmisahin-software-web-package-storage-nodejs-javascript-v1
storage container logical partitions management.
conventional-commits data dev-container docker library linux module nodejs package partitions queue redis storage
Last synced: 05 Apr 2026
https://github.com/csheldonhess/reporting-on-congress
What has Congress passed and not passed, lately?
civic-data congress data government government-data propublica propublica-congress-api
Last synced: 20 Apr 2026
https://github.com/ktbarrett/scdil
simple configuration and data interchange language
configuration data json python yaml
Last synced: 20 Apr 2026
https://github.com/garciparedes/r-examples
Set of awesome R Examples
data data-science garciparedes r statistics university-of-valladolid
Last synced: 20 Apr 2026
https://github.com/jinsyin/dataorigin
数据之源 | A data source management framework
Last synced: 21 Apr 2026
https://github.com/stefen-taime/myubereats_datapipeline
Building a Modern Uber Eats Data Pipeline
airflow api data datawarehouse mongodb pipeline powerbi snowflake
Last synced: 22 Apr 2026
https://github.com/howtoquitvivek/ai-crop-yeild-prediction
AI-driven crop yield prediction and agricultural optimization system (SIH 2025)
2025 2026 ai crop-yeild data minor-project ml predcition python science sih
Last synced: 23 Apr 2026
https://github.com/sebastianbrzustowicz/collision-detection-ai
Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.
accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow
Last synced: 24 Apr 2026
https://github.com/andygol/osm-diff-state
CLI tool to search OSM diff state files
custom data openstreetmap planet replication
Last synced: 24 Apr 2026
https://github.com/aidenellis/connectmp
🍰 ConnectMP - An easy way to share data between Processes in Python.
aidenellis connectmp data data-sharing multiprocessing process sharing
Last synced: 27 Apr 2026
https://github.com/jtpio/data-playground
Experiments using public APIs and data
Last synced: 28 Apr 2026
https://github.com/ahmetcansolak/developer-insights
New project of ClubRockers from Sarıyer Hills
bitbucket data data-science data-visualization github python3
Last synced: 28 Apr 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/chompfoods/stub-asp-net-core
ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server
Last synced: 30 Apr 2026
https://github.com/timclicks/dataclerk
zero fuss data logging over HTTP
actix-web command-line data logging rust sqlite sqlite3 utility
Last synced: 30 Apr 2026
https://github.com/chompfoods/sdk-php
PHP SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients php raw recipe-api recipes sdk
Last synced: 30 Apr 2026
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/lucien-loua/libgn
Manipulate geographical and administrative data about Guinea.
Last synced: 08 Jun 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/liuliqiang/laueagle
YAML/JSON Lints and Converters
converter data formater json linter python serialization yaml
Last synced: 02 May 2026
https://github.com/arif-miad/titanic-analysis
artificial-intelligence data data-science deep-neural-networks
Last synced: 09 Jun 2026
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/hasnocool/war_thunder_data_scraper
A web scraping tool designed to extract valuable data from War Thunder, a popular online game.
data database framework integration multi processing python scraper scraping scrapy sql threaded thunder war
Last synced: 06 May 2026
https://github.com/satur-io/estoraje
Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.
data database distributed go golang key-value performance training
Last synced: 07 May 2026
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 08 May 2026