data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/zalweny26/open_data_unipa
Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31
Last synced: 26 Apr 2026
https://github.com/karthikmprakash/github_repos_scraper
A tool to extract names of github repos of any user
automation bs4 data github python repositories requests webscraping
Last synced: 27 Apr 2026
https://github.com/saulojoab/crato-ce-json
Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.
data database geolocation json json-api localization
Last synced: 28 Apr 2026
https://github.com/rdjarbeng/rdjarbeng
Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below
data jekyll machine-learning ruby website
Last synced: 28 Apr 2026
https://github.com/jackosheadev/databasetechproject
This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction
Last synced: 18 May 2026
https://github.com/quarylabs/quary_basketball_analysis_duckdb
An example analysis
analytics data duckdb engineering quary
Last synced: 29 Apr 2026
https://github.com/aidanjuma/ankideckextractor
A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.
anki apkg cli data decompression extraction flashcards learning python zip
Last synced: 29 Apr 2026
https://github.com/chrnthnkmutt/theartofstatistic_python
This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization
data data-science data-visualization machine-learning statistics
Last synced: 08 Jun 2026
https://github.com/chompfoods/stub-asp-net-core
ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server
Last synced: 30 Apr 2026
https://github.com/divanny/academixbackend
🧑🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.
backend csharp data net webapi
Last synced: 07 Jun 2026
https://github.com/gdhhgnbnvbn/f1-2025-ai-predict
fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.
agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe
Last synced: 01 May 2026
https://github.com/ggeop/multiple-fields-management
Fields management from/to different data sources. :bulb:
data data-engineering data-organization data-retrieval data-science pandas python
Last synced: 01 May 2026
https://github.com/stefanbohacek/fediverse-account-analyzer
bots botsinspace data dataviz fediverse mastodon
Last synced: 02 May 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/unicef/magicbox-download-shapefiles
Downloads shapefiles for each country from gadm.org and unzips them.
data data-science docker downloads-shapefiles emergency-response gadm geospatial geospatial-data humanitarian javascript magicbox nodejs shapefile unicef
Last synced: 02 May 2026
https://github.com/ishaansathaye/data40x-1_2_3
Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab
capstone-prep data data-science ethics lab python
Last synced: 04 May 2026
https://github.com/raghavendranhp/credit_card_fraud_detection
This repository contains code for a credit card fraud detection model using autoencoders and logistic regression, achieving 95.3% accuracy.
anomaly-detection autoencoder-neural-network credit-card-fraud data keras logistic-regression machine-learning preprocessing tensorflow
Last synced: 04 May 2026
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/satur-io/estoraje
Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.
data database distributed go golang key-value performance training
Last synced: 07 May 2026
https://github.com/yash22222/sync-intern-s-ml-tasks
SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.
bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns
Last synced: 07 May 2026
https://github.com/chompfoods/stub-jaxrs-resteasy
JAX-RS RESTEasy server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients jax-rs jax-rs-server nutrition raw recipe-api recipes resteasy server server-stub stub stub-server
Last synced: 08 May 2026
https://github.com/keanteng/nextjs-directory
🌐A Draft Website For Data Catalogue Using NextJs
catalogue climate-change css data directory html javascript nextjs website
Last synced: 09 May 2026
https://github.com/lmuffato/project-mysql-vocabulary-booster-trybe
Projeto mysql vocabulary booster - Projeto avaliativo da Trybe do Bloco 20: Funções SQL, Joins e Subqueries
back-end crud data database mysql mysqlworkbench query sql trybe-projects
Last synced: 10 May 2026
https://github.com/dimitryzub/walmart-stores-coffee-analysis
Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡
analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn
Last synced: 10 May 2026
https://github.com/petrosdemetrakopoulos/ethairballoons.py
A strictly typed ORM library for Ethereum blockchain.
blockchain dao dapp data database ethereum ethereum-blockchain library orm python smart-contracts web3
Last synced: 11 May 2026
https://github.com/suryavamsi-p/conflict-nlp-topic-modeling-sentiment-analysis-using-llms
Extracts insights from 26K+ protest events using BERTopic, Top2Vec, and LLMs for real-world applications like crisis monitoring, policy research, and social unrest analysis.
all-mpnet-base-v2 bertopic conflict-data data data-science lda llama2 llms machine-learning mistral-7b nlp nltk protest-analysis pyldavis python3 top2vec topic-modeling transformers visualization
Last synced: 11 May 2026
https://github.com/scarblase/russian-military-losses-analysis
This repository provides an in-depth analysis of Russian equipment losses using PySpark and data visualization techniques.
data data-science data-visualization jyputer-notebook matplotlib pyspark python3 seaborn seaborn-plots ukraine ukraine-invasion
Last synced: 12 May 2026
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 May 2026
https://github.com/svetlanam/twitter-ads
Get data about campaigns from Twitter Ads API
api data keboola keboola-extractor twitter twitter-ads twitter-api
Last synced: 12 Jun 2026
https://github.com/fairspec/fairspec-standard
Fairspec is a data exchange format compatible with DataCite for metadata and JSON Schema for structured data
ckan csv data dataset excel fair fairspec json ods polars python quality schema sqlite table typescript validation zenodo
Last synced: 16 Jun 2026
https://github.com/cdcgov/importsurvey
Import survey: Import data into R, with an application to the National Center for Health Statistics (NCHS)
data import r sas survey survey-data
Last synced: 19 Jun 2026
https://github.com/williamwutq/bllist
Durable, crash-safe, checksummed block-based linked list allocators stored in a single file
data data-storage data-structure database file-based linkedlist
Last synced: 25 Jun 2026
https://github.com/seabbs/estzoonotictb
Explore, Visualise and Estimate the Global Zoonotic Tuberculosis Burden
bovine-tb data estimation package rstats tuberculosis visualisation zoonotic-tb
Last synced: 28 Feb 2026
https://github.com/pradeep221b/turbofan_predictive_maintenance
An R project for predicting turbofan engine RUL using {targets} and {tidymodels}.
data data-science-portfolio machine-learning nasa preditive-maintaince r rstats targets-pipeline tidymodels
Last synced: 04 Oct 2025
https://github.com/zediculz/block
Block is a data structure/collection that uses Blockchain principle in managing data.
Last synced: 05 Oct 2025
https://github.com/dylanhogg/cloud-products
A package for getting cloud products and product descriptions from a cloud provider website.
aws cloud-products crawler data text-processing
Last synced: 05 Oct 2025
https://github.com/DefinetlyNotAI/VulnScan_Data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 17 Aug 2025
https://github.com/freddy03h/immutable-data-structure
Normalize and Merge your application's data store using Immutable.JS objects
Last synced: 05 Oct 2025
https://github.com/vincentneo/sgtidetimings
Scraped SG NEA tide timings table into machine-readable JSON files!
data github-actions github-pages gov html-tables-to-json javascript json nodejs sg singapore singapore-data-analysis tide webscraping
Last synced: 10 Apr 2026
https://github.com/rambodrahmani/covid19-behind-the-numbers
COVID-19: Behind the Numbers.
apriori-algorithm apriori-algorithm-python clustering clustering-algorithm clustering-analysis covid covid-19 covid19-data data data-mining data-science datamining fpgrowth machine-learning machine-learning-algorithms python python-machine-learning
Last synced: 20 Aug 2025
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 10 Apr 2026
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/jerryfzhang/rockets
A Node + React App that displays space launch missions around the world.
bootstrap data expressjs less momentjs nodejs react reactjs reactstrap
Last synced: 10 Apr 2026
https://github.com/grkndev/twitcher
A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.
api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user
Last synced: 09 Mar 2026
https://github.com/aymane-maghouti/mobile-data-hive-insights
This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,
apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi
Last synced: 09 Mar 2026
https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data
This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.
aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python
Last synced: 03 Jan 2026
https://github.com/xdrokra/road-accident-analytics
A data visualization project that maps and analyzes road accidents across major Italian municipalities in 2023
analytics data design italy javascript
Last synced: 30 Aug 2025
https://github.com/tatey/list_of_baby_names
A list of baby names given to tiny humans in Ruby
Last synced: 11 Nov 2025
https://github.com/ukplab/pragtag2023
Code and data for the PragTag-2023 Shared Task
argument-mining data peer-review pragmatics shared-task
Last synced: 18 Jun 2025
https://github.com/nafisalawalidris/sales-performance-dashboard
Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.
analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business
Last synced: 03 Feb 2026
https://github.com/marcelo-earth/h5n8-data
🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.
csv data h5n8 h5n8-cases h5n8-virus russia
Last synced: 19 Jan 2026
https://github.com/snimmagadda1/stack-exchange-dump-to-mysql
Batch pipeline to import Stack Exchange XML data dumps to relational DB
batch data mysql spring-batch stackoverflow
Last synced: 30 Mar 2025
https://github.com/ngambip/priscilla
About my work and Experience
accounting analytics data finance-management
Last synced: 03 Feb 2026
https://github.com/gappeah/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 25 Feb 2025
https://github.com/devsujay19/knowledgebase
My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.
data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel
Last synced: 10 Apr 2026
https://github.com/stdlib-js/array-base-to-accessor-array
Convert an array-like object to a minimal array-like object supporting the accessor protocol.
accessor accessors array array-like convert data javascript node node-js nodejs object protocol stdlib structure types wrap wrapper
Last synced: 04 Jan 2026
https://github.com/husna-poyraz/titanic-machine-learning
Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic
Last synced: 10 May 2026
https://github.com/neelravi/data-management
A data management plan for computational chemists/physicists and material scientists for a FAIR storage of raw data
data dmp fair management workflows
Last synced: 16 Jan 2026
https://github.com/milandjurdjevic/discriminalizer
.NET library designed for seamless JSON deserialization of objects with complex discrimination requirements, built on top of System.Text.Json.
data deserialization dotnet json
Last synced: 15 Apr 2025
https://github.com/stdlib-js/datasets-herndon-venus-semidiameters
Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.
astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus
Last synced: 09 Oct 2025
https://github.com/ilejuxepwaduzd/structured-data-extractor
🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.
cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data
Last synced: 10 Apr 2026
https://github.com/qeeqbox/data-states
Data states refer to structured and unstructured data divided into three categories (At Rest, In Use, and In Transit)
data data-state infosecsimplified qeeqbox
Last synced: 10 Mar 2026
https://github.com/exoticknight/juhe
simple way to analyze complex data in one chain call
aggregation aggregator analysis data statistic typescript
Last synced: 21 May 2026
https://github.com/rremple/intervalidus
For all your interval-based data needs.
Last synced: 21 Feb 2026
https://github.com/bilalmehrban/data-log-monitor
A simple yet elegant desktop c# application based on 3 Tier architecture, designed to have a look at the logs stored in the database using Nlog or other logging framework's.
csharp data desktop-app logging
Last synced: 14 Mar 2025
https://github.com/ndohvich/ndohvich
Je suis un grand fan de l'analyse des données avev PYTHON
anaconda arduino data github jypyter keras machine-learning machine-learning-algorithms numpy pandas python scikit-learn sql tensorflow visual-studio-code visualization-dashboard
Last synced: 11 Apr 2026
https://github.com/jayantur13/kountry
Node module variant of the Country API
api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn
Last synced: 26 Jan 2026
https://github.com/mews-labs/dataframe-memory
This tools aims to provide simple solution to save memory when using pandas' data frame.
data data-science memory-usage pandas-dataframe python3
Last synced: 22 May 2026
https://github.com/brianali-codes/github-searcher
A website for API experimentation that users the github Api to search for different users and some of their (public) information
Last synced: 21 May 2026
https://github.com/vatshayan/list-of-animals-data-classification-
Classification & Visualization of List of Animals Data set using Machine Learning Algorithm
animal-behavior animal-data animals artificial-intelligence classification data data-analysis data-mining data-science data-visualization dataset jupyter-notebook machine-learning python supervised-learning
Last synced: 17 May 2026
https://github.com/yukti-09/extracting-data-from-twitter
Data From Twitter!
data data-mining extracting-data timeline tweepy tweets twitter
Last synced: 11 Oct 2025
https://github.com/mrbisquit/weathercollector
Open-Source weather station data collector
collector customisable data modular opensource weather weather-forecast weather-station
Last synced: 16 Jan 2026
https://github.com/rohancyberops/rp1
This project performs an analysis of Starbucks (SBUX) stock returns using R. The analysis includes both simple returns and continuously compounded returns (CC returns) for a period of one month. It also calculates the growth of $1 invested in SBUX and provides visual insights through various plots.
analysis cc data r rlanguage sbux
Last synced: 15 Mar 2025
https://github.com/apfirebolt/data-structures-and-algorithms-in-python
Data Structure and Algorithms in Python
algorithms data data-structures python python3 tkinter-gui
Last synced: 15 Mar 2025
https://github.com/ucd-cws/nitrates-cv
california centralvalley data frep groundwater model nitrates
Last synced: 16 Jan 2026
https://github.com/ahmadjamil888/facial-recognition-ai-model
A facial recognition AI model powered by CNN , and trained by thousands of images.
ai cnn data data-science facial facial-recognition recognition
Last synced: 30 Jun 2025
https://github.com/jimut123/web-crawller
A web crawler which crawls through the whole internet
beautifulsoup collector data databases glance internet link links mining python3 scrapping-python web-crawler
Last synced: 16 Jan 2026
https://github.com/antononcube/raku-data-cryptocurrencies
Raku package of cryptocurrency data retrieval.
Last synced: 02 Apr 2025
https://github.com/ishanoshada/matplot3dex
A Matplotlib 3D Extension package for enhanced data visualization
data data-science matplotlib python-packages scikit-learn
Last synced: 05 Jan 2026
https://github.com/nesterenko-kv/object-id
ObjectIDs are a special type of identifier mainly used in MongoDB to uniquely identify documents within a collection. They consist of a 12-byte binary value that includes a timestamp, a machine identifier, a process identifier, and a counter.
c-sharp data id net object-id unique-identifier
Last synced: 16 May 2025
https://github.com/sbdk-dev/sbdk.dev
A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.
ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer
Last synced: 27 May 2026
https://github.com/spine-tools/metreload
Python application for downloading meteorological reanalysis data
Last synced: 01 Jul 2025
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/cintia0528/data_analytics_and_visualization-sql_tableau
Evaluate Magist as a strategic partner for Eniac's Brazilian expansion. Use SQL to analyze growth, tech accessory sales potential, delivery times, and customer satisfaction in Magist's database.
data dataanalysis datavisualization sql strategy tableau
Last synced: 31 Mar 2025
https://github.com/idea2app/public-meta-data
HTTP API for Public Meta Data, written in TypeScript & designed for CDN.
api cdn data http meta public typescript
Last synced: 15 Mar 2025
https://github.com/mtingers/opacify
Opacify reads a file and builds a manifest of external sources to rebuild said file.
backup data obfuscation python
Last synced: 18 May 2026
https://github.com/dataship/beam
Get collimate'd data into Frame, in Node or the Browser
column-store data data-science
Last synced: 27 Apr 2026