data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/cdapio/website
CDAP IO website
analytics applications cdap cdapio data data-analytics data-integration hugo integration metadata oss rules-engine
Last synced: 18 Jun 2025
https://github.com/raghavendranhp/credit_card_fraud_detection
This repository contains code for a credit card fraud detection model using autoencoders and logistic regression, achieving 95.3% accuracy.
anomaly-detection autoencoder-neural-network credit-card-fraud data keras logistic-regression machine-learning preprocessing tensorflow
Last synced: 04 May 2026
https://github.com/jayantur13/kountry
Node module variant of the Country API
api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn
Last synced: 26 Jan 2026
https://github.com/ariqf1/learn_data
Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.
Last synced: 15 Apr 2026
https://github.com/ncgl-git/eriparse
Python code to parse the cost-of-living HTML from erieri.com, i.e. https://www.erieri.com/cost-of-living/united-states/illinois/chicago
cost-of-living crime crime-data data economic-research-institute erieri webscraper
Last synced: 14 Jan 2026
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/ishaansathaye/data40x-1_2_3
Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab
capstone-prep data data-science ethics lab python
Last synced: 04 May 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/simranjeet97/datastructures_algoritms_python
Data Structures and Algorithms using Python
algorithms arrays arrays-and-strings coding data data-science data-structures datastructures-python hashing interview-preparation interview-questions linked-list python stacks stacks-as-an-array
Last synced: 09 Apr 2026
https://github.com/unicef/magicbox-download-shapefiles
Downloads shapefiles for each country from gadm.org and unzips them.
data data-science docker downloads-shapefiles emergency-response gadm geospatial geospatial-data humanitarian javascript magicbox nodejs shapefile unicef
Last synced: 02 May 2026
https://github.com/aiwithqasim/competitive-programming
I will add all material which i did or in the future i will do to make my programming skill more enhanced to become a competitive programmer
c-plus-plus code data java programming structured-data
Last synced: 20 May 2026
https://github.com/themost-framework/memory
MOST Web Framework in-memory data adapter for testing environments
Last synced: 01 Jul 2026
https://github.com/CheeseWithSauce/HadithsJSONFormat
Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.
api arabic data dev free hadith islam islamic muslim open-source quran sunnah
Last synced: 24 Feb 2026
https://github.com/arif-miad/titanic-analysis
artificial-intelligence data data-science deep-neural-networks
Last synced: 09 Jun 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/jcasbin/jcasbin-menu-permission
Casbin Menu Permission Example (Based on jCasbin)
abac acl auth authorization authz casbin data go java jcasbin menu permission rbac spring springboot
Last synced: 11 Jul 2025
https://github.com/ngambip/priscilla
About my work and Experience
accounting analytics data finance-management
Last synced: 03 Feb 2026
https://github.com/stefanbohacek/fediverse-account-analyzer
bots botsinspace data dataviz fediverse mastodon
Last synced: 02 May 2026
https://github.com/GiveMePseudonyms/PiVisualisations
A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.
data data-visualization pi pygame python self-organising-criticality tkinter
Last synced: 08 Apr 2025
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/danielgiljam/orbit-utils
A collection of utility packages for Orbit.js.
data inference orbit orbitjs schema synchronization type typescript validation zod
Last synced: 01 May 2026
https://github.com/open-i18n/data-unicode-math
Git mirror for Unicode Support for Mathematics data
data i18n internationalization math mathematics open-i18n unicode unicode-consortium unicode-data
Last synced: 11 Mar 2026
https://github.com/s-raza/csvio
Wrapper for conveniently processing CSV files
csv data file processing wrapper
Last synced: 14 Jan 2026
https://github.com/ddeutils/ddedocs
📖 Data Developer & Engineer Documents and Hands-On
blogs data data-engineering documents hands-on
Last synced: 08 Aug 2025
https://github.com/eshaagarwa/hr-analytics-project
Explore our HR Analytics Dashboard, a powerful Power BI project designed for HR managers and leaders. Analyzed essential KPIs such as Employee Count, Attrition Rate, and Job Satisfaction across various demographics.
dashboard data data-visualization dataanylasis ms-excel ms-excel-data-analytics powerbi statistics
Last synced: 23 Jan 2026
https://github.com/wangshouh/cryptofinancedata
An ipynb file containing data acquisition of futures, options and other financial derivatives
Last synced: 05 Oct 2025
https://github.com/athari22/house_sales_in_king_count_usa
The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.
analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library
Last synced: 01 May 2026
https://github.com/epsoft/deep-learning-for-structured-data
Deep Learning for structured data
concatenate data data-learning dense farsi input load-penguins pandas persian structure structured-data subtract tensorflow
Last synced: 01 May 2026
https://github.com/helins/ex.clj
Java exceptions as clojure data
clojure data exception java java-exceptions
Last synced: 12 Dec 2025
https://github.com/igorskyflyer/npm-adblock-header-extract
✂️ Parse and extract ad-block filter list headers with ease. Works on strings or files, trims whitespace, and returns clean metadata for tooling and automation. 📃
adblock back-end biome data filter header igorskyflyer javascript js metadata node nodejs npm string ts typescript utility
Last synced: 11 Mar 2026
https://github.com/divanny/academixbackend
🧑🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.
backend csharp data net webapi
Last synced: 07 Jun 2026
https://github.com/thechibuzornwachukwu/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 16 Nov 2025
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/rayenfathallah/students_analysis
This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.
analysis d3 data education javascript python students
Last synced: 12 Apr 2026
https://github.com/idea2app/public-meta-data
HTTP API for Public Meta Data, written in TypeScript & designed for CDN.
api cdn data http meta public typescript
Last synced: 15 Mar 2025
https://github.com/dev-owdenmag/dataflow-manager
A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.
data data-management data-management-platform data-visualization python
Last synced: 30 Mar 2025
https://github.com/miniql/miniql-express-mongodb-example
A MiniQL example for querying a MongoDB database through an Express REST API.
data database mongodb query query-language
Last synced: 19 Apr 2026
https://github.com/timclicks/dataclerk
zero fuss data logging over HTTP
actix-web command-line data logging rust sqlite sqlite3 utility
Last synced: 30 Apr 2026
https://github.com/hyperversal-blocks/averveil
Averveil is OpenSea for Data.
blockchain data golang iot privacy zero-knowledge zkp
Last synced: 14 Jan 2026
https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data
This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.
aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python
Last synced: 03 Jan 2026
https://github.com/freebirdscrew/datascience_crash_course
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data datascience datascience-crash-course datascience-machinelearning datascientist datasets freebirdscrew matplotlib numpy numpy-library pandas plotly plotly-python python python3 simranjeet simranjeetsingh
Last synced: 29 Apr 2026
https://github.com/soenneker/soenneker.quark.table
A native Blazor table component.
blazor blazorlibrary csharp data dotnet html quark quarktable table tables
Last synced: 13 Aug 2025
https://github.com/kevinsames/microsoft-fabric-data-platform-template
A GitHub starter repository for building modern Data Engineering, ML, and AI solutions on Microsoft Fabric. Includes medallion architecture (Bronze → Silver → Gold), Spark Notebooks, dbt, MLflow, GitHub Actions CI/CD, and arc42-based documentation.
data dbt fabric microsoft python spark
Last synced: 29 Apr 2026
https://github.com/stdlib-js/array-filled-by
Create a filled array according to a provided callback function.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 09 Mar 2026
https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali
Työllisyyden kuntakokeilujen palvelutietovarannon manuaali
data drupal drupal-9 unemployment
Last synced: 24 Jan 2026
https://github.com/chrnthnkmutt/theartofstatistic_python
This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization
data data-science data-visualization machine-learning statistics
Last synced: 08 Jun 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/azevedontc/datapulse
DataPulse
automation brazil cli data data-analysis matplotlib meteorology open-meteo pandas prevision pycharm python python3 reports venv weather
Last synced: 29 Apr 2026
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/the-aerospace-corporation/pivt
PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.
analytics dashboards data devops jenkins pipeline python splunk visualization
Last synced: 29 Apr 2026
https://github.com/connectomicslab/cmtklib-data
Datalad dataset that stores all data resources of the cmtklib module of Connectome Mapper 3 (https://github.com/connectomicslab/connectomemapper3).
brain data parcellation resources software
Last synced: 16 Jan 2026
https://github.com/jackosheadev/databasetechproject
This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction
Last synced: 18 May 2026
https://github.com/rafaelfloressouza/Covid-19-Dashboard
Python web application to display COVID19 data from the world using Plotly and Dash
bootstrap covid-19 css data datavisualization plotly-dash python3
Last synced: 10 Mar 2025
https://github.com/tushar2704/insurance-cross-sell
This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.
data datascience h20automl machine-learning mlflow python streamlit-tushar2704
Last synced: 08 Oct 2025
https://github.com/cintia0528/data_science-ab_testing
Conduct a 5-way AB Test on Montana State University Library's website, comparing the original "Interact" button with new versions ("Learn," "Help," "Connect," "Services") to boost user engagement.
abtesting bonferroni chisquare-test data data-science datacleaning datavisualization hypothesis-testing mde statistics
Last synced: 31 Mar 2025
https://github.com/nikoshet/rust-dms-cdc-operator
The rust-dms-cdc-operator is a Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios.
aws cdc data dms parquet pgdatadiff polars postgres rds rust s3 validation
Last synced: 18 Jan 2026
https://github.com/rdjarbeng/rdjarbeng
Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below
data jekyll machine-learning ruby website
Last synced: 28 Apr 2026
https://github.com/saulojoab/crato-ce-json
Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.
data database geolocation json json-api localization
Last synced: 28 Apr 2026
https://github.com/jtpio/data-playground
Experiments using public APIs and data
Last synced: 28 Apr 2026
https://github.com/ryanjoy0000/yt-notifier
Youtube Notifier (Telegram Bot) - A real time data processing pipeline
data go kafka-streams real-time telegram-api youtube-api
Last synced: 14 Jan 2026
https://github.com/horisystems/uk_ev_data_analysis
Analysis of Electric Vehicle charging infrastructure in the United Kingdom.
data data-science electric-vehicles ev python uk united-kingdom
Last synced: 12 Jan 2026
https://github.com/mewmix/drivehound
magic file signatures + python drive recovery magic
data disk file-signatures harddrive python recovery recovery-tool
Last synced: 08 Oct 2025
https://github.com/karthikmprakash/github_repos_scraper
A tool to extract names of github repos of any user
automation bs4 data github python repositories requests webscraping
Last synced: 27 Apr 2026
https://github.com/jorgeatgu/casa-caida-bot
Twitter-bot sobre la despoblación en Aragón
aragon bot data data-viz despoblacion twitter-bot
Last synced: 11 Aug 2025
https://github.com/patrikmasiar/algorythm-of-the-night
Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻💻
algorithms data interview-questions logic logic-programming math mathematics science
Last synced: 27 Oct 2025
https://github.com/zalweny26/open_data_unipa
Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31
Last synced: 26 Apr 2026
https://github.com/ahmad-ali-rafique/pyviznotebook
PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.
analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization
Last synced: 06 Jun 2026
https://github.com/chriseaton/sample-database
A long-term supported sample dataset for file and database unit testing and validation. Simple, straight-forward, raw data shared across formats.
data database examples flat-file samples schema unit-testing
Last synced: 25 Apr 2026
https://github.com/sbdk-dev/sbdk.dev
A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.
ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer
Last synced: 27 May 2026
https://github.com/scienxlab/datasets
Some small datasets for demos, courses, testing, etc.
data open-data sample-data teaching-resources
Last synced: 09 Oct 2025
https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer
Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.
analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob
Last synced: 17 Apr 2026
https://github.com/yessasvini23/accenture_-social-buzz-data-analytics-virtual-programme-forage
Accenture Data Analytics and Visualization - Virtual Internship
accenture content data dataanalytics excel forge socialbuzz
Last synced: 18 Jan 2026
https://github.com/ginga1402/chinook_database
Microsoft SQL Server Management Studio
business-query data sql-server
Last synced: 30 Mar 2025
https://github.com/bredalis/functionalprogrammingpython
💻 Programación Funcional en Python
data functional-programming functions programing programming-language python structured-data
Last synced: 06 Jun 2026
https://github.com/yord/klp-core
A plugin with basic operations for klp (Kelpie), the small, fast, and magical command-line data processor.
csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv
Last synced: 24 Apr 2026
https://github.com/sebastianbrzustowicz/collision-detection-ai
Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.
accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow
Last synced: 24 Apr 2026
https://github.com/mihaiconstantin/lavot
A `React` application that allows users to indicate how votes will be redistributed among candidates for the second round of Romanian presidential elections.
data data-visualization elections react sankey typescript
Last synced: 06 Feb 2026
https://github.com/danish-foundation-models/dfm-processing
Toolkit for processing data in the danish foundation models project.
Last synced: 02 Jul 2025