data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 19 Jan 2026
https://github.com/igorwastaken/math-problems
Solve math problems easily with this utility library.
algorithm area data demography geography javascript math npm package population school typescript util utils
Last synced: 23 Feb 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/v-mayya/python-sales-data-analysis
Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python
analysis data matplotlib numpy python
Last synced: 29 Apr 2026
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/agavitalis/sample-c-codes
A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.
ageteller atm binary data gpcalculator logging
Last synced: 09 Apr 2025
https://github.com/chrnthnkmutt/theartofstatistic_python
This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization
data data-science data-visualization machine-learning statistics
Last synced: 08 Jun 2026
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/freebirdscrew/datascience_crash_course
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data datascience datascience-crash-course datascience-machinelearning datascientist datasets freebirdscrew matplotlib numpy numpy-library pandas plotly plotly-python python python3 simranjeet simranjeetsingh
Last synced: 29 Apr 2026
https://github.com/zoekelepiri/ota_observatory
A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.
backend data database spring-boot
Last synced: 06 Feb 2026
https://github.com/CheeseWithSauce/HadithsJSONFormat
Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.
api arabic data dev free hadith islam islamic muslim open-source quran sunnah
Last synced: 24 Feb 2026
https://github.com/ariqf1/learn_data
Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.
Last synced: 15 Apr 2026
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/mccarthy-m-g/alda
An R data package for the book "Applied longitudinal data analysis: Modeling change and event occurrence" by Singer and Willett (2003).
data growth-curves longitudinal-data mixed-models nonlinear-mixed-models r r-package structural-equation-modeling survival-analysis time-to-event
Last synced: 19 Jan 2026
https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico
Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel
analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3
Last synced: 01 May 2026
https://github.com/fairspec/fairspec-typescript
Fairspec TypeScript is a fast data management framework built on top of the Fairspec standard and Polars DataFrames
ckan csv data dataframe dataset excel fair json ods polars quality schema sqlite table typescript validation zenodo
Last synced: 09 Feb 2026
https://github.com/lucien-loua/libgn
Manipulate geographical and administrative data about Guinea.
Last synced: 08 Jun 2026
https://github.com/alejo1630/titanic_kaggle
This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.
data data-science jupyter-notebook notebook python titanic-survival-prediction
Last synced: 03 May 2026
https://github.com/gdhhgnbnvbn/f1-2025-ai-predict
fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.
agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe
Last synced: 01 May 2026
https://github.com/epsoft/deep-learning-for-structured-data
Deep Learning for structured data
concatenate data data-learning dense farsi input load-penguins pandas persian structure structured-data subtract tensorflow
Last synced: 01 May 2026
https://github.com/syed-bilal-haider-engineer/interview_questions
Interview Questions
data database interview-questions javascript oop operating-system reactjs structure technical
Last synced: 01 May 2026
https://github.com/gagolews/clustering-results-v1
A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)
benchmark benchmark-datasets clustering data dataset datasets machine-learning
Last synced: 16 Mar 2025
https://github.com/ggeop/multiple-fields-management
Fields management from/to different data sources. :bulb:
data data-engineering data-organization data-retrieval data-science pandas python
Last synced: 01 May 2026
https://github.com/itu-helper/data-updater
Periodically scrapes data related to ITU to be used by anyone. This data powers the ITU Helper web sites.
data istanbul-technical-university scraper selenium-python
Last synced: 29 Jan 2026
https://github.com/qbicsoftware/research-data-management
Documentation about the life science research data management at QBiC
data data-management data-stewardship documentation hacktoberfest life-science management metadata rdm reasearch-data-management
Last synced: 30 Jan 2026
https://github.com/jinsyin/datagovernance
公众号:「数据之道」
data data-governance datagovernance governance
Last synced: 30 Jan 2026
https://github.com/jahilldev/immutable-parsejs
Parse a JS object or array/map into an Immutable collection. Makes use of ImmutableJs List, and Record primitives.
data immutablejs javascript json nodejs parse typescript
Last synced: 13 Apr 2026
https://github.com/sandk21/etude_eau_potable_monde
Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau
analysis data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/arif-miad/titanic-analysis
artificial-intelligence data data-science deep-neural-networks
Last synced: 09 Jun 2026
https://github.com/genert/metis
Asynchronous data sender library
analytics asynchronous data dependency-free typescript
Last synced: 27 Jan 2026
https://github.com/kirkalyn13/portfolio-dashboard-site
Portfolio Site; Initially a Service Provider Metrics Dashboard using React.
dashboard data data-visualization react
Last synced: 15 Apr 2026
https://github.com/shogunbanik18/budgetify
End-to-End Budget Analysis enables effective budgeting through detailed analysis and strategic planning
analysis data data-engineering data-exploration databricks databricks-notebooks etl etl-process python3
Last synced: 09 Jun 2026
https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer
Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.
convert converter data data-science deserialize json lightweight powershell serializer
Last synced: 04 May 2026
https://github.com/R-Mahesh45/HR---Resume-Text-Classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 13 Oct 2025
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/perceptronv/miscellaneous
A huge variety of materials, mostly training data for AI. Not a lot of source code yet.
data gan machine-learning nlp text-generation
Last synced: 04 May 2026
https://github.com/kucingkode/dmerge
Small javascript library to help you merge same formatted data in a string
cithak data data-merge javascript library lightweight lightweight-javascript-library merge open-source
Last synced: 04 May 2026
https://github.com/kenmwaura1/nuvo-data-cleaning-functions
Collection of scripts and functions to clean and preprocess data using Nuvo SDK.
Last synced: 04 May 2026
https://github.com/eve-ning/osumania_data
processed osu!mania data from osu!API
Last synced: 24 Feb 2026
https://github.com/bredalis/datastructure
📚 Estructuras de Datos en Python
algorithms data data-structure python
Last synced: 12 Apr 2026
https://github.com/nfaltir/dataxplorer
🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.
data data-science python streamlit
Last synced: 05 May 2026
https://github.com/tylerben/data-spring
Easily generate a dummy dataset based on a provided config
data data-spring datagenerator fake-data generator javascript typescript
Last synced: 27 May 2026
https://github.com/danielrosehill/monetised-ghg-emissions
Calculating monetised GHG emissions for various companies based upon disclosure data
data sustainability sustainability-data
Last synced: 07 Sep 2025
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/cworld1/novel-data
The data repository of novel analysis
Last synced: 01 Feb 2026
https://github.com/bredalis/numpy
✨ Library to work with arrays ✨
arrays data matrix numpy numpy-arrays numpy-library python
Last synced: 06 May 2026
https://github.com/hasnocool/war_thunder_data_scraper
A web scraping tool designed to extract valuable data from War Thunder, a popular online game.
data database framework integration multi processing python scraper scraping scrapy sql threaded thunder war
Last synced: 06 May 2026
https://github.com/6km/islamic-data-repository
مستودع البيانات الإسلامية - قائمة بالموارد التي قد تفيد المبرمجين في تطوير التطبيقات ومواقع الويب.
data fonts hadeeth json quran quran-json
Last synced: 06 May 2026
https://github.com/player29879/neum-ai
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors
Last synced: 18 Apr 2026
https://github.com/xtao-org/tree-annotation
What is TAO
annotation data intercommunication json notation s-expressions simplicity syntax tao tree tree-annotation universal xml
Last synced: 25 May 2026
https://github.com/montanaz0r/imdb-ratings-auto-inserter
A Python script that enables auto-inserting movie ratings into the IMDB profile.
data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping
Last synced: 07 May 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/desininja/data-engineer-interview-questions
This repository contains all the Data Engineer Interview Questions asked by interviewers.
data data-engineer-interview-questions
Last synced: 31 Mar 2025
https://github.com/ompreetham/dcn-network-traffic-anomaly-detection
Data Communication Networks - Network Traffic Anomaly Detection
anomaly anomaly-detection communication data dcn keras learning machine machine-learning network pandas presentation project python scikit-learn tensorflow traffic
Last synced: 08 Apr 2026
https://github.com/xpotify/scraper
Scraper designed for Xpotify's client to gather information from websites🌟
axios cheerio data javascript scraper webscraper
Last synced: 07 Jul 2025
https://github.com/augustoarraes/corais
App Python de Monitoramento de vida marinha de Recife de Corais 🪸
coral data iot matplotlib pandas python streamlit
Last synced: 07 May 2026
https://github.com/raymondcm/strawberrydata
Tool suite for fast multi-camera strawberry data collection project. The standards document houses cross compatibility/purpose implementation details.
camera cpp data intel multi-camera
Last synced: 08 Feb 2026
https://github.com/athul64/powerbi
Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.
data data-an data-visualization dax dax-expression powerbi
Last synced: 23 Feb 2026
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 08 May 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/jeanmanguy/milk-sci-fi
Census of every mention of milk in sci-fi works.
Last synced: 26 Feb 2026
https://github.com/n0nag0n/flee-intercom
For those of you who like to keep your money after Intercom jacks up the prices year after year, but want to keep an export of your data.
again-and-again api data database export exporter flee high-prices intercom mysql php price run save saver year-over-year
Last synced: 09 May 2026
https://github.com/pharo-ai/data-preprocessing
Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.
data pharo pharo-smalltalk preprocessing smalltalk
Last synced: 09 Feb 2026
https://github.com/danielbello7/nosql-json-database
Simple and quick database to help development process and speed
data database json json-database models nosql nosql-database nosql-json-database schema
Last synced: 09 May 2026
https://github.com/harmanveer-2546/supply-chain
Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.
customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis
Last synced: 10 Apr 2026
https://github.com/bastianolea/comisarias_chile
Base de datos con las comisarías, retenes, tenencias y otras instalaciones de Carabineros
Last synced: 23 Jun 2025
https://github.com/codenoid/alodokter.com-database
a Alodokter.com Database, collected by Hofesh Bot (Scrapper)
alodokter data extraction hofesh
Last synced: 18 Mar 2026
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/stdlib-js/ndarray-base-dtypes2signatures
Transform a list of array argument data types into a list of signatures.
api array base data dtype dtypes interface javascript multidimensional ndarray node node-js nodejs sig signatures stdlib types utilities utility utils
Last synced: 14 Apr 2026
https://github.com/rapter1990/data-visualization-examples
Data Visualization Examples
data data-analysis data-visualization folium matplotlib plot plotly python seaborn visualization
Last synced: 13 Apr 2026
https://github.com/devathul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 09 May 2026
https://github.com/jhpoelen/bats
self-documenting data publication on Bat (Chiroptera) specimen
biodiversity data natural-history-collections provenance specimen
Last synced: 18 Mar 2026
https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis
This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle
data database mysql sql walmart
Last synced: 24 Feb 2026
https://github.com/kooltuoehias/fi-insider
cloudflare cloudflare-pages data finance insider react sweden
Last synced: 29 Jun 2026
https://github.com/scottleechua/data
Public datasets under CC-BY-4.0 license.
Last synced: 18 Mar 2026
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/whitehathackerpr/data-visualization-tool
This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations
data data-analysis data-visualization python python3
Last synced: 05 Sep 2025
https://github.com/itrauco/robots
ai, machine learning, and robots...
ai artificial-intelligence automation big-data cloud cloud-engineering data data-engineering data-science data-science-projects m machine machine-learning ml prompts robots
Last synced: 11 Jun 2026
https://github.com/williamwutq/bllist
Durable, crash-safe, checksummed block-based linked list allocators stored in a single file
data data-storage data-structure database file-based linkedlist
Last synced: 25 Jun 2026
https://github.com/ultrasage-danz/scikit-learn-ml
Machine Learning with scikit-learn by Data School
ai data data-school machine-learning macos ml scikit-learn ultrasage-dan
Last synced: 13 May 2026