data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/joeyism/py-cifar10
This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site
cifar cifar-10 cifar10 data machine-learning machinelearning
Last synced: 30 Jul 2025
https://github.com/asuozzo/medicare-data-analysis
An analysis of Medicare Part D data in Vermont
Last synced: 04 May 2026
https://github.com/millengustavo/salarios-data-science
Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers
brasil brazil ciencia-de-dados data data-science heroku salarios salary
Last synced: 07 Oct 2025
https://github.com/visenger/prada
Profiling Datasets
cleaning data dataset profiling
Last synced: 24 Aug 2025
https://github.com/derrickbaruga7/python-data-analysis
This project analyzes ORU’s off-season sewer usage using Python, with `pandas` for data handling, histograms and line plots for exploration, and a `scipy`-based model for prediction. Pearson’s correlation and visualizations help reveal key trends and relationships.
analytics data data-science visualization
Last synced: 31 Jul 2025
https://github.com/flowsynx/plugin-postgresql
FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.
data database flowsynx postgresql postgresql-database sql
Last synced: 09 May 2026
https://github.com/elhariri78/case-study-a-better-smoker-detector
Case Study-A better Smoker Detector
data dataframe evaluation kaggle matplotlib-pyplot numpy pandas pandas-dataframe pandas-python python3 seaborn sklearn
Last synced: 07 Apr 2026
https://github.com/ajsalemo/python-pandas-datalib
Testing and experimenting with some simple Pandas functionality using Flask to serve the parsed data.
csv data flask json pandas pandas-dataframe pandas-series python tabular tabular-data terminal
Last synced: 09 Apr 2026
https://github.com/tonykipkemboi/ens_subgraph_data
Query On-Chain Data from Subgraphs by The Graph Protocol using Python
data subgraphs thegraphprotocol web3
Last synced: 17 Sep 2025
https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint
A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT
bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs
Last synced: 25 Sep 2025
https://github.com/jimbrig/jimstaskviews
CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com
cran data docs rstats shiny-app submodules task-views
Last synced: 06 Mar 2026
https://github.com/v6ntage/sql-sales_data-analytics-project
This repository contains a SQL scripts demonstration analytical techniques.
analytics business-analytics data data-analysis database query sql sql-server
Last synced: 12 Apr 2026
https://github.com/DataHerb/dataherb-flora
DataHerb Flora: The core of DataHerb
data data-mining data-science datascience dataset datasets
Last synced: 08 May 2025
https://github.com/ate329/nsl-kdd-feature-extractor
Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.
cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset
Last synced: 30 Oct 2025
https://github.com/cont-limno/lagosus-reservoir
Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.
Last synced: 17 Jan 2026
https://github.com/fjc0k/vue-merge-data
Intelligently merge data for Vue render functions.
data merge-data render-functions vue
Last synced: 17 May 2026
https://github.com/mikebairdrocks/fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 17 May 2026
https://github.com/prioritizr/prioritizrdata
Conservation planning data sets
Last synced: 19 Jul 2025
https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning
The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/
airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn
Last synced: 30 Dec 2025
https://github.com/inzhenerka/scooters_data_uploader
Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех
Last synced: 04 May 2026
https://github.com/am-i-groot/summer-intern-iitguwahati-spml
Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.
algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing
Last synced: 17 May 2026
https://github.com/sevmardi/data-mining-hacks
Hacks in Data Mining
data data-mining data-mining-algorithms python3
Last synced: 18 Jul 2025
https://github.com/muhammad-fiaz/ason
ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.
adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3
Last synced: 02 Feb 2026
https://github.com/saboye/web-scraping-with-python
A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.
beautifulsoup csv data data-harvesting data-mining python request web webscraping
Last synced: 18 Jul 2025
https://github.com/giscience/measures-rest-oshdb-docker
Scripts for starting measures for geospatial datasets in docker container, using the OSHDB
data dggs docker geospatial mesure openstreetmap rest
Last synced: 18 Apr 2026
https://github.com/yourdataarchitect/french-realestate-data-pipeline
This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.
airlfow data datanalysis datapipeline market-intelligence real-estate
Last synced: 31 Dec 2025
https://github.com/coderooz/hr-dashboard
The goal of this project is to create a power bi dashboard to showcase the attrition data within the company.
Last synced: 07 Jan 2026
https://github.com/webianks/anotech-android
Android application which deals on various anomalous behaviour that occur on server data.
Last synced: 13 Apr 2025
https://github.com/alexdonh/adonis-cache
Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!
adonis-framework adonisjs cache data dependency redis storing
Last synced: 15 May 2026
https://github.com/amethyst-php/value
amethyst amethyst-package api data laravel value
Last synced: 17 May 2026
https://github.com/pyrustic/jayson
Intuitive interaction with JSON files [DEPRECATED, check the project Shared]
Last synced: 17 May 2026
https://github.com/hemangsharma/bookingdataanalysisreport
The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.
analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard
Last synced: 14 May 2026
https://github.com/fliplet/fliplet-widget-data-source-query
Data Source Query Provider
Last synced: 11 Apr 2025
https://github.com/samharrison7/datamapper
Making mapping between datasets as simple as possible.
data data-mapper data-mapping data-science data-structures
Last synced: 17 Mar 2025
https://github.com/srindot/average_flightdata_collection_fwuav
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 18 Sep 2025
https://github.com/a-poor/taro
A package for repeatable rectangular data transformations in Python.
data data-science data-transformation pipeline pypi-package python
Last synced: 13 Oct 2025
https://github.com/sourceduty/text_file_metadata
📄 Extract metadata from .txt files and record the metadata in .txt files.
data datascience metadata metafile practice sourceduty
Last synced: 08 Aug 2025
https://github.com/encoreshao/data-science
Data analyze examples, using Jupyter notebook and Python!!!
data dataanalysis encore jupyter-notebook
Last synced: 29 Mar 2025
https://github.com/kylepw/multistack
Example of multiple stacks in one array.
algorithms array data data-structures python stack
Last synced: 17 Mar 2025
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/pulgamecanica/d3examples
https://www.oreilly.com/library/view/d3-for-the/9781492046783/
d3 d3-visualization d3js d3v4 data javascript
Last synced: 19 May 2026
https://github.com/kameronbrooks/datalys2-reporting
Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.
data data-visualization html react
Last synced: 08 Apr 2026
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 19 May 2026
https://github.com/farovictor/mongodbloader
This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.
Last synced: 15 May 2026
https://github.com/robsteranium/user2022-ldf-talk
Slides from my useR! 2022 talk about the Linked-Data Frames package
data data-frame linked-data r rdf
Last synced: 19 Apr 2025
https://github.com/shahules786/titanic-analysis
different analysis of titanic accident (data from kaggle)
Last synced: 26 Jun 2025
https://github.com/sofyan48/wahoo
Data stream library with kinesis
aws data data-stream event kinesis stream
Last synced: 14 May 2026
https://github.com/jigyasag18/financial-risk-analysis-project
The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics
data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis
Last synced: 06 Mar 2026
https://github.com/ayush1999/data-mining
data mining natural-language-processing
Last synced: 10 Sep 2025
https://github.com/stdlib-js/array-base-index-of-same-value
Return the index of the first element which equals a provided search element according to the same value algorithm.
array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types
Last synced: 15 May 2026
https://github.com/jankapunkt/meteor-reactive-data-structures
Collection of verious reactive data sructures for MeteorJS
data data-structures graph linked-list list meteor meteorjs queue reactive reactivity stack tree
Last synced: 17 May 2026
https://github.com/mawiegand/automatic-point-label-placement-data
Test instances for the automatic point label placement problem.
data datastructures generator javascript labeling problem ruby
Last synced: 16 May 2026
https://github.com/toluwaa-o/stears-lite-overview
Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.
africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview
Last synced: 14 May 2026
https://github.com/dina-hosny/sequence-trigger-pair-for-all-schema-tables-plsql
A PLSQL script that creates Sequence Trigger Pair for all Schema's Tables
data oracle plsql sequence sequencetrigger sql toad trigger
Last synced: 06 Mar 2026
https://github.com/tpetzoldt/datasets
teaching data sets
data data-analysis-in-r teaching-materials
Last synced: 16 Feb 2026
https://github.com/amarlearning/exploring-the-evolution-of-linux
Data Analysis about the development of the Linux operating system by exploring its Git repository history.
cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux
Last synced: 12 May 2026
https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration
This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.
covid-19 data data-exploration dataset sql
Last synced: 19 Feb 2026
https://github.com/lord3008/instances-of-data-analysis
This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.
Last synced: 03 Mar 2025
https://github.com/francois-lenne/portofolio_flenne_streamlit
portofolio francois lenne using streamlit
data portofolio python slack-api streamlit
Last synced: 15 May 2026
https://github.com/eyluldursun/data-science-project
This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.
data data-science obesity r rmarkdown
Last synced: 26 Jun 2025
https://github.com/sourceduty/cults_3d
🔢 Software concept for additional statistics from Python for Cults design data .csv files.
3d 3d-model 3d-model-software 3d-modelling account account-management concept cults cults-3d data idea sourceduty
Last synced: 08 Aug 2025
https://github.com/muhammadadilnaeem/data-science-materials
This repository will contain basic source code and materials related to Data science.
artificial-intelligence artificial-neural-networks calculus data data-science deep-learning deep-neural-networks machine-learning machine-learning-algorithms mathematics nlp-machine-learning projects statistics
Last synced: 07 May 2025
https://github.com/solrikk/vargen
VarGen (Variation Generator) is a user-friendly desktop application designed to simplify the creation of product variations from CSV files.
csv-files csv-format csv-parser data data-engineering excel excelparser python
Last synced: 29 Mar 2025
https://github.com/jhwa426/database
SQL, MSSQL, MongoDB Database
data data-warehouse data-wrangling database datamodeling entity-relationship-diagram normalization sql sqlite3 ssms
Last synced: 06 Apr 2025
https://github.com/mysociety/sync-ep-to-jkan
Syncs EveryPolitician data to mySociety's data portal.
data everypolitician jkan politicians
Last synced: 27 Jul 2025
https://github.com/amethyst-php/opening-hour
amethyst amethyst-package api data laravel opening-hour
Last synced: 19 May 2026
https://github.com/gunn/covid-19-scripts
Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries
covid-19 data geography typescript
Last synced: 17 May 2026
https://github.com/pedelriomarron/spanish-api-covid19
Data from Spain of COVID-19 (by Datadista) as a service
api covid-19 covid-19-spain data now spain zeit
Last synced: 12 Mar 2025
https://github.com/ericgio/history-of-jazz
Data and visualizations based on Ted Gioia's "The History of Jazz"
Last synced: 28 Mar 2025
https://github.com/amethyst-php/warehouse
amethyst amethyst-package api data laravel warehosue
Last synced: 19 May 2026
https://github.com/akashlogics/street-data-tracking
Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones
analysis data excel newdataset object-detection opencv python python3 yolo
Last synced: 19 May 2026
https://github.com/amethyst-php/recipe
amethyst amethyst-package api data laravel recipe
Last synced: 19 May 2026
https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm
The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.
algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script
Last synced: 17 May 2026
https://github.com/buildinamsterdam/contentful-graphql
Contentful GraphQL connection
Last synced: 05 Jan 2026
https://github.com/madihanazir/ds-using-c
Basic insights into Data Structures (inspired by Abdul Bari course but in C language)
data self-learning structures-in-c
Last synced: 17 Mar 2025
https://github.com/dan149/uselesscontentcreator
Useless Content Creator (UCC) is a fake content generator, text, html and pdf files.
content customizable data easy-to-use fake-data fake-data-generator faker-generator generator lightweight open-source opensource python python3
Last synced: 03 Apr 2025
https://github.com/brunosalerno/osm_data
Ruby objects for dealing with OSM data, and generating XML files
Last synced: 21 Apr 2026
https://github.com/garcane/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 29 Oct 2025
https://github.com/ezeparziale/analisis-uso-bicicletas-caba
:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.
data data-science data-visualization
Last synced: 14 Mar 2025
https://github.com/ezeparziale/analisis-data-delitos
:gun: Analsis de delitos de CABA
Last synced: 14 Mar 2025
https://github.com/official-imvoiid/multifetch
A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection
aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows
Last synced: 19 May 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/webdevcave/collections-php
A PHP library for managing collections of data with support for nested keys.
array collection data helper library nested-keys package php utility utility-classes
Last synced: 28 Jun 2026
https://github.com/johndelatto/-universities-to-pursue-a-master-s-degree-in-machine-learning
Best Master’s Programs in Machine Learning (ML) for 2021 These are the best universities to pursue a master’s degree in machine learning, with research rankings in AI and machine learning
ai api data education project school
Last synced: 17 Jun 2025
https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial
Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI
dagster data data-orchestration kedro luigi mageai prefect
Last synced: 18 Apr 2026