data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/real-veersandhu/cia-country-comparison
Data analysis system on the CIA World Factbook
Last synced: 25 Feb 2025
https://github.com/simranjeet97/datascience_crashcourse
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data data-science data-science-crash-course data-structures data-visualization datascience-machinelearning datasciencecoursera datascienceproject instagram matplotlib numpy pandas telegram tutorials youtube
Last synced: 08 Apr 2026
https://github.com/aboualine/sql-formation
Library Management System Database: A MySQL project with tables, triggers, stored procedures, and views for managing books, members, and borrowings. Includes sample data for testing. Ideal for learning SQL or building a library app.
data database library-management-system mysql sql system
Last synced: 18 Apr 2026
https://github.com/marxmit7/kaggle
Kaggle competitions
data kaggle kaggle-competition
Last synced: 19 May 2026
https://github.com/luminati-io/pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 17 Mar 2025
https://github.com/erinaldi/bmn2-lattice
Data analysis of lattice Monte Carlo simulations of quantum matrix models.
data data-science data-visualisation lattice
Last synced: 27 Mar 2025
https://github.com/danieljdufour/easy-file-saver
Very Easily Save a File
csv data download file file-saver javascript js json save
Last synced: 21 Apr 2026
https://github.com/aruneshbasak/python-dsa-problems-geeksforgeeks-160-days
I will upload my daily Python DSA problems solved on GeeksforGeeks and post it here!
algorithms-and-data-structures and data data-structures dsa python python3 structure
Last synced: 08 May 2025
https://github.com/denko5/sales-analysis
A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.
africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql
Last synced: 24 Jan 2026
https://github.com/agustinmusanti/sqlchallenge-2
This repository contains my solutions to a SQL challenge using MySQL, centered around a fictional retail company called TechMarket. The challenge covers various SQL tasks such as data retrieval, manipulation, and analysis, simulating real-world scenarios within a retail business environment.
Last synced: 03 Apr 2025
https://github.com/LisaKey/convert-csv-to-sav
We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.
analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations
Last synced: 03 Mar 2025
https://github.com/DevAthul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 22 Jun 2025
https://github.com/aditya172926/blockchain_indexers
Indexers to fetch data from blockchain events and transactions data with their parameters
Last synced: 02 Aug 2025
https://github.com/shgysk8zer0/schema
A PHP implementation of schema.org structured data objects
data microdata schema seo structured-data
Last synced: 24 Jun 2025
https://github.com/sandravizz/global_inequality_story
Dataviz Project about Global Inequality
data data-visualization inequality
Last synced: 03 Jul 2025
https://github.com/kevinsames/spark-fuse
spark-fuse is an open-source toolkit for PySpark — providing utilities, connectors, and tools to fuse your data workflows together.
data databricks fabric pyspark python spark
Last synced: 08 May 2026
https://github.com/jimbrig/jimstaskviews
CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com
cran data docs rstats shiny-app submodules task-views
Last synced: 06 Mar 2026
https://github.com/giscience/measures-rest-oshdb-docker
Scripts for starting measures for geospatial datasets in docker container, using the OSHDB
data dggs docker geospatial mesure openstreetmap rest
Last synced: 18 Apr 2026
https://github.com/Greatwoman23/Market-Basket-Analysis
Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.
analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python
Last synced: 04 May 2025
https://github.com/saboye/web-scraping-with-python
A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.
beautifulsoup csv data data-harvesting data-mining python request web webscraping
Last synced: 18 Jul 2025
https://github.com/dostuffthatmatters/circadian-scp-upload
Resumable, interruptible, SCP upload client for any files or directories generated day by day
checksum daily data directories files library python scp ssh synchronization time-series upload utilities
Last synced: 24 Jun 2025
https://github.com/sevmardi/data-mining-hacks
Hacks in Data Mining
data data-mining data-mining-algorithms python3
Last synced: 18 Jul 2025
https://github.com/r-mahesh45/hr---resume-text-classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 12 Sep 2025
https://github.com/lmuffato/project-restaurant-orders-trybe
Projeto restaurant orders - Projeto avaliativo da Trybe do Bloco 36: Estrutura de Dados I: Arrays, Hashmaps e Sets
array array-set csv data data-analysis hashmap python set trybe trybe-projects
Last synced: 13 Sep 2025
https://github.com/incubrain/awesome-maharashtra-data
A collection of datasets specific to Maharashtra, India. WIP
ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi
Last synced: 23 May 2026
https://github.com/am-i-groot/summer-intern-iitguwahati-spml
Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.
algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing
Last synced: 17 May 2026
https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning
The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/
airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn
Last synced: 30 Dec 2025
https://github.com/lstedmanfalls/betterself
Python / Django eHealth web app for behavior change programs
ajax bcrypt behavioral-sciences data django ehealth hashing javascript jquery likes login-registration motivational-quotes python salting sqlite users web-design
Last synced: 06 Apr 2026
https://github.com/aleklukanen/chapterhousedb-example-app
An example application using the ChapterhouseDB processing engine
arrow data database event golang parquet processing stream
Last synced: 18 Apr 2026
https://github.com/stefanbohacek/dataviz-projects
My dataviz projects.
data data-visualization dataviz
Last synced: 08 Jul 2025
https://github.com/lennart080/esp8266-tinyconfig
Esp8266 library to store configuration data
arduino arduino-ide arduino-library config configuration credential-storage credentials data data-config esp8266 esp8266-arduino iot platformio platformio-library
Last synced: 03 May 2026
https://github.com/whatheheckisthis/pwc_project-
Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver
case-study data data-science dataanalytics databases datavisualization powerbi virtual
Last synced: 05 Apr 2025
https://github.com/bastianolea/censo_viviendas
Censo de Viviendas procesado con R para disponibilizarlo con códigos/nombres de comunas, regiones, y etiquetas de sus variables. En formato original (6,5 millones de filas) y en conteo por comunas.
chile comunas data poblacion rural
Last synced: 30 Oct 2025
https://github.com/dineshpinto/geist-finance-subgraph
Subgraph for the Geist Finance protocol on the Fantom blockchain.
assemblyscript blockchain data fantom graphql typescript
Last synced: 17 May 2026
https://github.com/linas/archeo
File Recovery, Integrity and Archive Management
corruption data monitoring recovery
Last synced: 29 Mar 2025
https://github.com/jvrck/australianpayphones
Get Australian payphone data in GeoJSON format.
australia data geojson geojson-data scraper
Last synced: 04 Apr 2025
https://github.com/gbburleigh/quick-seeders
Generate realistic test data quickly with Quick-Seeders, a Python library offering a wide range of data types and schema definitions. Control data variance, probabilities, and output formats, including SQL. Simplify your data seeding process and improve testing efficiency.
data dataset faker generator python seeder sql test
Last synced: 03 Apr 2025
https://github.com/prioritizr/prioritizrdata
Conservation planning data sets
Last synced: 19 Jul 2025
https://github.com/suyashkumar/deeplesion-gcp-loader
Get the DeepLesion CT Image data set into a GCP Storage Bucket
bucket data data-loader data-loading data-science deep-learning deep-lesion deeplesion gcp gcp-bucket loader storage
Last synced: 04 Apr 2025
https://github.com/qeeqbox/data-security
Safeguarding your personal information (How your info is protected)
data data-security infosecsimplified qeeqbox security
Last synced: 19 Mar 2026
https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian
I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively
data data-visualization tableau
Last synced: 17 Feb 2026
https://github.com/reiiyuki/once-data-manager
Once Data Manager is temporary data management utility kit for Unity.
data manager playerprefs preference scene temporary unity
Last synced: 17 May 2026
https://github.com/epogrebnyak/business-conditions-digest-2017
Replicate illustration from Business Conditions Digest
Last synced: 22 Mar 2025
https://github.com/muhammad-fiaz/ason
ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.
adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3
Last synced: 02 Feb 2026
https://github.com/yernaz-togizbayev/microsoft_store_data-analysis
Microsoft Store
data data-analysis data-visualization jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/cainmi/easy-pull-from-repository
A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now
data html javascript python schema
Last synced: 04 Apr 2025
https://github.com/artcc/coredatagenericmodule
Core Data generic module for persist encrypted object
core coredata coredata-model data data-generic database encrypted encrypted-data encryption entity identifier persist protocol swift
Last synced: 08 May 2026
https://github.com/ate329/nsl-kdd-feature-extractor
Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.
cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset
Last synced: 30 Oct 2025
https://github.com/DataHerb/dataherb-flora
DataHerb Flora: The core of DataHerb
data data-mining data-science datascience dataset datasets
Last synced: 08 May 2025
https://github.com/cont-limno/lagosus-reservoir
Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.
Last synced: 17 Jan 2026
https://github.com/wamphlett/smart-data-objects
An easy solution for capturing and validating data into usable DTO's
data dto forms php php7 validation
Last synced: 17 May 2026
https://github.com/cliffano/volothamp
Random D&D stuffs my son and I dabble with
data dungeons-and-dragons info little-godzilla
Last synced: 06 Apr 2025
https://github.com/fjc0k/vue-merge-data
Intelligently merge data for Vue render functions.
data merge-data render-functions vue
Last synced: 17 May 2026
https://github.com/mikebairdrocks/fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 17 May 2026
https://github.com/yessasvini23/cisco-data-analytics-essentials_-virtual-_internship
From the CISCO Networking Academy
data dataanalysis database datascience excel relational-databases sql statistics structured-query tableau
Last synced: 17 Jul 2025
https://github.com/dsietz/datadot
data multicast plugin-manager plugins rust-lang
Last synced: 07 Sep 2025
https://github.com/amethyst-php/contract
amethyst amethyst-package api contract data laravel
Last synced: 20 May 2026
https://github.com/thomd/git-scrape-hacker-news
scrape hacker news metadata for data analysis
data data-science git-scraping hacker-news
Last synced: 16 Sep 2025
https://github.com/chibuzordev/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 31 Oct 2025
https://github.com/chompfoods/sdk-typescript-fetch
Fetch TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database fetch food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 03 May 2026
https://github.com/tomasfarias/pipeline
A simple data pipeline done as a challenge project
Last synced: 29 Mar 2025
https://github.com/qeeqbox/data-lifecycle-management
Data Lifecycle Management (DLM) is a policy-based model for managing data in an organization
data data-lifecycle-management infosecsimplified lifecycle management qeeqbox
Last synced: 07 Mar 2026
https://github.com/stdlib-js/array-base-any-by-right
Test whether at least one element in an array passes a test implemented by a predicate function, while iterating from right to left.
any array data generic javascript node node-js nodejs predicate some stdlib structure test types validate
Last synced: 14 Apr 2025
https://github.com/inzhenerka/scooters_data_uploader
Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех
Last synced: 04 May 2026
https://github.com/jonsafari/toy-data
Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks
data language-data machine-translation nlp sanity-checks toy-data
Last synced: 06 Nov 2025
https://github.com/lamiaaali/depi-graduation-project
SkinCare Sentiment Analysis Reviews
analytics azure azure-data-factory azure-data-lake azure-databricks azure-synapse-analytics data data-analytics data-engineering machine-learning pyspark python sql ssms unsupervised-learning
Last synced: 03 Feb 2026
https://github.com/epsoft/dataset-generator
dataset generator
data dataset dataset-generation matplotlib matplotlib-figures tensorflow tensorflow-datasets
Last synced: 18 May 2026
https://github.com/cpanse/tartare
raw file collection recorded on Thermo Fisher Scientific mass spectrometers for extented unit testing
bioconductor blob data r unittesting
Last synced: 03 Apr 2025
https://github.com/andrianllmm/wika-data
Philippine language resources.
data language low-resource-languages parser philippines scraper
Last synced: 17 Jul 2025
https://github.com/owengombas/genyus
🐍 Lyrics analysis with genius.com, Python and Jupyter Notebooks
api data data-science genius jupyter-notebook lyrics python statistics
Last synced: 20 May 2026
https://github.com/yash22222/tsf-grip-tasks
The Sparks Foundation Data Science & Business Analytics Internship Tasks
buisness-intelligence business-analytics data data-science data-science-projects data-structures grip gripjune23 internship internship-task machine-learning projects python simple-linear-regression the-sparks-foundation tsf
Last synced: 27 Apr 2026
https://github.com/finnspartronics/orpheus
A took for looking at FRC (First Robotics Competition) scouting data
data first-robotics-competition scouting scouting-data spartronics
Last synced: 28 Mar 2025
https://github.com/parzibyte/cifrar-descifrar-php
Cifrar y descifrar datos con PHP usando la librería php-encryption; cifrar con clave general o con claves generadas por contraseñas de usuarios
crypto data decrypt encryption password php security
Last synced: 20 May 2026
https://github.com/sstendahl/giscan
Simple tool to read and analyze existing GISAXS data
cbf data diffraction diffraction-analysis gisans gisaxs physics reflectivity scattering xray
Last synced: 30 Jun 2026
https://github.com/davedupplaw/jquery.bargraph
Moving, sliding bargraph display for jQuery
barchart bargraphs data javascript javascript-library jquery jquery-library jquery-plugin jquery-widgets realtime scrolling visualization
Last synced: 17 May 2026
https://github.com/seafloor-geodesy/gnatss-test-data
Repository to host test data for GNATSS software
Last synced: 06 Apr 2026
https://github.com/kerlossony/nested-formdata
Nested-FormData is a Function designed to handle nested form data structures in a simplified and efficient way. It helps in managing complex form data, making it easier to work with forms that require hierarchical data
data forms javascript nested-structures nextjs reactjs typescript
Last synced: 08 Mar 2026
https://github.com/makosai/covid19datachart
A basic chart for checking corona data. Written in a single HTML file for convenience. Grab the single file and run it anywhere. Or visit the webpage.
chart chartjs corona coronavirus coronavirus-analysis covid-19 covid-2019 covid19 covid19-data data data-analysis datasets
Last synced: 23 Feb 2026