data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/schijioke-uche/data-analysis-with-python-an-spss-model
With this Python notebook algorithm, you can use SPSS Model notebook to build machine learning pipelines that you can use to iterate rapidly during the model building process in data analysis. Whether you're trying to find the right algorithm or experimenting with different ways of preparing your data, you can create reproducible research that's easily understood by any member of your team with Hypothesis definition.
anova cp4a cp4d cp4i cp4s data ibm ibm-cloud jeffrey-chijioke-uche jeffrey-solomon-chijioke-uche openshift python python3 redhat t-test
Last synced: 22 Apr 2026
https://github.com/rbcavi/factorio-mod-data
The modpacke data for factorio-viewer
data factorio factorio-data factorio-mod-data
Last synced: 23 Apr 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/elcarrillo/structpy
StructPy is a Python-based command-line tool designed for academics and scientists to manage data projects effectively. It simplifies workflows by creating structured project directories, generating timestamped filenames, validating datasets, and backing up projects seamlessly.
command-line-tool data database file-structure organization python science-tool
Last synced: 24 Apr 2026
https://github.com/coryson/osm-mla-finder
Python script to locate institutions employing Medical Laboratory Assistants in Germany, developed for BTZ – Berufliche Bildung Köln GmbH. It uses OpenStreetMap, SerpAPI, and web scraping to find and verify relevant labs, clinics, and diagnostic centers.
beautifulsoup data openstreetmap osm python scraping serpapi webscraping
Last synced: 24 Apr 2026
https://github.com/marielachirinosr/cyclistic-data-analytics-project
This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.
data data-visualization pandas powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/rubix982/product-quality-classification
This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.
data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp
Last synced: 25 Apr 2026
https://github.com/xjwllmsx/hacker-news-engagement
Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.
data data-analysis jupyter python
Last synced: 25 Apr 2026
https://github.com/mlkav/tri-hita-karana
Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.
bali data data-science g20 science
Last synced: 06 Jun 2026
https://github.com/shwetajanwekar/prediction-with-regression
prediction with regression for salary_hike and delivery time dataset
data data-science datset exploratory-data-analysis matplotlib pandas plot prediction r2-score seaborn sns
Last synced: 25 Apr 2026
https://github.com/f-ssemwanga/pandas-numpy-repo
This repo has extensive work I have done on Pandas and NumPy Modules during the advanced programming Module
cleaning-data-in-python data numpy-arrays pandas visualization
Last synced: 27 Apr 2026
https://github.com/tsbarr/citi-bikes-challenge
Citibikes NYC Data Analysis: Uncover insights from over a decade of ride data. Jupyter notebook for data aggregation/cleaning & Tableau dashboards for interactive visualization.
data data-visualization pandas-python python tableau
Last synced: 27 Apr 2026
https://github.com/fatihemres/africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 27 Apr 2026
https://github.com/demkeys/lazydatatransfer
Lazy method to transfer upto 64kb of data over the network using UDP
data data-trans network python transfer udp
Last synced: 07 Jun 2026
https://github.com/amethyst-php/subscription
amethyst amethyst-package api data laravel subscription
Last synced: 27 Apr 2026
https://github.com/vatshayan/b.tech-project-cancer-predication-system
Cancer Prediction System Project Developed through a Machine learning approach.
btech btechfinalyear cancer collegeproject csv data data-science data-structures datas datasets final-project finalyear india machinelearning project python python-3
Last synced: 07 Jun 2026
https://github.com/schenkd/tweetminer
Data Miner for Twitter Streaming API
data dataminer datamining java twitter twitter-api twitter4j
Last synced: 07 Jun 2026
https://github.com/santiagoenriquega/custom_database
Python-based database library for database management, indexing, transactions, and constraints, showcasing foundational database concepts.
data data-engineering database database-design python
Last synced: 27 Apr 2026
https://github.com/chompfoods/stub-inflector
Inflector server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery inflector ingredients nutrition raw recipe-api recipes server stub stub-inflector stub-server
Last synced: 27 Apr 2026
https://github.com/drkane/area-profiles
Produce UK area profiles based on various data sources
dash-plotly data flask statistics uk
Last synced: 27 Apr 2026
https://github.com/gngdb/llamass
LLAMASS is an arbitrary collection of tools I've put together to deal with motion data
Last synced: 28 Apr 2026
https://github.com/oguzhanfatihkucuk/data-analytics-project-kafka-spark
The data in this project was collected in a database using Apache Kafka and processed with Apache Spark Streaming. The project aims to create a forecasting model and analyze sales forecasts per customer.
big-data data data-visualization hadoop kafka ml mlpipeline plt pyhton spark
Last synced: 28 Apr 2026
https://github.com/leonardomusini/mbe-growth-nexus-converter
Python tool to convert laboratory text files into NeXus files for Molecular Beam Epitaxy (MBE) data.
data data-engineering nexus python
Last synced: 28 Apr 2026
https://github.com/hoijui/osh-dir-std
Open Source Hardware directory standard(s)
data fchh interfacer-project-eu interfacer-project-eu-wp4-3 oseg specification standard
Last synced: 28 Apr 2026
https://github.com/delonnewman/relational
Relational programming for Ruby
csv csv-import data data-analysis database export json relational relational-algebra relational-database relational-model relational-programming reporting reports ruby yaml
Last synced: 28 Apr 2026
https://github.com/kingsley-ezenwaka/medical-data-visualizer
A data analysis project that investigates a dataset of anonymous patients' medical information, and explores the relationship between cardiac disease, body measurements, blood markers, and lifestyle choices.
analysis data matplotlib numpy pandas seaborn
Last synced: 28 Apr 2026
https://github.com/epomatti/az-e2e-data-eng-proj
Data engineering with Azure services
azure data data-engineering databricks datafactory datalake lake synapse terraform
Last synced: 28 Apr 2026
https://github.com/codegeekr/test_datasciencestarter
test Data Science Starter
analytics data data-science data-visualization machine-learning python science starter-kit statistics test
Last synced: 28 Apr 2026
https://github.com/n-ce/localstorage-data-interchange-manager
Implementation of local storage data interchange using map data structure.
data export import javascript js-maps json localstorage
Last synced: 28 Apr 2026
https://github.com/moderrek/periodic-table
Periodic Table with clickable elements to see details.
chemical chemistry data element elements generator html javascipt javascript json periodic-table pure-javascript table vanilla-html vanilla-javascript
Last synced: 28 Apr 2026
https://github.com/sgbasaraner/cs50
my cs50 solutions
algorithms c cs50 cs50x data harvard python structures
Last synced: 29 Apr 2026
https://github.com/howz1t/ptypes
This package provides useful data types for use in PHP.
badges composer computer-science data data-structures data-types packagist php types
Last synced: 29 Apr 2026
https://github.com/mtalhaofc/nutrition_system
A simple AI-powered web app built using Streamlit that provides personalized weekly meal plans and nutrition recommendations based on user demographics, health goals, and nutritional preferences.
cosine-similarity data data-science food machine-learning model nutrition pandas python streamlit
Last synced: 29 Apr 2026
https://github.com/iammahesh123/spring-annotations-demo
This project serves as a demonstration of various annotations used in the Spring Framework.
autowire bean component configuration controller data document postmapping repository requestmapping scope service spring
Last synced: 29 Apr 2026
https://github.com/sn0wfree/factor_table
an universal connector for all kind data source and manage all kind data as factor type by one package
connector data database factor
Last synced: 29 Apr 2026
https://github.com/shoaib1522/data-aggregator-tool-in-python
This all are the illustration of the things used in " Data Aggregation Tool " as a scenario of Data Science Engineer written in Document(PDF)
data data-science dataaggregation lists python-script python3 sets-python tuples
Last synced: 29 Apr 2026
https://github.com/barkintopcu/apple-stock-prediction-edu
The purpose of this project is to demonstrate time series analysis techniques using real-world stock data, without offering any form of financial advice or investment suggestion.
data deep-learning forecasting machine-learning python
Last synced: 29 Apr 2026
https://github.com/chandansoren/financial-budget-analysis
Financial budget for 2021
Last synced: 29 Apr 2026
https://github.com/koltyakov/pgcopy
🐘 PostgreSQL data migration tool
cli data database golang migration postgresql sync
Last synced: 29 Apr 2026
https://github.com/ozgrozer/electron-store-data
A Node.js module to store Electron data in the computer
Last synced: 29 Apr 2026
https://github.com/ipstack/wizard
Wizard for create ipstack databases
composer data geo geoip id-database info ip ipstack ipstack-wizard php wizard
Last synced: 29 Apr 2026
https://github.com/devcsrj/docparsr-jvm
JVM client for https://github.com/axa-group/Parsr
data document extraction nlp ocr pdf
Last synced: 08 Jun 2026
https://github.com/wireservice/workbench-lookup
A port of `agate-lookup` to Workbench
data journalism lookup workbench
Last synced: 08 Jun 2026
https://github.com/patrickdavies100/pipeline38
An application to automate the creation and execution of SQL queries.
data pandas-dataframe pipeline postgresql psycopg2 sqlalchemy
Last synced: 30 Apr 2026
https://github.com/abhinav330/instagram-influencers-analysis
This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.
data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn
Last synced: 08 Jun 2026
https://github.com/lamouchi-bayrem/data-matrix-scanner
A dual-interface tool that leverages AI to **detect and decode QR codes and Data Matrix codes** from images using computer vision
data datamatrix-scanner decoder flask qrcode scanner tkinter-gui webapp
Last synced: 30 Apr 2026
https://github.com/dhruvsrikanth/superconductor-regression-kaggle-challenge
Kaggle challenge based on superconductor dataset.
data data-science jupyter-notebook kaggle kaggle-challenge kaggle-competition lasso-regression linear-regression machine-learning python random-forest regression sklearn support-vector-regression
Last synced: 30 Apr 2026
https://github.com/onekiloparsec/arcsecond-swift
The swift client for interacting with the server-side RESTful resources of arcsecond.io.
arcsecond astro-library astronomy data django swift swift-3
Last synced: 30 Apr 2026
https://github.com/priyam-hub/covid-19-data-analysis
Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook
analysis data data-visualization jupyter-notebook machine-learning python
Last synced: 08 Jun 2026
https://github.com/mmaithani/kaggle-projects
Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem
computer-vision data data-science image-processing machine-learning python
Last synced: 30 Apr 2026
https://github.com/raphcodec/rand-org-generator
Rand-Org-Generator attempts mimic real company structures. The dummy data generated by this project is intended to be used in analytics projects or web projects.
data duckdb factory-boy faker org-chart polars python3
Last synced: 30 Apr 2026
https://github.com/lugolbis/data-immo
End-to-end ETL pipeline
data data-engineering dbt dremio duckdb etl-pipeline lakehouse rust
Last synced: 08 Jun 2026
https://github.com/dnut/json-match-finder
Python application used to match listings against openings via authenticated JSON API access.
data data-structures data-wrangling database json-api python-application python-modules
Last synced: 01 May 2026
https://github.com/benmizrahi/reactivejs
microservices event bus for async/sync communications
Last synced: 01 May 2026
https://github.com/syedzaheerabbas/jamboree-education-linear-regression
Using data from Jamboree, this project explores the relationship between applicant profiles (GRE, TOEFL, GPA, etc.) and their chances of admission to Ivy League graduate programs. Linear regression, Ridge, and Lasso regression are employed to build predictive models and identify key factors.
data eda linear-regression python visualization
Last synced: 01 May 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/skygenesisenterprise/aether-meet
Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications data docker javascript meeting nextjs notes typescript voip
Last synced: 01 May 2026
https://github.com/chompfoods/sdk-kotlin
Kotlin SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food foods grocery ingredients kotlin nutrition raw recipe-api recipes sdk sdk-kotlin
Last synced: 01 May 2026
https://github.com/fatihemres/fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 01 May 2026
https://github.com/gabrielf7/relogiohd
:watch: Relógio com Horário e Data
clock css data horario html javascript relogio relogio-hd relogio-javascript watch
Last synced: 01 May 2026
https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support
RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.
ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit
Last synced: 01 May 2026
https://github.com/nel-zi/climainsights
Developed an automated ETL pipeline using Apache Airflow and Python to collect, process, and store weather data from multiple cities via Weatherstack API. Implemented data cleaning, orchestration, and error handling to ensure accuracy and scalability.
airflow apache-spark data data-engineering engineering etl-pipeline
Last synced: 01 May 2026
https://github.com/sorairolake/japanese-era-dataset
日本の元号のデータセット / Dataset of the Japanese era
data dataset date japanese-calendar japanese-era json toml wareki yaml
Last synced: 01 May 2026
https://github.com/eshitakundu/disease-outbreak-predictor
Disease Outbreak Predictor: A Streamlit-based web application for predicting diabetes, heart disease, and Parkinson's disease using machine learning models.
data data-science disease-prediction healthcare-application jupyter-notebook machinelearning ml notebook prediction python streamlit streamlit-webapp
Last synced: 01 May 2026
https://github.com/gcoronelc/cepsuni-disbd-64505
Taller de Modelamiento de de Base de Datos con Gustavo Coronel
data database databases db2 db2-database modeling oracle oracle-database relational-database relational-database-design relational-databases relationships sql sql-server
Last synced: 02 May 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/lurenss/healthypandas
A library that takes row output from the export of the Iphone Health app and produce pandas dataframes.
Last synced: 02 May 2026
https://github.com/rbreeze/dashboard
My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.
css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization
Last synced: 02 May 2026
https://github.com/gcoronelc/ucv_gdi-1_202302-a2
Taller de Gestión de Datos e Información I con Gustavo Coronel.
data data-science database databases machine-learning machinelearning oracle sql sql-server
Last synced: 02 May 2026
https://github.com/hafs96/prediction_consommation-de-carburant
Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.
analysis data data-visualization machine-learning testing training
Last synced: 09 Jun 2026
https://github.com/radekbednarik/covid-czech-data-api
Library to make it easy to work with REST API of official Czech Covid data.
api covid-19 data deno library typescript
Last synced: 02 May 2026
https://github.com/jesuscc1993/data-cleaner-extension
Clears browser data in a single click.
application-data chrome chrome-extension data
Last synced: 02 May 2026
https://github.com/viniddev/active_finance
Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário
automation data data-analisis rpa selenium-webdriver telegram-bot
Last synced: 03 May 2026
https://github.com/anyantudre/associate-data-scientist-track
Materials for the Associate Data Scientist in Python track on DataCamp.
data data-science experimental-design hypothesis-testing machine-learning matplotlib-pyplot pandas python regression sampling seaborn statistics statsmodels unsupervised-learning
Last synced: 03 May 2026
https://github.com/amethyst-php/price-rule
amethyst amethyst-package api data laravel price price-rule rule
Last synced: 03 May 2026
https://github.com/tn3w/moviedb-json
A JSON library with 981,530 films.
data database db json movie movie-database movies
Last synced: 03 May 2026
https://github.com/arnavk-09/phishing-detection
🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI
csv data fastapi flask python scikit-learn
Last synced: 03 May 2026
https://github.com/ghufranbarcha/company-account-analyzer
This project is a Streamlit application designed to visualize and analyze client data. It includes interactive features for exploring client-specific metrics, generating plots, and viewing distribution charts.
data data-science pandas streamlit visualization
Last synced: 03 May 2026
https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning
Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page
data data-visualization dataanalytics excel postgresql powerbi python3
Last synced: 03 May 2026
https://github.com/qrailibs/dataflow
✨ Data processing in Node.js made multithreaded and type-safe.
data dataprocessing multithread node
Last synced: 04 May 2026
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 04 May 2026
https://github.com/maxwelllzh/gis-tutorial-
Tutorials for Columbia University GIS Club
Last synced: 04 May 2026
https://github.com/rabeal21/tea
Generate random TEA wallet addresses in bulk with this simple utility. Perfect for testing and exploring the TEA blockchain. 🌱💻
bucklescript bucklescript-tea chinese-translation cli data earlgrey educators hacking ios-automation ios-test ocaml peer-evaluations php red-team teachyourselfcs test-framework translation tui
Last synced: 04 May 2026
https://github.com/dimitryzub/russo-ukraine-war-prediction-losses
Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱👤
data dataanalysis dataanalytics matplotlib pandas prophet python
Last synced: 04 May 2026
https://github.com/sjg/my-search-story
My Search Story is a demo application developed for the Data Portability API Workshop and the #AISprint2025 events. #BuildwithAI
data docker generative-ai google-cloud-platform google-cloud-run nodejs
Last synced: 04 May 2026
https://github.com/jdanielgoh/cobertura-campanias
En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024
d3js data datavisualization vue
Last synced: 09 Jun 2026
https://github.com/gabya06/twitter_models
Repository used for twitter impression models
data data-science impressions machinelearning python ridge-regression sklearn twitter
Last synced: 04 May 2026
https://github.com/farhad2415/job_scraper
Job Site Based Job Scraping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 05 May 2026
https://github.com/kasunjayasanka/simple-backend-database-data-retrieval
Simple HTML form with inserting and retrieving data from Firebase Realtime Database
bootstrap css3 data firebase firebase-realtime-database html5 insert-data javascript retrieve-data
Last synced: 05 May 2026