data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/powersyang/visualization
data visualization templates 数据可视化模板
Last synced: 24 Mar 2025
https://github.com/lorenzobloise/client_satisfaction_classification
Jupyter notebook in which satisfaction from clients reviewing European hotels is analyzed using Python libraries such as pandas, numpy and scikit-learn. Various classification models are trained and tested to predict client satisfaction.
classification data data-mining jupyter jupyter-notebook machine-learning pandas python
Last synced: 21 Feb 2026
https://github.com/OneMoreDavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 28 Oct 2025
https://github.com/afnanenayet/kaggle-titanic
The classic Kaggle Titanic data science challenge
backprop backpropagation classification classifier data forest kaggle layer learn mlp multi numpy pandas perceptron random science scikit sklearn titanic
Last synced: 12 Apr 2026
https://github.com/etmendz/mendz.data
Provides tools and guidance for creating data access contexts and repositories.
context data datasettings entity-framework mendz paginginfo repository resultinfo
Last synced: 11 Jun 2025
https://github.com/emanoelcampos/power-bi-fundamentals
Datacamp's Power BI Fundamentals Skill Track
data data-analyst data-analyst-power-bi datacamp power-bi powerbi
Last synced: 24 Jan 2026
https://github.com/sahraiidle/email-spam-detector
Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.
data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm
Last synced: 24 Jan 2026
https://github.com/woctezuma/hidden-gems-data
Data available to compute regional rankings of hidden gems.
data hidden-gems steam steam-reviews
Last synced: 06 Feb 2026
https://github.com/shibbbbs/fastapi_project
A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs
data dataanalysis fastapi pandas python
Last synced: 06 May 2026
https://github.com/filiprokita/foldertoiso
Python script that converts a specified folder into an ISO.
automation command-line-interface command-line-tool compression cross-platform data file-system folder-to-iso iso iso-image iso-tool python python-cli python-script python3 shutil utility
Last synced: 24 Mar 2025
https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication
StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.
catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression
Last synced: 08 Apr 2025
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 28 Jan 2026
https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis
In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.
data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public
Last synced: 09 Mar 2026
https://github.com/rishabhmathur06/data_analysis-netflix
data data-analytics data-science matplotlib-pyplot numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/purarue/scramble-history
parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs
cstimer cubing data rubiks-cube speedsolving
Last synced: 11 Jun 2025
https://github.com/thingston/extractor
Collection of PHP classes to extract data from HTML pages.
Last synced: 14 Jan 2026
https://github.com/abhibisht89/data-visualization
data matplotlib pandas ploty python visualization
Last synced: 06 May 2026
https://github.com/chrisabruce/scrapling-rs
Adaptive web scraping, built in Rust. A high-performance port of Python Scrapling.
ai ai-scraping automation crawler crawling crawling-rust data data-extraction mcp mcp-server playwright rust-lang scraping selectors stealth web-scraper web-scraping web-scraping-rust webscraping xpath
Last synced: 26 Jun 2026
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/maxisoft/yahoo-finance-data-downloader
Automate downloading historical and recent stock data from Yahoo Finance.
data stock-market yahoo-finance
Last synced: 29 Jan 2026
https://github.com/steveanik/kestra
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine
Last synced: 06 Jan 2026
https://github.com/ompreetham/data-structures
binary-search-tree c data data-structures datastructures graph linked-list list stack structures tree
Last synced: 25 Mar 2025
https://github.com/alextanhongpin/node-github-api
:page_with_curl: sample github api queries with nodejs for scraping purposes
Last synced: 06 May 2026
https://github.com/spatialcurrent/go-counter
Simple library and command line program for generating frequency distributions.
Last synced: 29 Jan 2026
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/aimin-nur/data-analyst
Sebuah project Data Analyst (Mechine Learning) untuk melakukan analisa harga mobil bekas Ford berdasarkan dataset yang sudah ada, serta mengetahui apa saja feature atau kolom yang mempengaruhi harga mobil bekas Ford.
analytics data mechine-learing visualization
Last synced: 29 Jan 2026
https://github.com/apoorv74/njdg-stats
Tracking data from the National Judicial Data Grid's (NJDG) district courts portal
data git-scraping judiciary law
Last synced: 29 Jan 2026
https://github.com/irsol/udacity-data-foundations-nd
data data-analysis data-visualization exel sql udacity udacity-data udacity-nanodegree
Last synced: 05 Mar 2026
https://github.com/tpltnt/wir_vs_virus_hackathon_projects
A list of all projects / challenges for the WirVsVirus hackathon as CSV
coronavirus csv data hackathon raw-data
Last synced: 29 Jan 2026
https://github.com/thedragoncode/training-data-for-ai
Training data for the neural network
ai data flood meaningless neural-network neural-networks nn obscene politics spam toxic training
Last synced: 29 Jan 2026
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/sushmashreeps/data-science-with-python
This repository showcases a comprehensive data science project utilizing Python, demonstrating expertise in data analysis, visualization, and machine learning. Built with Python 3.x, the project leverages popular libraries like Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and TensorFlow. The project features data preprocessing, feature engine
cnn data dataanalysis datascience keras linear-regression matplotlib python python3 regression rnn visualization
Last synced: 14 Apr 2026
https://github.com/dfsp-spirit/neuroimaging_testdata
Contains test data for unit tests, used in developing neuroimaging software. Ignore this. Licenses in the individual archives.
Last synced: 25 Feb 2026
https://github.com/chenxingqiang/modeling_tabular_data
# modeling_tabular_data | Keywords: modeling_tabular_data focusing on modeling_tabular_data.
Last synced: 30 Jan 2026
https://github.com/canadaluke888/ttb2
TerminalTableBuilder 2
c17 csv data database datasets datautils json ncurses ods spreadsheet sqlite3 tables terminal terminaltablebuilder terminaltablebuilder2 ttb ttb2 ttbx xlsx
Last synced: 10 Apr 2026
https://github.com/makcymal/silvera
My researches on ML and statistics, optimization methods, CS algoritms and numerical methods
algorithms data data-structures machine-learning numerical-methods statistics
Last synced: 01 Apr 2025
https://github.com/themost-framework/cache
MOST Web Framework Caching Module
Last synced: 12 Feb 2026
https://github.com/bearaujus/bdatamatrix
Structured Tabular Data Management in Go
Last synced: 30 Jan 2026
https://github.com/abendayan/orm
Lightweight orm
cli dao data database database-management javascript mysql node node-js nodejs orm ormius ormius-cli schema
Last synced: 25 Feb 2026
https://github.com/pchaparro/search-engine
Full stack search-engine created from youtube videos obtained using "web-scraping"
data opensearch python python3 react scraper scraping scraping-websites search search-engine semantic-search sentence-transformers typescript website
Last synced: 17 Apr 2026
https://github.com/chompfoods/stub-scala-akka-http-server
Scala Akka HTTP server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
akka api branded chomp data database food grocery ingredients raw recipe-api recipes scala server stub stub-server
Last synced: 15 Apr 2026
https://github.com/trollmii/bunnybase
An efficient data managing system
bunnybase data data-science data-structures database datascience python python3
Last synced: 22 Apr 2025
https://github.com/brianlesko/postresql-docker
Run a postgreSQL server hosted in a docker container, and start a webUI for basic querying
basics container containerization containers data data-science docker postgres postgresql sql template
Last synced: 31 Jan 2026
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/tadiusfrank2001/pythonprojects
Compilation of Some Fun Introduction to Python Lab Coding Projects introducing the foundamentals of data science, databases, and pythonlibraries
data data-science databases gamedesign python pythonlibrarires sorting-algorithms sqlite string-manipulation
Last synced: 06 May 2026
https://github.com/armand-sauzay/datasets
Datasets for machine learning
ai data datasets machine-learning ml
Last synced: 18 Jan 2026
https://github.com/takshshah-16/library_management
Library Management using complex SQL
business-analytics data data-analytics joins stored-procedures
Last synced: 31 Jan 2026
https://github.com/denisecase/dc-texter
Send a text message using Python
alerts data python sms-messages streaming
Last synced: 08 Feb 2026
https://github.com/opendatach/alds
a colaborative list of resources and ideas to enable "Amt Local Data Stewards" to manage the (open) data of their respective federal office
awesome-list data datagovernance dataliteracy datamanagement datastewardship opendata opengovernmentdata
Last synced: 31 Jan 2026
https://github.com/azmag/spm-dashboard
System Performance Measures are a selection of criteria used by Department of Housing and Urban Development (HUD) to evaluate how local Continua of Care are performing.
Last synced: 31 Jan 2026
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 10 Jun 2026
https://github.com/ludwing-mj/manipulacion_ej
Ejercicio utilizado en la seccion numero ocho del manual para ejemplificar las herramientas proporcionadas por el tydyverse para la manipulacion de datos.
data manipulate-data package r
Last synced: 01 Apr 2025
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/darkogamerz/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 01 Apr 2025
https://github.com/rorovic/rorovic.github.io
my github blog
code data datawarehouse devops realtime
Last synced: 01 Feb 2026
https://github.com/natanast/euroleaguebasketball
An R package providing data on Euroleague Basketball
Last synced: 01 Apr 2025
https://github.com/nits2612/data-science-projects
Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.
data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning
Last synced: 01 Feb 2026
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/aniruddha-biswas/shield-insurance-business-insights
Shield Insurance Business Insights
data data-visualization dataanalysis excel mysql powerbi sql
Last synced: 01 Apr 2025
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/ralzz/dibimbing_datascience
This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.
data data-science eda google-colab kaggle pandas python
Last synced: 06 May 2026
https://github.com/anuraganalog/twitter-data-analysis
My internship work during the 2020 summer
analysis data eda exploratory-data-analysis jupyter-notebook nlp spotle textblob twitter wordcloud
Last synced: 20 May 2026
https://github.com/ms140569/loki-example-store
Testdata for loki password manager
Last synced: 26 Feb 2026
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/beriberikix/senml-zephyr
A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr
codec data iot senml sensor zephyr-rtos
Last synced: 24 Mar 2025
https://github.com/mehmetkahya0/earthquake-tracker
Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.
ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web
Last synced: 15 Apr 2026
https://github.com/ymorsi7/quranicvisualization
A visual exploration tool for the Holy Quran using D3.js treemaps.
css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization
Last synced: 15 Apr 2026
https://github.com/jigyasag18/movie-recommendation-system-project
This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.
data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp
Last synced: 28 May 2026
https://github.com/justinhennis1/hackathon24
Hofstra's Hacknology Competition 2024 - Team Null Pointers
data data-analysis data-science data-visualization data-visualization-python dataanalysis dataanalytics traveling web webapplication
Last synced: 21 Aug 2025
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/michaelfromyeg/lyrics
Lyric-store and API hosted on Git.
Last synced: 08 Feb 2026
https://github.com/giscience/measures-rest-oshdb-app
A frontend for providing measures for geospatial datasets, using the OSHDB
data dggs geospatial measure openstreetmap rest
Last synced: 20 Apr 2026
https://github.com/juanpablodiaz/beertv
A Next.js Full Stack app to displays funny Beer TV Ads
api-routes data next tailwindcss
Last synced: 07 May 2026
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/matt-dray/draytasets
:1234::disguised_face: Miscellaneous datasets I've collected or prepared
Last synced: 09 Feb 2026
https://github.com/sauravsrivastav/githubreposearcher
GitHub Repo Searcher 🔍 is a Streamlit web application designed to help you search for GitHub repositories based on a query and view the results in a tabular format. You can also download the results in CSV or Excel format for further analysis. 📊📈
data data-export excel github-api python repository-searcher streamlit webapp
Last synced: 20 Jan 2026
https://github.com/manishjanky/wrangle-weratedogs-dataset
A data wrangling project done ad part of Udacity DAND
data data-wrangling twitter udacity udacity-data-analyst-nanodegree udacity-nanodegree weratedogs
Last synced: 15 Apr 2026
https://github.com/cpietsch/breitband
developer repo of breitband-berlin
d3js data threejs visualization
Last synced: 02 May 2026
https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021
In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.
data data-analysis data-science data-visualization
Last synced: 23 Mar 2025
https://github.com/metapsy-project/data-panic-psyctr
Database of psychotherapy for panic disorder compared to control conditions
Last synced: 18 Mar 2026