data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial
Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI
dagster data data-orchestration kedro luigi mageai prefect
Last synced: 18 Apr 2026
https://github.com/emanoelcampos/power-bi-fundamentals
Datacamp's Power BI Fundamentals Skill Track
data data-analyst data-analyst-power-bi datacamp power-bi powerbi
Last synced: 24 Jan 2026
https://github.com/sahraiidle/email-spam-detector
Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.
data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm
Last synced: 24 Jan 2026
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 28 Jan 2026
https://github.com/ezeparziale/analisis-uso-bicicletas-caba
:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.
data data-science data-visualization
Last synced: 14 Mar 2025
https://github.com/buildinamsterdam/contentful-graphql
Contentful GraphQL connection
Last synced: 05 Jan 2026
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/amethyst-php/recipe
amethyst amethyst-package api data laravel recipe
Last synced: 19 May 2026
https://github.com/spatialcurrent/go-pipe
go-pipe is a simple library for piping objects from iterators to writers.
big-data bigdata concurrency data
Last synced: 29 Jan 2026
https://github.com/spatialcurrent/go-counter
Simple library and command line program for generating frequency distributions.
Last synced: 29 Jan 2026
https://github.com/apoorv74/njdg-stats
Tracking data from the National Judicial Data Grid's (NJDG) district courts portal
data git-scraping judiciary law
Last synced: 29 Jan 2026
https://github.com/tpltnt/wir_vs_virus_hackathon_projects
A list of all projects / challenges for the WirVsVirus hackathon as CSV
coronavirus csv data hackathon raw-data
Last synced: 29 Jan 2026
https://github.com/chenxingqiang/modeling_tabular_data
# modeling_tabular_data | Keywords: modeling_tabular_data focusing on modeling_tabular_data.
Last synced: 30 Jan 2026
https://github.com/rosacarla/databases
Bases de dados utilizados em atividades práticas do MBA Data Analytics do IGTI.
Last synced: 19 Mar 2026
https://github.com/bearaujus/bdatamatrix
Structured Tabular Data Management in Go
Last synced: 30 Jan 2026
https://github.com/jhwa426/database
SQL, MSSQL, MongoDB Database
data data-warehouse data-wrangling database datamodeling entity-relationship-diagram normalization sql sqlite3 ssms
Last synced: 06 Apr 2025
https://github.com/opdev1004/totjs
Not totally new but a file format for managing human readable data in a file. JS version.
data data-storage data-store database database-management hacktoberfest hactoberfest-accepted nodejs
Last synced: 31 Jan 2026
https://github.com/nits2612/data-science-projects
Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.
data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning
Last synced: 01 Feb 2026
https://github.com/drostlab/biodbretrievr
Retrieve and efficiently index entire biological sequence databases
biological-data biological-sequences data databasestoring retrieval
Last synced: 26 Feb 2026
https://github.com/ymorsi7/quranicvisualization
A visual exploration tool for the Holy Quran using D3.js treemaps.
css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization
Last synced: 15 Apr 2026
https://github.com/shahules786/titanic-analysis
different analysis of titanic accident (data from kaggle)
Last synced: 26 Jun 2025
https://github.com/matt-dray/draytasets
:1234::disguised_face: Miscellaneous datasets I've collected or prepared
Last synced: 09 Feb 2026
https://github.com/fnu-ankit/nyc_parking_violation
data dataengineering dbt githubactions python
Last synced: 16 Apr 2026
https://github.com/enescidem/twitter-topic-modeling
Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.
data data-science machine-learning nlp topic-modeling twitter x
Last synced: 10 Feb 2026
https://github.com/os-climate/data-requests
This repo is used to track issues related to new Data Requests
Last synced: 27 Feb 2026
https://github.com/encoreshao/data-science
Data analyze examples, using Jupyter notebook and Python!!!
data dataanalysis encore jupyter-notebook
Last synced: 29 Mar 2025
https://github.com/sweta-kaundilya/power-bi-learning-projects
This repository contains completed exercises while learning Power BI
data datavisualization dax powerbi powerquery
Last synced: 27 Feb 2026
https://github.com/anandanraju/power_bi_dashboard_projects
The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.
amazon dashboard data data-visualization healthcare powerbi project
Last synced: 11 Feb 2026
https://github.com/vianneymi/amplifai
Amplifai is a package that allows you to transform your raw unstructured text into structured data in a few lines of codes.
data data-mining extraction langchain llm pydantic
Last synced: 27 Feb 2026
https://github.com/pawamoy/keycut-data
Keyboard shortcuts data stored in YAML files
Last synced: 12 Feb 2026
https://github.com/foundationallm/.github
A platform accelerating delivery of secure, trustworthy enterprise copilots.
agent ai data enterprise generative-ai large-language-model llm ml tool
Last synced: 12 Feb 2026
https://github.com/sumaiyyaf/british-airline-dashboard
This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.
analysis dashboard data tableau visualization
Last synced: 13 Feb 2026
https://github.com/j0a0m4/olympics
Final Project for Data Engineering Accelerated LATAM
Last synced: 13 Feb 2026
https://github.com/imartinezl/madrid-challenge
Madrid Route Optimization Challenge 🚚♻️🚚
challenge city data optimization routing-algorithm traffic
Last synced: 28 Feb 2026
https://github.com/tgorka/amplify-datastore-rxjs
RxJs Subjects to work with AWS Amplify and Amplify Datastore.
amplify amplifydatastore angular aws awsamplify data datastore fetch graphql graphql-client ionic rxjs scroll typescript
Last synced: 14 Feb 2026
https://github.com/dawidolko/datafusion-app-python
Project as part of the Data Warehousing subject.
academic-project data dataprocessing extraction gui loading project pysimplegui python transformation
Last synced: 15 Feb 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/gourab337/karnataka-health-visualizer
Visualizer for Karnataka's district-wise healthcare info built using PHP
Last synced: 19 Mar 2026
https://github.com/reshmaaiman/liver-patient-prediction
Liver Disease Prediction
data data-science data-visualization dataanalysis jupyter-notebook numpy pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/j2kun/terrorism-usa-post-9-11
A copy of the terror data published by NewAmerica
data politics terrorism transparency
Last synced: 02 Mar 2026
https://github.com/anuppm9917/data-processing-and-csv-to-json-using-python-project
This project guides you through processing data from CSV to JSON format using Python. You'll learn to cleanse, validate, and transform data with pandas, numpy, csv, and json libraries, ensuring it's ready for POS system integration. This will help improve data integrity and streamline integration.
csv-files data data-analysis data-cleaning data-collection data-transformation data-validation python3 transformation
Last synced: 16 Apr 2026
https://github.com/amethyst-php/consume-rule
amethyst amethyst-package api consume-rule data laravel
Last synced: 19 May 2026
https://github.com/inzhenerka/scooters_data_generator
Generate data of scooter trips for analysis
Last synced: 02 Jun 2026
https://github.com/ashakoen/bls-data-extract
This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.
Last synced: 01 Apr 2026
https://github.com/ksimicevic/discord-message-analyzer
Analyzing discord messages in Jupyter notebook
analysis data discord messages
Last synced: 16 Apr 2026
https://github.com/udhaya2823/microsoft---classifying-cybersecurity-incidents-with-machine_learning
🚨Microsoft: Classifying Cybersecurity Incidents with Machine Learning🔐 This project leverages the power of Machine Learning to classify cybersecurity incidents 🚨, improving the efficiency of Security Operation Centers (SOCs) at Microsoft. We train a model to predict incident grades, helping analysts prioritize threats with precision🎯.
classification data feature-engineering iqr-method machine-learning matplotlib model-evaluation modelselection predictive-modeling python sklearn
Last synced: 17 Apr 2026
https://github.com/sweta-kaundilya/911-calls-capstone-project
For this capstone project we will be analyzing some 911 call data from Kaggle.
data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/djdhairya/black-friday-sale
csv data data-analytics data-science data-visualization visualization
Last synced: 30 Oct 2025
https://github.com/jwszolek/accelerated-data-generator
Ultra-fast random data generator. It gives you an ability to generate almost 1M of rows in around second.
bash csv data data-generator generator shell
Last synced: 02 Apr 2026
https://github.com/opengeoshub/vdownload
A Powerful Geospatial Data Downloader
Last synced: 19 May 2026
https://github.com/squareslab/frameworkstudytranscripts
archived data human-study zackc
Last synced: 06 Mar 2026
https://github.com/ashfaqalizardariofficial/databasehelper
A C# database helper library to connect with the database server and perform actions insert, update, delete, select data and select multiple data from the database.
ashfaq-ali-zardari ashfaq-ali-zardari-official data database delete helper insert ms-sql-server multiple select-data server sql-server update
Last synced: 02 Apr 2026
https://github.com/nika2811/new-york-city-taxi-fare-prediction
About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff
data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost
Last synced: 06 Apr 2025
https://github.com/cnr-ibba/smarter-repository
SMARTER Data Repository
bootstrap5 data django repository smarter
Last synced: 03 Apr 2026
https://github.com/madhuresh2011/50-days-sql-challenge
Start a 50days-sql-challenge journey to SQL mastery and transform how we interact with data!
consistency data data-analytics database problem-solving query question-answering real-world-data sql
Last synced: 03 Jun 2026
https://github.com/shsiddhant/womens-wc
ML project to predict match outcomes for Women's Cricket World Cup 2025.
cricket-prediction data feature-engineering postgresql python
Last synced: 04 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/bhavanachitragar/layoff_analysis
This Streamlit app is designed for Layoff Analysis. It allows users to explore and analyze layoff data from different perspectives, including overall analytics, country-specific insights, and individual company details.
data dataanalysis streamlit streamlit-webapp
Last synced: 18 Apr 2026
https://github.com/zurd46/zurdsynthdatagen
This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).
data data-structures dataset electron json jsonl nodejs openai synthetic
Last synced: 04 Apr 2026
https://github.com/rd-uk/rduk-data-pg
PostgreSQL Data Provider implementation for rduk-data
Last synced: 18 Apr 2026
https://github.com/mipacd/holochatstats
A VTuber chat log (and general) analytics platform
data flask hololive postgresql python visualization vtuber youtube
Last synced: 05 Apr 2026
https://github.com/codbex/codbex-hestia-data-sample
Sample data for codbex-hestia
Last synced: 05 Apr 2026
https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly
Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool
blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly
Last synced: 18 Apr 2026
https://github.com/phelipe-sempreboni/certificates
Tutorial intended for information about my licenses and certificates acquired over time.
certificate certificates certification course data database datascience licences license-management marketing marketing-analytics python sql
Last synced: 16 May 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/crypt596-rubykz/metaai-data-explorer-scraping-tool
MetaAI data explorer tool
api-research automation data explorer html-parsing metaai playwright python rate-limiting scraping
Last synced: 20 Apr 2026
https://github.com/arda-guler/binmotion
Convert ANY data to a video file. Sister project of binGallery.
data data-visualization proof-of-concept video
Last synced: 04 Jun 2026
https://github.com/fastpix/android-data-kaltura
This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard
analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics
Last synced: 21 Apr 2026
https://github.com/critocrito/data-scores-map
Data scores in the UK web app.
algorithmic-decision-making data data-investigation data-scores investigation
Last synced: 21 Apr 2026
https://github.com/stefen-taime/llm-rag-mtl-public-hospital
Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal
data google-reviews hopital hospital hub ia llm montreal open-source quebec rag
Last synced: 21 Apr 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/howwohmm/fetchgram
era-adjusted Instagram content intelligence — scrape any public profile, OCR every image, measure what actually works. free, local, no API keys.
analytics cli content-strategy data instagram ocr python scraper
Last synced: 06 Jun 2026
https://github.com/marielachirinosr/cyclistic-data-analytics-project
This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.
data data-visualization pandas powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/desininja/weather-data-etl-pipeline
ETL pipeline using Apache Airflow
apache-airflow aws cicd dags data data-engineering etl glue-job mwaa pyspark redshift
Last synced: 25 Apr 2026
https://github.com/mlkav/tri-hita-karana
Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.
bali data data-science g20 science
Last synced: 06 Jun 2026
https://github.com/shwetajanwekar/prediction-with-regression
prediction with regression for salary_hike and delivery time dataset
data data-science datset exploratory-data-analysis matplotlib pandas plot prediction r2-score seaborn sns
Last synced: 25 Apr 2026
https://github.com/anuraganalog/blog
Data Science Blog
anuraganalog blog data science
Last synced: 26 Apr 2026
https://github.com/sagarkhese40/prediction-with-binomial-logistic-regression
bank data excel logistic-regression python
Last synced: 26 Apr 2026
https://github.com/f-ssemwanga/pandas-numpy-repo
This repo has extensive work I have done on Pandas and NumPy Modules during the advanced programming Module
cleaning-data-in-python data numpy-arrays pandas visualization
Last synced: 27 Apr 2026
https://github.com/ioanzicu/batch_loading_one-to-many_data_model
Unesco Batch Loading One-to-Many Data using Django
Last synced: 27 Apr 2026
https://github.com/yuweaec/project-scidatapipeline
A comprehensive toolkit for processing, simulating, and analyzing scientific data, integrating Python, Fortran, and Jupyter notebooks for seamless workflows.
analysis data pipeline processing scientific simulation
Last synced: 27 Apr 2026
https://github.com/gurpreet0022/crop-fertilizers-recommendation-system-using-ml-
This repository is a part of AICTE - Shell Internship on 'Green Skills using AI technologies' Cycle 3.
data datapreprocessing datavisualization jupyter-notebook machine-learning python
Last synced: 27 Apr 2026