data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/chenxingqiang/modeling_tabular_data
# modeling_tabular_data | Keywords: modeling_tabular_data focusing on modeling_tabular_data.
Last synced: 30 Jan 2026
https://github.com/robwiederstein/covid-19-ky
Monitor US covid-19 cases w/ Johns Hopkins data
data data-visualization leaflet plotly r shell
Last synced: 02 May 2026
https://github.com/rosacarla/databases
Bases de dados utilizados em atividades práticas do MBA Data Analytics do IGTI.
Last synced: 19 Mar 2026
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/vedikasnehil/my-data-science-projects
This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.
data data-science deep-learning machine-learning matplotlib numpy python sql visualization
Last synced: 10 Apr 2026
https://github.com/saritaphd/predicting-performance-of-students---complete-ml-project-with-deployment-using-aws
Student performance analysis with deployment (End to end ML project)
aws data data-science deployment jupyter-notebook machine-learning python visualization
Last synced: 10 Apr 2026
https://github.com/bearaujus/bdatamatrix
Structured Tabular Data Management in Go
Last synced: 30 Jan 2026
https://github.com/srindot/average_flightdata_collection_fwuaav
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 18 Aug 2025
https://github.com/abendayan/orm
Lightweight orm
cli dao data database database-management javascript mysql node node-js nodejs orm ormius ormius-cli schema
Last synced: 25 Feb 2026
https://github.com/amethyst-php/project
amethyst amethyst-package api data laravel project
Last synced: 15 Apr 2026
https://github.com/lut-ful/pizza-sales-report
This Pizza Sales Report provides valuable insights into sales performance through detailed analysis and visualizations. By leveraging Power BI and SQL Server
data data-wrangling microsoft-sql-server power-bi power-bi-dax python
Last synced: 30 Jan 2026
https://github.com/chompfoods/stub-scala-akka-http-server
Scala Akka HTTP server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
akka api branded chomp data database food grocery ingredients raw recipe-api recipes scala server stub stub-server
Last synced: 15 Apr 2026
https://github.com/sirmaxx/log_manager
log manager services for microservices
data fastapi logging microservice mongodb
Last synced: 09 Apr 2026
https://github.com/nia-cloud-official/influx-agents
Influx-CRD is a web application designed to facilitate data collection, recovery, and distribution for agents uploading data to a centralized database. It provides an intuitive interface for managing data collection from various sources, recovering lost or corrupted data.
broker collection data data- influx influx-agent
Last synced: 30 Jul 2025
https://github.com/brianlesko/postresql-docker
Run a postgreSQL server hosted in a docker container, and start a webUI for basic querying
basics container containerization containers data data-science docker postgres postgresql sql template
Last synced: 31 Jan 2026
https://github.com/greedchikara/dsajs
Data Structures and Algorithms written in Javascript
Last synced: 09 Apr 2026
https://github.com/e-kotov/albofr-data-archive
Tiger Mosquito Colonisation in France data
aedes-albopictus colonisation data france tiger-mosquito
Last synced: 23 May 2026
https://github.com/molinsagustin/cinedata
# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.
agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver
Last synced: 28 Feb 2026
https://github.com/guardias-eu/reasin
Interface to the European Alien Species Information Network API
api biodiversity biodiversity-data biodiversity-informatics data invasive-species oscibio r r-package
Last synced: 04 Oct 2025
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/sulujulianto/population-data-retrieval-and-analysis
I created a simple program that can be used to search for global population data or population data from various countries using Python.
Last synced: 09 Mar 2026
https://github.com/sunnahboy/checkfake_true_news
Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News
algorithms data level low programming structure
Last synced: 28 Feb 2026
https://github.com/tgorka/amplify-datastore-rxjs
RxJs Subjects to work with AWS Amplify and Amplify Datastore.
amplify amplifydatastore angular aws awsamplify data datastore fetch graphql graphql-client ionic rxjs scroll typescript
Last synced: 14 Feb 2026
https://github.com/rijkvanzanten/ds-fa-1
The first final assignment for the data structures class
assignment data final map now parsons structures thenewschool
Last synced: 04 Oct 2025
https://github.com/ekoepplin/dbt-bigquery-core
How to get data to BigQuery (or duckDB) and setup dbt tests for SODA cloud monitoring
bigquery data data-quality dbt dlt duckdb gcp soda
Last synced: 06 May 2026
https://github.com/vedantwalia/mymusicvisualisationproject
data datavisualisation json jupyter-notebook pandas python xml xml-parser
Last synced: 09 Apr 2026
https://github.com/farhad2415/Job_Scraper
Job Site Based Job Scrapping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 15 Aug 2025
https://github.com/0xhericles/spamdetector
:email: A Simple Python Spam Detector with Scikit-Learn
data ham machine-learning python sklearn spam
Last synced: 02 May 2026
https://github.com/twilighty-abhi/locust-data-visualiser
Locust Data Visualiser
Last synced: 15 Aug 2025
https://github.com/rbreeze/dashboard
My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.
css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization
Last synced: 02 May 2026
https://github.com/seqeralabs/ffq-api
A minimal wrapper to make ffq searches available via a REST API.
api data fastq fetch-fastq ffq genomics
Last synced: 15 Aug 2025
https://github.com/supremkc05/global-job-market-analytics
Scrape jobs from websites like Indeed/LinkedIn, extract skills using NLP, then visualize hiring trends.
beautifulsoup data machine-learning nlp pandas scrapping
Last synced: 14 Aug 2025
https://github.com/gcoronelc/ucv_gdi-1_202302-a2
Taller de Gestión de Datos e Información I con Gustavo Coronel.
data data-science database databases machine-learning machinelearning oracle sql sql-server
Last synced: 02 May 2026
https://github.com/leoBitto/CloudForge
Data foundry
airflow data data-engineering django docker docker-compose grafana postgresql prometheus
Last synced: 14 Aug 2025
https://github.com/itsachrafmansari/moroccan-real-estate-analysis
Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.
api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping
Last synced: 13 Aug 2025
https://github.com/mtwn105/phonepe-pulse-plus
An API on top of PhonePe Pulse Data APIs
cors data data-science express finance hacktoberfest heroku javascript nodejs phonepe pulse
Last synced: 09 Apr 2026
https://github.com/xljones/bugsnag-exporter
Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs
bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh
Last synced: 06 May 2026
https://github.com/dawidolko/datafusion-app-python
Project as part of the Data Warehousing subject.
academic-project data dataprocessing extraction gui loading project pysimplegui python transformation
Last synced: 15 Feb 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/jleung51/foundations-dags
Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.
data data-engineering etl extract housing load pipeline transform
Last synced: 04 Oct 2025
https://github.com/mochsyahrizal/jkfkjabar_studycase
First Data Analytics Study Case
Last synced: 15 Feb 2026
https://github.com/bocchilorenzo/hugginginfo
Unofficial library to retrieve information from the HuggingFace website.
Last synced: 03 Apr 2026
https://github.com/panodata/tikray
A compact data transformation engine.
data data-transformation data-transformation-pipeline data-transformer jmes jmespath jq jqlang json json-pointer json-transform json-transformation json-translate json-translator transformation transon
Last synced: 04 Oct 2025
https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction
This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.
analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/neptun-software/neptun.data.generators
Send scraped data from neptun-scraper to CHATGPT to generate training data for NEPTUN.AI.
Last synced: 30 Jul 2025
https://github.com/arnocan/yapydata
The yapydata provides miscellaneous low-level Python data access APIs.
data datastructures ini json properties python python2 python3 xml yaml
Last synced: 16 Feb 2026
https://github.com/hafs96/prediction_consommation-de-carburant
Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.
analysis data data-visualization machine-learning testing training
Last synced: 09 Jun 2026
https://github.com/adri6336/payvis-android
An app that enables people working by the hour to keep track of how much they've earned.
android android-application app clock data data-visualization database finances financial-data json money money-management monitoring paycheck-records productivity records records-management time-worked work worktime
Last synced: 09 Apr 2026
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/jacopodl/jcollections
Common data structures for the C language
c collections data data-structures jcollections
Last synced: 30 Jul 2025
https://github.com/kadirlofca/unity-csvmaker
Quick and easy way to create and export .csv files from Unity.
Last synced: 09 Apr 2026
https://github.com/ddofer/ddofer.github.io
Dan's Blog
blog cv data data-science machine-learning
Last synced: 12 Aug 2025
https://github.com/corneliustanui/personal_blogdown_website
This repo contains source files for my personal Blogdown-based website.
analyis analytics blog blogdown blogdown-sites data data-science hugo hugo-theme netlify personal-website rbind statistics web website
Last synced: 13 Feb 2026
https://github.com/mubashirsidiki/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
analytics azure big-data data dataengineering devops pipeline
Last synced: 02 May 2026
https://github.com/radekbednarik/covid-czech-data-api
Library to make it easy to work with REST API of official Czech Covid data.
api covid-19 data deno library typescript
Last synced: 02 May 2026
https://github.com/amethyst-php/cycle
amethyst amethyst-package api cycle data laravel
Last synced: 17 May 2026
https://github.com/s1dewalker/electric-future
Visual Analysis: Future of Automotive Industry
data data-visualization machine-learning python3 regression-analysis tableau
Last synced: 02 May 2026
https://github.com/reshmaaiman/liver-patient-prediction
Liver Disease Prediction
data data-science data-visualization dataanalysis jupyter-notebook numpy pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/soenneker/soenneker.attributes.mapto
A C# attribute for generic data mapping translation
attributes columns csharp data datatables dotnet mapping mapto maptoattribute object
Last synced: 02 Mar 2026
https://github.com/vidupriya/aws-glue--data-copy
The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket
aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark
Last synced: 02 May 2026
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/jesuscc1993/data-cleaner-extension
Clears browser data in a single click.
application-data chrome chrome-extension data
Last synced: 02 May 2026
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 18 May 2026
https://github.com/mcraiha/datagensharp
C# managed library for generating data
Last synced: 11 Aug 2025
https://github.com/anuppm9917/data-processing-and-csv-to-json-using-python-project
This project guides you through processing data from CSV to JSON format using Python. You'll learn to cleanse, validate, and transform data with pandas, numpy, csv, and json libraries, ensuring it's ready for POS system integration. This will help improve data integrity and streamline integration.
csv-files data data-analysis data-cleaning data-collection data-transformation data-validation python3 transformation
Last synced: 16 Apr 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/0xhericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 09 Feb 2026
https://github.com/hupili/djworkshop-cuc2018
data data-journalism data-visualization
Last synced: 27 Mar 2026
https://github.com/lab5e/loadabledata
Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.
async data loadable state typescript ui
Last synced: 07 May 2026
https://github.com/coderjolly/spotify-api-data-analysis
The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.
airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql
Last synced: 27 Mar 2026
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026
https://github.com/wyattowalsh/proxywhirl
rotating proxy system
data data-extraction dataextraction proxy proxy-checker proxy-list proxy-scraper proxy-server proxypool python python3 rotating-proxy sqlite sqlite3 web-data-extraction
Last synced: 03 Mar 2026
https://github.com/ahmad-ali-rafique/heart-disease-detection-model
A comprehensive project for detecting heart disease using machine learning, including data processing, model training, and evaluation metrics with AUC curve analysis.
artificial-intelligence data datascience heart-disease machine-learning modeling prediction-model
Last synced: 11 Aug 2025
https://github.com/inzhenerka/scooters_data_generator
Generate data of scooter trips for analysis
Last synced: 02 Jun 2026
https://github.com/chocoscoding/fakeapi
A fake API with nice functionalities for testing
api data express fetch fetch-api frontend javascript js json json-api json-server nodejs testing typescript
Last synced: 09 Apr 2026
https://github.com/srindot/fwuav-average-flight-data-collection
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 10 Aug 2025
https://github.com/viniddev/active_finance
Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário
automation data data-analisis rpa selenium-webdriver telegram-bot
Last synced: 03 May 2026
https://github.com/ineelhere/langchain-chat-with-your-data
LangChain Chat with Your Data course from DeepLearning.AI and LangChain
chatapplication chatgpt data deeplearning-ai deeplearning-notebooks jupyter-notebooks langchain langchain-python openai-api opensource personalised-learning python3
Last synced: 16 Apr 2026
https://github.com/prakashpandey16/sql_data_warehouse_project
Building a modern data warehouse with SQL Server, including ETL Processes, data modeling, and analytics.
cleaning-data data data-engineering data-science database etl-pipeline sqlserver
Last synced: 03 May 2026
https://github.com/ometman/vet-clinic
This is a database project for vetinary data management for animals, owners, clinic employees and visits; and applicable to any data management need. It uses Postgresql, a relational database management system. It allows storing, updating and querying.
data database normalization postgresql postgresql-database queries sql sql-server-database tables transactions
Last synced: 13 May 2026
https://github.com/colesmcintosh/colesmcintosh.github.io
My portfolio site :)
ai automation data llms open-source
Last synced: 04 Mar 2026
https://github.com/antoineaugusti/youtubers-tips
Collecting data about tips given to Youtubers
data economy youtube youtubers
Last synced: 03 May 2026
https://github.com/amethyst-php/price-rule
amethyst amethyst-package api data laravel price price-rule rule
Last synced: 03 May 2026
https://github.com/fabsdevx/files-to-database-loader-handout
Data Engineering project for learning purposes. Credits to itversity
csv data data-engineering database json pandas python
Last synced: 09 Apr 2026
https://github.com/jillmpla/kaggle_notebooks
Kaggle-based data analysis, data science, and data visualization.
data data-science data-visualization kaggle machine-learning
Last synced: 16 Apr 2026
https://github.com/0xkibh/datamining-algo
This repository consist data mining algorithm implementation example in python
apriori-algorithm data datamining fp-growth python
Last synced: 19 May 2026
https://github.com/chubek/pyramid-dashboard
A Dashboard to Show Data Made Using Plotly Dash
dash data docker ml plotly plotly-dash python
Last synced: 19 May 2026
https://github.com/iv4n-ga6l/functional-dataprocessing-pipeline
A functional data processing pipeline that accepts an input file, allows specifying both input and output formats, applies specified transformations, and produces a resulting output file.
csv data datapreprocessing excel json pandas parquet pipeline python
Last synced: 06 May 2026
https://github.com/ashamethedestroyer/data-structures
Dedication of all Data Structures Creation 🛠
cpp data data-structures implementation implementation-of-data-structures structure structured-data
Last synced: 23 May 2026