data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/eradical/analytics-unibody
Ansible role that sets up a farm of analytics collectors based on nginx
analytics ansible ansible-role big-data collectors data nginx
Last synced: 06 May 2026
https://github.com/montanaz0r/imdb-ratings-auto-inserter
A Python script that enables auto-inserting movie ratings into the IMDB profile.
data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping
Last synced: 07 May 2026
https://github.com/yash22222/sync-intern-s-ml-tasks
SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.
bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns
Last synced: 07 May 2026
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 08 May 2026
https://github.com/favarettorm/bd_universidade
BD_UNIVERSIDADE V01 - Banco de dados fictício de uma universidade para fins didáticos
data database dataset mariadb mariadb-database mariadb-mysql mysql mysql-database scripts sql university
Last synced: 08 May 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/n0nag0n/flee-intercom
For those of you who like to keep your money after Intercom jacks up the prices year after year, but want to keep an export of your data.
again-and-again api data database export exporter flee high-prices intercom mysql php price run save saver year-over-year
Last synced: 09 May 2026
https://github.com/alechash/rndmzr
Randomizer is a random data generator.
data data-science random random-generation random-number-generators
Last synced: 10 Jun 2026
https://github.com/suryavamsi-p/conflict-nlp-topic-modeling-sentiment-analysis-using-llms
Extracts insights from 26K+ protest events using BERTopic, Top2Vec, and LLMs for real-world applications like crisis monitoring, policy research, and social unrest analysis.
all-mpnet-base-v2 bertopic conflict-data data data-science lda llama2 llms machine-learning mistral-7b nlp nltk protest-analysis pyldavis python3 top2vec topic-modeling transformers visualization
Last synced: 11 May 2026
https://github.com/scarblase/russian-military-losses-analysis
This repository provides an in-depth analysis of Russian equipment losses using PySpark and data visualization techniques.
data data-science data-visualization jyputer-notebook matplotlib pyspark python3 seaborn seaborn-plots ukraine ukraine-invasion
Last synced: 12 May 2026
https://github.com/itrauco/robots
ai, machine learning, and robots...
ai artificial-intelligence automation big-data cloud cloud-engineering data data-engineering data-science data-science-projects m machine machine-learning ml prompts robots
Last synced: 11 Jun 2026
https://github.com/ultrasage-danz/scikit-learn-ml
Machine Learning with scikit-learn by Data School
ai data data-school machine-learning macos ml scikit-learn ultrasage-dan
Last synced: 13 May 2026
https://github.com/dmitriiweb/tr-data-getter
Tool to get market data from bitstamp.ne
Last synced: 14 May 2026
https://github.com/erwan-simon/aws-serverless-notebook-platform
A self-hosted, serverless platform offering an intuitive UI to manage, schedule, and execute Jupyter notebooks on AWS.
aws data docker notebook python serverless terraform webapp
Last synced: 13 Jun 2026
https://github.com/fairspec/fairspec-standard
Fairspec is a data exchange format compatible with DataCite for metadata and JSON Schema for structured data
ckan csv data dataset excel fair fairspec json ods polars python quality schema sqlite table typescript validation zenodo
Last synced: 16 Jun 2026
https://github.com/geo-y20/coursera-managment-system
ML and Data Science-based recommendation system
course coursera data data-science data-visualization datacleaning machine-learning mean-square-error recommendation-system
Last synced: 19 Jun 2026
https://github.com/bredalis/functions
Functions in Python 🐍
algorithms data functions porgraming programming-language python
Last synced: 19 Jun 2026
https://github.com/oneblack333/pizza_sales_analysis
The project involves transforming raw pizza sales data into actionable business intelligence through analysis and visualization. This enables pizza business owners to make data-driven decisions on inventory, staffing, and marketing, ultimately improving performance and profitability.
data data-structures data-visualization excel mysql powerbi
Last synced: 20 Jun 2026
https://github.com/jorgeatgu/pqnvl
candi-DATOS
candi-datos data data-viz elections elections-spain poletika political spain
Last synced: 20 Jun 2026
https://github.com/CentralFloridaAttorney/ComfyUI-ZMongo
An Easy-to-Use database framework and parameter library for ComfyUI. Centralize node presets, capture workflow logic, manage structured image collections, and build document-driven text automation pipelines on an offline Local File Store or BusinessProcessApplications.com .
api comfy comfy-ui comfyui comfyui-custom-node comfyui-custom-nodes comfyui-manager comfyui-node comfyui-nodes comfyui-workflow data database
Last synced: 21 Jun 2026
https://github.com/datadotworld/dw-jupyter-contents
Jupyter ContentsManager implementation for data.world
data data-analysis data-science dwstruct-t50-public-projects jupyter jupyter-notebook jupyterlab reference-implementation
Last synced: 22 Jun 2026
https://github.com/brianali-codes/github-searcher
A website for API experimentation that users the github Api to search for different users and some of their (public) information
Last synced: 21 May 2026
https://github.com/vatshayan/list-of-animals-data-classification-
Classification & Visualization of List of Animals Data set using Machine Learning Algorithm
animal-behavior animal-data animals artificial-intelligence classification data data-analysis data-mining data-science data-visualization dataset jupyter-notebook machine-learning python supervised-learning
Last synced: 17 May 2026
https://github.com/psyteachr/sdg-data
Data relevant to the UN Sustainable Development Goals
Last synced: 09 Oct 2025
https://github.com/vaxdata22/foresight-institution
This is a Data Analysis case study done on the Foresight Institution dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-processing data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 28 May 2026
https://github.com/entropyorg/p5-data-testimage
:notebook::camera: interface for retrieving test images
Last synced: 29 May 2026
https://github.com/miozilla/fraudfinder
fraudfinder :mag_right::smiling_imp::suspect: : Historical Payment Transactions # Fraud Detection # EDA # Feature Store # Model Registry
analysis data exploratory feature-store fraud-detection
Last synced: 29 Aug 2025
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/gianlucatruda/qs-analyser
A quantified self data analysis script in Python 3.
data experiment matplotlib matrix optimization productivity python quantified quantified-self science self
Last synced: 10 Oct 2025
https://github.com/j-sephb-lt-n/joes_giant_toolbox
A large collection of general python functions and classes that I use in my daily work
ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping
Last synced: 10 Oct 2025
https://github.com/loggdme/kyro
Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.
Last synced: 14 Jan 2026
https://github.com/team-hydrogen/nasa-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 25 Mar 2025
https://github.com/lorenzobloise/client_satisfaction_classification
Jupyter notebook in which satisfaction from clients reviewing European hotels is analyzed using Python libraries such as pandas, numpy and scikit-learn. Various classification models are trained and tested to predict client satisfaction.
classification data data-mining jupyter jupyter-notebook machine-learning pandas python
Last synced: 21 Feb 2026
https://github.com/jatin-mehra119/paris_housing_price-kaggle-
Paris Housing Price Kaggle Competiton
data data-visualization kaggle-competition machine-learning numpy pandas predictive-modeling scikit-learn
Last synced: 29 Apr 2026
https://github.com/aldro61/mmit-data
The data used in the Maximum Margin Interval Trees paper
data machine-learning machine-learning-algorithms reproducible-research
Last synced: 19 Feb 2026
https://github.com/afnanenayet/kaggle-titanic
The classic Kaggle Titanic data science challenge
backprop backpropagation classification classifier data forest kaggle layer learn mlp multi numpy pandas perceptron random science scikit sklearn titanic
Last synced: 12 Apr 2026
https://github.com/mapaor/horaris-rodalies
Web que utilitza la API de rodalies de Catalunya per mostrar els horaris d'una manera més divertida
adif api ave barcelona bordils catalunya dades data distancia generalitat girona horaris md r11 regional renfe rodalies sants tren viajes
Last synced: 16 May 2026
https://github.com/sourceduty/text_file_metadata
📄 Extract metadata from .txt files and record the metadata in .txt files.
data datascience metadata metafile practice sourceduty
Last synced: 08 Aug 2025
https://github.com/srvanderplas/statistical_atlas
Framed Charts and the Statistical Atlas of 1870
census data ggplot2 graphics r statistics visualization
Last synced: 29 May 2026
https://github.com/sonwaneshivani/data-science-learning
Basics of python
css data data-science deep-learning flask gen-ai html ml nlp
Last synced: 02 May 2026
https://github.com/equinor/sumo-wrapper-python
Thin python wrapper to interact with Sumo API
analytics data fmu python subsurface sumo
Last synced: 19 Jan 2026
https://github.com/sourceduty/data_metrics
📈 Analyzing, sorting and visualizing data.
data data-analysis data-metrics data-sci data-science data-science-projects data-sorting data-visualization database dataset metrics sorting statistics visualization
Last synced: 08 Aug 2025
https://github.com/rishabhmathur06/data_analysis-netflix
data data-analytics data-science matplotlib-pyplot numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/jacoblincool/moodle-export
A streamlined library for retrieving data from Moodle.
Last synced: 07 May 2025
https://github.com/sourceduty/digital_brand_footprint
🔗 Expert in finding and analyzing branded websites and social media links.
analytics artificial-intelligence business business-footprint businesses chatgpt company concept data link openai social-media tool url website
Last synced: 16 Aug 2025
https://github.com/radekbednarik/covid-czech-data-api
Library to make it easy to work with REST API of official Czech Covid data.
api covid-19 data deno library typescript
Last synced: 02 May 2026
https://github.com/thingston/extractor
Collection of PHP classes to extract data from HTML pages.
Last synced: 14 Jan 2026
https://github.com/drzax/light-up-brisbane
Where, what and why various public places in Brisbane are lit up.
Last synced: 19 Jan 2026
https://github.com/pixlcrashr/stwhh-mensa
Better STWHH Mensa menu data / interface / notifier
api crawler data food studierendenwerk-hamburg university website
Last synced: 07 Aug 2025
https://github.com/adilsaid64/real-time-data-monitoring
Exploring what a real-time data drift monitoring solution could look like within MLOps
data datadrift grafana machine-learning mlops mlops-workflow prometheus python software-engineering
Last synced: 04 Aug 2025
https://github.com/jesuscc1993/data-cleaner-extension
Clears browser data in a single click.
application-data chrome chrome-extension data
Last synced: 02 May 2026
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/prakashpandey16/sql_data_warehouse_project
Building a modern data warehouse with SQL Server, including ETL Processes, data modeling, and analytics.
cleaning-data data data-engineering data-science database etl-pipeline sqlserver
Last synced: 03 May 2026
https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms
The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.
data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier
Last synced: 01 May 2026
https://github.com/bilgehangecici/datatypeconverter
Converting integer and floating numbers to appropriate bit-level representation.
data datatypeconverter java machine-level variables
Last synced: 30 Mar 2025
https://github.com/mamskie/visdat
google collab
colab-notebook data visualization
Last synced: 03 Aug 2025
https://github.com/soenneker/soenneker.data.email.disposables
Simply adds a list of compiled disposable/temporary email domains, updated daily (if available)
csharp data disposable disposables domain dotnet email mailinator
Last synced: 29 May 2026
https://github.com/gman-au/white-knight-neo4j
Neo4j implementation of White Knight data abstraction library
abstractions data datastore dotnet neo4j repository-pattern specification-pattern
Last synced: 20 Jan 2026
https://github.com/smaug6739/sidonie
📦 Sidonie is a prototype of module to manipulate json data.
data database javascript json module typescript
Last synced: 03 May 2026
https://github.com/2022-04-11588/data-fakes
🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.
data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing
Last synced: 11 Apr 2026
https://github.com/tn3w/moviedb-json
A JSON library with 981,530 films.
data database db json movie movie-database movies
Last synced: 03 May 2026
https://github.com/alextanhongpin/node-github-api
:page_with_curl: sample github api queries with nodejs for scraping purposes
Last synced: 06 May 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/charlenry/python_data_science
Mes notebooks de travaux pratiques sur Python pour la Data Science
analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation
Last synced: 25 Jun 2026
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/j-sephb-lt-n/personal-projects
A history of my personal projects and professional development
ai api auth cloud data llms personal-development web
Last synced: 24 Jan 2026
https://github.com/sushmashreeps/data-science-with-python
This repository showcases a comprehensive data science project utilizing Python, demonstrating expertise in data analysis, visualization, and machine learning. Built with Python 3.x, the project leverages popular libraries like Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and TensorFlow. The project features data preprocessing, feature engine
cnn data dataanalysis datascience keras linear-regression matplotlib python python3 regression rnn visualization
Last synced: 14 Apr 2026
https://github.com/ghufranbarcha/company-account-analyzer
This project is a Streamlit application designed to visualize and analyze client data. It includes interactive features for exploring client-specific metrics, generating plots, and viewing distribution charts.
data data-science pandas streamlit visualization
Last synced: 03 May 2026
https://github.com/saboye/sales-performance-analysis
A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.
dashboard data data-science eda tableau visualization
Last synced: 27 Jan 2026
https://github.com/ronknight/user-data-dashboard
📈 A data visualization tool for analyzing user data using an Excel-based data source.
dashboard data excel ga4 screenshot
Last synced: 17 Oct 2025
https://github.com/qrailibs/dataflow
✨ Data processing in Node.js made multithreaded and type-safe.
data dataprocessing multithread node
Last synced: 04 May 2026
https://github.com/meokullu/colorizenumber
ColorizeNumber - Bodrum Papatya, visualizes numeric data into colors which creates an image.
color colorize colors data data-visualization visualization vizualize-data
Last synced: 01 Jun 2026
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 04 May 2026
https://github.com/erencelik/binance-public-data-node
Nodejs downloader and unzipper script for Binance Public Data
binance data downloader nodejs public script
Last synced: 15 May 2026
https://github.com/inist-cnrs/ws-data
Modèles et données pour les web services
Last synced: 03 Sep 2025
https://github.com/octoenergy/tentaclio-snowflake
A python project containing all the dependencies for snowflake tentaclio schema.
Last synced: 20 Oct 2025
https://github.com/alimghmi/bdlc
Bloomberg API integration, handling data requests, processing, and SQL database insertion.
api-client bloomberg data data-processing financial-data oauth2 python sql-database transformation
Last synced: 10 Jun 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/plurid/defocus
Apophatic User Content Resolution [Desearch Concept]
Last synced: 08 Nov 2025
https://github.com/smeltier/data-structures-c
This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.
algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c
Last synced: 16 May 2025