data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/danreynolds/data_batcher
Data batcher batches and de-dupes data fetched in the same task of the event loop.
batching data flutter hacktoberfest
Last synced: 19 May 2026
https://github.com/harmanveer-2546/supply-chain
Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.
customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis
Last synced: 10 Apr 2026
https://github.com/goncaloperes/datavisualization
Here I will share some of my data visualizations using a variety of datasets, technologies and tools.
d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick
Last synced: 04 Feb 2026
https://github.com/ispyhumanfly/prowler
Query the web, extract data from the results, and transform that data into a format you can use.
ai analytics business cryptocurrency data extract-data machine-learning mining scraping web
Last synced: 06 Sep 2025
https://github.com/tylerben/data-spring
Easily generate a dummy dataset based on a provided config
data data-spring datagenerator fake-data generator javascript typescript
Last synced: 27 May 2026
https://github.com/stdlib-js/array-zero-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 08 Jan 2026
https://github.com/luminati-io/Crunchbase-dataset-samples
A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.
crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping
Last synced: 09 Apr 2025
https://github.com/luminati-io/Twitter-X-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 09 Apr 2025
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/jrcichra/ingestd
HTTP server that easily ingests data into a database
data gin hacktoberfest ingest ingestion restful-api
Last synced: 28 Apr 2026
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/izaaccoding36/dados-dinamicos
Esse repositório apresenta um site criado com API para a criação de gráficos, relatando o uso de redes sociais em uma escala global
api data redes-sociais social-media website
Last synced: 26 Mar 2025
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/danielrosehill/monetised-ghg-emissions
Calculating monetised GHG emissions for various companies based upon disclosure data
data sustainability sustainability-data
Last synced: 07 Sep 2025
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 01 Mar 2025
https://github.com/unownone/spenddy-link
Simple Privacy Friendly chrome extension to track your spends and more!
Last synced: 12 Mar 2026
https://github.com/ncgl-git/eriparse
Python code to parse the cost-of-living HTML from erieri.com, i.e. https://www.erieri.com/cost-of-living/united-states/illinois/chicago
cost-of-living crime crime-data data economic-research-institute erieri webscraper
Last synced: 14 Jan 2026
https://github.com/bunnysunny24/bluepulse
A Smart Water Management System
data data-processing data-visualization firebase iot machine-learning mysql-database reactjs
Last synced: 17 Mar 2025
https://github.com/s-raza/csvio
Wrapper for conveniently processing CSV files
csv data file processing wrapper
Last synced: 14 Jan 2026
https://github.com/avto-dev/static-references-data
Data for static references
Last synced: 05 Oct 2025
https://github.com/igorwastaken/math-problems
Solve math problems easily with this utility library.
algorithm area data demography geography javascript math npm package population school typescript util utils
Last synced: 23 Feb 2026
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/tushar2704/insurance-cross-sell
This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.
data datascience h20automl machine-learning mlflow python streamlit-tushar2704
Last synced: 08 Oct 2025
https://github.com/jakakokosar/bioinformatics-serverfiles
Knowledge base for Orange3-bioinformatics add-on
bioinformatics data dictybase gene genesets go homologene markergenes ncbi serverfiles
Last synced: 16 Apr 2026
https://github.com/scienxlab/datasets
Some small datasets for demos, courses, testing, etc.
data open-data sample-data teaching-resources
Last synced: 09 Oct 2025
https://github.com/andygol/osm-diff-state
CLI tool to search OSM diff state files
custom data openstreetmap planet replication
Last synced: 24 Apr 2026
https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis
Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard
analysis data data-visualization datacleaning excel excel-vba
Last synced: 19 Mar 2026
https://github.com/sap-samples/security-research-codegraphsmote
Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.
augmentation data detection learning machine research sample security vulnerability
Last synced: 07 Jun 2026
https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview
In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data
data data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/aidenellis/connectmp
🍰 ConnectMP - An easy way to share data between Processes in Python.
aidenellis connectmp data data-sharing multiprocessing process sharing
Last synced: 27 Apr 2026
https://github.com/m0nica/datalogues-outdated
Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll
data pelican pelican-blog pelican-theme
Last synced: 28 Feb 2026
https://github.com/howtoquitvivek/ai-crop-yeild-prediction
AI-driven crop yield prediction and agricultural optimization system (SIH 2025)
2025 2026 ai crop-yeild data minor-project ml predcition python science sih
Last synced: 23 Apr 2026
https://github.com/mikeintoshsystems/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 28 Apr 2026
https://github.com/ahmetcansolak/developer-insights
New project of ClubRockers from Sarıyer Hills
bitbucket data data-science data-visualization github python3
Last synced: 28 Apr 2026
https://github.com/ismail-mouyahada/lodscroljs-library
LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks
data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll
Last synced: 13 Feb 2026
https://github.com/atesbazi/dataimitator
Generates random data for your needs.
clojure clojure-library data fake fake-data random-data random-data-generation random-generation
Last synced: 08 Feb 2026
https://github.com/colour-science/colour-checker-detection-tests-datasets
Colour - Checker Detection - Tests Datasets
color color-checker color-science color-space color-spaces colorspace colorspaces colour colour-checker colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets raw
Last synced: 19 Mar 2026
https://github.com/the-aerospace-corporation/pivt
PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.
analytics dashboards data devops jenkins pipeline python splunk visualization
Last synced: 29 Apr 2026
https://github.com/quarylabs/quary_basketball_analysis_duckdb
An example analysis
analytics data duckdb engineering quary
Last synced: 29 Apr 2026
https://github.com/obsidianplusplus/5e_play_cs-go
Python工具,分析你在5EPlay的CS:GO比赛数据。抓取、分析、筛选并导出。 | Python tool to analyze your 5EPlay CS:GO match data. Fetches, analyzes, filters, and exports.
5eplay analysis api automation csgo data esports excel json match pandas performance player python reporting scraping stats team
Last synced: 13 Feb 2026
https://github.com/frictionlessdata/extensiondp
Extension DP (Data Package Extension Template) is a Git repository template for rapid Data Package extension development
data datapackage exchange extension format
Last synced: 13 Feb 2026
https://github.com/saisriramkamineni/e-commerce-sales-analysis-excel-
Conducted an in-depth sales analysis for an e-commerce platform, leveraging Excel for data preprocessing and Power BI for visualization. Identified key sales trends, customer purchasing behavior, and revenue growth patterns to optimize business performance.
analysis analytics data excel sales
Last synced: 14 Feb 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/blacksujit/shikshamitra
Shiksha Mitra is an innovative MVP designed to reshape the way students learn through gamification. Our platform transforms the traditional approach to education by making learning engaging, interactive, and rewarding. As an MVP, Shiksha Mitra focuses on delivering core features that showcase the value of gamified learning,
ai data gamified-learning hackathon lms ml mlflow mlops mlops-workflow mvp pipeline platforn
Last synced: 28 Feb 2026
https://github.com/inphyt/quantitative_single_neuron_modeling_competition_2009
Data for the Quantitative Single-Neuron Modeling Competition (2009).
bayesian-inference bayesian-methods bayesian-optimization bayesian-statistics challenge competition computational-neuroscience data electrophysiological-data electrophysiology-data model-calibration modeling neuronal-models neuroscience neuroscience-competition parameter-estimation simulation simulation-modeling single-neuron-model uncertainty-quantification
Last synced: 25 Feb 2026
https://github.com/chrnthnkmutt/theartofstatistic_python
This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization
data data-science data-visualization machine-learning statistics
Last synced: 08 Jun 2026
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/garcane/british-airways-analysis
This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.
data data-analysis data-visualization tableau
Last synced: 19 Mar 2026
https://github.com/chompfoods/sdk-php
PHP SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients php raw recipe-api recipes sdk
Last synced: 30 Apr 2026
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/neomutt/sample-data
📚 Lists of things. Useful for developing and testing.
Last synced: 19 Mar 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/m-rishab/stock_trend-analysis-power-bi-project-
In this project, I've harnessed the robust capabilities of Power BI to analyse, visualize, and uncover the story behind HUL's stock performance.
data datavisualization datavisualization-project powerbi
Last synced: 19 Mar 2026
https://github.com/divanny/academixbackend
🧑🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.
backend csharp data net webapi
Last synced: 07 Jun 2026
https://github.com/epsoft/deep-learning-for-structured-data
Deep Learning for structured data
concatenate data data-learning dense farsi input load-penguins pandas persian structure structured-data subtract tensorflow
Last synced: 01 May 2026
https://github.com/colour-science/colour-checker-detection-examples-datasets
Colour - Checker Detection - Examples Datasets
color color-checker color-science color-space color-spaces colorspace colorspaces colour colour-checker colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets raw
Last synced: 19 Mar 2026
https://github.com/efler/microservice-data-bus
Data bus based on Apache Kafka and consisting of separate components [copied from own private repos]
data data-bus deduplication enrichment filtering kafka microservice mongodb postgresql redis
Last synced: 16 Apr 2026
https://github.com/skywardai/paper_gallery
Papers gallery for using LLMs ability over dataset
ai data data-science llm medicine neural-network research security
Last synced: 19 Mar 2026
https://github.com/tee8z/noaa-oracle
NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style
data duckdb-wasm noaa-weather parquet-files sql weather
Last synced: 17 Feb 2026
https://github.com/meineglock20/listtotabledisplay
The List to Table Formatter for .NET is a versatile library designed to convert lists of objects into well-formatted table displays . Ideal for web applications and console applications - including log files and word documents.
asp-net asp-net-core console csharp data display dotnet formatter html list logging netstandard20 object-list presentation razor-pages table table-formatter text-table text-to-table utility
Last synced: 04 Mar 2026
https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer
Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.
convert converter data data-science deserialize json lightweight powershell serializer
Last synced: 04 May 2026
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/raghavendranhp/credit_card_fraud_detection
This repository contains code for a credit card fraud detection model using autoencoders and logistic regression, achieving 95.3% accuracy.
anomaly-detection autoencoder-neural-network credit-card-fraud data keras logistic-regression machine-learning preprocessing tensorflow
Last synced: 04 May 2026
https://github.com/stdlib-js/slice-ctor
Slice constructor.
constructor constructors ctor ctors data javascript node node-js nodejs python slice stdlib structure types
Last synced: 16 Apr 2026
https://github.com/jeanmanguy/milk-sci-fi
Census of every mention of milk in sci-fi works.
Last synced: 26 Feb 2026
https://github.com/eradical/analytics-unibody
Ansible role that sets up a farm of analytics collectors based on nginx
analytics ansible ansible-role big-data collectors data nginx
Last synced: 06 May 2026
https://github.com/ajityadav2621/datadoom
Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.
Last synced: 09 Feb 2026
https://github.com/pharo-ai/data-preprocessing
Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.
data pharo pharo-smalltalk preprocessing smalltalk
Last synced: 09 Feb 2026
https://github.com/dandre3000/matrix
Matrix library
algebra array data data-structure math matrix vector
Last synced: 01 Feb 2026
https://github.com/mg380/ibm-applied-data-science-capstone
This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization
capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql
Last synced: 05 Mar 2026
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/palewire/nyc-hpd-bronx-lead-paint-violations
Download and process housing code lead paint violations in the Bronx from NYC Open Data
bronx data data-journalism news nyc python
Last synced: 02 Apr 2026
https://github.com/themuhd/world-cup-analysis
Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023
data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python
Last synced: 08 May 2026