data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/east-empire-trading-company/eetc-data-client
Client library for retrieving data managed by EETC Data Hub.
client-library data data-science finance library python
Last synced: 31 May 2026
https://github.com/alexandregazagnes/rica-analysis
This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.
analysis argiculture business data data-analysis data-analytics food python
Last synced: 29 Apr 2026
https://github.com/definetlynotai/vulnscan_data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 11 Oct 2025
https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint
A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT
bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs
Last synced: 25 Sep 2025
https://github.com/strata/data
Tools to help you read data from a range of different data providers.
Last synced: 27 Jan 2026
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 19 Jan 2026
https://github.com/agavitalis/sample-c-codes
A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.
ageteller atm binary data gpcalculator logging
Last synced: 09 Apr 2025
https://github.com/shawnduong/pacman-digest
Generate a digest of package space usage for Linux systems using pacman.
Last synced: 13 May 2026
https://github.com/himel-sarder/web-scraping-it-jobs-dataset
This project is a Python-based web scraping tool that collects job listings from TimesJobs for IT-related positions. It extracts job titles, company names, locations, and experience requirements, and saves the data into a CSV file. The tool uses BeautifulSoup and Pandas for web scraping and data manipulation.
data datascience dataset kaggle-dataset machine-learning machinelearning ml web-scraping
Last synced: 22 Feb 2026
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/diegoperea20/own_dataset_segmentation_yolov8
Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.
coins colombia data opencv own python segmentation tensorflow yolov8
Last synced: 12 Apr 2026
https://github.com/tonykipkemboi/ens_subgraph_data
Query On-Chain Data from Subgraphs by The Graph Protocol using Python
data subgraphs thegraphprotocol web3
Last synced: 17 Sep 2025
https://github.com/ajsalemo/python-pandas-datalib
Testing and experimenting with some simple Pandas functionality using Flask to serve the parsed data.
csv data flask json pandas pandas-dataframe pandas-series python tabular tabular-data terminal
Last synced: 09 Apr 2026
https://github.com/vincentneo/sgtidetimings
Scraped SG NEA tide timings table into machine-readable JSON files!
data github-actions github-pages gov html-tables-to-json javascript json nodejs sg singapore singapore-data-analysis tide webscraping
Last synced: 10 Apr 2026
https://github.com/themost-framework/memory
MOST Web Framework in-memory data adapter for testing environments
Last synced: 06 Mar 2025
https://github.com/malvfr/zap
Fill your database with fake data.
cli csv data database generator hacktoberfest mock node populate populate-database seed sql
Last synced: 21 Jan 2026
https://github.com/chandraprakash-bathula/keywords_prediction-machine-learning-integration
Keywords Prediction Model Built the Model By: Data Cleaning Removing Stopwords Constructing Word2vec Advancing to TF-IDF Weighted Word2vec.
algori artifici data machine-learning tf-idf weighted-word2vec word2vec
Last synced: 08 Nov 2025
https://github.com/jrmedd/emojinal
An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.
data emojis sentiment user-research ux
Last synced: 19 Jan 2026
https://github.com/mccarthy-m-g/alda
An R data package for the book "Applied longitudinal data analysis: Modeling change and event occurrence" by Singer and Willett (2003).
data growth-curves longitudinal-data mixed-models nonlinear-mixed-models r r-package structural-equation-modeling survival-analysis time-to-event
Last synced: 19 Jan 2026
https://github.com/yasenstar/powerbi_tutorial
Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool
analytics data microsoft powerbi tutorial visualization
Last synced: 07 Jan 2026
https://github.com/iguptashubham/walmart-eda
Imagine diving into the fascinating world of Walmart with just a few lines of code! This project lets you do that using MySQL, a powerful tool for data analysts. You can clean up messy data like a detective, uncovering hidden patterns and trends. Data scientists can take it further,.
analysis data dataset eda mysql portfolio-project python sql
Last synced: 10 Apr 2026
https://github.com/danieljdufour/rle-serializers
Serialize and Deserialize Run Length Encoding
cloud-optimized compression csv data deserializer run-length run-length-decoding run-length-encoding serializer
Last synced: 24 Sep 2025
https://github.com/stdlib-js/datasets-herndon-venus-semidiameters
Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.
astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus
Last synced: 09 Oct 2025
https://github.com/simonbernarding/ml_project_simonbernarding
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science flight-delay-prediction machine-learning machinelearning prediction
Last synced: 12 Oct 2025
https://github.com/sefakcmn00/tensorflow_car_price_analysis
In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.
data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow
Last synced: 14 Apr 2026
https://github.com/akhi07rx/f1-statistics-dashboard
A comprehensive command-line tool for analyzing Formula 1 race data using the FastF1 library.
akhi07rx cli cli-tools data f1 f1-score f1cli f1dashboard f1stats fastf1 formula1 opensource race race-analytics
Last synced: 23 May 2026
https://github.com/vatshayan/list-of-animals-data-classification-
Classification & Visualization of List of Animals Data set using Machine Learning Algorithm
animal-behavior animal-data animals artificial-intelligence classification data data-analysis data-mining data-science data-visualization dataset jupyter-notebook machine-learning python supervised-learning
Last synced: 17 May 2026
https://github.com/genert/metis
Asynchronous data sender library
analytics asynchronous data dependency-free typescript
Last synced: 27 Jan 2026
https://github.com/castdrian/kdapi
A TypeScript library that scrapes K-pop idol and group information from online sources to create comprehensive JSON datasets.
api data kpop scraper typescript
Last synced: 15 May 2025
https://github.com/elhariri78/case-study-a-better-smoker-detector
Case Study-A better Smoker Detector
data dataframe evaluation kaggle matplotlib-pyplot numpy pandas pandas-dataframe pandas-python python3 seaborn sklearn
Last synced: 07 Apr 2026
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 13 Apr 2026
https://github.com/anobaka/insidecollector
这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。
collection data excel-like list list-manager table
Last synced: 19 Jan 2026
https://github.com/free-domains/data
A simple website which visualises domain data.
data data-visualisation data-visualiser data-visualization data-visualizer free-domains
Last synced: 18 Apr 2025
https://github.com/R-Mahesh45/HR---Resume-Text-Classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 13 Oct 2025
https://github.com/coqui123/tradegpt
TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹
analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript
Last synced: 11 Apr 2026
https://github.com/tberey/social-stocks
A Graphical Data and Analysis Tool
data data-analysis data-science data-stream data-visualization database javascript mysql mysql-database node nodejs rest rest-api social-stocks stock-market stocks ticker-data tickers trends typescript
Last synced: 21 Jan 2026
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/connectaman/deepseek-ocr-multigpu-infer
Efficient multi-GPU OCR inference framework leveraging parallel processes for accelerated token throughput and faster batch processing. Designed for scalable, high-performance optical character recognition workloads using PyTorch. Supports dynamic GPU assignment, optimized resource utilization, and easy integration for large-scale image datasets.
agentic-extraction data deepseek document-parser extraction extractor gpu image-parser llm multigpu nvidia ocr parallel-computing parser pdf-parser vlm
Last synced: 22 Jan 2026
https://github.com/flowsynx/plugin-postgresql
FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.
data database flowsynx postgresql postgresql-database sql
Last synced: 09 May 2026
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 31 Jul 2025
https://github.com/twistezo/ts-dto-mapper
DTO (Data Transfer Object) to Object Model transformer
data dto map mapper model object transfer transform transformer typescript
Last synced: 05 Feb 2026
https://github.com/player29879/neum-ai
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors
Last synced: 18 Apr 2026
https://github.com/bukalapak/bukadata
Data supplier plugin for populating design with real data.
data plugin sketch sketch-plugin
Last synced: 05 Jul 2025
https://github.com/richardschoen/sshnetibmi
This .Net/.Net Core class library is used to interface with existing IBM i database, program calls, CL commands, service programs and data queues via the PASE based xmlservice-cli PASE command program or regular qsh/bash commands. qsh/bash commands can be used to interface with any qsh/pase based utilities such as the IBM i db2util utility
as400 cl command csharp data db2 ddm dotnet drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg xmlservice xmlservice-cli
Last synced: 04 Feb 2026
https://github.com/derrickbaruga7/python-data-analysis
This project analyzes ORU’s off-season sewer usage using Python, with `pandas` for data handling, histograms and line plots for exploration, and a `scipy`-based model for prediction. Pearson’s correlation and visualizations help reveal key trends and relationships.
analytics data data-science visualization
Last synced: 31 Jul 2025
https://github.com/linx-software/file-import-to-rest-api
Import a CSV file and make the data available via a REST API.
Last synced: 19 Mar 2026
https://github.com/nightroman/farnet.fsharp.data
FSharp.Data package for FarNet.FSharpFar
Last synced: 27 Apr 2026
https://github.com/iamlucianojr/laravel-api-query-handler
:flashlight: This Laravel package helps to handle a query request properly
api collection data eloquent handler l5x laravel query
Last synced: 28 Apr 2026
https://github.com/jtpio/data-playground
Experiments using public APIs and data
Last synced: 28 Apr 2026
https://github.com/ahmetcansolak/developer-insights
New project of ClubRockers from Sarıyer Hills
bitbucket data data-science data-visualization github python3
Last synced: 28 Apr 2026
https://github.com/eugenedakin/polyalphabeticcipher
PolyAlphabetic Cipher
data decryption encryption polyalphabetic polyalphabetic-cipher polyalphabetic-crypto polyalphabetic-substitution xojo
Last synced: 19 Mar 2026
https://github.com/jackosheadev/databasetechproject
This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction
Last synced: 18 May 2026
https://github.com/skywardai/paper_gallery
Papers gallery for using LLMs ability over dataset
ai data data-science llm medicine neural-network research security
Last synced: 19 Mar 2026
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/agnosticeng/cli
Agnostic magic is now at your fingertips.
cli clickhouse data datalake datalakehouse
Last synced: 03 Mar 2026
https://github.com/garcane/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 09 Feb 2026
https://github.com/aidanjuma/ankideckextractor
A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.
anki apkg cli data decompression extraction flashcards learning python zip
Last synced: 29 Apr 2026
https://github.com/azevedontc/datapulse
DataPulse
automation brazil cli data data-analysis matplotlib meteorology open-meteo pandas prevision pycharm python python3 reports venv weather
Last synced: 29 Apr 2026
https://github.com/mchenryspagg/hng-hire-data-model
The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.
dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema
Last synced: 11 Feb 2026
https://github.com/skygenesisenterprise/aether-account
Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.
account data javascript nextjs platform service sso-service typescript user-interface
Last synced: 16 Apr 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/chrnthnkmutt/theartofstatistic_python
This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization
data data-science data-visualization machine-learning statistics
Last synced: 08 Jun 2026
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/ismailarilik/react-covid-maps
A global maps application aims to display COVID-19 statistics by countries, written with React
covid-19 data global maps react statistics
Last synced: 16 Apr 2026
https://github.com/chompfoods/stub-asp-net-core
ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server
Last synced: 30 Apr 2026
https://github.com/ajityadav2621/datadoom
Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.
Last synced: 09 Feb 2026
https://github.com/stdlib-js/slice-ctor
Slice constructor.
constructor constructors ctor ctors data javascript node node-js nodejs python slice stdlib structure types
Last synced: 16 Apr 2026
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/ewertondrigues02/engenharia-de-dados
Varios Projetos de Engenharia de Dados usando principais ferramentas como: Airflow, Snowflake, dbt, Postrgres, Looker Studio, Power BI
airflow analise-exploratoria analytics aws-ec2 dados data dbt-cloud engenharia-de-dados looker-studio postgres pyspark python3 snowflake spark
Last synced: 16 Apr 2026
https://github.com/ozlerhakan/eda
Exploratory Data Analysis Samples
data data-analysis data-virtualization eda exploratory-data-analysis matplotlib plotly python seaborn
Last synced: 16 Apr 2026
https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico
Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel
analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3
Last synced: 01 May 2026
https://github.com/lucien-loua/libgn
Manipulate geographical and administrative data about Guinea.
Last synced: 08 Jun 2026
https://github.com/divanny/academixbackend
🧑🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.
backend csharp data net webapi
Last synced: 07 Jun 2026
https://github.com/3squared/smoulder
Smoulder is a really good data pipe
composition data facade-pattern forge-framework object-oriented
Last synced: 25 Apr 2026
https://github.com/gdhhgnbnvbn/f1-2025-ai-predict
fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.
agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe
Last synced: 01 May 2026
https://github.com/athari22/house_sales_in_king_count_usa
The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.
analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library
Last synced: 01 May 2026
https://github.com/danielgiljam/orbit-utils
A collection of utility packages for Orbit.js.
data inference orbit orbitjs schema synchronization type typescript validation zod
Last synced: 01 May 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/stefanbohacek/fediverse-account-analyzer
bots botsinspace data dataviz fediverse mastodon
Last synced: 02 May 2026
https://github.com/liuliqiang/laueagle
YAML/JSON Lints and Converters
converter data formater json linter python serialization yaml
Last synced: 02 May 2026
https://github.com/unicef/magicbox-download-shapefiles
Downloads shapefiles for each country from gadm.org and unzips them.
data data-science docker downloads-shapefiles emergency-response gadm geospatial geospatial-data humanitarian javascript magicbox nodejs shapefile unicef
Last synced: 02 May 2026
https://github.com/lmuffato/project-mongodb-dataflights-trybe
Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB
back-end crud data database filter mongo mongodb query trybe-projects
Last synced: 16 Apr 2026
https://github.com/aniketkkajania/wassupanalyzer
WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.
data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat
Last synced: 25 Feb 2026
https://github.com/atesbazi/dataimitator
Generates random data for your needs.
clojure clojure-library data fake fake-data random-data random-data-generation random-generation
Last synced: 08 Feb 2026
https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis
Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard
analysis data data-visualization datacleaning excel excel-vba
Last synced: 19 Mar 2026