data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 16 Mar 2025
https://github.com/danish-foundation-models/dfm-processing
Toolkit for processing data in the danish foundation models project.
Last synced: 02 Jul 2025
https://github.com/the-aerospace-corporation/pivt
PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.
analytics dashboards data devops jenkins pipeline python splunk visualization
Last synced: 29 Apr 2026
https://github.com/vapourismo/binary-io
Read and write values of types that implement Binary from and to Handles
data haskell haskell-library io parsing
Last synced: 28 Mar 2025
https://github.com/chompfoods/stub-jaxrs-resteasy
JAX-RS RESTEasy server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients jax-rs jax-rs-server nutrition raw recipe-api recipes resteasy server server-stub stub stub-server
Last synced: 08 May 2026
https://github.com/abdul-rafay19/youngdevinterns_machine-learning_tasks
This internship offers hands-on exposure to real-world Machine Learning applications — from data visualization and preprocessing to model development, evaluation, and deployment. It focuses on real ML workflows, problem-solving, neural networks, and hyperparameter tuning — all within a collaborative, remote, and growth-oriented environment.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data data-visualization internship machine-learning machine-learning-algorithms machinelearning ml model model-development neural-network preprocessing programming-language python task tasks youngdevintern
Last synced: 29 Apr 2026
https://github.com/andyduke/data_processor_cli
Flexible command-line data processor.
command-line command-line-tool converter csv data data-structures json template toml xml yaml
Last synced: 08 May 2026
https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis
Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard
analysis data data-visualization datacleaning excel excel-vba
Last synced: 19 Mar 2026
https://gitlab.com/Native-Coder/d3-react-component
This is a dead-simple React component that makes D3 implementation a breeze.
chart component d3 data react vis visualization viz
Last synced: 24 Jan 2026
https://github.com/gbowne1/jsonhelix
This is a X11 GUI JSON application for editing, debugging and converting JSON and schemas and API data.
api data gui gui-application json x11
Last synced: 10 Jun 2025
https://github.com/hamzacham/data_set_projet-4
analysis analytics data data-science datawarehouse sas sql sql-server
Last synced: 24 Mar 2025
https://github.com/yord/klp-json
A JSON plugin for klp (Kelpie), the small, fast, and magical command-line data processor.
csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv
Last synced: 29 Apr 2026
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 13 Apr 2026
https://github.com/benmaier/boarding_school_sir
Fit SIR dynamics to the prevalence curve of an H1N1 outbreak of a British boarding school in 1978.
boarding data disease epidemiology modeling school spreading
Last synced: 31 Mar 2025
https://github.com/jayantur13/kountry
Node module variant of the Country API
api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn
Last synced: 26 Jan 2026
https://github.com/cdapio/website
CDAP IO website
analytics applications cdap cdapio data data-analytics data-integration hugo integration metadata oss rules-engine
Last synced: 18 Jun 2025
https://github.com/zonggen/data-structure
Course notes on data structures and analysis (CSC263)
Last synced: 23 Mar 2025
https://github.com/yash22222/sync-intern-s-ml-tasks
SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.
bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns
Last synced: 07 May 2026
https://github.com/ewertondrigues02/engenharia-de-dados
Varios Projetos de Engenharia de Dados usando principais ferramentas como: Airflow, Snowflake, dbt, Postrgres, Looker Studio, Power BI
airflow analise-exploratoria analytics aws-ec2 dados data dbt-cloud engenharia-de-dados looker-studio postgres pyspark python3 snowflake spark
Last synced: 16 Apr 2026
https://github.com/fiddlydigital/fastmap
A simple 2D map that is optimized for speed.
Last synced: 23 Oct 2025
https://github.com/definetlynotai/vulnscan_data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 11 Oct 2025
https://github.com/free-domains/data
A simple website which visualises domain data.
data data-visualisation data-visualiser data-visualization data-visualizer free-domains
Last synced: 18 Apr 2025
https://github.com/bredalis/exceptions
Examples of exceptions 🚫
algotithms coding data exceptions language-programing python
Last synced: 04 Mar 2025
https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights
Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.
analytics data excel sas sasprogramming statistical-analysis
Last synced: 24 Feb 2026
https://github.com/rodekruis/510-data-catalog
The Project is CKAN based Data Catalog Portal for 510
Last synced: 23 Jan 2026
https://github.com/satur-io/estoraje
Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.
data database distributed go golang key-value performance training
Last synced: 07 May 2026
https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali
Työllisyyden kuntakokeilujen palvelutietovarannon manuaali
data drupal drupal-9 unemployment
Last synced: 24 Jan 2026
https://github.com/saroshfarhan/kaggle-playground-s4e12
Kaggle competition first attempt
analytics data data-analysis-python data-science
Last synced: 12 Oct 2025
https://github.com/imahdimir/githubdata
A very simple Python package to easily download from and manage a GitHub "Data Repository"
data data-repository python-package
Last synced: 23 Jan 2026
https://github.com/fritzrehde/asciibar
A cli tool to print percentages as ascii bar charts
cli data percentage visualization
Last synced: 31 Oct 2025
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/sefakcmn00/tensorflow_car_price_analysis
In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.
data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow
Last synced: 14 Apr 2026
https://github.com/yasenstar/powerbi_tutorial
Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool
analytics data microsoft powerbi tutorial visualization
Last synced: 07 Jan 2026
https://github.com/scottleechua/data
Public datasets under CC-BY-4.0 license.
Last synced: 18 Mar 2026
https://github.com/dicook/tutorial_effective_data_plots
Materials for WOMBAT 2024 tutorial
data graphics inference statistics tidyverse visualisation
Last synced: 23 Jan 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/maccccd/wsoa3029a_2444372
This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.
d3js data hacking visualization
Last synced: 24 Jan 2026
https://github.com/jhpoelen/rats
self-replicating data publication related to rat (Rattus sp.) specimen.
biodiversity data natural-history-collections provenance
Last synced: 18 Mar 2026
https://github.com/exoticknight/juhe
simple way to analyze complex data in one chain call
aggregation aggregator analysis data statistic typescript
Last synced: 21 May 2026
https://github.com/diegoperea20/own_dataset_segmentation_yolov8
Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.
coins colombia data opencv own python segmentation tensorflow yolov8
Last synced: 12 Apr 2026
https://github.com/quasilyte/phpcorpus
A collection of various PHP code; useful for PHP tools writers to get some insights on how "real-world" PHP code looks like
analysis corpus data php php-corpus
Last synced: 04 Jul 2025
https://github.com/R-Mahesh45/HR---Resume-Text-Classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 13 Oct 2025
https://github.com/athari22/house_sales_in_king_count_usa
The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.
analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library
Last synced: 01 May 2026
https://github.com/qeeqbox/data-states
Data states refer to structured and unstructured data divided into three categories (At Rest, In Use, and In Transit)
data data-state infosecsimplified qeeqbox
Last synced: 10 Mar 2026
https://github.com/iamgmujtaba/github-python-daily-trending
This repository provides an automated, daily-updated list of the top trending Python repositories on GitHub. Using a GitHub Actions workflow, it scrapes data from GitHub's trending page, sorts the results by total stars, and generates a clean, well-structured README file
data data-scraping github-actions tranding tranding-bot
Last synced: 13 Oct 2025
https://github.com/montanaz0r/imdb-ratings-auto-inserter
A Python script that enables auto-inserting movie ratings into the IMDB profile.
data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping
Last synced: 07 May 2026
https://github.com/tberey/social-stocks
A Graphical Data and Analysis Tool
data data-analysis data-science data-stream data-visualization database javascript mysql mysql-database node nodejs rest rest-api social-stocks stock-market stocks ticker-data tickers trends typescript
Last synced: 21 Jan 2026
https://github.com/connectaman/deepseek-ocr-multigpu-infer
Efficient multi-GPU OCR inference framework leveraging parallel processes for accelerated token throughput and faster batch processing. Designed for scalable, high-performance optical character recognition workloads using PyTorch. Supports dynamic GPU assignment, optimized resource utilization, and easy integration for large-scale image datasets.
agentic-extraction data deepseek document-parser extraction extractor gpu image-parser llm multigpu nvidia ocr parallel-computing parser pdf-parser vlm
Last synced: 22 Jan 2026
https://github.com/danielbello7/nosql-json-database
Simple and quick database to help development process and speed
data database json json-database models nosql nosql-database nosql-json-database schema
Last synced: 09 May 2026
https://github.com/bilalmehrban/data-log-monitor
A simple yet elegant desktop c# application based on 3 Tier architecture, designed to have a look at the logs stored in the database using Nlog or other logging framework's.
csharp data desktop-app logging
Last synced: 14 Mar 2025
https://github.com/geocollections/turvas
Database of peat geology
data data-visualization database estonia geology mineral-resources peat
Last synced: 05 Feb 2026
https://github.com/3squared/smoulder
Smoulder is a really good data pipe
composition data facade-pattern forge-framework object-oriented
Last synced: 25 Apr 2026
https://github.com/ompreetham/dcn-network-traffic-anomaly-detection
Data Communication Networks - Network Traffic Anomaly Detection
anomaly anomaly-detection communication data dcn keras learning machine machine-learning network pandas presentation project python scikit-learn tensorflow traffic
Last synced: 08 Apr 2026
https://github.com/jeanmanguy/milk-sci-fi
Census of every mention of milk in sci-fi works.
Last synced: 26 Feb 2026
https://github.com/athul64/powerbi
Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.
data data-an data-visualization dax dax-expression powerbi
Last synced: 23 Feb 2026
https://github.com/ggeop/multiple-fields-management
Fields management from/to different data sources. :bulb:
data data-engineering data-organization data-retrieval data-science pandas python
Last synced: 01 May 2026
https://github.com/dominhduy09/my-links
All of my links and websites I have been creating - For saving all of my website's links
data database link linked-list linktree list save storage website
Last synced: 03 May 2026
https://github.com/lisakey/datacamp-data-analyst-python-sql-projects
Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.
analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali
Last synced: 19 Apr 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/raymondcm/strawberrydata
Tool suite for fast multi-camera strawberry data collection project. The standards document houses cross compatibility/purpose implementation details.
camera cpp data intel multi-camera
Last synced: 08 Feb 2026
https://github.com/rremple/intervalidus
For all your interval-based data needs.
Last synced: 21 Feb 2026
https://github.com/atesbazi/dataimitator
Generates random data for your needs.
clojure clojure-library data fake fake-data random-data random-data-generation random-generation
Last synced: 08 Feb 2026
https://github.com/himel-sarder/web-scraping-it-jobs-dataset
This project is a Python-based web scraping tool that collects job listings from TimesJobs for IT-related positions. It extracts job titles, company names, locations, and experience requirements, and saves the data into a CSV file. The tool uses BeautifulSoup and Pandas for web scraping and data manipulation.
data datascience dataset kaggle-dataset machine-learning machinelearning ml web-scraping
Last synced: 22 Feb 2026
https://github.com/agavitalis/sample-c-codes
A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.
ageteller atm binary data gpcalculator logging
Last synced: 09 Apr 2025
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 27 Jan 2026
https://github.com/stdlib-js/ndarray-base-dtypes2signatures
Transform a list of array argument data types into a list of signatures.
api array base data dtype dtypes interface javascript multidimensional ndarray node node-js nodejs sig signatures stdlib types utilities utility utils
Last synced: 14 Apr 2026
https://github.com/SAP-archive/signavio-qualtrics-di
Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.
data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio
Last synced: 09 May 2025
https://github.com/hasnocool/war_thunder_data_scraper
A web scraping tool designed to extract valuable data from War Thunder, a popular online game.
data database framework integration multi processing python scraper scraping scrapy sql threaded thunder war
Last synced: 06 May 2026
https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis
This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle
data database mysql sql walmart
Last synced: 24 Feb 2026
https://github.com/bredalis/numpy
✨ Library to work with arrays ✨
arrays data matrix numpy numpy-arrays numpy-library python
Last synced: 06 May 2026
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/stdlib-js/array-float32
Float32Array.
array data float float32 float32array ieee754 javascript node node-js nodejs single single-precision stdlib structure typed typed-array types
Last synced: 14 Jan 2026
https://github.com/codeforafrica/ckanext-followy
[ARCHIVED] A CKAN extension to show the datasets a user is following.
ckan ckan-extension ckanext-followy data dataset followy-extension open-data
Last synced: 16 Mar 2025
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 04 Feb 2026
https://github.com/stdlib-js/ndarray-base-to-reversed
Return a new ndarray where the order of elements of an input ndarray is reversed along each dimension.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure to-reversed types vector view
Last synced: 12 Apr 2026
https://github.com/ibilalkayy/covid-tracking-app
This repository contains the code of a covid tracking app that shows the data of covid-19 on Google Map.
Last synced: 14 Oct 2025
https://github.com/cworld1/novel-data
The data repository of novel analysis
Last synced: 01 Feb 2026
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/astrid-project/cb-manager
APIs to interact with the Context Broker's database. Through a REST Interface, it exposes data and events stored in the internal storage system in a structured way. It provides uniform access to the capabilities of monitoring agents.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 30 Jun 2025