data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/jimbrig/jimstaskviews
CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com
cran data docs rstats shiny-app submodules task-views
Last synced: 06 Mar 2026
https://github.com/vvipjain/ev-data-analysis
EV Data Analysis
data data-analysis data-visualisation tableau tableau-public
Last synced: 16 Feb 2026
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/glassflow/pipelines-push-action
This Github Action lets you automate GlassFlow pipelines deployments as code
data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing
Last synced: 19 May 2026
https://github.com/codenoid/storial.co-database
a Storial.co Database, collected by Hofesh Bot (Scrapper)
Last synced: 28 Mar 2025
https://github.com/glaucopater/covid19-vaccinations
Covid19 Vaccination Statistics
charts covid-19 data echarts italia react statistics vaccini
Last synced: 27 Mar 2025
https://github.com/nia-cloud-official/influx
Influx is a powerful search engine application designed to provide access to personal information of individuals from anywhere in the world. With Influx, users can search for and retrieve personal details of people, enabling them to find and connect with individuals across the globe.
data find people-search search-engine
Last synced: 27 Jun 2025
https://github.com/realabbas/instagram-user-meta-data
Instagram User Meta Data 📷 can be fetched using this script in an easy to use JSON Object for displaying Instagram Cards.
data instagram javascript metadata nodejs profile user xray
Last synced: 10 May 2026
https://github.com/stdlib-js/array-nans
Create an array filled with NaNs and having a specified length.
array complex128 complex128array complex64array data float32array float64array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types vector
Last synced: 06 Mar 2026
https://github.com/jebin1999/livestock-production-monitoring-
Livestock production Monitoring
data datascience livestock livestock-monitor r shiny shiny-apps shiny-r shinydashboard
Last synced: 05 Nov 2025
https://github.com/hoaihuongbk/lakeops
A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.
data data-operations dataengineering datalake
Last synced: 07 Mar 2026
https://github.com/novecento99/nuvolino
air cloud data ikea iot pm pm25 sensor vindstyrka
Last synced: 13 Jul 2025
https://github.com/mheadd/SamDotNet
:office: A C# wrapper for the SAM.gov API.
api business client data gov-api government
Last synced: 30 Apr 2025
https://github.com/abhaysingh71/india-censes-data-analysis
This repo is a india censes data analysis in many domains
data data-science data-visualization dataanalysis streamlit
Last synced: 15 May 2026
https://github.com/avto-dev/data-migrations-laravel
Package for database data migrations
data database laravel migrations package
Last synced: 12 Jul 2025
https://github.com/katerynazakharova/common-ml
Creating this lib for ML tasks, because I'm bored of copy-pasting the same functions for different projects.
data data-processing deep-learning lib machi
Last synced: 26 Mar 2025
https://github.com/alpheustangs/jder
A standardized structure for JSON responses
api data error json response specification structure
Last synced: 26 Mar 2025
https://github.com/raigu/ordered-lists-sync
Library for synchronizing ordered data with the minimum of insert and delete operations. Suitable for lage data sets in isolated environments
data lists ordering sync syncrhonization update
Last synced: 12 Jan 2026
https://github.com/yash22222/tsf-grip-tasks
The Sparks Foundation Data Science & Business Analytics Internship Tasks
buisness-intelligence business-analytics data data-science data-science-projects data-structures grip gripjune23 internship internship-task machine-learning projects python simple-linear-regression the-sparks-foundation tsf
Last synced: 27 Apr 2026
https://github.com/gappeah/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 25 Feb 2025
https://github.com/m-muecke/isocountry
R package containing ISO codes for countries and currencies
country-codes currency-codes data iso-3166-1 iso-4217 r r-package
Last synced: 20 Mar 2025
https://github.com/gappeah/beverage-sales-analytics
This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 25 Feb 2025
https://github.com/williamzebrowski/assistant-api
OpenAI Assistant API integrated with Elasticsearch, Logstash & Kibana
ai chatapp chatgpt conversational-ai data elasticsearch kibana llm-inference llms openai rag
Last synced: 16 Feb 2026
https://github.com/gappeah/british-airways-analysis
This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.
data data-analysis data-visualization tableau
Last synced: 11 Jun 2025
https://github.com/r-mahesh45/hr---resume-text-classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 12 Sep 2025
https://github.com/speakeasy-sdks/fivetran-python-sdk
Python SDK for accessing Fivetran API.
api connector data fivetran fivetran-connector python sdk
Last synced: 01 Jul 2025
https://github.com/hamzacham/data_set_projet-3
analysis data project rstudio visualization
Last synced: 29 Oct 2025
https://github.com/soulyma/web_crawler
A focused web crawler to extract and structure Arabic content from web pages. Designed for researchers, data analysts, and developers working on Arabic language datasets.
beautifulsoup4 crawler csv data json python structured-data
Last synced: 15 May 2026
https://github.com/wahyuwsslah/salary_prediction-aiml
Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest
ai analytics data data-science datascience machine-learning python python3
Last synced: 19 May 2026
https://github.com/mundra-ankur/msw_ai_pipeline
Municipal solid waste (MSW) characterization, AI and Data pipeline to charcterize solid waste in real time into diffrent buckets using Yolo
artificial-intelligence data datapipeline solid-waste-segregation yolo
Last synced: 11 Apr 2025
https://github.com/benji-lewis/archivord
An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.
archive data data-mining discord discord-bot typescript
Last synced: 16 May 2026
https://github.com/erictleung/2017-new-coder-survey
:beginner: Code to help clean and format the 2017 New Coder Survey by freeCodeCamp
coder-survey data data-cleaning dplyr freecodecamp
Last synced: 03 Apr 2025
https://github.com/real-veersandhu/cia-country-comparison
Data analysis system on the CIA World Factbook
Last synced: 25 Feb 2025
https://github.com/sermetpekin/perse
Perse is an experimental Python package that combines some of the most widely-used functionalities from the powerhouse libraries Pandas, Polars, and DuckDB into a single, unified DataFrame object. The goal of Perse is to provide a streamlined and efficient interface, leveraging the strengths of these libraries to create a versatile data handling.
data data-science data-structures duckdb pandas polars
Last synced: 09 May 2026
https://github.com/rafalwrzeszcz-wrzasqpl/pl.wrzasq.commons
General-purpose data structures and routines.
aws data data-structures library rust
Last synced: 10 Apr 2025
https://github.com/luminati-io/pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 17 Mar 2025
https://github.com/simranjeet97/datascience_crashcourse
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data data-science data-science-crash-course data-structures data-visualization datascience-machinelearning datasciencecoursera datascienceproject instagram matplotlib numpy pandas telegram tutorials youtube
Last synced: 08 Apr 2026
https://github.com/sevmardi/data-mining-hacks
Hacks in Data Mining
data data-mining data-mining-algorithms python3
Last synced: 18 Jul 2025
https://github.com/amazingtest/data4test
测试数据构造生成器,you can get useful data here for software testing
data test-automation testdata testdatabuilder testing testing-tools
Last synced: 16 Jan 2026
https://github.com/qetdr/names-genders
Surnames, genders, and gender probabilities data extraction script and dataset
Last synced: 01 May 2026
https://github.com/hoangsonww/fred-banking-data-analysis
💸 AI-powered banking data explorer that combines FRED API insights with vector search, regression analysis, and interactive chat via OpenAI, Claude, and Gemini. Built with TypeScript, React, and Express for seamless full-stack performance.
anthropic chartjs claude-ai data data-analysis data-analytics data-science data-visualization fred fred-api gemini google-generative-ai logistic-regression multiple-regression openai pinecone react regression typescript vector-database
Last synced: 09 Apr 2025
https://github.com/jrdnbradford/google-sheet-color-sort
Google Sheet-bound script that assists with sorting Google Sheet rows by background fill color
data excel google-apps google-apps-script google-sheet google-sheets javascript microsoft-excel sort-rows
Last synced: 14 Apr 2025
https://github.com/benjaminr/udacity-data-engineering
Data Engineering
data dataengineering python udacity
Last synced: 14 May 2026
https://github.com/patelabhi574/hotel_reservation_analysis
Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.
data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server
Last synced: 19 Feb 2026
https://github.com/finnspartronics/orpheus
A took for looking at FRC (First Robotics Competition) scouting data
data first-robotics-competition scouting scouting-data spartronics
Last synced: 28 Mar 2025
https://github.com/nichtich/wikidata-taxonomy-examples
Extract classifications from Wikidata
coli-conc data knowledge-organization wikidata
Last synced: 12 Jul 2025
https://github.com/antoineaugusti/antennes-free
Historique des antennes relais Free Mobile en maintenance ou en panne
data free-mobile free-mobile-operator mobile-networks
Last synced: 30 Jul 2025
https://github.com/swarchal/morar
Processing phenotypic screening data
biology data data-analysis drug-discovery hts phenotypic
Last synced: 19 Jun 2025
https://github.com/stonecharioteer/renfield
Synchronize and Search through Hard Drives
catalogue data search storage synchronization
Last synced: 09 Feb 2026
https://github.com/warlock/tck
Data Type Checker
ajax browser data javascript nodejs type-checking types validation
Last synced: 19 May 2026
https://github.com/elazar/pycopyql
Exports a subset of data from a relational database.
data database export relational tool utility
Last synced: 16 May 2026
https://github.com/sandravizz/global_inequality_story
Dataviz Project about Global Inequality
data data-visualization inequality
Last synced: 03 Jul 2025
https://github.com/newrelic-experimental/newrelic-java-sap-bi
Instrumentation for SAP PI/PO Server
bi data instrumentation java newrelic nrlabs nrlabs-data nrlabs-odp observability-data sap sap-pi sap-po
Last synced: 03 Mar 2025
https://github.com/erinaldi/bmn2-lattice
Data analysis of lattice Monte Carlo simulations of quantum matrix models.
data data-science data-visualisation lattice
Last synced: 27 Mar 2025
https://github.com/kevinsames/spark-fuse
spark-fuse is an open-source toolkit for PySpark — providing utilities, connectors, and tools to fuse your data workflows together.
data databricks fabric pyspark python spark
Last synced: 08 May 2026
https://github.com/mvuorre/psyarxivdb
Datasette serving PsyArXiv preprint metadata
data datasette open-science preprints psyarxiv
Last synced: 14 May 2026
https://github.com/LisaKey/convert-csv-to-sav
We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.
analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations
Last synced: 03 Mar 2025
https://github.com/diddypod/crop-data-comparer
A Python script to compare crop data over years
comparison crop data openpyxl python
Last synced: 28 Jun 2026
https://github.com/rrwen/slides-covid19-geosocial-db
Presentation titled "A Real-time Geo-social Media Database for Large-scale Coronavirus Disease 2019 (COVID-19) Research" for my second research seminar at Ryerson University
covid covid-19 covid19 data database disease geo gis index media ncov-2019 ncov19 postgres postgresql presentation research seminar slides social virus
Last synced: 18 May 2026
https://github.com/tillahoffmann/idxhound
🐶 Track indices across one or more numpy selections.
data numpy scientific-computing
Last synced: 14 May 2026
https://github.com/mmaithani/singapore-residents-data-eda
The data contains Population by ethnicity, age and gender for the country of Singapore from the year 1957 to 2018
data data-visualization ethnicity kaggle-dataset python singapore singapore-residents-data
Last synced: 16 Apr 2026
https://github.com/luminati-io/crunchbase-dataset-samples
A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.
crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping
Last synced: 17 Mar 2025
https://github.com/thomd/git-scrape-hacker-news
scrape hacker news metadata for data analysis
data data-science git-scraping hacker-news
Last synced: 16 Sep 2025
https://github.com/margostino/job-pulse
PoC to analyse the hiring market
data golang mongodb visualization
Last synced: 16 May 2026
https://github.com/stdlib-js/array-base-any-by-right
Test whether at least one element in an array passes a test implemented by a predicate function, while iterating from right to left.
any array data generic javascript node node-js nodejs predicate some stdlib structure test types validate
Last synced: 14 Apr 2025
https://github.com/lmuffato/project-mysql-one-for-all-trybe
Projeto mysql one for all - Projeto avaliativo da Trybe do Bloco 21: Normalização e Modelagem de Banco de Dados
back-end data database database-modeling mysql mysqlworkbench query sql trybe-projects
Last synced: 08 May 2026
https://github.com/stdlib-js/ndarray-slice-assign
Assign element values from a broadcasted input ndarray to corresponding elements in an output ndarray view.
assign assignment copy data javascript matrix ndarray node node-js nodejs set setitem slice stdlib structure types vector view
Last synced: 11 Apr 2025
https://github.com/jonsafari/toy-data
Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks
data language-data machine-translation nlp sanity-checks toy-data
Last synced: 06 Nov 2025
https://github.com/stdlib-js/ndarray-base-reverse-dimension
Return a view of an input ndarray in which the order of elements along a specified dimension is reversed.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view
Last synced: 07 Mar 2026
https://github.com/wreedb/tree-sitter-god
A tree-sitter grammar for God
data data-serialization file-format tree-sitter tree-sitter-grammar
Last synced: 16 May 2026
https://github.com/epogrebnyak/business-conditions-digest-2017
Replicate illustration from Business Conditions Digest
Last synced: 22 Mar 2025
https://github.com/vishwagauravin/screener-scraper-pro
Effortlessly scrape comprehensive financial data from screener.in and use it in your projects. No API key required.
data finance finances market-data scraper scrapers screener screener-in screener-plugin stock stock-data stock-market stocks
Last synced: 18 Feb 2026
https://github.com/danieljdufour/fast-bin
Quickly Convert an Array of Numbers into their Minimal Binary Representations
array binarize binary bits data nbits numbers unbinarize
Last synced: 13 Apr 2025
https://github.com/michellepellon/jobx
A modern, powerful job scraper for LinkedIn, Indeed and beyond.
compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper
Last synced: 17 Jan 2026
https://github.com/MikeBairdRocks/Fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 02 Apr 2025
https://github.com/discindo/natochak
Analysis of bicycle accidents in Macedonia using Rmarkdown and ggplot2
Last synced: 19 Feb 2026
https://github.com/cobluestars/dataherd-raika
"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."
big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience
Last synced: 02 Jan 2026
https://github.com/ajitharunai/covid-tracker-using-python
Covid-Tracker-Using-Python
data datavisualization python python3 pythonapplications
Last synced: 25 Jun 2025
https://github.com/danieljdufour/easy-file-saver
Very Easily Save a File
csv data download file file-saver javascript js json save
Last synced: 21 Apr 2026
https://github.com/ayushverma135/accenture-data-analytics-and-visualization
This program provided practical experience in advising a hypothetical social media client as a Data Analyst at Accenture. The simulation involved cleaning, modeling, and analyzing multiple datasets, culminating in the creation of a PowerPoint deck and video presentation to communicate key insights.
accenture analytics data data-visualization forage presentation
Last synced: 19 Sep 2025
https://github.com/2kabhishek/pokemon-stats
Gotta stat 'em all 🖲🐭
d3 data emoji pokemon rollup statistics
Last synced: 14 May 2026
https://github.com/cont-limno/lagosus-reservoir
Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.
Last synced: 17 Jan 2026