data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 16 May 2026
https://github.com/macsual/dotgov-jamaica-domains
A listing of .gov.jm domains.
Last synced: 03 Jan 2026
https://github.com/financejs/discord-bot
A Discord Bot Used In Financejs Discord Server
data discord discord-bot discordjs-bot finance financejs financial
Last synced: 13 Apr 2026
https://github.com/jongirard/unique_names_generator
A Unique Names Generator built in Elixir
data data-generator elixir elixir-lang fake-data name-generator phoenix seed
Last synced: 21 Oct 2025
https://github.com/instaclustr/cassandra-parquet-transformer
Transform SSTables from Apache Cassandra to Parquet or Avro files, locally or remotely via Apache Cassandra Sidecar
analytics apache apache-cassandra avro big cassandra data parquet spark sstable transformation
Last synced: 29 Aug 2025
https://github.com/mmaithani/loan-approvel-ml-model-with-insights
This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided
data data-science loan-prediction-analysis machine-learning visualization
Last synced: 16 Aug 2025
https://github.com/squareslab/probabilisticmodel_saner2018
Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018
code data mausotog published replication
Last synced: 26 Oct 2025
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026
https://github.com/bastgau/snow-revoke-privileges
Script designed to simplify the management of permissions in your Snowflake databases.
data database dba dev-container python snowflake
Last synced: 20 Apr 2025
https://github.com/everythings-gonna-be-alright/amazing-clickhouse-connector
Quick recording of analytics data
analytics clickhouse data k8s kubernetes
Last synced: 04 Jan 2026
https://github.com/natylaza89/covid19-il
Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. β 19K+ Downloadsβ
api covid covid19 covid19-data data israel pandas python
Last synced: 13 Apr 2026
https://github.com/luminati-io/Pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 09 Apr 2025
https://github.com/pommes-public/pommesdata
A full-featured transparent data preparation routine from raw data to POMMES model inputs
data opensource power raw-data transparent
Last synced: 07 Oct 2025
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/junkwaxhero/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 24 Apr 2025
https://github.com/stdlib-js/array-typed-float-ctors
Floating-point typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 24 Apr 2025
https://github.com/programmer-rd-ai/open-images-v6
Open-Images-V6
ai data dataset dl images ml object-detection open open-images programming python v6
Last synced: 03 Aug 2025
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/thyringer/cast
CLI tool for reading strings or complex data sets from CSV files to output them in other text formats.
csv-converter data data-preprocessing python python3 sql-builder
Last synced: 02 Feb 2026
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
π¦π΅End-to-end ChatGPT Users' Tweets Data Pipeline with Python π, Hive π, and Power BI π
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/andrew-johnson-4/misspeller
Take correctly spelled words and return common spelling mistakes
common-mistakes data language natural nlp processing rust
Last synced: 30 Apr 2025
https://github.com/woo071002/parcel-management-system
A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.
cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml
Last synced: 08 Oct 2025
https://github.com/mmabiaa/data-structure-and-algorithms-java
Data structures and algorithms in java
algorithms algorithms-and-data-structures data data-structure-and-algorithm data-structures data-structures-algorithms data-structures-and-algorithms datastructures dsa dsa-learning-series dsa-practice java
Last synced: 09 Apr 2026
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/espoirmur/balobi_nini
An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.
Last synced: 24 Aug 2025
https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python
This Python project is made by me, Python project for improving python skills.
card data data-generator employee python
Last synced: 03 Feb 2026
https://github.com/yashmistry-24/ytcomment-iq
YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.
analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube
Last synced: 15 Feb 2026
https://github.com/Nazaniiin/EDA_QualityofRedWine
:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.
charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization
Last synced: 30 Jul 2025
https://github.com/zalweny26/tools
Just a bunch of tools made in TypeScript.
algorithms data dimensionality distances helpers reduction sortings structures tools utils
Last synced: 03 Feb 2026
https://github.com/hasnocool/war_thunder_camouflage_scraper
A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.
asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web
Last synced: 04 Jan 2026
https://github.com/mahmoud-saeed-mahmoud/loading_state_handler
The StateHandlerWidget manages different UI statesβloading, error, empty, and normalβallowing you to customize the displayed widgets for each state.
dart data error flutter flutter-package flutter-widget loading state
Last synced: 10 Mar 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chaseβs publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/mollybeach/cherryether
CherryEther: Typescript Staking Deposits Ethereum Transactions
blockchain data data-science ethereum typescripts
Last synced: 21 May 2026
https://github.com/imranhsayed/programming-in-c
Programming in C
array c c-programming circular-linked-list cprogramming data data-structures-and-algorithms file-handling linked-list pointers
Last synced: 28 Jan 2026
https://github.com/karashiiro/lodestone-id-time
Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.
data ffxiv ffxiv-character lodestone
Last synced: 30 Jun 2025
https://github.com/doctorlai/hex-viewer
Simple File Viewer in HEX
application data files hacktoberfest hex-viewer hexeditor hexidecimal web-app
Last synced: 09 Oct 2025
https://github.com/askaniy/celestialocationsmaker
Tool for making Celestia location files
celestia data geology locations mapping planetary-science space
Last synced: 14 Mar 2025
https://github.com/xtrendence/comp2001-coursework
Grade: 98%. COMP2001 Coursework by Khodadad (Adrian) Nouchin. A RESTful authentication API, and a linked data application.
api asp-net csharp data dataset linked-data php restful restful-api
Last synced: 13 Apr 2026
https://github.com/steelcake/cherry-pipelines
A collection of pipelines built with cherry
blockchain clickhouse data pipeline pyhton
Last synced: 09 Mar 2026
https://github.com/stdlib-js/utils-compact-adjacency-matrix
Compact adjacency matrix.
adjacency dag data data-structure data-structures graph javascript matrix node node-js nodejs stdlib structure topological toposort tsort util utilities utility utils
Last synced: 15 Apr 2026
https://github.com/imtiaz-emu/exploratory-data-analysis-with-r
Data Transformation, Descriptive statistics, data visualization, Linear regression using R
data dplyr ggplot2 r rstudio visualization
Last synced: 15 Mar 2025
https://github.com/mawburn/across-a-thousand-dead-worlds-data
Across a Thousand Dead Worlds Data
Last synced: 21 Apr 2026
https://github.com/sadcenter/messenger
Data messaging system between servers using popular messaging brokers
Last synced: 06 Aug 2025
https://github.com/farovictor/mongodbextractor
This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.
Last synced: 14 Jan 2026
https://github.com/ium101/files-and-folders-lister-z
Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.
application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows
Last synced: 09 Oct 2025
https://github.com/lxcoding06/e-gereja
Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis
data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan
Last synced: 15 May 2025
https://github.com/bkamapantula/india-pc-nfhs4
Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.
data government health india nfhs nfhs4
Last synced: 19 Mar 2026
https://github.com/dantesc03/uberpool-case-study
This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.
analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization
Last synced: 16 Apr 2026
https://github.com/mo-karbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 02 Aug 2025
https://github.com/nrennie/londonmarathon
R package containing data relating to London Marathon.
Last synced: 02 Apr 2025
https://github.com/j1sk1ss/dateapppc.exmpl
ΠΡΠΎΡΡΠΎΠ΅ Π½Π°ΡΠΈΠ²Π½ΠΎΠ΅ ΠΏΡΠΈΠ»ΠΎΠΆΠ΅Π½ΠΈΠ΅ Π΄Π»Ρ Windows Ρ Π΄Π΅ΠΌΠΎΠ½ΡΡΡΠ°ΡΠΈΠ΅ΠΉ ΠΠΠ ΠΈ SQL Π±Π°Π· Π΄Π°Π½Π½ΡΡ Π½Π° ΠΏΡΠΈΠΌΠ΅ΡΠ΅ ΠΏΡΠΈΠ»ΠΎΠΆΠ΅Π½ΠΈΡ Π΄Π»Ρ Π·Π½Π°ΠΊΠΎΠΌΡΡΠ².
data oop-principles parsing pgadmin4 sql wpf
Last synced: 11 Apr 2026
https://github.com/guslovesmath/top_tech_sp_500_forecasting
Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.
arima-forecasting arima-model data data-science forecasting vector-autoregression
Last synced: 14 Mar 2025
https://github.com/planarnetwork/feeds.planar.network
GTFS feeds for bus, train and plane
data feeds gtfs transit transportation
Last synced: 11 Feb 2026
https://github.com/nixhantb/data-structures-and-algorithms-in-java-
Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting
algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming
Last synced: 05 Jul 2025
https://github.com/thomas-nyanumba/r-programming-air-pollution_disease-project
Personal R Programming Project
aggregate-functions boxplot-visualization data dpylr ggplot2 leftjoin linear-regression patchwork powerquery r readxl scatter-plot tidyr visualization
Last synced: 25 Mar 2025
https://github.com/jimut123/scrapers
All Scrapers that I'll build
bs4 data python3 real-time-visualisations scrapers scrapy wget
Last synced: 16 Jan 2026
https://github.com/mujadded/facebook_scrapper
The fcebook scrapper gem that dont need the api
data data-mining facebook ruby-gem scrapper selenium-webdriver
Last synced: 28 Oct 2025
https://github.com/keosariel/nairagazer-clustered-news
Providing clustered News data specifically Nigeria news. In hindsight this repo contain nigeria news and it's coverage. Data is from Nairagazer
ai data data-science news nigeria nigerian-data python
Last synced: 30 Aug 2025
https://github.com/stdlib-js/array-ones-like
Create an array filled with ones and having the same length and data type as a provided array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 05 Jan 2026
https://github.com/uk-ipop/open-data-pipeline
A pipeline for processing, enhancing, and sharing open datasets.
actions automation data python
Last synced: 25 May 2026
https://github.com/ryanmorr/typed
Statically typed properties for object literals
data javascript object properties statically-typed
Last synced: 12 Jun 2026
https://github.com/ingmarboeschen/jatsdecoderevaluation
Evaluation data and code
Last synced: 04 Feb 2026
https://github.com/stdlib-js/datasets-suthaharan-multi-hop-sensor-network
Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 10 Oct 2025
https://github.com/rikvdh/zabuffer
Zero-Allocation buffer handling in C
buffer c clib data embedded memory string zero-allocation
Last synced: 03 Mar 2025
https://github.com/nononoexe/setariaviridis
πΎ Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/xxczaki/parsify-plugin-covid19
Parsify plugin, that adds COVID 19-related variables π¦
confirmed coronavirus covid19 data deaths fun math parser parsify parsify-plugin plugin variable variables
Last synced: 13 Mar 2026
https://github.com/kvstore-io/sdk-java
api data java sdk sdk-java serverless storage
Last synced: 14 Jan 2026
https://github.com/erwan-simon/aws-data-platform-framework
A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.
aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module
Last synced: 23 May 2026
https://github.com/joaocarmo/react-very-simple-data-table
When all you want is a table
Last synced: 06 Mar 2025
https://github.com/purarue/listenbrainz_export
Export your scrobbling history from ListenBrainz
data data-export music scrobbling
Last synced: 24 Jan 2026
https://github.com/yashika-malhotra/cardioflex-treadmill-analysis-using-descriptive-statistics-probability
Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.
colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/marcuwynu23/phaddress
Data API of Regions,Provinces, CityMunicipalities, and Barangay of the Philippines
address address-data-api api barangay city data geolocation municipalities provinces
Last synced: 14 Feb 2026
https://github.com/fabriciopsouza/covid-19-demographic-social-dataset
A social demographic dataset for analysis of the COVID-19 pandemic.
alteryx coronavirus coronavirus-analysis coronavirus-dataset covid-19 covid19 covid19-data data data-science dataset enrichment-analysis timeseries timeseries-analysis timeseries-clustering timeseries-covid-19 timeseries-database timeseries-segmentation timeseriesclassification
Last synced: 31 May 2026
https://github.com/automators-com/datamaker-js
The official Node.js / Typescript library for the DataMaker API
data javascript nodejs typescript
Last synced: 11 Oct 2025
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 09 Apr 2026
https://github.com/deepwaterpaladin/statscanpy
Basic package for querying & downloading StatsCan data by table name.
Last synced: 16 Jan 2026
https://github.com/vikashpr/18cse301j_ra2011003010737
This website tells the story of a nation's GDP through data visualization, providing insights on global GDP, state-wise GDP, sector-wise GDP, and the vision for India's economy. It includes data sets and sources for further reference.
css3 d3-visualization d3js data data-vizualisation gephi-visualizations html5 indian-economy indian-gdp information-visualization js python-word-cloud python3 storytelling tableau tableau-public threejs wordcloud-visualization
Last synced: 03 May 2026
https://github.com/kocyigitkim/realtime.io
Real time data streaming & socket programming library
data realtime socket streaming
Last synced: 29 Jul 2025
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/simoneas02/data-science
π A planning study to become a data scientist and to improve my current skills. π€πΌπ»
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003
US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.
america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa
Last synced: 12 Oct 2025
https://github.com/bradlindblad/quotableoffice
Repo for the quotable office R Shiny app
data datascience golem-apps r shiny shiny-apps text text-mining
Last synced: 26 May 2026
https://github.com/frnt-end/weather-app-react
:atom_symbol: React project - Fetch and Toggle display of current weather in Berlin, Paris, New York & London (tabs) - using axios for API fetch. Watch DEMO π https://Frnt-End.github.io/Weather-App-React π
api axios axios-react background card current-weather data fetch gh-pages react reactjs tabs toggle ui usestate usestate-hook weather weather-app weather-information weatherapp
Last synced: 18 Feb 2026
https://github.com/ngambip/diabetes_factors_2024
Exploring BMI Categories and Health Factors.
dashboards data datacleaning dax-languague powerbi sql sqlstudio tsql visualization
Last synced: 03 Mar 2026
https://github.com/weisscharlesj/data_scicompforchem
Zipped data for SciCompforChem book for easy download
chemistry chemistry-education data data-visualization python
Last synced: 07 Nov 2025
https://github.com/cgossain/genericresultscontroller
A generic NSFetchedResultsController replacement for iOS, written in Swift.
api client connector controller coredata data database fetch firebase firebase-firestore firebase-realtime-database generic ios mongodb nsfetchedresultscontroller results source swift-generics tableview ui
Last synced: 19 Feb 2026
https://github.com/yetnt/ump
These utils are useless
area data distance factorization factors gcd-calculator javascript math mean median mode numbers pattern prime range rate ratio temprature temprature-converter volume
Last synced: 03 Feb 2026