data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/qrailibs/dataflow
✨ Data processing in Node.js made multithreaded and type-safe.
data dataprocessing multithread node
Last synced: 04 May 2026
https://github.com/maxwelllzh/gis-tutorial-
Tutorials for Columbia University GIS Club
Last synced: 04 May 2026
https://github.com/dimitryzub/russo-ukraine-war-prediction-losses
Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱👤
data dataanalysis dataanalytics matplotlib pandas prophet python
Last synced: 04 May 2026
https://github.com/sjg/my-search-story
My Search Story is a demo application developed for the Data Portability API Workshop and the #AISprint2025 events. #BuildwithAI
data docker generative-ai google-cloud-platform google-cloud-run nodejs
Last synced: 04 May 2026
https://github.com/a-poor/datatransform.jl
A package for defining (and performing) tabular-data transformations with JSON.
data data-science data-transformation etl feature-engineering json julia julia-package tabular-data
Last synced: 05 May 2026
https://github.com/edjoukou/pizza-sales-report
A data analysis project using SQL with MySQL database
analysis data mysql powerbi visualization
Last synced: 05 May 2026
https://github.com/muthupillai1204/diwali_sales_analysis
The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.
data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization
Last synced: 05 May 2026
https://github.com/sohomm/predict-insurance-charges
A predictive model to estimate the insurance charges based on a client's attributes, such as age and health factors. It offers a practical application of ml in business, enabling more accurate pricing models and helping companies manage risk while delivering personalized pricing strategies to clients.
administration algorithm bot data decision-trees download easy finance github java machine-learning management model neural-network nlp prediction project science trading university
Last synced: 05 May 2026
https://github.com/shibbbbs/fastapi_project
A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs
data dataanalysis fastapi pandas python
Last synced: 06 May 2026
https://github.com/ksm26/ml-ai-data-science-jobs-in-canada
Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.
ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics
Last synced: 06 May 2026
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 10 Jun 2026
https://github.com/ralzz/dibimbing_datascience
This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.
data data-science eda google-colab kaggle pandas python
Last synced: 06 May 2026
https://github.com/shantanujpk/bigdatacloud
Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.
big-data data jupyter-notebook pipeline pyspark python spark sparksql
Last synced: 07 May 2026
https://github.com/lab5e/loadabledata
Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.
async data loadable state typescript ui
Last synced: 07 May 2026
https://github.com/hudson-newey/data-miner
A simple data miner that collects information from an API and stores it in a file
api api-client big-data bigdata data logger logging
Last synced: 10 Jun 2026
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 08 May 2026
https://github.com/zsvoboda/olympics
Self service analytics of 120 years of Olympics data
analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports
Last synced: 08 May 2026
https://github.com/writetome51/public-data-container-interface
Just a TypeScript interface with 1 property: 'data'
container data interface typescript
Last synced: 15 May 2026
https://github.com/chompfoods/sdk-typescript-angular
Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 09 May 2026
https://github.com/tupizz/python-data-manipulation
Data manipulation and visualization with Python 2.x
Last synced: 09 May 2026
https://github.com/caiorss/julia-box-docker
Docker that provides a development environment for Julia language, Octave, Python, R (Rlang) with a Jupyter Notebook; Jupyter QtConsole and so on.
data datascience deveops docker julia jupyter octave python rlang scientific
Last synced: 09 May 2026
https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project
This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop
analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping
Last synced: 09 May 2026
https://github.com/beeracs/llama
Run Llama models in your web browser using JavaScript and WebAssembly. Explore light and dark modes easily. 🌐🐱👤
ai data fine-tuning framework gpt langchain large-language-models llama3 llamaindex llm lora machine-learning nlp peft qlora qwen rlhf vllm
Last synced: 10 May 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/notthestallion/pca__3d-and-from-scratch__principal-component-analysis
In this project, I will be implementing Principal Component Analysis (PCA) from scratch on an ecological footprint consummation database for countries and a three-dimensional scale using a movie database. The goal of this project is to gain a deeper understanding of PCA and to demonstrate its capabilities in exploring complex datasets.
data data-science database pca pca-analysis principal-component-analysis principal-component-analysis-pca principle-component-analysis
Last synced: 10 May 2026
https://github.com/alimghmi/bdlc
Bloomberg API integration, handling data requests, processing, and SQL database insertion.
api-client bloomberg data data-processing financial-data oauth2 python sql-database transformation
Last synced: 10 Jun 2026
https://github.com/sebastian-diaz-berdecia/analisis-popularidad-de-series-y-generos-de-series
Consultas SQL para el análisis de la popularidad de series y géneros series de la base de datos NetflixDB.
business-analytics bussiness-intelligence data data-analysis database mysql mysql-database sql
Last synced: 12 May 2026
https://github.com/miniql/notebook-example
An example of MiniQL in a JavaScript Notebook
comma-separated-values csv data data-analysis data-science graphql javascript notebook query query-language
Last synced: 13 May 2026
https://github.com/m0nica/datalogues-refresh
:bar_chart: Programming blog focused on data with an emphasis on exploration in Python.
data jekyll python technical-writing
Last synced: 14 May 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/shashwat9kumar/trends_in_a_country_on_twitter
Finding trending topics in each country on twitter and visualizing them in a WordCloud
data data-visualization trends tweepy twitter-api wordcloud
Last synced: 13 Jun 2026
https://github.com/word2vect/beijing-pm2.5-data-process
Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2
Last synced: 15 Jun 2026
https://github.com/ayushman0511/data-analytics-project1
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics busine data data-anal data-enginee data-sci data-scien database datascien query reporting sql sql-query sql-server window-func
Last synced: 17 Jun 2026
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/dushansenadheera/web_scraper
web scraper using Python along with BeautifulSoup and Selenium
beautifulsoup data python selenium web-scraping
Last synced: 19 Jun 2026
https://github.com/rylan12/apscores
A quick way to visualize how the AP score distributions have changed from year to year.
advanced-placement analysis ap-exam data scores
Last synced: 19 Jun 2026
https://github.com/supunlakmal/coronavirus-covid-19-status
Covid 19 cases and death count for each country in a json file.
coronavirus count country covid-19 covid-data covid19 data data-science data-visualization geographical geographical-information-system json
Last synced: 21 Jun 2026
https://github.com/anburocky3/cbse-schools-data
Fetch CBSE Schools in seconds and use it for your data projects
cbse data data-analysis data-science grabber nextjs
Last synced: 24 Jun 2026
https://github.com/europanite/gundam-forest
Random Forest Data Analysis of Kill In Action rate for every personnel in GUNDAM world, like in Titanic.
data data-analysis data-science data-visualization death-rate death-rates gundam-model gundam-series gundom-forest jupyter jupyter-notebook kia kill-in-action python random-forest titanic titanic-kaggle titanic-survival titanic-survival-prediction
Last synced: 29 Jun 2026
https://github.com/RedInfinityPro/ScientificSharp
Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.
componentmodel cryptography data drawing forms generic linq system tasks text
Last synced: 30 Sep 2025
https://github.com/ramonrsv/f1_data
Provides consolidated access to various sources of Formula 1 information and data, including event schedules, session results, timing and telemetry data, as well as historical information about drivers, constructors, circuits, etc.
Last synced: 07 Apr 2026
https://github.com/yvandana/pwc-power-bi-job-simulation
Projects pursued during my Job Simulation
dashboard data dataanalysis powerbi pwc-forage-switzerland
Last synced: 06 Mar 2026
https://github.com/lorinczakos/sql-projects
This is a collection of my SQL scripts that I wrote and were approved through my course with GoIT Romania Data Analyst course
bigquery cte data data-analysis dbeaver marketing-analytics postgresql project-repository sql vscode
Last synced: 16 May 2026
https://github.com/the-tech-idea/beep.winform.sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 08 Jul 2025
https://github.com/dolanmiu/mclaren-task
A front end assessment task for Mclaren
angular data observable observables rxjs
Last synced: 16 May 2026
https://github.com/ashishsingh789/hr_analysis_dashboard
The HR Analyst Dashboard is an interactive Power BI tool that provides insights into HR metrics sourced from Excel. It focuses on data cleaning, transformation, and visualization, enabling stakeholders to explore key indicators like employee demographics and performance through intuitive charts.
dashboard data dataanalysis datacleaning powerbi-desktop visualization
Last synced: 06 Mar 2026
https://github.com/dhi13man/rca_ace
RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).
analytics data enterprise open-source python python3 rca
Last synced: 10 Sep 2025
https://github.com/katahiromz/comp_decomp
data compressor/decompressor
bzip2 compress compressor cxx data decompress decompressor lzma uncompress zlib
Last synced: 10 Jul 2025
https://github.com/skygenesisenterprise/api-service
The Official Sky Genesis Enterprise API Service Ecosystem
api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket
Last synced: 31 Dec 2025
https://github.com/shubhamsoni98/survey-data-analysis
Surey Data Analysis
analysis dashboards data data-mining data-visualization dataanalysis datacleaning datascience datasets insights pivot-tables pivotanalysis
Last synced: 07 Mar 2026
https://github.com/samaalharbi2/virtual-work-experience---data-analysis-at-stc
Virtual Work Experience in Data Analysis at STC
analysis data data-visualization misk stc
Last synced: 20 Jun 2025
https://github.com/halyusa16/basic-sql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 16 May 2026
https://github.com/denisecase/cintel-04-reactive
Interactive analytics, reactive app built with Shiny for Python
analytics bokeh data flights interactive mtcars penguins python relationships shiny
Last synced: 20 Jun 2025
https://github.com/bho0920/crime-data-analysis-eu
Crime Data Analysis for Self-Defense Tool Market Entry in the EU.
data data-analysis sql sqlite tableau
Last synced: 21 Jun 2025
https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java
Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.
arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra
Last synced: 12 Apr 2025
https://github.com/webobite/fact-chatbot
A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.
chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp
Last synced: 04 May 2026
https://github.com/canadaluke888/speedtable
Ultra-fast terminal table renderer written in C
c data datasets fast python python-wrapper python3 tables
Last synced: 01 Mar 2026
https://github.com/sakan811/gachascope
Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.
data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves
Last synced: 04 May 2026
https://github.com/austinv11/pypeline
A simple data pipeline builder for Python 3+
data leveldb pypeline python python3 stream-processing
Last synced: 20 Aug 2025
https://github.com/karaniwachira/baby_names_analysis
Data Analysis: Baby Names Exploration
data data-analysis quarto quartopub r rstats tidyverse-ggplot2
Last synced: 22 Jun 2025
https://github.com/mradkov/secure-data-exchange
Elliptic Curve Diffie-Hellman secure data exchange via smart contracts on Aeternity blockchain
aeternity data exchange key-exchange smart-contracts sophia
Last synced: 22 Jun 2025
https://github.com/sibeux/redesigned-broccoli
Repositori untuk menyimpan data file musik
data data-center nasrulwahabi sibeux
Last synced: 24 Jan 2026
https://github.com/uttori/uttori-data-tools
Tools for working with binary data.
Last synced: 17 Feb 2026
https://github.com/aliaksandr-master/unipipeline
simple way to build the declarative and destributed data pipelines with python
Last synced: 11 Jul 2025
https://github.com/thetacom/byteclasses
A Python package to manage and interact with binary data in a simple and structured manner.
binary-data bytes data dataclasses package python python3
Last synced: 11 Jul 2025
https://github.com/jensostertag-archive/charts.js
A JavaScript Plugin to draw Charts to visualize Data and Statistics on Websites
charts data javascript statistics webapplication
Last synced: 22 Jun 2025
https://github.com/fintech-lsi/fintech-credit-risk-prediction
This repository provides a machine learning model for predicting credit risk in the financial sector. The model uses borrower information, such as age, income, employment length, loan amount, and credit history, to assess the likelihood of loan repayment or default.
data fintech machine-learning model prediction risk
Last synced: 12 Oct 2025
https://github.com/1sumer/mass-mail-automation
Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.
data oops-in-python python smtp-server tkinter
Last synced: 20 Aug 2025
https://github.com/nanis/unitedat
Unify data sets which consist of separate files with a common header repeated in each one.
Last synced: 12 Apr 2025
https://github.com/uzinfocom-org/archive
📦 | Archived projects that aren't used anymore
archive archive-data data notused
Last synced: 01 Sep 2025
https://github.com/allanotieno254/spss-nutrition-research
This repository contains the results of statistical analyses performed in IBM SPSS Statistics on a child nutrition dataset.
data data-preprocessing dataanalysis spss
Last synced: 17 Feb 2026
https://github.com/ppmim/papi4k_old2
PAPI: the PANIC data reduction pipeline
data near-infrared pipeline processing
Last synced: 23 Jun 2025
https://github.com/gsmithun4/expressjs-field-validator
Plugin for validating JSON request, middleware for expressjs
data express-js expressjs json-request middleware nodejs request rest-api validation
Last synced: 06 Mar 2026
https://github.com/darshjasani/claims-analysis
This repository contains a comprehensive analysis of claims data, detailing the workflow from data preprocessing to model evaluation. The goal of this analysis is to build predictive models to improve claims prediction and management.
analysis data linear machine-learning python
Last synced: 16 May 2026
https://github.com/mvuorre/osfdatasette
Harvest, wrangle, and serve preprint data from OSF API with Datasette
data datasette open-science preprints
Last synced: 11 Apr 2025
https://github.com/sap-samples/sap-bdc-explore-hyperscaler-data
The repository contains detailed steps to integrate external hyperscaler data sources to SAP Datasphere in the SAP Business Data Cloud per the Open data ecosystem integration principles .
aws azure business cloud data databricks datasphere gcp hyperscalers sap
Last synced: 16 May 2026
https://github.com/dimaa1608/azurecontent
AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.
azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization
Last synced: 10 Apr 2025
https://github.com/ournet/news-data
Ournet news data package
data news news-data news-storage ournet storage
Last synced: 04 Apr 2025
https://github.com/ournet/quotes-data
Ournet quotes data package
data ournet ournet-quotes quotes
Last synced: 04 Apr 2025
https://github.com/stdlib-js/array-base-banded-filled2d-by
Create a filled two-dimensional banded nested array according to a provided callback function.
alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types
Last synced: 19 May 2026
https://github.com/cmdrvl/profile
profile manages column-scoping configurations for report tools — defining which columns to include, key alignment, and normalization rules for rvl, compare, and shape.
cli configuration csv data data-quality open-source ops rust tooling
Last synced: 07 Mar 2026
https://github.com/priyanshubiswas-tech/farmlab-report-and-case-study-iot
This project was developed through live interviews and case studies with farmers in the year 2023 to address key agricultural challenges. The device provides real-time farm insights for better decision-making. Future plans include a digital portal, increased range, more sensors, and improved design. Open to collaboration!
arduino-ide c case case-study data data-analysis iot iot-device serialization
Last synced: 15 Jul 2025
https://github.com/germanpaul12/flights-data-sky-scraper-api
Sky Scraper - Python app for searching flight information using the Sky Scrapper API.
data flights flights-api scraping
Last synced: 15 Jul 2025
https://github.com/null-none/py-fear-and-greed
Fear & Greed Index
data fear-and-greed python trading
Last synced: 16 Jul 2025
https://github.com/birjemin/wxgameod
wxgame 开放数据 weixin 微信小游戏 关系链数据
data interactive-data relation user-storage
Last synced: 16 Jul 2025
https://github.com/shoaib1522/database-systems
📚💾 Master the fundamentals of database systems with this all-in-one lab repository, featuring ERD design diagrams 🧠🗺️, Oracle SQL 🌐📝, relational schema practice, and complete PowerPoint lectures 🖥️📑. Perfect for revision, exams, or quick reference! 💡📘
data database database-management databases databases-course db dbms-project erd notes oracle oracle-database sql
Last synced: 21 Aug 2025
https://github.com/istinnew/cook-me-up
[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.
bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning
Last synced: 16 May 2026
https://github.com/qubitpi/wiktionary-data
Wiktionary data in simple parsable formats hosted on 🤗 Datasets
ancient-greek data german huggingface huggingface-datasets language latin natural-language-processing nlp old-persian python wiktionary wiktionary-data
Last synced: 17 Jul 2025