data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/pcpp94/elexon_pipeline_gb_demand
Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.
data electricity elexon gb octopusenergy power powerdata pypsa uk
Last synced: 12 Jul 2025
https://github.com/phtrempe/l2a
This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.
applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/devbigboy/iti-database
This course will cover the following Topics: joins, Normalization, Aggregate function, Group By, Order By, Select, Ranking Functions, Built-In Functions
analytics data data-analytics mssql-database sql sql-server
Last synced: 03 Nov 2025
https://github.com/gabboraron/datacamp_projects
Here you can find my DataCamp Projects
data datacamp datacamp-projects
Last synced: 14 Jun 2026
https://github.com/wciesialka/top-names
A Python module for scraping the list of top first names in the United States.
Last synced: 08 Jun 2026
https://github.com/fridex/real-estate
My machine learning in real estate
data machine-learning real-estate
Last synced: 27 Jun 2025
https://github.com/kiing-dom/data-structures-algorithms
data structures and algorithms
algorithms-and-data-structures data data-structures java leetcode
Last synced: 09 Aug 2025
https://github.com/echang1802/normandy
Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.
analytics business-intelligence data dataengineering datascience etl pipeline
Last synced: 11 Mar 2026
https://github.com/colour-science/colour-streamlit-tm-30-18
Generates the "ANSI/IES TM-30-18 Colour Rendition Report" using Colour and Streamlit
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets streamlit
Last synced: 29 Jun 2026
https://github.com/radekbednarik/att
Python wrapper for calling Apitalks API.
api-wrapper apitalks data python3 rest-api wrapper
Last synced: 05 Apr 2025
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/iliyasalve/cyclistic_case_study
Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"
bike-sharing data data-analysis data-visualisation r
Last synced: 06 Apr 2025
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/yvandana/brain-tumor-detection-and-classification
Bachelor's Major Project- Presented at ICMISC 2022
2d-cnn brain-tumor-classification brain-tumor-detection cnn-model data data-augmentation keras-tensorflow sklearn-metrics
Last synced: 16 Jun 2025
https://github.com/himanshub16/lekhpal
Monitor and catalog Twitter feed matching your desired keywords
analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api
Last synced: 14 May 2026
https://github.com/axafrance/azureml-to-openshift-talk
Scale your dev IA: From dev AzureML to prod OpenShift in one click
ai axa azureml data learn ml openshift raise-the-bar talk
Last synced: 16 Feb 2026
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/shamaz332/ecomrace-data-analysis-in-datascience
data data-science matplotlib pandas
Last synced: 15 May 2026
https://github.com/jmcph4/rpdb
rpdb
automation data database dataset db real-estate rpdata sql
Last synced: 12 Apr 2025
https://github.com/peternaydenov/data-pool
Data layer for node apps and single page applications
Last synced: 29 Apr 2025
https://github.com/4ment/aiv-rate-heterogeneity
Avian influenza virus data sets
Last synced: 24 Jan 2026
https://github.com/lisakey/lisakey
I am passionate about Python π and SQL ποΈ for data analysis π, and I actively develop projects in these languages.
analysis analyst data dataanalysis dataanalyst java python sql
Last synced: 02 May 2026
https://github.com/vojtech-dobes/php-conformance
constraint data input normalization php sanitization schema validation
Last synced: 23 Jul 2025
https://github.com/purarue/blizzard_gdpr_parser
Parses date-related information from my blizzard GDPR export.
blizzard data gdpr webscraping
Last synced: 06 Apr 2025
https://github.com/nxank4/an-augment
A Python library for advanced and novel data augmentation, combining traditional techniques like cropping and blurring with state-of-the-art generative AI methods such as style transfer, image inpainting, and latent space interpolation. It boosts data diversity for robust machine learning applications.
computer-vision data data-augmentation data-augmentation-strategies data-augmentation-techniques generative-ai image image-processing synthetic-data
Last synced: 10 Mar 2026
https://github.com/renebentes/2808
Curso 2808 - Fundamentos do Entity Framework
Last synced: 27 Jun 2025
https://github.com/lakshyakumar266/jee-dpp-manager-app
DPP manager app for JEE preparing Students
data expo javascript management react-native
Last synced: 07 May 2026
https://github.com/mai-space/design-concept-sharing-recipes
πΌοΈ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework
concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories
Last synced: 14 May 2025
https://github.com/stuffbymax/game-dependencies-db
data database game games-list json mit-license
Last synced: 15 May 2026
https://github.com/jph5396/sumomodel
A data models related to sumo wrestling.
Last synced: 17 Jan 2026
https://github.com/nouraalgohary/data-scientist-with-python
This repo comprises of my solutions for the tasks assigned in the course.
data data-science data-visualization datacamp datacamp-course datacamp-data-science datacamp-exercises datacamp-solutions-python datascience python
Last synced: 15 Jun 2025
https://github.com/gagolews/clustering-data-v0
Datasets for Clustering [DEPRECATED β A NEW VERSION IS AVAILABLE]
clustering data dataset machine-learning
Last synced: 15 Sep 2025
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025
https://github.com/mapi-developer/dapo
Simple, zero-dependency tabular data manipulation and analysis for Python.
Last synced: 06 Mar 2026
https://github.com/piazzai/chess-variants
Analysis of Lichess variant games
analysis chess chess-variant chess-variants data data-mining data-science data-visualization lichess lichess-database logistic-regression logit-model pgn r r-code r-scripts regression regression-analysis shell shell-scripting
Last synced: 15 May 2026
https://github.com/miss-mhv/data-analysis-for-social-buzz
In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.
Last synced: 14 May 2026
https://github.com/canadaluke888/terminaltablebuilder
Build and edit tabular data all from the terminal.
cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables
Last synced: 20 Apr 2026
https://github.com/heitang/fcu-courseapi
ι’η²ε€§εΈοΌθͺ²η¨ζͺ’η΄’η³»η΅± API δ½Ώη¨θͺͺζ
Last synced: 27 Jul 2025
https://github.com/ssiarhei115/cv-dbase-analysis
HeadHunter CVs data base analysis
analysis cv data data-science resume
Last synced: 09 Apr 2025
https://github.com/rrwen/poster-gisci-osmol
Conference poster and short paper titled "Outlier Detection in OpenStreetMap Data using the RandomForest Algorithm and Variable Contributions" for the GIScience Conference in 2016
2016 algorithm conference contribution data detection forest gis giscience learn machine open openstreetmap osm outlier paper poster random short variable
Last synced: 03 Apr 2025
https://github.com/parmsam/rweekly.data
R package containing data on Rweekly posts
Last synced: 21 May 2026
https://github.com/rrwen/geohoods-to
Geospatial dataset of 1000+ aggregated variables for neighbourhoods in Toronto, ON, CA
csv data dataset geo geojson gis neighborhood neighborhoods neighbourhood neighbourhoods open open-data toronto toronto-open-data
Last synced: 25 Jun 2025
https://github.com/codehard8/web-scrapping
In this repository we have provide a web scrapping project through beautifulSoup and related files
beutifulsoup data houses-for-sale python3 requests-library-python webscraping
Last synced: 01 Jul 2025
https://github.com/rajlabmssm/echodata
echoverse module: Example data.
data echoverse fine-mapping genomics gwas qtl
Last synced: 17 Jan 2026
https://github.com/jonprice99/regional-election-analysis
An analysis of election results in Allegheny County using Pandas and other Python libraries to better understand the voting habits, practices, and preferences of regional voters.
data data-visualization election-analysis election-data pandas python
Last synced: 05 May 2026
https://github.com/abshek7/big-data
A repository for documenting the learning related to theory and practical notes of big data computing.
big-data data data-engineering mapreduce pyspark
Last synced: 15 Jun 2025
https://github.com/ahmad-mtr/prjkt_exam_schedule_test
I hate scrolling in a list of 300+ courses of my Uni exam schedule, so I'm creating this. this's a test btw :)
Last synced: 11 Apr 2025
https://github.com/yassin522/health-insurance-cross-sell-prediction
Prediction of Vehicles Health Insurance
data data-analysis data-science machine-learning plotly python
Last synced: 15 May 2026
https://github.com/badawy403/egy.list
A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.
city data egypt javascript nodejs npm package
Last synced: 08 Mar 2026
https://github.com/skygenesisenterprise/aether-calendar
Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications calendar capacitorjs data javascript linux macos nextjs typescript windows
Last synced: 12 Apr 2026
https://github.com/Greatwoman23/Sentiment-Analysis-on-Amazon-Products-Review
Sentiment_Analysis_On_Amazon_Product_Review
analysis dashboard-application data data-science datascientistproject machine-learning publication python remotejob
Last synced: 04 May 2025
https://github.com/mysociety/sync-ep-to-jkan
Syncs EveryPolitician data to mySociety's data portal.
data everypolitician jkan politicians
Last synced: 27 Jul 2025
https://github.com/indhra/cats-ijcnn-data-2004
CATS IJCNN Data 2004 Competition of Artificial Time Series
2004 artificial cats data ijcnn time-series
Last synced: 22 Mar 2025
https://github.com/hivesolutions/crossline
Simple event pipping and storing infra-structure
Last synced: 15 May 2026
https://github.com/GAMELEIRA/studies-database
Esse repositΓ³rio tΓͺm como objetivo alocar todo e qualquer script para aprender e praticar gerenciamento de banco de dados SQL e NoSQL. Nesse projeto, serΓ£o consolidados os principais fundamentos e princΓpios, alΓ©m da prΓ‘tica de exercΓcios e desenvolvimento de projetos.
data database mongodb mssql mysql nosql sql
Last synced: 03 May 2025
https://github.com/engineeringmadness/gaming-ai-analytics
Using Databricks to analyze game reviews from Steam web store
data databricks llama pyspark semantic-layer
Last synced: 15 May 2026
https://github.com/ioboi/obloc-data
Scrape guest counter of O'BLOC π§ββοΈ
Last synced: 04 Nov 2025
https://github.com/manifoldfinance/honte
reference data and metrics for sushiswap proposal
Last synced: 18 May 2026
https://github.com/prernarohra/todo-webapp
Simple Todo App for practice.
axios css data fastapi html json python typescript
Last synced: 06 Apr 2026
https://github.com/soenneker/soenneker.timezones.data
Provides TimeZone geometry
csharp data dotnet geometry lookup polygons timezone timezones timezonesdata
Last synced: 30 May 2026
https://github.com/gunn/covid-19-scripts
Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries
covid-19 data geography typescript
Last synced: 17 May 2026
https://github.com/theanujsinha01/data-analytics-portal-
Data Analytics Portal Built a web-based data analytics tool using Streamlit, Pandas, and Plotly. Supported CSV and Excel uploads (up to 200MB) for data exploration. Features included statistical summaries, group-by aggregation, and frequency counts. Integrated interactive charts (bar, pie, line, scatter) for visual insights. This tool is live now.
Last synced: 28 Apr 2026
https://github.com/dms-codes/scrape_tripsantai
Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.
beautifulsoup4 data python requests scraper webscraper
Last synced: 21 May 2026
https://github.com/bfontaine/datatools
:triangular_ruler: Some scripts I use to work with data
Last synced: 23 Jul 2025
https://github.com/rameshaditya/dynamic-hybrid-data-grid
Facilitates faster read-and-write of large ordered collections of data.
algorithms data data-structures storage
Last synced: 30 Jun 2026
https://github.com/shailu2004/azure_big_data_project
This project demonstrates a comprehensive Azure Data Engineering workflow using multiple Azure resources to process and analyze an e-commerce dataset. The dataset consists of 8 files containing details about customers, payments, orders, and other key information
ai azure cloud data data-engineering
Last synced: 08 Jul 2025
https://github.com/truongnhatbui/automatidata
Automatidata
data data-analysis data-science data-visualization python tableau
Last synced: 08 Jul 2025
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/omari-kd/recommendation-system-analysis-and-modelling
This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.
data data-science data-science-in-r machine-learning-algorithms recommendation-system
Last synced: 08 Jan 2026
https://github.com/j-hagedorn/locals
:globe_with_meridians: A collection of tidied, neighborhood-level public datasets
address-dataset census-data census-tract data neighborhood social-sciences
Last synced: 03 Feb 2026
https://github.com/ressuman/next-blog-1-project
Next.js with TypeScript: Fetching Data and Setting Up Routes. This project demonstrates my first experience with Next.js using TypeScript. It involves fetching posts from the JSON Placeholder dummy API, setting up pages, and linking routes.
api-rest data html-css-javascript jsx nextjs14 routing typescript
Last synced: 15 May 2026
https://github.com/rubidev68/citadelai-community
Community version of citadelai.app
ai ai-assistant chatbot chatbot-framework data knowledge-management silo-digital
Last synced: 03 Feb 2026
https://github.com/ims94/ballerina-tsv-querying
An example Ballerina project to query tsv data using Ballerina language integrated queries
ballerina ballerina-lang data olympics query sql
Last synced: 03 Feb 2026
https://github.com/lut-ful/e-commerce-sales-report
This dashboard provides a visual analysis of e-commerce sales data
data data-analytics data-science data-visualization power-bi statics
Last synced: 28 Jun 2025
https://github.com/jun-labs/json-handling
π Json λ°μ΄ν° νΈλ€λ§ μμ .
data gson jackson json json-object
Last synced: 15 May 2026
https://github.com/xylambda/data-structures-algorithms
This repository provides implementations of popular algorithms and abstract data types using JAVA.
algorithm algorithms array arraylist avl-tree data data-structures graph heap iterative java linked list netbeans queue recursive set stack tree
Last synced: 30 Jun 2026
https://github.com/vedantwalia/google-data-analytics-capstone-case-study
This is a repository of my work on data analysis as a part of the Google Data Analytics Capstone
bigquery data data-viz datavisualization-project divvy-bikes google googledataanalytics sql tableau tableau-public
Last synced: 02 Jan 2026
https://github.com/kashyap-prabhat/sigma
A Scala library for probability and statistics formulas, including rules for probability calculations.
data formulas library mathematics probability scala statistics
Last synced: 30 Jun 2026
https://github.com/chompfoods/stub-jaxrs-jersey
JAX-RS Jersey server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients jax-rs jersey nutrition raw recipe-api recipes server server-stub stub stub-server
Last synced: 02 May 2026
https://github.com/interzoid/typescript-examples
Provides TypeScript examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
angular api cloud data database matching nodejs quality typescript
Last synced: 12 Jan 2026
https://github.com/interzoid/php-examples
Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
api cloud data database php quality
Last synced: 12 Jan 2026
https://github.com/cody-scott/arclint
A flexible tool to validate and improve your data in ArcGIS using regex and other methods
arcgis arcgispro data lint regex validation
Last synced: 14 May 2025
https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning
This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.
credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects
Last synced: 02 Sep 2025
https://github.com/karajmiglani-datascientist/karajmiglanifake-news-detection
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 22 May 2026
https://github.com/ntnn/dataparse
Parsing, transforming and unmarshalling data.
data data-parser data-parsing data-transformation golang golang-lib
Last synced: 30 Jun 2026
https://github.com/rickstaa/ai-compute-visualizer
A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.
Last synced: 28 Jun 2025
https://github.com/matheussoranco/how-to-estimate-required-sample-size-for-model-training
Modeling the relationship between training set size and model accuracy.
artificial-intelligence data jupyter-notebook machine-learning python
Last synced: 22 May 2026
https://github.com/ressuman/csv-writer-project
CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.
Last synced: 15 May 2026