An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/pcpp94/elexon_pipeline_gb_demand

Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.

data electricity elexon gb octopusenergy power powerdata pypsa uk

Last synced: 12 Jul 2025

https://github.com/phtrempe/l2a

This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.

applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow

Last synced: 05 May 2026

https://github.com/devbigboy/iti-database

This course will cover the following Topics: joins, Normalization, Aggregate function, Group By, Order By, Select, Ranking Functions, Built-In Functions

analytics data data-analytics mssql-database sql sql-server

Last synced: 03 Nov 2025

https://github.com/gabboraron/datacamp_projects

Here you can find my DataCamp Projects

data datacamp datacamp-projects

Last synced: 14 Jun 2026

https://github.com/wciesialka/top-names

A Python module for scraping the list of top first names in the United States.

data python python3

Last synced: 08 Jun 2026

https://github.com/fridex/real-estate

My machine learning in real estate

data machine-learning real-estate

Last synced: 27 Jun 2025

https://github.com/echang1802/normandy

Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.

analytics business-intelligence data dataengineering datascience etl pipeline

Last synced: 11 Mar 2026

https://github.com/radekbednarik/att

Python wrapper for calling Apitalks API.

api-wrapper apitalks data python3 rest-api wrapper

Last synced: 05 Apr 2025

https://github.com/csmith0651/ormy

A simple python ORM.

data database python

Last synced: 13 May 2026

https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql

This project involves cleaning a dataset containing information about layoffs from companies around the world.

data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql

Last synced: 08 Jun 2026

https://github.com/iliyasalve/cyclistic_case_study

Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"

bike-sharing data data-analysis data-visualisation r

Last synced: 06 Apr 2025

https://github.com/himanshub16/lekhpal

Monitor and catalog Twitter feed matching your desired keywords

analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api

Last synced: 14 May 2026

https://github.com/axafrance/azureml-to-openshift-talk

Scale your dev IA: From dev AzureML to prod OpenShift in one click

ai axa azureml data learn ml openshift raise-the-bar talk

Last synced: 16 Feb 2026

https://github.com/azaz9026/loan_approval_prediction

Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio

data data-analysis data-visualization eda machine-learning numpy pandas python statistics

Last synced: 06 Apr 2026

https://github.com/peternaydenov/data-pool

Data layer for node apps and single page applications

cache data store

Last synced: 29 Apr 2025

https://github.com/4ment/aiv-rate-heterogeneity

Avian influenza virus data sets

data influenza

Last synced: 24 Jan 2026

https://github.com/thesfinox/sql-simple-backup

Simple script to backup data in a MySQL database and store it in a WebDAV server.

backup bash data mysql script sql webdav

Last synced: 18 Apr 2026

https://github.com/lisakey/lisakey

I am passionate about Python 🐍 and SQL πŸ—ƒοΈ for data analysis πŸ“Š, and I actively develop projects in these languages.

analysis analyst data dataanalysis dataanalyst java python sql

Last synced: 02 May 2026

https://github.com/purarue/blizzard_gdpr_parser

Parses date-related information from my blizzard GDPR export.

blizzard data gdpr webscraping

Last synced: 06 Apr 2025

https://github.com/purarue/hpi-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 06 Apr 2025

https://github.com/nxank4/an-augment

A Python library for advanced and novel data augmentation, combining traditional techniques like cropping and blurring with state-of-the-art generative AI methods such as style transfer, image inpainting, and latent space interpolation. It boosts data diversity for robust machine learning applications.

computer-vision data data-augmentation data-augmentation-strategies data-augmentation-techniques generative-ai image image-processing synthetic-data

Last synced: 10 Mar 2026

https://github.com/renebentes/2808

Curso 2808 - Fundamentos do Entity Framework

course csharp data ef-core

Last synced: 27 Jun 2025

https://github.com/lakshyakumar266/jee-dpp-manager-app

DPP manager app for JEE preparing Students

data expo javascript management react-native

Last synced: 07 May 2026

https://github.com/mai-space/design-concept-sharing-recipes

πŸ–ΌοΈ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework

concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories

Last synced: 14 May 2025

https://github.com/jph5396/sumomodel

A data models related to sumo wrestling.

data go sumo

Last synced: 17 Jan 2026

https://github.com/gagolews/clustering-data-v0

Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]

clustering data dataset machine-learning

Last synced: 15 Sep 2025

https://github.com/zeh237/superstore-data-analytics

This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python

analytics data data-analysis data-science data-visualization flask python superstore

Last synced: 04 May 2025

https://github.com/mapi-developer/dapo

Simple, zero-dependency tabular data manipulation and analysis for Python.

dapo data python

Last synced: 06 Mar 2026

https://github.com/miss-mhv/data-analysis-for-social-buzz

In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.

data jupyter-notebook python

Last synced: 14 May 2026

https://github.com/canadaluke888/terminaltablebuilder

Build and edit tabular data all from the terminal.

cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables

Last synced: 20 Apr 2026

https://github.com/heitang/fcu-courseapi

ι€’η”²ε€§ε­ΈοΌšθͺ²η¨‹ζͺ’η΄’η³»η΅± API 使用θͺͺ明

api data fcu project

Last synced: 27 Jul 2025

https://github.com/ssiarhei115/cv-dbase-analysis

HeadHunter CVs data base analysis

analysis cv data data-science resume

Last synced: 09 Apr 2025

https://github.com/rrwen/poster-gisci-osmol

Conference poster and short paper titled "Outlier Detection in OpenStreetMap Data using the RandomForest Algorithm and Variable Contributions" for the GIScience Conference in 2016

2016 algorithm conference contribution data detection forest gis giscience learn machine open openstreetmap osm outlier paper poster random short variable

Last synced: 03 Apr 2025

https://github.com/parmsam/rweekly.data

R package containing data on Rweekly posts

data package rweekly

Last synced: 21 May 2026

https://github.com/rrwen/geohoods-to

Geospatial dataset of 1000+ aggregated variables for neighbourhoods in Toronto, ON, CA

csv data dataset geo geojson gis neighborhood neighborhoods neighbourhood neighbourhoods open open-data toronto toronto-open-data

Last synced: 25 Jun 2025

https://github.com/codehard8/web-scrapping

In this repository we have provide a web scrapping project through beautifulSoup and related files

beutifulsoup data houses-for-sale python3 requests-library-python webscraping

Last synced: 01 Jul 2025

https://github.com/rajlabmssm/echodata

echoverse module: Example data.

data echoverse fine-mapping genomics gwas qtl

Last synced: 17 Jan 2026

https://github.com/jitsasmal/customer-purches-behavior-and-shopping-analysis

Create dashboard to analyse the data based to total product sales, terget, revenue, state and season wize analyse to show the current treand the data.

analytics dashboard data etl powerbi

Last synced: 14 Feb 2026

https://github.com/jonprice99/regional-election-analysis

An analysis of election results in Allegheny County using Pandas and other Python libraries to better understand the voting habits, practices, and preferences of regional voters.

data data-visualization election-analysis election-data pandas python

Last synced: 05 May 2026

https://github.com/abshek7/big-data

A repository for documenting the learning related to theory and practical notes of big data computing.

big-data data data-engineering mapreduce pyspark

Last synced: 15 Jun 2025

https://github.com/ahmad-mtr/prjkt_exam_schedule_test

I hate scrolling in a list of 300+ courses of my Uni exam schedule, so I'm creating this. this's a test btw :)

data strings-manipulation

Last synced: 11 Apr 2025

https://github.com/badawy403/egy.list

A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.

city data egypt javascript nodejs npm package

Last synced: 08 Mar 2026

https://github.com/skygenesisenterprise/aether-calendar

Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications calendar capacitorjs data javascript linux macos nextjs typescript windows

Last synced: 12 Apr 2026

https://github.com/mysociety/sync-ep-to-jkan

Syncs EveryPolitician data to mySociety's data portal.

data everypolitician jkan politicians

Last synced: 27 Jul 2025

https://github.com/indhra/cats-ijcnn-data-2004

CATS IJCNN Data 2004 Competition of Artificial Time Series

2004 artificial cats data ijcnn time-series

Last synced: 22 Mar 2025

https://github.com/hivesolutions/crossline

Simple event pipping and storing infra-structure

counter data opencv warehouse

Last synced: 15 May 2026

https://github.com/GAMELEIRA/studies-database

Esse repositΓ³rio tΓͺm como objetivo alocar todo e qualquer script para aprender e praticar gerenciamento de banco de dados SQL e NoSQL. Nesse projeto, serΓ£o consolidados os principais fundamentos e princΓ­pios, alΓ©m da prΓ‘tica de exercΓ­cios e desenvolvimento de projetos.

data database mongodb mssql mysql nosql sql

Last synced: 03 May 2025

https://github.com/dcmox/moxymapper

Data mapping made easy

data json mapper

Last synced: 15 May 2026

https://github.com/engineeringmadness/gaming-ai-analytics

Using Databricks to analyze game reviews from Steam web store

data databricks llama pyspark semantic-layer

Last synced: 15 May 2026

https://github.com/ioboi/obloc-data

Scrape guest counter of O'BLOC πŸ§—β€β™€οΈ

data scraping

Last synced: 04 Nov 2025

https://github.com/manifoldfinance/honte

reference data and metrics for sushiswap proposal

data ethereum sushi sushiswap

Last synced: 18 May 2026

https://github.com/prernarohra/todo-webapp

Simple Todo App for practice.

axios css data fastapi html json python typescript

Last synced: 06 Apr 2026

https://github.com/gunn/covid-19-scripts

Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries

covid-19 data geography typescript

Last synced: 17 May 2026

https://github.com/rrwen/twitter2return

Module for extracting Twitter data using option objects

access api data extract geo get location media oauth object option post rest return sample social stream token tweet twitter

Last synced: 03 Apr 2025

https://github.com/theanujsinha01/data-analytics-portal-

Data Analytics Portal Built a web-based data analytics tool using Streamlit, Pandas, and Plotly. Supported CSV and Excel uploads (up to 200MB) for data exploration. Features included statistical summaries, group-by aggregation, and frequency counts. Integrated interactive charts (bar, pie, line, scatter) for visual insights. This tool is live now.

analytics data portal

Last synced: 28 Apr 2026

https://github.com/dms-codes/scrape_tripsantai

Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.

beautifulsoup4 data python requests scraper webscraper

Last synced: 21 May 2026

https://github.com/bfontaine/datatools

:triangular_ruler: Some scripts I use to work with data

data ruby script

Last synced: 23 Jul 2025

https://github.com/rameshaditya/dynamic-hybrid-data-grid

Facilitates faster read-and-write of large ordered collections of data.

algorithms data data-structures storage

Last synced: 30 Jun 2026

https://github.com/shailu2004/azure_big_data_project

This project demonstrates a comprehensive Azure Data Engineering workflow using multiple Azure resources to process and analyze an e-commerce dataset. The dataset consists of 8 files containing details about customers, payments, orders, and other key information

ai azure cloud data data-engineering

Last synced: 08 Jul 2025

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/omari-kd/recommendation-system-analysis-and-modelling

This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.

data data-science data-science-in-r machine-learning-algorithms recommendation-system

Last synced: 08 Jan 2026

https://github.com/j-hagedorn/locals

:globe_with_meridians: A collection of tidied, neighborhood-level public datasets

address-dataset census-data census-tract data neighborhood social-sciences

Last synced: 03 Feb 2026

https://github.com/ressuman/next-blog-1-project

Next.js with TypeScript: Fetching Data and Setting Up Routes. This project demonstrates my first experience with Next.js using TypeScript. It involves fetching posts from the JSON Placeholder dummy API, setting up pages, and linking routes.

api-rest data html-css-javascript jsx nextjs14 routing typescript

Last synced: 15 May 2026

https://github.com/ims94/ballerina-tsv-querying

An example Ballerina project to query tsv data using Ballerina language integrated queries

ballerina ballerina-lang data olympics query sql

Last synced: 03 Feb 2026

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/jun-labs/json-handling

πŸ” Json 데이터 핸듀링 예제.

data gson jackson json json-object

Last synced: 15 May 2026

https://github.com/zanuarts/datamining

Repo Matkul Data Mining

data data-mining

Last synced: 14 Mar 2025

https://github.com/xylambda/data-structures-algorithms

This repository provides implementations of popular algorithms and abstract data types using JAVA.

algorithm algorithms array arraylist avl-tree data data-structures graph heap iterative java linked list netbeans queue recursive set stack tree

Last synced: 30 Jun 2026

https://github.com/vedantwalia/google-data-analytics-capstone-case-study

This is a repository of my work on data analysis as a part of the Google Data Analytics Capstone

bigquery data data-viz datavisualization-project divvy-bikes google googledataanalytics sql tableau tableau-public

Last synced: 02 Jan 2026

https://github.com/kashyap-prabhat/sigma

A Scala library for probability and statistics formulas, including rules for probability calculations.

data formulas library mathematics probability scala statistics

Last synced: 30 Jun 2026

https://github.com/chompfoods/stub-jaxrs-jersey

JAX-RS Jersey server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jersey nutrition raw recipe-api recipes server server-stub stub stub-server

Last synced: 02 May 2026

https://github.com/interzoid/typescript-examples

Provides TypeScript examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.

angular api cloud data database matching nodejs quality typescript

Last synced: 12 Jan 2026

https://github.com/interzoid/php-examples

Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.

api cloud data database php quality

Last synced: 12 Jan 2026

https://github.com/cody-scott/arclint

A flexible tool to validate and improve your data in ArcGIS using regex and other methods

arcgis arcgispro data lint regex validation

Last synced: 14 May 2025

https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning

This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.

credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects

Last synced: 02 Sep 2025

https://github.com/ntnn/dataparse

Parsing, transforming and unmarshalling data.

data data-parser data-parsing data-transformation golang golang-lib

Last synced: 30 Jun 2026

https://github.com/rickstaa/ai-compute-visualizer

A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.

ai data livepeer streamlit

Last synced: 28 Jun 2025

https://github.com/ressuman/csv-writer-project

CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.

data javascript typescript

Last synced: 15 May 2026