An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/joeyism/py-cifar10

This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site

cifar cifar-10 cifar10 data machine-learning machinelearning

Last synced: 30 Jul 2025

https://github.com/asuozzo/medicare-data-analysis

An analysis of Medicare Part D data in Vermont

data python

Last synced: 04 May 2026

https://github.com/millengustavo/salarios-data-science

Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers

brasil brazil ciencia-de-dados data data-science heroku salarios salary

Last synced: 07 Oct 2025

https://github.com/visenger/prada

Profiling Datasets

cleaning data dataset profiling

Last synced: 24 Aug 2025

https://github.com/derrickbaruga7/python-data-analysis

This project analyzes ORU’s off-season sewer usage using Python, with `pandas` for data handling, histograms and line plots for exploration, and a `scipy`-based model for prediction. Pearson’s correlation and visualizations help reveal key trends and relationships.

analytics data data-science visualization

Last synced: 31 Jul 2025

https://github.com/flowsynx/plugin-postgresql

FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.

data database flowsynx postgresql postgresql-database sql

Last synced: 09 May 2026

https://github.com/ajsalemo/python-pandas-datalib

Testing and experimenting with some simple Pandas functionality using Flask to serve the parsed data.

csv data flask json pandas pandas-dataframe pandas-series python tabular tabular-data terminal

Last synced: 09 Apr 2026

https://github.com/tonykipkemboi/ens_subgraph_data

Query On-Chain Data from Subgraphs by The Graph Protocol using Python

data subgraphs thegraphprotocol web3

Last synced: 17 Sep 2025

https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint

A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT

bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs

Last synced: 25 Sep 2025

https://github.com/jimbrig/jimstaskviews

CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com

cran data docs rstats shiny-app submodules task-views

Last synced: 06 Mar 2026

https://github.com/v6ntage/sql-sales_data-analytics-project

This repository contains a SQL scripts demonstration analytical techniques.

analytics business-analytics data data-analysis database query sql sql-server

Last synced: 12 Apr 2026

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/ate329/nsl-kdd-feature-extractor

Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.

cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset

Last synced: 30 Oct 2025

https://github.com/cont-limno/lagosus-reservoir

Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.

data module

Last synced: 17 Jan 2026

https://github.com/fjc0k/vue-merge-data

Intelligently merge data for Vue render functions.

data merge-data render-functions vue

Last synced: 17 May 2026

https://github.com/mikebairdrocks/fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 17 May 2026

https://github.com/snegovoy98/data-storage

This is test version of data storage

data of storage test version

Last synced: 19 Jul 2025

https://github.com/prioritizr/prioritizrdata

Conservation planning data sets

data r spatial-data

Last synced: 19 Jul 2025

https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning

The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/

airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn

Last synced: 30 Dec 2025

https://github.com/inzhenerka/scooters_data_uploader

Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех

data dbt postgresql

Last synced: 04 May 2026

https://github.com/am-i-groot/summer-intern-iitguwahati-spml

Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.

algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing

Last synced: 17 May 2026

https://github.com/muhammad-fiaz/ason

ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.

adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3

Last synced: 02 Feb 2026

https://github.com/desilinguist/hanukkah-of-data-2022

My solutions to Hanukkah of Data 2022

2022 data hanukkah pandas python

Last synced: 17 May 2026

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/giscience/measures-rest-oshdb-docker

Scripts for starting measures for geospatial datasets in docker container, using the OSHDB

data dggs docker geospatial mesure openstreetmap rest

Last synced: 18 Apr 2026

https://github.com/yourdataarchitect/french-realestate-data-pipeline

This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.

airlfow data datanalysis datapipeline market-intelligence real-estate

Last synced: 31 Dec 2025

https://github.com/coderooz/hr-dashboard

The goal of this project is to create a power bi dashboard to showcase the attrition data within the company.

data data-analytics power-bi

Last synced: 07 Jan 2026

https://github.com/webianks/anotech-android

Android application which deals on various anomalous behaviour that occur on server data.

anomaly-detection data server

Last synced: 13 Apr 2025

https://github.com/alexdonh/adonis-cache

Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!

adonis-framework adonisjs cache data dependency redis storing

Last synced: 15 May 2026

https://github.com/pyrustic/jayson

Intuitive interaction with JSON files [DEPRECATED, check the project Shared]

data json pyrustic python

Last synced: 17 May 2026

https://github.com/stdlib-js/array-base-fill-by

Fill all elements within a portion of an array according to a callback function.

accessor array data fill generic javascript map node node-js nodejs set stdlib structure transform typed types

Last synced: 14 May 2026

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/fliplet/fliplet-widget-data-source-query

Data Source Query Provider

data provider widget

Last synced: 11 Apr 2025

https://github.com/boettiger-lab/taxadb-cache

Cache for taxadb files

data

Last synced: 19 May 2026

https://github.com/samharrison7/datamapper

Making mapping between datasets as simple as possible.

data data-mapper data-mapping data-science data-structures

Last synced: 17 Mar 2025

https://github.com/srindot/average_flightdata_collection_fwuav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Sep 2025

https://github.com/a-poor/taro

A package for repeatable rectangular data transformations in Python.

data data-science data-transformation pipeline pypi-package python

Last synced: 13 Oct 2025

https://github.com/sourceduty/text_file_metadata

📄 Extract metadata from .txt files and record the metadata in .txt files.

data datascience metadata metafile practice sourceduty

Last synced: 08 Aug 2025

https://github.com/encoreshao/data-science

Data analyze examples, using Jupyter notebook and Python!!!

data dataanalysis encore jupyter-notebook

Last synced: 29 Mar 2025

https://github.com/kylepw/multistack

Example of multiple stacks in one array.

algorithms array data data-structures python stack

Last synced: 17 Mar 2025

https://github.com/sharoonjoseph321/social_media_eda

Data Analysis on social media apps ,using pandas, python, matplotlib.

data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects

Last synced: 03 Mar 2025

https://github.com/pulgamecanica/d3examples

https://www.oreilly.com/library/view/d3-for-the/9781492046783/

d3 d3-visualization d3js d3v4 data javascript

Last synced: 19 May 2026

https://github.com/kameronbrooks/datalys2-reporting

Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.

data data-visualization html react

Last synced: 08 Apr 2026

https://github.com/farovictor/mongodbloader

This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.

data go mongodb pipeline

Last synced: 15 May 2026

https://github.com/josemartinezrdev/logisticadb

Logistica Database

data ddl diagrama dml mysql sql

Last synced: 09 Jul 2025

https://github.com/robsteranium/user2022-ldf-talk

Slides from my useR! 2022 talk about the Linked-Data Frames package

data data-frame linked-data r rdf

Last synced: 19 Apr 2025

https://github.com/shahules786/titanic-analysis

different analysis of titanic accident (data from kaggle)

analyze data titanic-kaggle

Last synced: 26 Jun 2025

https://github.com/sofyan48/wahoo

Data stream library with kinesis

aws data data-stream event kinesis stream

Last synced: 14 May 2026

https://github.com/jigyasag18/financial-risk-analysis-project

The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics

data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis

Last synced: 06 Mar 2026

https://github.com/henryssondaniel/teacup-java-report-file

Report Teacup data to a file

data file logs reports teacup

Last synced: 22 Jul 2025

https://github.com/stdlib-js/array-base-index-of-same-value

Return the index of the first element which equals a provided search element according to the same value algorithm.

array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types

Last synced: 15 May 2026

https://github.com/mawiegand/automatic-point-label-placement-data

Test instances for the automatic point label placement problem.

data datastructures generator javascript labeling problem ruby

Last synced: 16 May 2026

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/dina-hosny/sequence-trigger-pair-for-all-schema-tables-plsql

A PLSQL script that creates Sequence Trigger Pair for all Schema's Tables

data oracle plsql sequence sequencetrigger sql toad trigger

Last synced: 06 Mar 2026

https://github.com/amarlearning/exploring-the-evolution-of-linux

Data Analysis about the development of the Linux operating system by exploring its Git repository history.

cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux

Last synced: 12 May 2026

https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration

This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.

covid-19 data data-exploration dataset sql

Last synced: 19 Feb 2026

https://github.com/lord3008/instances-of-data-analysis

This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.

data data-analysis

Last synced: 03 Mar 2025

https://github.com/francois-lenne/portofolio_flenne_streamlit

portofolio francois lenne using streamlit

data portofolio python slack-api streamlit

Last synced: 15 May 2026

https://github.com/eyluldursun/data-science-project

This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.

data data-science obesity r rmarkdown

Last synced: 26 Jun 2025

https://github.com/sourceduty/cults_3d

🔢 Software concept for additional statistics from Python for Cults design data .csv files.

3d 3d-model 3d-model-software 3d-modelling account account-management concept cults cults-3d data idea sourceduty

Last synced: 08 Aug 2025

https://github.com/heitang/fcu-courseapi

逢甲大學:課程檢索系統 API 使用說明

api data fcu project

Last synced: 27 Jul 2025

https://github.com/solrikk/vargen

VarGen (Variation Generator) is a user-friendly desktop application designed to simplify the creation of product variations from CSV files.

csv-files csv-format csv-parser data data-engineering excel excelparser python

Last synced: 29 Mar 2025

https://github.com/mysociety/sync-ep-to-jkan

Syncs EveryPolitician data to mySociety's data portal.

data everypolitician jkan politicians

Last synced: 27 Jul 2025

https://github.com/gunn/covid-19-scripts

Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries

covid-19 data geography typescript

Last synced: 17 May 2026

https://github.com/pedelriomarron/spanish-api-covid19

Data from Spain of COVID-19 (by Datadista) as a service

api covid-19 covid-19-spain data now spain zeit

Last synced: 12 Mar 2025

https://github.com/ericgio/history-of-jazz

Data and visualizations based on Ted Gioia's "The History of Jazz"

data jazz

Last synced: 28 Mar 2025

https://github.com/akashlogics/street-data-tracking

Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones

analysis data excel newdataset object-detection opencv python python3 yolo

Last synced: 19 May 2026

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026

https://github.com/buildinamsterdam/contentful-graphql

Contentful GraphQL connection

contentful data graphql

Last synced: 05 Jan 2026

https://github.com/madihanazir/ds-using-c

Basic insights into Data Structures (inspired by Abdul Bari course but in C language)

data self-learning structures-in-c

Last synced: 17 Mar 2025

https://github.com/dan149/uselesscontentcreator

Useless Content Creator (UCC) is a fake content generator, text, html and pdf files.

content customizable data easy-to-use fake-data fake-data-generator faker-generator generator lightweight open-source opensource python python3

Last synced: 03 Apr 2025

https://github.com/brunosalerno/osm_data

Ruby objects for dealing with OSM data, and generating XML files

data openstreetmap ruby xml

Last synced: 21 Apr 2026

https://github.com/garcane/layoffs-exploratory-data-analysis

This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.

data dataanalysis eda mysql sql

Last synced: 29 Oct 2025

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/ezeparziale/analisis-data-delitos

:gun: Analsis de delitos de CABA

data data-science

Last synced: 14 Mar 2025

https://github.com/official-imvoiid/multifetch

A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection

aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows

Last synced: 19 May 2026

https://github.com/biril/audio-test-data

Audio data to use for testing

audio data mpeg test

Last synced: 11 Jan 2026

https://github.com/thedevreda/jadaerospace

A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero

data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python

Last synced: 02 Aug 2025

https://github.com/webdevcave/collections-php

A PHP library for managing collections of data with support for nested keys.

array collection data helper library nested-keys package php utility utility-classes

Last synced: 28 Jun 2026

https://github.com/bodfdaf/api

api data service provider

api data detail instagram lazada shopee tiktok video

Last synced: 11 Mar 2025

https://github.com/johndelatto/-universities-to-pursue-a-master-s-degree-in-machine-learning

Best Master’s Programs in Machine Learning (ML) for 2021 These are the best universities to pursue a master’s degree in machine learning, with research rankings in AI and machine learning

ai api data education project school

Last synced: 17 Jun 2025

https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial

Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI

dagster data data-orchestration kedro luigi mageai prefect

Last synced: 18 Apr 2026