An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/rickyarians/practical-statistic-car-emission

Practical Statistic Project- Car Emission in Canada - 2022

data data-science dataanalysis r rmarkdown rpubs statistics

Last synced: 22 May 2026

https://github.com/iamyourdre/naive-bayes-classifier-js

Naive Bayes classifier developed with MySQL, ExpressJS, and NodeJS by @iamyourdre.

backend data data-science expressjs javascript mysql naive-bayes naive-bayes-algorithm naive-bayes-classifier nodejs

Last synced: 08 Apr 2026

https://github.com/iyashwantsaini/tweetify_

Twitter Data Collection, Analysis Tool

collection data twitter twitter-sentiment-analysis

Last synced: 08 Mar 2026

https://github.com/pooja-manjunatha/nyc_parking_violations_dbt

This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations

data data-analysis data-engineering dbt duckdb python sql

Last synced: 14 May 2026

https://github.com/mobinx/easymeet-js

EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything

data meeting react realtime screensharing streaming-video webrtc zoom

Last synced: 03 Jan 2026

https://github.com/merrill007/sql-data-warehouse-project

The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.

architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver

Last synced: 22 Mar 2025

https://github.com/valyaevgeorgiy/r_basic

Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.

coding data data-analysis r rstudio

Last synced: 12 Dec 2025

https://github.com/richelbilderbeek/heyahmama

Data about the Flemish/Dutch band K3

band data k3 package r r-lang r-language

Last synced: 22 May 2026

https://github.com/charlieroth/exoexplo

Exploring NASA Exoplanet Archive Data

data exoplanets julia nasa

Last synced: 03 Apr 2025

https://github.com/kirkalyn13/xyz-books-pipeline

XYZ Books Pipeline to check and update incoming ISBNs from newly added books from the CRUD UI, and record new data to a CSV file.

api csv data go http rabbitmq

Last synced: 05 Mar 2025

https://github.com/push-protocol/push-google-bigquery

The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis

bigquery data push push-notifications web3

Last synced: 26 Mar 2025

https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023

Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.

cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill

Last synced: 16 Aug 2025

https://github.com/RedInfinityPro/ScientificSharp

Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.

componentmodel cryptography data drawing forms generic linq system tasks text

Last synced: 30 Sep 2025

https://github.com/realbxnnie/accountservice

A Simple DataStoreService wrapper with session backuping and session locking.

data lua luau roblox

Last synced: 29 Jul 2025

https://github.com/shubhamsoni98/analysis-with-sql

This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.

analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql

Last synced: 03 Jan 2026

https://github.com/8hrsk/ranger

Package for generating fake userdata to work with.

data factory faker generator npm

Last synced: 30 Apr 2026

https://github.com/kenanbek/youtube-data

YouTube stats data over YouTube Data API v3 using Python.

data python youtube youtube-api

Last synced: 13 May 2026

https://github.com/vladandreitoma/igisol_jyvaskyla_xept_experimental_campaign

A simulation toolkit together with data analysis for the Xe&Pt Exotic Nuclei Generation experiment @ Jyvaskyla December 2022. Helping dr.Paul Constantin with simulation development. Simulation is done using Geant4 provided by CERN. Data anlysis is done using ROOT by Cern. Both C++ based. Job distributors to run the sim are coded in pearl

analysis architecture-design cplusplus data oop oop-principles pearl simulations

Last synced: 05 Sep 2025

https://github.com/alex0x4b/akutils

High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).

data dataframe pandas python

Last synced: 08 Mar 2026

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/ethenkem/PyGraphSurvey

A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc

data

Last synced: 30 Apr 2025

https://github.com/jonathanstowe/databulous

Abstraction for tabular data

data perl6 table tabular

Last synced: 02 Apr 2025

https://github.com/thesfinox/fit-the-data

Data analysis using Wolfram Mathematica

analysis data data-analysis lab mathematica wolfram wolfram-mathematica

Last synced: 24 Jan 2026

https://github.com/patrikcze/meshtatic_data

Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.

data meshtastic meshtastic-python

Last synced: 03 Feb 2026

https://github.com/erictleung/tidytuesdays

:chart_with_upwards_trend: My attempts at #tidytuesday

data data-science data-visualization r rstats tables tidytuesday tidyverse

Last synced: 19 Sep 2025

https://github.com/krescruz/pegaso-data

Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso

cfdi-mexico data pac sat-gob

Last synced: 29 Apr 2026

https://github.com/pyrustic/litedao

Intuitive interaction with SQLite database

auto-init dao data database database-access library lightweight pyrustic python sql sqlite

Last synced: 09 May 2026

https://github.com/mattpap/pycon-2017-bokeh

Bokeh tutorial at PyCon.PL 2017

bokeh data tutorial visualization

Last synced: 17 Mar 2025

https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling

Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.

classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier

Last synced: 01 Jun 2026

https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling

Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.

data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis

Last synced: 05 Mar 2025

https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset

This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.

arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting

Last synced: 23 Jun 2025

https://github.com/par7133/xsltmaster

Dynamically load data from multiple XML/XSLT in webpages

data dynamic load webpages xml xslt

Last synced: 02 Mar 2025

https://github.com/maulanakavaldo/tri-hita-karana

Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.

bali data data-science g20 science

Last synced: 02 Mar 2025

https://github.com/bala-1409/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning

Last synced: 26 Apr 2026

https://github.com/bala-1409/loan-classification-data-science-projects

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization

Last synced: 22 Mar 2025

https://github.com/itsmeyogesh22/solved-8-weeks-sql-challenge-correct-solutions

Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/

8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql

Last synced: 07 Apr 2025

https://github.com/gui-sitton/prepaid

In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 22 May 2026

https://github.com/domarps/grad-project-reports

Write-ups of a few key semester-long projects I have worked during my Masters

circuit data deeplearning graph-algorithms matlab question-answering

Last synced: 26 Mar 2025

https://github.com/jor-/measurements

Python functions to handle, statistically analyze and plot measurement data.

data measurements python

Last synced: 17 Mar 2025

https://github.com/shubhamsoni98/classification-with-random-forest---2

Fraud detection is a critical task for financial institutions and businesses. This document outlines the end-to-end process of predicting fraudulent activities using a Random Forest model. The process includes data preparation, exploration, model training, and evaluation.

algorithms anaconda data data-science dataflow feature-engineering jupyter-notebook machine-learning model modeltraining prediction python random-forest sql visualization

Last synced: 20 Jan 2026

https://github.com/moons-14/datapot

Incorporate and serve all information.

ai aiogram api data infomation news newspaper rss video

Last synced: 04 Jan 2026

https://github.com/khansasafira19/sk-cool-storytelling

Source Code for Data Storytelling with HTML5

data html5 javascript storytelling

Last synced: 13 May 2026

https://github.com/amethyst-php/setting

Give the user the ability to configure his own settings

amethyst amethyst-package api data laravel setting

Last synced: 19 May 2026

https://github.com/gregoritsch3/project_excel_dataanalysis_carsales

An Excel Data Analysis project based on a vehicle vendor's car sales data from 2014 and 2015 showcasing data cleaning and formatting, DAX, pivot tables and charts, timelines, slicers, an interactive Dashboard, descriptive Statistics and more.

analysis dashboard data excel sales statistics

Last synced: 01 Feb 2026

https://github.com/amethyst-php/price

Define prices and attach them to any model

amethyst amethyst-package api data laravel price

Last synced: 17 May 2026

https://github.com/lordzintick/spellcaster-api-1.21.4

A server-side Fabric mod to load JSON spell files from datapacks.

api api-server data fabric fabric-mod fabricmc json magic minecraft-mod server-side small spells

Last synced: 08 May 2026

https://github.com/ayresgneto/use-case-gcp-etl

ELT pipeline GCP. Tecnologias utilizadas: Postgresql, GCP Storage, Airflow (local), Pyspark (local), BigQuery

airflow big-data bigquery data data-engineering etl gcp pipeline postgresql programming-oriented-object pyspark python spark

Last synced: 03 Jan 2026

https://github.com/amethyst-php/source

The source of information. It can be used to save the origin of whatever information (news, books, etc.. )

amethyst amethyst-package api data laravel source

Last synced: 27 Apr 2026

https://github.com/bagustris/dataits

Web for DataITS17: Summer School on Data Science

data data-science

Last synced: 28 Jun 2025

https://github.com/yourdataarchitect/abyat-scaring-

This Scrapy spider for automates the extraction of product data from the Abyat website using Hidden Backend API, supporting both Arabic and English content.

data database scraper scrapy-crawler

Last synced: 23 Apr 2026

https://github.com/rudxain/xorsum

Get XOR checksum with this command-line tool

binary checksum cli data digest file files hexadecimal rust-crate xor

Last synced: 08 Mar 2026

https://github.com/dina-hosny/calculate-installments-dates-and-amounts-plsql

PLSQL project to Calculate the installments dates and amounts for contracts

data plsql sql toad trigger

Last synced: 06 Mar 2026

https://github.com/ashishsingh789/data_visualization

Data visualization project using Python to analyze categorical and continuous variables. Includes bar charts, histograms, and scatter plots. Libraries used: pandas, matplotlib, and seaborn.

analysis barchart data data-science data-visualization histogram matplotlib pandas-dataframe scatter-plot seaborn

Last synced: 07 Sep 2025

https://github.com/gustavonav/youtubeextractorflask

Aplicação para Extração e tratamento de dados do Youtube.

data full-stack mysql pipelines python web

Last synced: 14 Jun 2025

https://github.com/rllyhz/mini-data-center

This repo is to fulfill my internship assignment at the Office of Communication and Information (Kominfo), Balai Kota, Semarang, Indonesia

chartjs country-information data information-visualization laravel laravel-application

Last synced: 06 Nov 2025

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/jeugregg/deeplearningpicturedogs

Classify dogs pictures by Deep Learning CNN neural networks

classez-des-images cnn-keras data data-science ipynb neural-network vision

Last synced: 24 Jul 2025

https://github.com/erkylima/algorithms

Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! 🚀 #Python #Algorithms #DataStructures

algorithms algorithms-and-data-structures data data-structures

Last synced: 19 Jan 2026

https://github.com/lu-sketch/chocolate-imports-dataset

Chocolate Imports for South Africa

data eda visualization

Last synced: 18 May 2026

https://github.com/scanthe-net/scanthenet-php

PHP API Data Fetcher.

api data php scan scanner threat

Last synced: 25 Jul 2025

https://github.com/gusgitmath/cnn_braintumor_classification

Built a CNN for MRI brain tumor classification (Glioma, Meningioma, No Tumor, Pituitary) with 99.4% accuracy. Used data augmentation, optimized learning rates (Adam), and included EarlyStopping, ReduceLROnPlateau for superior performance, averting overfitting. Boosts early, accurate diagnosis, advancing medical treatment.

classification convolutional-neural-networks data deep-learning machine-learning

Last synced: 25 Jul 2025

https://github.com/sam-moen/data-analyst-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

data dataanalysis matplotlib mssql pandas powerbi python seaborn sql

Last synced: 08 Mar 2026

https://github.com/basemax/okala-database-crawler

A robust, UTF-8 compliant PHP-based crawler designed to extract structured product data from Okala. This tool efficiently scrapes and saves store information, category slugs, and detailed product listings into organized JSON files. Ideal for data analysis, backup, or integration into other systems.

crawler crawler-php curl data json okala okala-com okalacom php php-crawler scraper

Last synced: 01 May 2026

https://github.com/giosil/export-as

A convenience library for exporting data in different formats.

data data-export export exporter java

Last synced: 26 Jul 2025

https://github.com/noraui/noraui-datas-webservices

noraui-datas-webservices is a RESTdataProvider for NoraUi

data noraui rest-api service spring-boot-2 spring-boot-actuator

Last synced: 17 Mar 2025

https://github.com/peternaydenov/data-pool

Data layer for node apps and single page applications

cache data store

Last synced: 29 Apr 2025

https://github.com/denisecase/cintel-04-reactive

Interactive analytics, reactive app built with Shiny for Python

analytics bokeh data flights interactive mtcars penguins python relationships shiny

Last synced: 20 Jun 2025

https://github.com/bho0920/crime-data-analysis-eu

Crime Data Analysis for Self-Defense Tool Market Entry in the EU.

data data-analysis sql sqlite tableau

Last synced: 21 Jun 2025

https://github.com/samaalharbi2/virtual-work-experience---data-analysis-at-stc

Virtual Work Experience in Data Analysis at STC

analysis data data-visualization misk stc

Last synced: 20 Jun 2025

https://github.com/sakan811/gachascope

Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.

data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves

Last synced: 04 May 2026

https://github.com/austinv11/pypeline

A simple data pipeline builder for Python 3+

data leveldb pypeline python python3 stream-processing

Last synced: 20 Aug 2025

https://github.com/hamolicious/console-table

Displaying Tables in the console

console data pypi python table

Last synced: 11 Jul 2025

https://github.com/dhi13man/rca_ace

RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).

analytics data enterprise open-source python python3 rca

Last synced: 10 Sep 2025

https://github.com/miroslavvidovic/distribution-graphs

Creating ASCII graphical histograms in the terminal with https://github.com/philovivero/distribution

ascii data graph histogram python terminal

Last synced: 24 Apr 2026

https://github.com/mradkov/secure-data-exchange

Elliptic Curve Diffie-Hellman secure data exchange via smart contracts on Aeternity blockchain

aeternity data exchange key-exchange smart-contracts sophia

Last synced: 22 Jun 2025

https://github.com/dolanmiu/mclaren-task

A front end assessment task for Mclaren

angular data observable observables rxjs

Last synced: 16 May 2026

https://github.com/uttori/uttori-data-tools

Tools for working with binary data.

binary data stream uttori

Last synced: 17 Feb 2026

https://github.com/aliaksandr-master/unipipeline

simple way to build the declarative and destributed data pipelines with python

celery data pipeline python

Last synced: 11 Jul 2025

https://github.com/thetacom/byteclasses

A Python package to manage and interact with binary data in a simple and structured manner.

binary-data bytes data dataclasses package python python3

Last synced: 11 Jul 2025

https://github.com/jensostertag-archive/charts.js

A JavaScript Plugin to draw Charts to visualize Data and Statistics on Websites

charts data javascript statistics webapplication

Last synced: 22 Jun 2025

https://github.com/fintech-lsi/fintech-credit-risk-prediction

This repository provides a machine learning model for predicting credit risk in the financial sector. The model uses borrower information, such as age, income, employment length, loan amount, and credit history, to assess the likelihood of loan repayment or default.

data fintech machine-learning model prediction risk

Last synced: 12 Oct 2025

https://github.com/lorinczakos/sql-projects

This is a collection of my SQL scripts that I wrote and were approved through my course with GoIT Romania Data Analyst course

bigquery cte data data-analysis dbeaver marketing-analytics postgresql project-repository sql vscode

Last synced: 16 May 2026

https://github.com/frer0t/userverse

creating api for data analysis

data data-analytics spring-boot users

Last synced: 12 Apr 2026