An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/The-Tech-Idea/Beep.winform.Sample

Application for Managing your Different DataSources . Still in Alpha.please be patient

application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows

Last synced: 04 Nov 2025

https://github.com/karensaraimoralesmontiel/8-week-sql-challenge

Case Studies Solutions for the 8-Week-SQL-Challenge.

data database sql

Last synced: 02 Jan 2026

https://github.com/dms-codes/www.usu.ac.ididdirektori

Faculty and Docent Data Retrieval Script The faculty_and_docent_data_retrieval.py script is a Python script for retrieving faculty and docent data from a university website using Selenium. It includes functions to extract faculty names and docent profiles, as well as a multithreading approach to fetch data for multiple faculty-docent pairs.

data python scrape

Last synced: 26 May 2026

https://github.com/kenjyco/mongo-helper

Helper funcs and tools for working with MongoDB

aggregation-pipeline data database kenjyco mongo mongodb python

Last synced: 28 Jan 2026

https://github.com/aiwithqasim/p1_explore-weather-trends

In this project, I'll analyze local and global temperature data and compare the temperature trends where I live to overall global temperature trends. Moreover i will use SQL query to extract data from the given Data base and i have to visualize the insight or Average temperature to find the findings.

data dataanalyst database datavisualization nanodegree udacity

Last synced: 22 May 2026

https://github.com/josecsotomorales/dataform

Repository for testing dataform

cli data data-engineering data-transformation

Last synced: 27 Mar 2025

https://github.com/rickyarians/practical-statistic-car-emission

Practical Statistic Project- Car Emission in Canada - 2022

data data-science dataanalysis r rmarkdown rpubs statistics

Last synced: 22 May 2026

https://github.com/iamyourdre/naive-bayes-classifier-js

Naive Bayes classifier developed with MySQL, ExpressJS, and NodeJS by @iamyourdre.

backend data data-science expressjs javascript mysql naive-bayes naive-bayes-algorithm naive-bayes-classifier nodejs

Last synced: 08 Apr 2026

https://github.com/iyashwantsaini/tweetify_

Twitter Data Collection, Analysis Tool

collection data twitter twitter-sentiment-analysis

Last synced: 08 Mar 2026

https://github.com/mobinx/easymeet-js

EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything

data meeting react realtime screensharing streaming-video webrtc zoom

Last synced: 03 Jan 2026

https://github.com/merrill007/sql-data-warehouse-project

The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.

architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver

Last synced: 22 Mar 2025

https://github.com/ember-nexus/reference-dataset

Ember Nexus API backup containing different standardized scenarios

backup data ember-nexus

Last synced: 25 Jan 2026

https://github.com/questionlp/wwdtm_uniquedates

Script that lists out the unique months and days of months that Wait Wait... Don't Tell Me! shows have aired

data python python3 script wwdtm

Last synced: 17 May 2026

https://github.com/richelbilderbeek/heyahmama

Data about the Flemish/Dutch band K3

band data k3 package r r-lang r-language

Last synced: 22 May 2026

https://github.com/ngupta23/data_prep_helper

A helper package for preparing and combining data from a variety of sources

data data-science dataprep datapreparation dataprocessing helpers python

Last synced: 03 Apr 2025

https://github.com/kirkalyn13/xyz-books-pipeline

XYZ Books Pipeline to check and update incoming ISBNs from newly added books from the CRUD UI, and record new data to a CSV file.

api csv data go http rabbitmq

Last synced: 05 Mar 2025

https://github.com/injamul3798/cpp_stl-discussion

As we know ,STL is mostly used tools is competitive programming.

data list map set structure vector

Last synced: 02 Apr 2025

https://github.com/realbxnnie/accountservice

A Simple DataStoreService wrapper with session backuping and session locking.

data lua luau roblox

Last synced: 29 Jul 2025

https://github.com/talitalobo/statistics-with-python

Repo about statistical concepts and (not always) their python implementation.

data data-science machine-learning statistics

Last synced: 11 Jan 2026

https://github.com/shubhamsoni98/analysis-with-sql

This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.

analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql

Last synced: 03 Jan 2026

https://github.com/emna-chebbi/student-performance

Predictive model for student exam scores based on student performance factors

ai computer-vision data kaggle machine-learning ml mse regression regression-models

Last synced: 15 May 2026

https://github.com/kenanbek/youtube-data

YouTube stats data over YouTube Data API v3 using Python.

data python youtube youtube-api

Last synced: 13 May 2026

https://github.com/moscatellimarco/webscrap-imdb

🎬 Python scraper for IMDB: Extract movie/TV details for πŸ“Š analysis & πŸ—ƒοΈ storage. Easy setup, πŸ”§ customizable, with πŸ–₯️ CLI.

css data datascience html movies python scrapy scrapy-crawler scrapy-spider web web-scraping webdata webscraping

Last synced: 15 May 2026

https://github.com/alex0x4b/akutils

High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).

data dataframe pandas python

Last synced: 08 Mar 2026

https://github.com/shysolocup/fndt

JavaScript package allowing you to see function data like body and arguments from outside of the function

aepl data fndt functions javascript javascript-tools js js-function js-functions lightweight nodejs nodejs-modules package stews

Last synced: 30 Apr 2026

https://github.com/rsc-labs/see-open-data

Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.

data data-visualization flourish government poland

Last synced: 04 Apr 2025

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/hackolade/yugabytedb-ysql

Hackolade(https://hackolade.com) plugin for the Cloud Native Yugabyte database with YSQL API

data data-modeling entity-relationship-diagram schema-design ysql yugabyte yugabytedb

Last synced: 30 Apr 2025

https://github.com/hivesolutions/repos

Modular repository management system

data python repos storage system

Last synced: 14 May 2026

https://github.com/thesfinox/fit-the-data

Data analysis using Wolfram Mathematica

analysis data data-analysis lab mathematica wolfram wolfram-mathematica

Last synced: 24 Jan 2026

https://github.com/patrikcze/meshtatic_data

Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.

data meshtastic meshtastic-python

Last synced: 03 Feb 2026

https://github.com/krescruz/pegaso-data

UtilerΓ­as para el analisis de datos del Proveedor de CertificaciΓ³n de Factura Pegaso

cfdi-mexico data pac sat-gob

Last synced: 29 Apr 2026

https://github.com/kuanjiahong/covid19-analysis

A simple project to familiarize myself with data analysis

data data-science data-visualization pandas python

Last synced: 02 Apr 2025

https://github.com/stdlib-js/array-base-fill-by

Fill all elements within a portion of an array according to a callback function.

accessor array data fill generic javascript map node node-js nodejs set stdlib structure transform typed types

Last synced: 14 May 2026

https://github.com/pyrustic/litedao

Intuitive interaction with SQLite database

auto-init dao data database database-access library lightweight pyrustic python sql sqlite

Last synced: 09 May 2026

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling

Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.

classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier

Last synced: 01 Jun 2026

https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling

Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.

data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis

Last synced: 05 Mar 2025

https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset

This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.

arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting

Last synced: 23 Jun 2025

https://github.com/sofyan48/wahoo

Data stream library with kinesis

aws data data-stream event kinesis stream

Last synced: 14 May 2026

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project β€” documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026

https://github.com/biril/audio-test-data

Audio data to use for testing

audio data mpeg test

Last synced: 11 Jan 2026

https://github.com/bala-1409/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning

Last synced: 26 Apr 2026

https://github.com/bala-1409/loan-classification-data-science-projects

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization

Last synced: 22 Mar 2025

https://github.com/itsmeyogesh22/solved-8-weeks-sql-challenge-correct-solutions

Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/

8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql

Last synced: 07 Apr 2025

https://github.com/afeiship/data-selection

Data structure for radio/checkbox-group.

checkbox data group radio

Last synced: 17 Jun 2025

https://github.com/shubhamsoni98/classification-with-random-forest---2

Fraud detection is a critical task for financial institutions and businesses. This document outlines the end-to-end process of predicting fraudulent activities using a Random Forest model. The process includes data preparation, exploration, model training, and evaluation.

algorithms anaconda data data-science dataflow feature-engineering jupyter-notebook machine-learning model modeltraining prediction python random-forest sql visualization

Last synced: 20 Jan 2026

https://github.com/goutam1511/real-time-covid-19-tracker-for-slack

This automated tracker tracks the spread of Covid-19 in a real time basis by scraping data from Ministry of Health and Family Welfare and notifies the same at Slack

covid-19 data python slack-bot web-scraping

Last synced: 30 Aug 2025

https://github.com/moons-14/datapot

Incorporate and serve all information.

ai aiogram api data infomation news newspaper rss video

Last synced: 04 Jan 2026

https://github.com/khansasafira19/sk-cool-storytelling

Source Code for Data Storytelling with HTML5

data html5 javascript storytelling

Last synced: 13 May 2026

https://github.com/agusk/ilmudata-book-excel-analytics

Hallo Microsoft Excel: Mastering Data Analytics

analytics data data-analytics excel power-query-editor

Last synced: 06 Jan 2026

https://github.com/luminati-io/linkedin-dataset-samples

Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.

data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping

Last synced: 17 Mar 2025

https://github.com/citizenlabsgr/data.world

Work with data sets prior to uploading to data.world

data data-structures

Last synced: 26 Mar 2025

https://github.com/gregoritsch3/project_excel_dataanalysis_carsales

An Excel Data Analysis project based on a vehicle vendor's car sales data from 2014 and 2015 showcasing data cleaning and formatting, DAX, pivot tables and charts, timelines, slicers, an interactive Dashboard, descriptive Statistics and more.

analysis dashboard data excel sales statistics

Last synced: 01 Feb 2026

https://github.com/kwame-mintah/ml-data-copy-to-aws-s3

Automatically copy new data to an AWS S3 bucket for Machine Learning.

aws aws-actions aws-s3 data

Last synced: 14 May 2026

https://github.com/lordzintick/spellcaster-api-1.21.4

A server-side Fabric mod to load JSON spell files from datapacks.

api api-server data fabric fabric-mod fabricmc json magic minecraft-mod server-side small spells

Last synced: 08 May 2026

https://github.com/ayresgneto/use-case-gcp-etl

ELT pipeline GCP. Tecnologias utilizadas: Postgresql, GCP Storage, Airflow (local), Pyspark (local), BigQuery

airflow big-data bigquery data data-engineering etl gcp pipeline postgresql programming-oriented-object pyspark python spark

Last synced: 03 Jan 2026

https://github.com/pooja-manjunatha/nyc_parking_violations_dbt

This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations

data data-analysis data-engineering dbt duckdb python sql

Last synced: 14 May 2026

https://github.com/bagustris/dataits

Web for DataITS17: Summer School on Data Science

data data-science

Last synced: 28 Jun 2025

https://github.com/valyaevgeorgiy/r_basic

Π Π°Π±ΠΎΡ‚Π° с основами срСды R ΠΈ Ρ‚Π΅ΠΌ самым изучСния Π½ΠΎΠ²ΠΎΠ³ΠΎ языка программирования, связанного нСпосрСдствСнно с Π°Π½Π°Π»ΠΈΠ·ΠΎΠΌ Π΄Π°Π½Π½Ρ‹Ρ… ΠΈ построСниСм Π³Ρ€Π°Ρ„ΠΈΠΊΠΎΠ² ΠΈ Π΄ΠΈΠ°Π³Ρ€Π°ΠΌΠΌ.

coding data data-analysis r rstudio

Last synced: 12 Dec 2025

https://github.com/yourdataarchitect/abyat-scaring-

This Scrapy spider for automates the extraction of product data from the Abyat website using Hidden Backend API, supporting both Arabic and English content.

data database scraper scrapy-crawler

Last synced: 23 Apr 2026

https://github.com/rudxain/xorsum

Get XOR checksum with this command-line tool

binary checksum cli data digest file files hexadecimal rust-crate xor

Last synced: 08 Mar 2026

https://github.com/cleanzr/cd

CD dataset for Entity Resolution

data linkage

Last synced: 10 Mar 2026

https://github.com/charlieroth/exoexplo

Exploring NASA Exoplanet Archive Data

data exoplanets julia nasa

Last synced: 03 Apr 2025

https://github.com/push-protocol/push-google-bigquery

The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis

bigquery data push push-notifications web3

Last synced: 26 Mar 2025

https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023

Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.

cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill

Last synced: 16 Aug 2025

https://github.com/RedInfinityPro/ScientificSharp

Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.

componentmodel cryptography data drawing forms generic linq system tasks text

Last synced: 30 Sep 2025

https://github.com/8hrsk/ranger

Package for generating fake userdata to work with.

data factory faker generator npm

Last synced: 30 Apr 2026

https://github.com/vladandreitoma/igisol_jyvaskyla_xept_experimental_campaign

A simulation toolkit together with data analysis for the Xe&Pt Exotic Nuclei Generation experiment @ Jyvaskyla December 2022. Helping dr.Paul Constantin with simulation development. Simulation is done using Geant4 provided by CERN. Data anlysis is done using ROOT by Cern. Both C++ based. Job distributors to run the sim are coded in pearl

analysis architecture-design cplusplus data oop oop-principles pearl simulations

Last synced: 05 Sep 2025

https://github.com/ethenkem/PyGraphSurvey

A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc

data

Last synced: 30 Apr 2025

https://github.com/jonathanstowe/databulous

Abstraction for tabular data

data perl6 table tabular

Last synced: 02 Apr 2025

https://github.com/rllyhz/mini-data-center

This repo is to fulfill my internship assignment at the Office of Communication and Information (Kominfo), Balai Kota, Semarang, Indonesia

chartjs country-information data information-visualization laravel laravel-application

Last synced: 06 Nov 2025

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/mattpap/pycon-2017-bokeh

Bokeh tutorial at PyCon.PL 2017

bokeh data tutorial visualization

Last synced: 17 Mar 2025

https://github.com/jeugregg/deeplearningpicturedogs

Classify dogs pictures by Deep Learning CNN neural networks

classez-des-images cnn-keras data data-science ipynb neural-network vision

Last synced: 24 Jul 2025

https://github.com/luminati-io/zoominfo-dataset-samples

A sample dataset of over 1000 ZoomInfo companies, extracted using the Bright Data API, ideal for market growth, lead generation, and market analysis.

b2b business companies data data-extraction database dataset datasets web-scraping zoominfo

Last synced: 17 Mar 2025

https://github.com/par7133/xsltmaster

Dynamically load data from multiple XML/XSLT in webpages

data dynamic load webpages xml xslt

Last synced: 02 Mar 2025

https://github.com/eslamdyab21/apara-data-gui

Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.

csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui

Last synced: 17 May 2026

https://github.com/huspacy/huspacy-resources

Resources for building and evaluating huspacy

data huspacy

Last synced: 21 Mar 2025

https://github.com/jefking/copyblobs

Copies all files in a container to another container, in another storage account.

aci arm azcopy azure blob container copy data file files from instant move one-time simple storage sync template to transfer

Last synced: 27 Mar 2025

https://github.com/octoenergy/tentaclio-gdrive

A python project containing all the dependencies for the gdrive tentaclio schema

data

Last synced: 24 Jun 2025

https://github.com/akesling/csvb

Have CSV? Use CSVB!

analytics csv data database

Last synced: 02 Feb 2026

https://github.com/octoenergy/tentaclio-databricks

Module to give tentaclio support to databricks

data

Last synced: 24 Jun 2025

https://github.com/octoenergy/tentaclio-s3

A python project containing all the dependencies for s3 tentaclio schema.

data

Last synced: 24 Jun 2025

https://github.com/octoenergy/tentaclio-athena

A python project containing all the dependencies for awsathena+rest tentaclio schema.

data

Last synced: 24 Jun 2025