An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/gabboraron/datacamp_projects

Here you can find my DataCamp Projects

data datacamp datacamp-projects

Last synced: 14 Jun 2026

https://github.com/wciesialka/top-names

A Python module for scraping the list of top first names in the United States.

data python python3

Last synced: 08 Jun 2026

https://github.com/fridex/real-estate

My machine learning in real estate

data machine-learning real-estate

Last synced: 27 Jun 2025

https://github.com/radekbednarik/att

Python wrapper for calling Apitalks API.

api-wrapper apitalks data python3 rest-api wrapper

Last synced: 05 Apr 2025

https://github.com/csmith0651/ormy

A simple python ORM.

data database python

Last synced: 13 May 2026

https://github.com/iliyasalve/cyclistic_case_study

Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"

bike-sharing data data-analysis data-visualisation r

Last synced: 06 Apr 2025

https://github.com/injamul3798/cpp_stl-discussion

As we know ,STL is mostly used tools is competitive programming.

data list map set structure vector

Last synced: 02 Apr 2025

https://github.com/4ment/aiv-rate-heterogeneity

Avian influenza virus data sets

data influenza

Last synced: 24 Jan 2026

https://github.com/thesfinox/sql-simple-backup

Simple script to backup data in a MySQL database and store it in a WebDAV server.

backup bash data mysql script sql webdav

Last synced: 18 Apr 2026

https://github.com/talitalobo/statistics-with-python

Repo about statistical concepts and (not always) their python implementation.

data data-science machine-learning statistics

Last synced: 11 Jan 2026

https://github.com/emna-chebbi/student-performance

Predictive model for student exam scores based on student performance factors

ai computer-vision data kaggle machine-learning ml mse regression regression-models

Last synced: 15 May 2026

https://github.com/purarue/blizzard_gdpr_parser

Parses date-related information from my blizzard GDPR export.

blizzard data gdpr webscraping

Last synced: 06 Apr 2025

https://github.com/purarue/hpi-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 06 Apr 2025

https://github.com/renebentes/2808

Curso 2808 - Fundamentos do Entity Framework

course csharp data ef-core

Last synced: 27 Jun 2025

https://github.com/lakshyakumar266/jee-dpp-manager-app

DPP manager app for JEE preparing Students

data expo javascript management react-native

Last synced: 07 May 2026

https://github.com/mai-space/design-concept-sharing-recipes

🖼️ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework

concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories

Last synced: 14 May 2025

https://github.com/jph5396/sumomodel

A data models related to sumo wrestling.

data go sumo

Last synced: 17 Jan 2026

https://github.com/gagolews/clustering-data-v0

Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]

clustering data dataset machine-learning

Last synced: 15 Sep 2025

https://github.com/moscatellimarco/webscrap-imdb

🎬 Python scraper for IMDB: Extract movie/TV details for 📊 analysis & 🗃️ storage. Easy setup, 🔧 customizable, with 🖥️ CLI.

css data datascience html movies python scrapy scrapy-crawler scrapy-spider web web-scraping webdata webscraping

Last synced: 15 May 2026

https://github.com/miss-mhv/data-analysis-for-social-buzz

In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.

data jupyter-notebook python

Last synced: 14 May 2026

https://github.com/canadaluke888/terminaltablebuilder

Build and edit tabular data all from the terminal.

cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables

Last synced: 20 Apr 2026

https://github.com/shysolocup/fndt

JavaScript package allowing you to see function data like body and arguments from outside of the function

aepl data fndt functions javascript javascript-tools js js-function js-functions lightweight nodejs nodejs-modules package stews

Last synced: 30 Apr 2026

https://github.com/parmsam/rweekly.data

R package containing data on Rweekly posts

data package rweekly

Last synced: 21 May 2026

https://github.com/rsc-labs/see-open-data

Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.

data data-visualization flourish government poland

Last synced: 04 Apr 2025

https://github.com/hackolade/yugabytedb-ysql

Hackolade(https://hackolade.com) plugin for the Cloud Native Yugabyte database with YSQL API

data data-modeling entity-relationship-diagram schema-design ysql yugabyte yugabytedb

Last synced: 30 Apr 2025

https://github.com/rajlabmssm/echodata

echoverse module: Example data.

data echoverse fine-mapping genomics gwas qtl

Last synced: 17 Jan 2026

https://github.com/jitsasmal/customer-purches-behavior-and-shopping-analysis

Create dashboard to analyse the data based to total product sales, terget, revenue, state and season wize analyse to show the current treand the data.

analytics dashboard data etl powerbi

Last synced: 14 Feb 2026

https://github.com/hivesolutions/repos

Modular repository management system

data python repos storage system

Last synced: 14 May 2026

https://github.com/badawy403/egy.list

A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.

city data egypt javascript nodejs npm package

Last synced: 08 Mar 2026

https://github.com/luminati-io/linkedin-dataset-samples

Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.

data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping

Last synced: 17 Mar 2025

https://github.com/indhra/cats-ijcnn-data-2004

CATS IJCNN Data 2004 Competition of Artificial Time Series

2004 artificial cats data ijcnn time-series

Last synced: 22 Mar 2025

https://github.com/ioboi/obloc-data

Scrape guest counter of O'BLOC 🧗‍♀️

data scraping

Last synced: 04 Nov 2025

https://github.com/kuanjiahong/covid19-analysis

A simple project to familiarize myself with data analysis

data data-science data-visualization pandas python

Last synced: 02 Apr 2025

https://github.com/ntnn/dataparse

Parsing, transforming and unmarshalling data.

data data-parser data-parsing data-transformation golang golang-lib

Last synced: 01 Apr 2025

https://github.com/stdlib-js/array-base-fill-by

Fill all elements within a portion of an array according to a callback function.

accessor array data fill generic javascript map node node-js nodejs set stdlib structure transform typed types

Last synced: 14 May 2026

https://github.com/dms-codes/scrape_tripsantai

Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.

beautifulsoup4 data python requests scraper webscraper

Last synced: 21 May 2026

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/bfontaine/datatools

:triangular_ruler: Some scripts I use to work with data

data ruby script

Last synced: 23 Jul 2025

https://github.com/sofyan48/wahoo

Data stream library with kinesis

aws data data-stream event kinesis stream

Last synced: 14 May 2026

https://github.com/cleanzr/cd

CD dataset for Entity Resolution

data linkage

Last synced: 10 Mar 2026

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/omari-kd/recommendation-system-analysis-and-modelling

This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.

data data-science data-science-in-r machine-learning-algorithms recommendation-system

Last synced: 08 Jan 2026

https://github.com/j-hagedorn/locals

:globe_with_meridians: A collection of tidied, neighborhood-level public datasets

address-dataset census-data census-tract data neighborhood social-sciences

Last synced: 03 Feb 2026

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026

https://github.com/ims94/ballerina-tsv-querying

An example Ballerina project to query tsv data using Ballerina language integrated queries

ballerina ballerina-lang data olympics query sql

Last synced: 03 Feb 2026

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/biril/audio-test-data

Audio data to use for testing

audio data mpeg test

Last synced: 11 Jan 2026

https://github.com/zanuarts/datamining

Repo Matkul Data Mining

data data-mining

Last synced: 14 Mar 2025

https://github.com/vedantwalia/google-data-analytics-capstone-case-study

This is a repository of my work on data analysis as a part of the Google Data Analytics Capstone

bigquery data data-viz datavisualization-project divvy-bikes google googledataanalytics sql tableau tableau-public

Last synced: 02 Jan 2026

https://github.com/interzoid/typescript-examples

Provides TypeScript examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.

angular api cloud data database matching nodejs quality typescript

Last synced: 12 Jan 2026

https://github.com/interzoid/php-examples

Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.

api cloud data database php quality

Last synced: 12 Jan 2026

https://github.com/afeiship/data-selection

Data structure for radio/checkbox-group.

checkbox data group radio

Last synced: 17 Jun 2025

https://github.com/cody-scott/arclint

A flexible tool to validate and improve your data in ArcGIS using regex and other methods

arcgis arcgispro data lint regex validation

Last synced: 14 May 2025

https://github.com/goutam1511/real-time-covid-19-tracker-for-slack

This automated tracker tracks the spread of Covid-19 in a real time basis by scraping data from Ministry of Health and Family Welfare and notifies the same at Slack

covid-19 data python slack-bot web-scraping

Last synced: 30 Aug 2025

https://github.com/rickstaa/ai-compute-visualizer

A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.

ai data livepeer streamlit

Last synced: 28 Jun 2025

https://github.com/agusk/ilmudata-book-excel-analytics

Hallo Microsoft Excel: Mastering Data Analytics

analytics data data-analytics excel power-query-editor

Last synced: 06 Jan 2026

https://github.com/citizenlabsgr/data.world

Work with data sets prior to uploading to data.world

data data-structures

Last synced: 26 Mar 2025

https://github.com/analyticslover/salifort-motors-turnover-project

The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.

data data-analysis datamodeling eda machine-learning pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/entitizer/data-js

Entitizer data module

data entitizer storage

Last synced: 25 Jan 2026

https://github.com/The-Tech-Idea/Beep.winform.Sample

Application for Managing your Different DataSources . Still in Alpha.please be patient

application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows

Last synced: 04 Nov 2025

https://github.com/karensaraimoralesmontiel/8-week-sql-challenge

Case Studies Solutions for the 8-Week-SQL-Challenge.

data database sql

Last synced: 02 Jan 2026

https://github.com/kwame-mintah/ml-data-copy-to-aws-s3

Automatically copy new data to an AWS S3 bucket for Machine Learning.

aws aws-actions aws-s3 data

Last synced: 14 May 2026

https://github.com/aiwithqasim/p1_explore-weather-trends

In this project, I'll analyze local and global temperature data and compare the temperature trends where I live to overall global temperature trends. Moreover i will use SQL query to extract data from the given Data base and i have to visualize the insight or Average temperature to find the findings.

data dataanalyst database datavisualization nanodegree udacity

Last synced: 22 May 2026

https://github.com/rickyarians/practical-statistic-car-emission

Practical Statistic Project- Car Emission in Canada - 2022

data data-science dataanalysis r rmarkdown rpubs statistics

Last synced: 22 May 2026

https://github.com/iamyourdre/naive-bayes-classifier-js

Naive Bayes classifier developed with MySQL, ExpressJS, and NodeJS by @iamyourdre.

backend data data-science expressjs javascript mysql naive-bayes naive-bayes-algorithm naive-bayes-classifier nodejs

Last synced: 08 Apr 2026

https://github.com/iyashwantsaini/tweetify_

Twitter Data Collection, Analysis Tool

collection data twitter twitter-sentiment-analysis

Last synced: 08 Mar 2026

https://github.com/pooja-manjunatha/nyc_parking_violations_dbt

This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations

data data-analysis data-engineering dbt duckdb python sql

Last synced: 14 May 2026

https://github.com/mobinx/easymeet-js

EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything

data meeting react realtime screensharing streaming-video webrtc zoom

Last synced: 03 Jan 2026

https://github.com/merrill007/sql-data-warehouse-project

The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.

architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver

Last synced: 22 Mar 2025

https://github.com/valyaevgeorgiy/r_basic

Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.

coding data data-analysis r rstudio

Last synced: 12 Dec 2025

https://github.com/richelbilderbeek/heyahmama

Data about the Flemish/Dutch band K3

band data k3 package r r-lang r-language

Last synced: 22 May 2026

https://github.com/charlieroth/exoexplo

Exploring NASA Exoplanet Archive Data

data exoplanets julia nasa

Last synced: 03 Apr 2025

https://github.com/kirkalyn13/xyz-books-pipeline

XYZ Books Pipeline to check and update incoming ISBNs from newly added books from the CRUD UI, and record new data to a CSV file.

api csv data go http rabbitmq

Last synced: 05 Mar 2025

https://github.com/push-protocol/push-google-bigquery

The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis

bigquery data push push-notifications web3

Last synced: 26 Mar 2025

https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023

Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.

cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill

Last synced: 16 Aug 2025

https://github.com/RedInfinityPro/ScientificSharp

Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.

componentmodel cryptography data drawing forms generic linq system tasks text

Last synced: 30 Sep 2025

https://github.com/realbxnnie/accountservice

A Simple DataStoreService wrapper with session backuping and session locking.

data lua luau roblox

Last synced: 29 Jul 2025

https://github.com/shubhamsoni98/analysis-with-sql

This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.

analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql

Last synced: 03 Jan 2026

https://github.com/8hrsk/ranger

Package for generating fake userdata to work with.

data factory faker generator npm

Last synced: 30 Apr 2026

https://github.com/kenanbek/youtube-data

YouTube stats data over YouTube Data API v3 using Python.

data python youtube youtube-api

Last synced: 13 May 2026

https://github.com/vladandreitoma/igisol_jyvaskyla_xept_experimental_campaign

A simulation toolkit together with data analysis for the Xe&Pt Exotic Nuclei Generation experiment @ Jyvaskyla December 2022. Helping dr.Paul Constantin with simulation development. Simulation is done using Geant4 provided by CERN. Data anlysis is done using ROOT by Cern. Both C++ based. Job distributors to run the sim are coded in pearl

analysis architecture-design cplusplus data oop oop-principles pearl simulations

Last synced: 05 Sep 2025

https://github.com/alex0x4b/akutils

High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).

data dataframe pandas python

Last synced: 08 Mar 2026

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/ethenkem/PyGraphSurvey

A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc

data

Last synced: 30 Apr 2025