An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kirkalyn13/xyz-books-pipeline

XYZ Books Pipeline to check and update incoming ISBNs from newly added books from the CRUD UI, and record new data to a CSV file.

api csv data go http rabbitmq

Last synced: 05 Mar 2025

https://github.com/eyluldursun/data-science-project

This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.

data data-science obesity r rmarkdown

Last synced: 26 Jun 2025

https://github.com/ssiarhei115/cv-dbase-analysis

HeadHunter CVs data base analysis

analysis cv data data-science resume

Last synced: 09 Apr 2025

https://github.com/ethenkem/PyGraphSurvey

A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc

data

Last synced: 30 Apr 2025

https://github.com/rrwen/poster-gisci-osmol

Conference poster and short paper titled "Outlier Detection in OpenStreetMap Data using the RandomForest Algorithm and Variable Contributions" for the GIScience Conference in 2016

2016 algorithm conference contribution data detection forest gis giscience learn machine open openstreetmap osm outlier paper poster random short variable

Last synced: 03 Apr 2025

https://github.com/rrwen/geohoods-to

Geospatial dataset of 1000+ aggregated variables for neighbourhoods in Toronto, ON, CA

csv data dataset geo geojson gis neighborhood neighborhoods neighbourhood neighbourhoods open open-data toronto toronto-open-data

Last synced: 25 Jun 2025

https://github.com/noorkhokhar99/text-to-speech-demo

Text to Speech Demo

data python roboflow

Last synced: 27 Mar 2025

https://github.com/codehard8/web-scrapping

In this repository we have provide a web scrapping project through beautifulSoup and related files

beutifulsoup data houses-for-sale python3 requests-library-python webscraping

Last synced: 01 Jul 2025

https://github.com/jonprice99/regional-election-analysis

An analysis of election results in Allegheny County using Pandas and other Python libraries to better understand the voting habits, practices, and preferences of regional voters.

data data-visualization election-analysis election-data pandas python

Last synced: 05 May 2026

https://github.com/huspacy/huspacy-resources

Resources for building and evaluating huspacy

data huspacy

Last synced: 21 Mar 2025

https://github.com/abshek7/big-data

A repository for documenting the learning related to theory and practical notes of big data computing.

big-data data data-engineering mapreduce pyspark

Last synced: 15 Jun 2025

https://github.com/akashlogics/street-data-tracking

Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones

analysis data excel newdataset object-detection opencv python python3 yolo

Last synced: 19 May 2026

https://github.com/ahmad-mtr/prjkt_exam_schedule_test

I hate scrolling in a list of 300+ courses of my Uni exam schedule, so I'm creating this. this's a test btw :)

data strings-manipulation

Last synced: 11 Apr 2025

https://github.com/buildinamsterdam/contentful-graphql

Contentful GraphQL connection

contentful data graphql

Last synced: 05 Jan 2026

https://github.com/eslamdyab21/apara-data-gui

Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.

csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui

Last synced: 17 May 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/ezeparziale/analisis-data-delitos

:gun: Analsis de delitos de CABA

data data-science

Last synced: 14 Mar 2025

https://github.com/official-imvoiid/multifetch

A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection

aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows

Last synced: 19 May 2026

https://github.com/jonathanstowe/databulous

Abstraction for tabular data

data perl6 table tabular

Last synced: 02 Apr 2025

https://github.com/stdlib-js/array-base-assert-any-has-property

Test whether at least one element in a provided array has a specified property, either own or inherited.

any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate

Last synced: 07 May 2025

https://github.com/jefking/copyblobs

Copies all files in a container to another container, in another storage account.

aci arm azcopy azure blob container copy data file files from instant move one-time simple storage sync template to transfer

Last synced: 27 Mar 2025

https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial

Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI

dagster data data-orchestration kedro luigi mageai prefect

Last synced: 18 Apr 2026

https://github.com/randomgamingdev/randomgamingdev.github.io.data

The data for RandomGamingDev.github.io (feel free to build your own website off of mine :D)

blog custom data projects projects-list

Last synced: 02 Jan 2026

https://github.com/hamolicious/console-table

Displaying Tables in the console

console data pypi python table

Last synced: 11 Jul 2025

https://github.com/realbxnnie/accountservice

A Simple DataStoreService wrapper with session backuping and session locking.

data lua luau roblox

Last synced: 29 Jul 2025

https://github.com/mattpap/pycon-2017-bokeh

Bokeh tutorial at PyCon.PL 2017

bokeh data tutorial visualization

Last synced: 17 Mar 2025

https://github.com/cemoktra/data_series

time series handling

data lazy-evaluation time-series

Last synced: 29 Oct 2025

https://github.com/akesling/csvb

Have CSV? Use CSVB!

analytics csv data database

Last synced: 02 Feb 2026

https://github.com/nabilaagha/chest-x-ray-medical-diagnosis-using-deep-learning

This project uses deep learning to classify chest X-ray images for disease detection. It involves data preprocessing, pre-trained CNN models, and the ChestX-ray8 dataset to enhance medical diagnostics with AI.

computer-vision data data-processing deep-learning juypter-notebook medical-image-processing x-ray-images

Last synced: 15 Dec 2025

https://github.com/hivesolutions/crossline

Simple event pipping and storing infra-structure

counter data opencv warehouse

Last synced: 15 May 2026

https://github.com/andygeiss/pipeline-example

This is a basic example of using a pipeline in data science.

data data-pipeline data-science example go golang iris-dataset pipeline protobuf

Last synced: 17 Jul 2025

https://github.com/jorgeatgu/dataset-elecciones-28a

Datasets generados a partir del dataset de elecciones generales de El País

28a data elecciones2019 elections spain

Last synced: 16 May 2026

https://github.com/GAMELEIRA/studies-database

Esse repositório têm como objetivo alocar todo e qualquer script para aprender e praticar gerenciamento de banco de dados SQL e NoSQL. Nesse projeto, serão consolidados os principais fundamentos e princípios, além da prática de exercícios e desenvolvimento de projetos.

data database mongodb mssql mysql nosql sql

Last synced: 03 May 2025

https://github.com/dcmox/moxymapper

Data mapping made easy

data json mapper

Last synced: 15 May 2026

https://github.com/chocolateboy/corrigenda

Corrections, addenda, and deltas for data that's wrong on the Internet

addenda api corrections corrigenda data json json-data

Last synced: 27 Mar 2025

https://github.com/engineeringmadness/gaming-ai-analytics

Using Databricks to analyze game reviews from Steam web store

data databricks llama pyspark semantic-layer

Last synced: 15 May 2026

https://github.com/prernarohra/todo-webapp

Simple Todo App for practice.

axios css data fastapi html json python typescript

Last synced: 06 Apr 2026

https://github.com/shubhamsoni98/analysis-with-sql

This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.

analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql

Last synced: 03 Jan 2026

https://github.com/md-emranhossen/leetcode-practice

This repository stores my solutions to LeetCode problems, organized by problem number and title.

cpp data datastructures-algorithms leetcode-solutions

Last synced: 26 Jun 2025

https://github.com/par7133/xsltmaster

Dynamically load data from multiple XML/XSLT in webpages

data dynamic load webpages xml xslt

Last synced: 02 Mar 2025

https://github.com/nonsignificantp/enfermedades-inmunoprevenibles

Analisis sobre el efecto de las vacunas y la incidencia de casos de enfermedades inmunoprevenibles en la Ciudad de Buenos Aires entre los años 1995 y 2016

a analysis argentina buenosaires data hepatitis science vaccination

Last synced: 18 Jun 2026

https://github.com/kenanbek/youtube-data

YouTube stats data over YouTube Data API v3 using Python.

data python youtube youtube-api

Last synced: 13 May 2026

https://github.com/campiohe/geomask

A very simple lib for creating geometric masks from spatial data using regular grids.

climate data gis weather

Last synced: 30 Dec 2025

https://gitlab.com/sean-c/pdf_rules

Turn PDFs into CSVs by defining rules

Data Cleaning automation data data parsing

Last synced: 14 Apr 2025

https://github.com/eloyhere/semantic-java

Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing — perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.

data functional functional-programming java pipeline stream

Last synced: 07 Apr 2026

https://github.com/ayushman0511/data-warehouse-project1

A comprehensive guide to building a data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

data data-ana data-anal data-cleaning data-enginee data-lakehou datalake datasci dataware datawarehouse datawarehousi etl etl-job etl-pipeline medallion sql sql-quer sql-query sql-server sqlserver

Last synced: 26 Jun 2025

https://github.com/majorcluster/clj-data-adapter

A Clojure library designed to convert data

clojure data lib library

Last synced: 12 Jul 2025

https://github.com/rrwen/twitter2return

Module for extracting Twitter data using option objects

access api data extract geo get location media oauth object option post rest return sample social stream token tweet twitter

Last synced: 03 Apr 2025

https://github.com/theanujsinha01/data-analytics-portal-

Data Analytics Portal Built a web-based data analytics tool using Streamlit, Pandas, and Plotly. Supported CSV and Excel uploads (up to 200MB) for data exploration. Features included statistical summaries, group-by aggregation, and frequency counts. Integrated interactive charts (bar, pie, line, scatter) for visual insights. This tool is live now.

analytics data portal

Last synced: 28 Apr 2026

https://github.com/dsietz/daas-workshop

Workshop for building a Data as a Service platform using the DaaS SDK.

archconf daas daas-pattern data dataprivacy nfjs rust rust-lang

Last synced: 20 May 2026

https://github.com/maulanakavaldo/tri-hita-karana

Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.

bali data data-science g20 science

Last synced: 02 Mar 2025

https://github.com/bcodmo/workshop_bios_oceanographic_data

Repository holding lesson on Data Management Basics. See webpage for rendered view: https://bcodmo.github.io/workshop_bios_oceanographic_data/

bco-dmo data datamanagement fair workshop

Last synced: 08 Apr 2026

https://github.com/vijaykumar1303/sales-data-analysis-and-dashboard-development

To analyze sales data to uncover insights into sales performance, trends, and patterns, and to develop an interactive dashboard that provides a comprehensive view of sales metrics and KPIs.

data dataanalysis datacleaning datavisualisation dax-query powerbi powerquery sql sqldataanalysis

Last synced: 11 Feb 2026

https://github.com/jigyasag18/orders-sales-analysis-report-using-power-bi

This repository analyzes and visualizes office supply sales data to improve profitability. It examines sales performance by various factors, using charts to provide insights and actionable recommendations for sales optimization, market research, and product mix.

data dataanalysis dataanalytics dataset powerbi powerbi-dashboards powerbi-report powerbi-reports powerbi-visuals powerbidashboard

Last synced: 18 Feb 2026

https://github.com/theduardomaciel/cc-pe

Conteúdos, scripts em R e datasets utilizados durante a matéria de Probabilidade e Estatística.

data probability r statistics

Last synced: 27 Mar 2025

https://github.com/alex0x4b/akutils

High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).

data dataframe pandas python

Last synced: 08 Mar 2026

https://github.com/codehub001/ai-driven-automation-for-data-quality-monitoring-in-cloud-data-warehouses

This project focuses on leveraging AI to automate data quality monitoring in cloud data warehouses. Traditional data validation methods often require manual intervention and fail to scale with increasing data complexity. By integrating machine learning models, this approach enables real-time anomaly detection, automated data cleansing.

csv-export csv-import dashboard data datacleaning lib modeltraining python testing-library visualization

Last synced: 13 May 2025

https://github.com/skygenesisenterprise/aether-calendar

Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications calendar capacitorjs data javascript linux macos nextjs typescript windows

Last synced: 12 Apr 2026

https://github.com/wolfchamane/amjs-data-types

Data types for your OOP javascript project

cjs data javascript modules nodejs oop types

Last synced: 20 May 2026

https://github.com/gui-sitton/prepaid

In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 22 May 2026

https://github.com/circlexo/circlexo

Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.

bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor

Last synced: 20 May 2026

https://github.com/shimul-zahan/all-practices-tukitaki

This is repository for all the practice tasks or learning new things. Cause environment are setup and no need to setup a new project or environments.

data data-science datapreprocessing deep-learning machine-learning neural-network practice python visualization

Last synced: 12 Jan 2026

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/ranjeetj06/insighthub

InsightHub is a data analytics project that helps automate the entire process of preparing, analyzing, and reporting on CSV data.

analysis begineer data springboot

Last synced: 17 May 2026

https://github.com/heshamalsaqqaf2/python-projects

Beginner Level Python Projects

data python3

Last synced: 22 Jul 2025

https://github.com/domarps/grad-project-reports

Write-ups of a few key semester-long projects I have worked during my Masters

circuit data deeplearning graph-algorithms matlab question-answering

Last synced: 26 Mar 2025

https://github.com/ellisvalentiner/legislation-embeddings

Embeddings for U.S. Congress legislation

data embeddings machine-learning nlp python

Last synced: 12 Aug 2025

https://github.com/clagiordano/marketplaces-data-export

LIbrary that share the same interface and provide adapters for online marketplaces services

adapter amazon api clagiordano data ebay ebay-api export marketplaces mws mws-api rest soap

Last synced: 22 Mar 2025

https://github.com/luminati-io/linkedin-dataset-samples

Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.

data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping

Last synced: 17 Mar 2025

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/jor-/measurements

Python functions to handle, statistically analyze and plot measurement data.

data measurements python

Last synced: 17 Mar 2025

https://github.com/prcharan592/olympic-insights-historical-data-analytics-in-r

This project analyzes 120 years of Olympic history (1896–2016), uncovering trends and insights from the data

data data-analytics data-science data-visualization kaggle r-programming

Last synced: 03 Apr 2025

https://github.com/amethyst-php/setting

Give the user the ability to configure his own settings

amethyst amethyst-package api data laravel setting

Last synced: 19 May 2026

https://github.com/flowsynx/plugin-sqlite

FlowSynx plugin to enables data access and manipulation on SQLite databases.

data database flowsynx sql sqlite

Last synced: 08 May 2026

https://github.com/rameshaditya/dynamic-hybrid-data-grid

Facilitates faster read-and-write of large ordered collections of data.

algorithms data data-structures storage

Last synced: 30 Jun 2026

https://github.com/johndelatto/-universities-to-pursue-a-master-s-degree-in-machine-learning

Best Master’s Programs in Machine Learning (ML) for 2021 These are the best universities to pursue a master’s degree in machine learning, with research rankings in AI and machine learning

ai api data education project school

Last synced: 17 Jun 2025