An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/bho0920/crime-data-analysis-eu

Crime Data Analysis for Self-Defense Tool Market Entry in the EU.

data data-analysis sql sqlite tableau

Last synced: 21 Jun 2025

https://github.com/jcloh98/rental-property-finder

A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.

data headless-browser node scraper scraping web

Last synced: 17 May 2026

https://github.com/codehard8/web-scrapping

In this repository we have provide a web scrapping project through beautifulSoup and related files

beutifulsoup data houses-for-sale python3 requests-library-python webscraping

Last synced: 01 Jul 2025

https://github.com/akashlogics/street-data-tracking

Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones

analysis data excel newdataset object-detection opencv python python3 yolo

Last synced: 19 May 2026

https://github.com/jonprice99/regional-election-analysis

An analysis of election results in Allegheny County using Pandas and other Python libraries to better understand the voting habits, practices, and preferences of regional voters.

data data-visualization election-analysis election-data pandas python

Last synced: 05 May 2026

https://github.com/buildinamsterdam/contentful-graphql

Contentful GraphQL connection

contentful data graphql

Last synced: 05 Jan 2026

https://github.com/abshek7/big-data

A repository for documenting the learning related to theory and practical notes of big data computing.

big-data data data-engineering mapreduce pyspark

Last synced: 15 Jun 2025

https://github.com/ahmad-mtr/prjkt_exam_schedule_test

I hate scrolling in a list of 300+ courses of my Uni exam schedule, so I'm creating this. this's a test btw :)

data strings-manipulation

Last synced: 11 Apr 2025

https://github.com/merekat/hb-passiv-income

Ein Rechner, der basierend auf historischen Daten unterschiedlicher Assets kalkuliert, welches voraussichtliche passive Einkommen der User abhängig von seinen Eingaben zu erwarten hat.

assets data datajournalism etf passive-income treasury

Last synced: 19 Jul 2025

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/ezeparziale/analisis-data-delitos

:gun: Analsis de delitos de CABA

data data-science

Last synced: 14 Mar 2025

https://github.com/official-imvoiid/multifetch

A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection

aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows

Last synced: 19 May 2026

https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial

Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI

dagster data data-orchestration kedro luigi mageai prefect

Last synced: 18 Apr 2026

https://github.com/randomgamingdev/randomgamingdev.github.io.data

The data for RandomGamingDev.github.io (feel free to build your own website off of mine :D)

blog custom data projects projects-list

Last synced: 02 Jan 2026

https://github.com/rllyhz/mini-data-center

This repo is to fulfill my internship assignment at the Office of Communication and Information (Kominfo), Balai Kota, Semarang, Indonesia

chartjs country-information data information-visualization laravel laravel-application

Last synced: 06 Nov 2025

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/hivesolutions/crossline

Simple event pipping and storing infra-structure

counter data opencv warehouse

Last synced: 15 May 2026

https://github.com/GAMELEIRA/studies-database

Esse repositório têm como objetivo alocar todo e qualquer script para aprender e praticar gerenciamento de banco de dados SQL e NoSQL. Nesse projeto, serão consolidados os principais fundamentos e princípios, além da prática de exercícios e desenvolvimento de projetos.

data database mongodb mssql mysql nosql sql

Last synced: 03 May 2025

https://github.com/dcmox/moxymapper

Data mapping made easy

data json mapper

Last synced: 15 May 2026

https://github.com/mysociety/sync-ep-to-jkan

Syncs EveryPolitician data to mySociety's data portal.

data everypolitician jkan politicians

Last synced: 27 Jul 2025

https://github.com/md-emranhossen/leetcode-practice

This repository stores my solutions to LeetCode problems, organized by problem number and title.

cpp data datastructures-algorithms leetcode-solutions

Last synced: 26 Jun 2025

https://github.com/nonsignificantp/enfermedades-inmunoprevenibles

Analisis sobre el efecto de las vacunas y la incidencia de casos de enfermedades inmunoprevenibles en la Ciudad de Buenos Aires entre los años 1995 y 2016

a analysis argentina buenosaires data hepatitis science vaccination

Last synced: 18 Jun 2026

https://github.com/engineeringmadness/gaming-ai-analytics

Using Databricks to analyze game reviews from Steam web store

data databricks llama pyspark semantic-layer

Last synced: 15 May 2026

https://github.com/jeugregg/deeplearningpicturedogs

Classify dogs pictures by Deep Learning CNN neural networks

classez-des-images cnn-keras data data-science ipynb neural-network vision

Last synced: 24 Jul 2025

https://github.com/prernarohra/todo-webapp

Simple Todo App for practice.

axios css data fastapi html json python typescript

Last synced: 06 Apr 2026

https://github.com/ayushman0511/data-warehouse-project1

A comprehensive guide to building a data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

data data-ana data-anal data-cleaning data-enginee data-lakehou datalake datasci dataware datawarehouse datawarehousi etl etl-job etl-pipeline medallion sql sql-quer sql-query sql-server sqlserver

Last synced: 26 Jun 2025

https://github.com/majorcluster/clj-data-adapter

A Clojure library designed to convert data

clojure data lib library

Last synced: 12 Jul 2025

https://github.com/webianks/anotech-android

Android application which deals on various anomalous behaviour that occur on server data.

anomaly-detection data server

Last synced: 13 Apr 2025

https://github.com/dsietz/daas-workshop

Workshop for building a Data as a Service platform using the DaaS SDK.

archconf daas daas-pattern data dataprivacy nfjs rust rust-lang

Last synced: 20 May 2026

https://github.com/rrwen/twitter2return

Module for extracting Twitter data using option objects

access api data extract geo get location media oauth object option post rest return sample social stream token tweet twitter

Last synced: 03 Apr 2025

https://github.com/bcodmo/workshop_bios_oceanographic_data

Repository holding lesson on Data Management Basics. See webpage for rendered view: https://bcodmo.github.io/workshop_bios_oceanographic_data/

bco-dmo data datamanagement fair workshop

Last synced: 08 Apr 2026

https://github.com/jigyasag18/orders-sales-analysis-report-using-power-bi

This repository analyzes and visualizes office supply sales data to improve profitability. It examines sales performance by various factors, using charts to provide insights and actionable recommendations for sales optimization, market research, and product mix.

data dataanalysis dataanalytics dataset powerbi powerbi-dashboards powerbi-report powerbi-reports powerbi-visuals powerbidashboard

Last synced: 18 Feb 2026

https://github.com/noraui/noraui-datas-webservices

noraui-datas-webservices is a RESTdataProvider for NoraUi

data noraui rest-api service spring-boot-2 spring-boot-actuator

Last synced: 17 Mar 2025

https://github.com/codehub001/ai-driven-automation-for-data-quality-monitoring-in-cloud-data-warehouses

This project focuses on leveraging AI to automate data quality monitoring in cloud data warehouses. Traditional data validation methods often require manual intervention and fail to scale with increasing data complexity. By integrating machine learning models, this approach enables real-time anomaly detection, automated data cleansing.

csv-export csv-import dashboard data datacleaning lib modeltraining python testing-library visualization

Last synced: 13 May 2025

https://github.com/theanujsinha01/data-analytics-portal-

Data Analytics Portal Built a web-based data analytics tool using Streamlit, Pandas, and Plotly. Supported CSV and Excel uploads (up to 200MB) for data exploration. Features included statistical summaries, group-by aggregation, and frequency counts. Integrated interactive charts (bar, pie, line, scatter) for visual insights. This tool is live now.

analytics data portal

Last synced: 28 Apr 2026

https://github.com/wolfchamane/amjs-data-types

Data types for your OOP javascript project

cjs data javascript modules nodejs oop types

Last synced: 20 May 2026

https://github.com/circlexo/circlexo

Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.

bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor

Last synced: 20 May 2026

https://github.com/shimul-zahan/all-practices-tukitaki

This is repository for all the practice tasks or learning new things. Cause environment are setup and no need to setup a new project or environments.

data data-science datapreprocessing deep-learning machine-learning neural-network practice python visualization

Last synced: 12 Jan 2026

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/heshamalsaqqaf2/python-projects

Beginner Level Python Projects

data python3

Last synced: 22 Jul 2025

https://github.com/peternaydenov/data-pool

Data layer for node apps and single page applications

cache data store

Last synced: 29 Apr 2025

https://github.com/clagiordano/marketplaces-data-export

LIbrary that share the same interface and provide adapters for online marketplaces services

adapter amazon api clagiordano data ebay ebay-api export marketplaces mws mws-api rest soap

Last synced: 22 Mar 2025

https://github.com/cmutel/jester

Import data from the olca-schema JSON-LD format into the HESTIA JSON-LD schema

agriculture data json-ld life-cycle-assessment ontology

Last synced: 26 Jul 2025

https://github.com/tomasfarias/louis

Yet another challenge project

challenge data python

Last synced: 29 Mar 2025

https://github.com/yorkearwaker/data

Data things; representation, transformation, pipelines, governance,

actuality data epistemology information knowledge ontology

Last synced: 07 Apr 2025

https://github.com/yagoluiz/enem-analise-extracao

[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste

analysis data extraction python3 r

Last synced: 17 Apr 2026

https://github.com/tks18/xl-pq-handler

A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation

analytics automation data excel power-query powerbi python xlwings

Last synced: 20 Jan 2026

https://github.com/poissonconsulting/klexdatr

An R package of data from the Kootenay Lake Exploitation Study

cran data fish kootenay-lake rstats

Last synced: 16 Oct 2025

https://github.com/alecxcode/table-parser

Python Table Parser (data extraction)

automation data extraction python robotic-process-automation

Last synced: 04 May 2026

https://github.com/abdullahashfaqvirk/Earth-Engine-Data-Scraper

A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.

beautifulsoup data data-science python requests scraper web-scraping

Last synced: 27 Sep 2025

https://github.com/bdr-pro/streamlint

ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files

data streamlit

Last synced: 14 Apr 2026

https://github.com/machinecyc/lotteryinsight

Use crawler to collect Taiwan Lotto data, and save data into local MySQL server.

crawler data docker lottery mysql-database python3 taiwan

Last synced: 09 May 2026

https://github.com/jun-labs/jq

🧷 Let's practice jq.

data jq json json-data parse

Last synced: 27 Sep 2025

https://github.com/vanduc1102/parse-stackoverflow-data

Parse stackoverflow data

data parser stackoverflow

Last synced: 16 Oct 2025

https://github.com/snimmagadda1/luigi-etl-example

🔍 Example of an ETL pipeline using Spotify's Luigi

data luigi luigi-pipeline python spotify

Last synced: 30 Mar 2025

https://github.com/lexiortiz/advanced-data-analytics

Structured learning notes, code snippets, and key takeaways from the Google Advanced Data Analytics Professional Certificate. Serves as a personal reference for reinforcing concepts and as a resource for others on a similar learning journey.

data data-analysis data-engineering google python-3 sql

Last synced: 29 May 2026

https://github.com/mat06mat/matbot

My discord bot code

data discord-bot discord-py py-cord

Last synced: 17 Oct 2025

https://github.com/skygenesisenterprise/aether-meet

Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications data docker javascript meeting nextjs notes typescript voip

Last synced: 01 May 2026

https://github.com/ronknight/user-data-dashboard

📈 A data visualization tool for analyzing user data using an Excel-based data source.

dashboard data excel ga4 screenshot

Last synced: 17 Oct 2025

https://github.com/enoch208/eventmaster

A user-friendly application that helps you easily record and play back your keyboard and mouse actions. With its modern design using `tkinter` and `ttkthemes`, it provides a smooth and easy-to-use interface. The app combines reliable technical features to give you a great experience.

automation data key keylogging-python replay spy tools

Last synced: 01 Jun 2026

https://github.com/analyst-amitbisht/pizza-sales-report-

Its a guided project to practice tools like SSMS + Power BI & also skills like data cleaning, data exploration, data analysis, data visualization, etc.

analytics data data-visualization powerbi sql-server

Last synced: 18 Oct 2025

https://github.com/pbinkley/mfmcollections

Project to distill data about published collections of microfilms from library lists

data research retro

Last synced: 28 May 2026

https://github.com/meokullu/colorizenumber

ColorizeNumber - Bodrum Papatya, visualizes numeric data into colors which creates an image.

color colorize colors data data-visualization visualization vizualize-data

Last synced: 01 Jun 2026

https://github.com/psgebeline/harvard-data-science

My work for the nine courses in Harvard's data science program, each with notes/assignments. Work in progress.

data linear-regression machine-learning modeling probability-theory r visualization wrangling

Last synced: 19 Oct 2025

https://github.com/anct-cartographie-nationale/mednum-cli

✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv

anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation

Last synced: 02 Aug 2025

https://github.com/scjoaoantonio/trab_datascience

Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.

bluesky data data-science streamlit

Last synced: 09 May 2026

https://github.com/plurid/delog

Cloud Service for Centralized Logging

cloud data logging

Last synced: 08 Nov 2025

https://github.com/bhojpur/dlm

The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.

data lifecycle-management

Last synced: 19 Feb 2026

https://github.com/erencelik/binance-public-data-node

Nodejs downloader and unzipper script for Binance Public Data

binance data downloader nodejs public script

Last synced: 15 May 2026

https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson

UK "Counties" Extracted from Office for National Statistics data

data geojson maps uk

Last synced: 29 Mar 2025

https://github.com/plurid/datasign

Single Source of Truth Data Contract Specifier

data file-format

Last synced: 08 Nov 2025

https://github.com/nushratjabenaurnima/cse_477_data_mining

A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.

data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3

Last synced: 09 Apr 2026

https://github.com/terracrow/tml

Easy to use data manipulation package using YAML.

data database db node npm tml yml

Last synced: 26 Feb 2025

https://github.com/cemc-oper/nmc-typhoon-db-client

A CLI client for NMC Typhoon Database.

data database-client nmc

Last synced: 01 Jun 2026

https://github.com/dilkushsingh/webscraping-with-selenium-and-beautifulsoup

Web Scrapped a popular tech gadgets website using Selenium and BeautifulSoup, also performed Data Analysis on scrapped data.

beautifulsoup data datacleaning datagathering eda exploratory-data-analysis python selenium webscraping

Last synced: 24 Feb 2026

https://github.com/linguini1/edueval

The BorealisAI Let's Solve It mentorship project: summarizing student feedback submissions on their professor into one cohesive paragraph for faculty consideration during performance reviews.

ai data data-analysis data-science machine-learning machinelearning nlp python pytorch sentiment-analysis

Last synced: 01 May 2026

https://github.com/companyakis/financial-data

Financial Data & Python

data finance python

Last synced: 29 Jun 2025

https://github.com/nmelgar/birthday_sports_dataviz

We will analyze how the Matthew Effect has influenced in professional sports players.

analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau

Last synced: 08 Jan 2026

https://github.com/mohibmirza-py/email-verifier-script

Streamlit app to verify emails in bulk

ai analysis data streamlit

Last synced: 29 Apr 2026