An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/viniddev/active_finance

Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário

automation data data-analisis rpa selenium-webdriver telegram-bot

Last synced: 03 May 2026

https://github.com/charityeverett/gobackfetchit

Award Winning WebXR Data Journalism Storytelling Project

3d aframe ar css data html html-css-javascript nodejs visuzalization vr webxr xr

Last synced: 03 May 2026

https://github.com/tn3w/moviedb-json

A JSON library with 981,530 films.

data database db json movie movie-database movies

Last synced: 03 May 2026

https://github.com/ghufranbarcha/company-account-analyzer

This project is a Streamlit application designed to visualize and analyze client data. It includes interactive features for exploring client-specific metrics, generating plots, and viewing distribution charts.

data data-science pandas streamlit visualization

Last synced: 03 May 2026

https://github.com/qrailibs/dataflow

✨ Data processing in Node.js made multithreaded and type-safe.

data dataprocessing multithread node

Last synced: 04 May 2026

https://github.com/fallaciousreasoning/nz-mountains

A list of mountains in NZ, scraped from https://climbnz.org.nz

alpine climbing climbnz data json json-api maps mountaineering scraping

Last synced: 04 May 2026

https://github.com/soham7998/data-analysis-projects

My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.

data data-analysis data-science machine-learning nlp python soham visualization

Last synced: 04 May 2026

https://github.com/maxwelllzh/gis-tutorial-

Tutorials for Columbia University GIS Club

data python

Last synced: 04 May 2026

https://github.com/rabeal21/tea

Generate random TEA wallet addresses in bulk with this simple utility. Perfect for testing and exploring the TEA blockchain. 🌱💻

bucklescript bucklescript-tea chinese-translation cli data earlgrey educators hacking ios-automation ios-test ocaml peer-evaluations php red-team teachyourselfcs test-framework translation tui

Last synced: 04 May 2026

https://github.com/sjg/my-search-story

My Search Story is a demo application developed for the Data Portability API Workshop and the #AISprint2025 events. #BuildwithAI

data docker generative-ai google-cloud-platform google-cloud-run nodejs

Last synced: 04 May 2026

https://github.com/jdanielgoh/cobertura-campanias

En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024

d3js data datavisualization vue

Last synced: 09 Jun 2026

https://github.com/a-poor/datatransform.jl

A package for defining (and performing) tabular-data transformations with JSON.

data data-science data-transformation etl feature-engineering json julia julia-package tabular-data

Last synced: 05 May 2026

https://github.com/rrwen/py-examples

Collection of python examples in each branch

beginner data download excel guide introduction links processing python reference spreadsheet url xls

Last synced: 05 May 2026

https://github.com/edjoukou/pizza-sales-report

A data analysis project using SQL with MySQL database

analysis data mysql powerbi visualization

Last synced: 05 May 2026

https://github.com/rodgeraraujo/open-dataverse

OpenDataverse: ETL application to filter and import open data from https://dados.ifpb.edu.br/ save on database, and exported via a Rest API.

data dataset dataverse flask ifpb pandas python

Last synced: 05 May 2026

https://github.com/munas-git/codm-review-analysis-and-predictions

Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.

data flask machine-learning python sentiment-analysis

Last synced: 05 May 2026

https://github.com/chanchalsoorma/web-scraping

This repo aims to provide a straightforward, easy-to-use scraping code written in Python.

beautifulsoup beautifulsoup4 data python request selenium webscraping

Last synced: 05 May 2026

https://github.com/mito-ds/mitosheet_helper_config

The mitosheet_helper_config package used by enterprises to configure the mitosheet package.

data data-analytics data-science data-visualization jupyter pandas python

Last synced: 05 May 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/shibbbbs/fastapi_project

A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs

data dataanalysis fastapi pandas python

Last synced: 06 May 2026

https://github.com/ksm26/ml-ai-data-science-jobs-in-canada

Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.

ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics

Last synced: 06 May 2026

https://github.com/poode/firebase-modeling

Get firebase/firestore entity model to migrate to mongo or any db later

data database firebase firestore modeling schema

Last synced: 06 May 2026

https://github.com/tadiusfrank2001/pythonprojects

Compilation of Some Fun Introduction to Python Lab Coding Projects introducing the foundamentals of data science, databases, and pythonlibraries

data data-science databases gamedesign python pythonlibrarires sorting-algorithms sqlite string-manipulation

Last synced: 06 May 2026

https://github.com/jbn/vaquero

A Python library for iterative and interactive data wrangling at laptop-scale.

data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework

Last synced: 10 Jun 2026

https://github.com/rrwen/twitter2mongodb-cli

Command line tool for extracting Twitter data to MongoDB databases

api cli cmd command data database get interface line mdb media mongo mongod mongodb post social stream tool tweet twitter

Last synced: 06 May 2026

https://github.com/ekoepplin/dbt-bigquery-core

How to get data to BigQuery (or duckDB) and setup dbt tests for SODA cloud monitoring

bigquery data data-quality dbt dlt duckdb gcp soda

Last synced: 06 May 2026

https://github.com/xljones/bugsnag-exporter

Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs

bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh

Last synced: 06 May 2026

https://github.com/iv4n-ga6l/functional-dataprocessing-pipeline

A functional data processing pipeline that accepts an input file, allows specifying both input and output formats, applies specified transformations, and produces a resulting output file.

csv data datapreprocessing excel json pandas parquet pipeline python

Last synced: 06 May 2026

https://github.com/darrendavy12/azure-databricks-setup-guide-with-formula1-csv

Azure Databricks Setup Guide with Formula1 CSV - Azure Databricks, PySpark, Python, Data Lake Storage

apache azure cloud data databricks lake notebooks pyspark python spark storage

Last synced: 06 May 2026

https://github.com/ashleydavis/brisjs-web-scraping-talk

Code to accompany my talk on web scraping for the Brisbane JavaScript meeting in September 2018

cheerio data data-acquisition data-acquisiton electron headless-browsers javascript nightmare nightmarejs nodejs web-scraping

Last synced: 06 May 2026

https://github.com/hackersandslackers/hackers-jupyter-posts

:red_circle: :closed_book: Our repository for Jupyter Notebook to serve as blog posts.

blog data data-engineering gatsbyjs jupyter jupyter-notebook python python3

Last synced: 07 May 2026

https://github.com/lab5e/loadabledata

Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.

async data loadable state typescript ui

Last synced: 07 May 2026

https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis

This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.

analysis data jupyter-notebook matplotlib numpy python seaborn vscode

Last synced: 07 May 2026

https://github.com/chardos/get-git-data

Access git repository data in node.

data git javascript node

Last synced: 07 May 2026

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/safwan2003/randomforest_heart_disease_prediction

A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.

binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit

Last synced: 07 May 2026

https://github.com/danyal-faheem/project-logs-analyzer

This repo contains scripts to analyze project logs and display some charts related to the data

data data-visualization matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer

This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.

cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark

Last synced: 07 May 2026

https://github.com/jigyasag18/iit-guhawati

Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.

aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp

Last synced: 08 May 2026

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/lckylke/vizweb

Web application for data visualization:)

data expressjs nextjs web

Last synced: 08 May 2026

https://github.com/writetome51/public-data-container-interface

Just a TypeScript interface with 1 property: 'data'

container data interface typescript

Last synced: 15 May 2026

https://github.com/chompfoods/sdk-typescript-angular

Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript

Last synced: 09 May 2026

https://github.com/basemax/okala-product-ids

A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.

crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/naitiknayak196/tech-layoffs-cleaning-sql-vs-python

This project cleans and analyzes a tech layoffs dataset using MySQL and Python (Pandas) to compare their efficiency in data processing. It provides business insights into workforce trends, industry stability, and economic impacts to support data-driven decision-making.

data datacleaning dataset jyputer-notebook layoffdata layoffs mysql python sql

Last synced: 09 May 2026

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/mohamedbilal1800/olympic_history_data_analysis

This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.

analysis data eda matplotlib-pyplot pandas python seaborn visulaization

Last synced: 09 May 2026

https://github.com/xiaomingx/10000-public-apis-and-data

Public APIs are interfaces that allow developers to access various services, features, or data from external systems or platforms.

api-ecosystem api-integration data developer-friendly-apis open-api-access public-api-tools third-party-services

Last synced: 30 Jul 2025

https://github.com/machinecyc/lotteryinsight

Use crawler to collect Taiwan Lotto data, and save data into local MySQL server.

crawler data docker lottery mysql-database python3 taiwan

Last synced: 09 May 2026

https://github.com/baranasoftware/curricular-api

The design and implementation of a REST API for student and course data for a Higher Ed institution.

aws data data-pipeline go golang lambda rest rest-api sqlite3 system-design terraform

Last synced: 09 May 2026

https://github.com/scjoaoantonio/trab_datascience

Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.

bluesky data data-science streamlit

Last synced: 09 May 2026

https://github.com/khushi-sabarad/data_analysis

linkedin learning capstone project

data data-engineering matplotlib pandas python

Last synced: 10 May 2026

https://github.com/beeracs/llama

Run Llama models in your web browser using JavaScript and WebAssembly. Explore light and dark modes easily. 🌐🐱👤

ai data fine-tuning framework gpt langchain large-language-models llama3 llamaindex llm lora machine-learning nlp peft qlora qwen rlhf vllm

Last synced: 10 May 2026

https://github.com/notthestallion/pca__3d-and-from-scratch__principal-component-analysis

In this project, I will be implementing Principal Component Analysis (PCA) from scratch on an ecological footprint consummation database for countries and a three-dimensional scale using a movie database. The goal of this project is to gain a deeper understanding of PCA and to demonstrate its capabilities in exploring complex datasets.

data data-science database pca pca-analysis principal-component-analysis principal-component-analysis-pca principle-component-analysis

Last synced: 10 May 2026

https://github.com/repirate/asset-recovery-tool

A simple tool for recovering undrained tokens and NFTs from a compromised wallet on the Ethereum network.

bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask-desktop metamask-plugin phrase recovery seed token wallet

Last synced: 10 May 2026

https://github.com/infinitode/crsd

A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.

ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data

Last synced: 10 Jun 2026

https://github.com/brightway-lca/bw_io

IO tools for Brightway LCA framework

bw3 data life-cycle-assessment python

Last synced: 10 Jun 2026

https://github.com/miniql/miniql-json

A MiniQL query resolver that loads data from JSON files.

data json query query-language

Last synced: 11 May 2026

https://github.com/gurpreet0022/airbnb-eda

EDA on Airbnb booking data to uncover valuable insights, trends, and patterns

data data-science dataanalytics insights jupyter-notebook matplotlib numy pandas projects python3 seaborn visualization

Last synced: 11 May 2026

https://github.com/sehaj003/boston-bruins-roster-planning-mysql-nosql

Repository for Data Management project, Boston Bruins Roster Planning using MySQL and NoSQL along with data analysis using Python

data data-management mongodb mysql project-repository python

Last synced: 11 May 2026

https://github.com/alimghmi/bdlc

Bloomberg API integration, handling data requests, processing, and SQL database insertion.

api-client bloomberg data data-processing financial-data oauth2 python sql-database transformation

Last synced: 10 Jun 2026

https://github.com/alpine418/datahandler

Data handler for PHP arrays.

data data-handler php73

Last synced: 10 Jun 2026

https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis

The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.

data data-analysis data-science data-visualization docker flask game honkai honkai-star-rail honkai-starrail seaborn webscraping webscraping-data webscraping-selenium

Last synced: 10 Jun 2026

https://github.com/mateogiuffra/estrd2024s1

trabajos prácticos realizados en la materia Estructura de Datos de la Universidad Nacional de Quilmes (UNQ)

c cpp data data-structures-and-algorithms eficiency functional-programming haskell unq

Last synced: 12 May 2026

https://github.com/triboot/ultimate-playerprefs

This repository is only created for **Issue-Tacking** and the **Wiki-Documentation** (Wiki Docs are cooming soon) of Ultimate PlayerPrefs. The plugin source code is fully open and available after the purchase on the Unity Asset Store.

anticheat assetstore data datamanagement easy manager persistent-storage playerprefs playerprefsmanager savegame storage tools unity unity3d unity3d-editor

Last synced: 11 Jun 2026

https://github.com/m0nica/datalogues-refresh

:bar_chart: Programming blog focused on data with an emphasis on exploration in Python.

data jekyll python technical-writing

Last synced: 14 May 2026

https://github.com/flexiui-labs/flexi-grid

Flexi Grid is an advanced, lightweight, and customizable Angular 19 data grid component

angular data filter grid search select sort table

Last synced: 14 May 2026

https://github.com/poojaharihar03/wellness-cities-case-study

A case study for dats analysis of city health centers

analytics data r rstudio

Last synced: 11 Jun 2026

https://github.com/lulloooo/article-fromfitto55tofittoeveryone

Analysis leading to an article published in the EcoSprinter 2024 Annual edition about an Analysis of EU "Fit for 55" packages under a different perspective 🔎

analysis data environment european-union

Last synced: 12 Jun 2026

https://github.com/iannil/one-data-studio

one-data-studio integrates a data governance and development platform, a cloud-native MLOps platform, and a large model application development platform. It connects the entire value chain from raw data governance to model training and deployment, and further to the construction of generative AI applications.

data llm model platform

Last synced: 12 Jun 2026

https://github.com/shashwat9kumar/trends_in_a_country_on_twitter

Finding trending topics in each country on twitter and visualizing them in a WordCloud

data data-visualization trends tweepy twitter-api wordcloud

Last synced: 13 Jun 2026

https://github.com/word2vect/beijing-new-house-data-visualization

Beijing New House Data Visualization for Python Programming 2024 Fall Data Visualization Lab

data python visualization

Last synced: 13 Jun 2026

https://github.com/neuro-mechatronics-interfaces/ros2_data_agent

Code for a multipurpose file explorer specializing in reading ROS2 topic data from '.bag' or '.db3' files

data python ros2

Last synced: 13 Jun 2026

https://github.com/bastianolea/plebiscitos_chile

Datos de resultados electorales de los plebiscitos constitucionales de 2022 y 2023

chile comunas data elecciones politica social

Last synced: 15 Jun 2026

https://github.com/isharescheme/participant-onboarding-portal

Standardized onboarding portal for data space participants.

data onboarding particpant space

Last synced: 15 Jun 2026

https://github.com/lafkpages/minecraft-crafting-info

Scrapes https://www.minecraftcrafting.info for crafting recipes.

api crafting data minecraft

Last synced: 17 Jun 2026

https://github.com/ibttf/bayborhood

Interactive map to find the ideal neighborhood in San Francisco based on data.

data data-analysis data-visualization gis mapbox react

Last synced: 18 Jun 2026

https://github.com/dushansenadheera/web_scraper

web scraper using Python along with BeautifulSoup and Selenium

beautifulsoup data python selenium web-scraping

Last synced: 19 Jun 2026

https://github.com/rylan12/apscores

A quick way to visualize how the AP score distributions have changed from year to year.

advanced-placement analysis ap-exam data scores

Last synced: 19 Jun 2026

https://github.com/bastianolea/mineduc_personal_academico

Datos de Personal Académico, entre los años 2008 y 2024, del sistema de Educación Superior.

chile data educacion meses tiempo

Last synced: 19 Jun 2026

https://github.com/svetlanam/etl-transformation

ETL data cleaning and transformation for specific use case in own Keboola project

cleaning data etl keboola python rest-api transformation

Last synced: 20 Jun 2026

https://github.com/g-schumacher44/analyst_resource_hub

A collection of guidebooks, quickref, and resources for data analysis

analytics bigquery data lookerstudio machine-learning model python sql yaml-configuration

Last synced: 20 Jun 2026

https://github.com/petzi53/repairdata

Open Repair Alliance Datasets 2021

data open-data open-datasets r repair repair-cafe repairs

Last synced: 22 Jun 2026

https://github.com/davorg/towerbridge

When is Tower Bridge lifting?

data hacktoberfest london perl web-scraping

Last synced: 29 Jun 2026