An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/782e616c6d/covid-d.a

Academic project, using Apache Spark for ETL and Data Studio for data analysis.

academic analytics automation cluster covid-19 data database etl python spark sql

Last synced: 10 May 2026

https://github.com/R-Mahesh45/HR---Resume-Text-Classification

Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.

data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing

Last synced: 13 Oct 2025

https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa

Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi

aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website

Last synced: 02 May 2026

https://github.com/goncaloperes/datavisualization

Here I will share some of my data visualizations using a variety of datasets, technologies and tools.

d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick

Last synced: 04 Feb 2026

https://github.com/aranfononi/h4x0r-news-section-17-project

A SwiftUI-powered app that displays top stories from Hacker News. Users can open articles directly within the app, utilizing SwiftUI’s NavigationLink and custom WebView integration.

app-development data data-binding data-binding-library ios swift swiftui xcode

Last synced: 18 May 2026

https://github.com/iamgmujtaba/github-python-daily-trending

This repository provides an automated, daily-updated list of the top trending Python repositories on GitHub. Using a GitHub Actions workflow, it scrapes data from GitHub's trending page, sorts the results by total stars, and generates a clean, well-structured README file

data data-scraping github-actions tranding tranding-bot

Last synced: 13 Oct 2025

https://github.com/saisriramkamineni/e-commerce-sales-analysis-excel-

Conducted an in-depth sales analysis for an e-commerce platform, leveraging Excel for data preprocessing and Power BI for visualization. Identified key sales trends, customer purchasing behavior, and revenue growth patterns to optimize business performance.

analysis analytics data excel sales

Last synced: 14 Feb 2026

https://github.com/doziestar/datavinci

DataVinci enables you to visualize data from various sources, generate insights, analyze data with AI models, and receive real-time updates on anomalies

data golang logs pipeline

Last synced: 23 Jan 2026

https://github.com/dimitryzub/walmart-stores-coffee-analysis

Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡

analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn

Last synced: 10 May 2026

https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights

Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.

analytics data excel sas sasprogramming statistical-analysis

Last synced: 24 Feb 2026

https://github.com/frictionlessdata/extensiondp

Extension DP (Data Package Extension Template) is a Git repository template for rapid Data Package extension development

data datapackage exchange extension format

Last synced: 13 Feb 2026

https://github.com/lmuffato/project-mysql-vocabulary-booster-trybe

Projeto mysql vocabulary booster - Projeto avaliativo da Trybe do Bloco 20: Funções SQL, Joins e Subqueries

back-end crud data database mysql mysqlworkbench query sql trybe-projects

Last synced: 10 May 2026

https://github.com/obsidianplusplus/5e_play_cs-go

Python工具,分析你在5EPlay的CS:GO比赛数据。抓取、分析、筛选并导出。 | Python tool to analyze your 5EPlay CS:GO match data. Fetches, analyzes, filters, and exports.

5eplay analysis api automation csgo data esports excel json match pandas performance player python reporting scraping stats team

Last synced: 13 Feb 2026

https://github.com/stdlib-js/strided-base-dtype-resolve-str

Return the data type string associated with a supported strided array data type value.

array data dtype dtypes enum javascript node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 13 Oct 2025

https://github.com/garcane/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 13 Feb 2026

https://github.com/athul64/powerbi

Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.

data data-an data-visualization dax dax-expression powerbi

Last synced: 23 Feb 2026

https://github.com/yasenstar/powerbi_tutorial

Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool

analytics data microsoft powerbi tutorial visualization

Last synced: 07 Jan 2026

https://github.com/ginga1402/chinook_database

Microsoft SQL Server Management Studio

business-query data sql-server

Last synced: 30 Mar 2025

https://github.com/sakshisrivastava-2601/credit-card-fraud-detection

Credit Card Fraud Detection Project Using Machine Learning. This project focuses on leveraging advanced Machine learning techniques to identify fraudulent transactions with high accuracy.

advanced-machine data machine-learning numpy project-repository python pytorch random-forest

Last synced: 16 Apr 2026

https://github.com/lisakey/datacamp-data-analyst-python-sql-projects

Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.

analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali

Last synced: 19 Apr 2026

https://github.com/m0nica/datalogues-outdated

Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll

data pelican pelican-blog pelican-theme

Last synced: 28 Feb 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/seabbs/estzoonotictb

Explore, Visualise and Estimate the Global Zoonotic Tuberculosis Burden

bovine-tb data estimation package rstats tuberculosis visualisation zoonotic-tb

Last synced: 28 Feb 2026

https://github.com/capire/xtravels-java

Travel booking app using master data from xflights built with CAP Java

cap cds data federation flights java reuse

Last synced: 23 Jan 2026

https://github.com/rbruinier/mysqlbulkimportbenchmark

Benchmarking some methods to import big data sets into mysql tables

benchmark data database mysql php

Last synced: 02 May 2026

https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis

Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard

analysis data data-visualization datacleaning excel excel-vba

Last synced: 19 Mar 2026

https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis

This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle

data database mysql sql walmart

Last synced: 24 Feb 2026

https://github.com/cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design

Last synced: 13 Mar 2025

https://github.com/exoticknight/juhe

simple way to analyze complex data in one chain call

aggregation aggregator analysis data statistic typescript

Last synced: 21 May 2026

https://github.com/shogunbanik18/budgetify

End-to-End Budget Analysis enables effective budgeting through detailed analysis and strategic planning

analysis data data-engineering data-exploration databricks databricks-notebooks etl etl-process python3

Last synced: 09 Jun 2026

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 01 May 2026

https://github.com/souvik09-tech/adventure-works-kpi-dashboard

This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.

analysis data kpi powerbi visualization

Last synced: 27 Jan 2026

https://github.com/danielgiljam/orbit-utils

A collection of utility packages for Orbit.js.

data inference orbit orbitjs schema synchronization type typescript validation zod

Last synced: 01 May 2026

https://github.com/ibilalkayy/covid-tracking-app

This repository contains the code of a covid tracking app that shows the data of covid-19 on Google Map.

covid-19 data google-maps

Last synced: 14 Oct 2025

https://github.com/tushar2704/applied-ai-playground

This repository serves as a comprehensive collection of resources and projects for Applied Artificial Intelligence (AI). Whether you're an AI enthusiast, a data scientist, or a developer looking to explore practical applications of AI, this repository aims to provide you with valuable materials and hands-on projects to deepen your understanding.

artificial-intelligence data data-science machine-learning machine-learning-algorithms

Last synced: 12 Feb 2026

https://github.com/orisai/nette-data-sources

Orisai Data Sources integration for Nette

data decoder encoder file-format files json neon nette orisai parser php yaml

Last synced: 05 Feb 2026

https://github.com/nnavales/desafios-data-engineer

En este proyecto abordaremos desafíos comunes en el rol de un Data Engineer con tecnologías modernas.

data data-engineering database dataengineering docker minio scrapping spark

Last synced: 01 Jun 2026

https://github.com/ewertondrigues02/engenharia-de-dados

Varios Projetos de Engenharia de Dados usando principais ferramentas como: Airflow, Snowflake, dbt, Postrgres, Looker Studio, Power BI

airflow analise-exploratoria analytics aws-ec2 dados data dbt-cloud engenharia-de-dados looker-studio postgres pyspark python3 snowflake spark

Last synced: 16 Apr 2026

https://github.com/ariqf1/learn_data

Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.

data data-engineering mysql

Last synced: 15 Apr 2026

https://github.com/seanowenhayes/recipe-scraper

A simple scraper uses puppeteer to scrape recipes and more from the web

crawler crawling data recipes scraping

Last synced: 22 Feb 2026

https://github.com/CheeseWithSauce/HadithsJSONFormat

Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.

api arabic data dev free hadith islam islamic muslim open-source quran sunnah

Last synced: 24 Feb 2026

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 26 Oct 2025

https://github.com/insolite/react-data-frame

Table for huge data sets

data react table

Last synced: 14 May 2026

https://github.com/n0nag0n/flee-intercom

For those of you who like to keep your money after Intercom jacks up the prices year after year, but want to keep an export of your data.

again-and-again api data database export exporter flee high-prices intercom mysql php price run save saver year-over-year

Last synced: 09 May 2026

https://github.com/skygenesisenterprise/aether-account

Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.

account data javascript nextjs platform service sso-service typescript user-interface

Last synced: 16 Apr 2026

https://github.com/stdlib-js/ndarray-base-fliplr

Return a view of an input ndarray in which the order of elements along the last dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 11 Feb 2026

https://gitlab.com/Native-Coder/d3-react-component

This is a dead-simple React component that makes D3 implementation a breeze.

chart component d3 data react vis visualization viz

Last synced: 24 Jan 2026

https://github.com/akv3sic/cryptocurrency-charts

Cryptocurrency API data visualizations 📈 with Matplolib.

cryptocurrency data data-visualization matplotlib python

Last synced: 16 Oct 2025

https://github.com/chompfoods/stub-jaxrs-resteasy

JAX-RS RESTEasy server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jax-rs-server nutrition raw recipe-api recipes resteasy server server-stub stub stub-server

Last synced: 08 May 2026

https://github.com/favarettorm/bd_universidade

BD_UNIVERSIDADE V01 - Banco de dados fictício de uma universidade para fins didáticos

data database dataset mariadb mariadb-database mariadb-mysql mysql mysql-database scripts sql university

Last synced: 08 May 2026

https://github.com/scottleechua/data

Public datasets under CC-BY-4.0 license.

data public-data

Last synced: 18 Mar 2026

https://github.com/so-cool/uobrain

My solution to the University of Bristol PURE Data Challenge

competition data modeling

Last synced: 09 Sep 2025

https://github.com/bishtrishu/pizza_sales_data_analysis_sql

This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.

cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database

Last synced: 14 Apr 2026

https://github.com/kucingkode/dmerge

Small javascript library to help you merge same formatted data in a string

cithak data data-merge javascript library lightweight lightweight-javascript-library merge open-source

Last synced: 04 May 2026

https://github.com/potreic/etl-fashion-trend-analysis

✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊

airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends

Last synced: 27 Jan 2026

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/gkapfham/ast2016-paper

Source Code of and Supporting Files for a Paper Published at AST 2016

data latex-document paper research

Last synced: 19 Oct 2025

https://github.com/data-forge-notebook/javascript-cheat-sheet

Cheat sheet that accompanies my book Data Wrangling with JavaScript

cheatsheet data data-wrangling javascript nodejs

Last synced: 15 Apr 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/oefenweb/python-untraceables

Randomizes IDs for a given set of tables making them untraceable across environments

anonymize data database mysql privacy python python2 python3 randomization

Last synced: 03 Feb 2026

https://github.com/kuro337/scalamono

Scala Monorepo Tooling for Kafka, Opensearch, Spark, Redpanda, Hadoop - and Lang Reference.

data database duckdb hadoop kafka redpanda sdala spark

Last synced: 13 Apr 2026

https://github.com/jayantur13/kountry

Node module variant of the Country API

api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn

Last synced: 26 Jan 2026

https://github.com/florianwendelborn/metatypes

Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)

code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript

Last synced: 27 Jan 2026

https://github.com/gematik/poc-isik-patient-merge

The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.

data fhir isik poc

Last synced: 19 Oct 2025

https://github.com/codenoid/alodokter.com-database

a Alodokter.com Database, collected by Hofesh Bot (Scrapper)

alodokter data extraction hofesh

Last synced: 18 Mar 2026

https://github.com/danielbello7/nosql-json-database

Simple and quick database to help development process and speed

data database json json-database models nosql nosql-database nosql-json-database schema

Last synced: 09 May 2026

https://github.com/pharo-ai/data-preprocessing

Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.

data pharo pharo-smalltalk preprocessing smalltalk

Last synced: 09 Feb 2026

https://github.com/garcane/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 09 Feb 2026

https://github.com/tarantinoarchive/dec

Developer-Easy CMS

cms data easy ejs js json simple

Last synced: 11 Mar 2026

https://github.com/yash22222/sync-intern-s-ml-tasks

SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.

bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns

Last synced: 07 May 2026

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/alexscigalszky/palabras-aleatorias-data

This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types

aleatorias data palabras random words

Last synced: 04 Oct 2025

https://github.com/prpriesler/covid19-insights-and-analytics

This project delves into the realm of data analytics and programming, focusing on four pivotal datasets related to the COVID-19 pandemic: confirmed global, death global, vaccination & population data, and Twitter data.

covid19 covid19-data data data-science dataanalytics deep-neural-networks machine-learning natural-language-processing

Last synced: 31 Aug 2025

https://github.com/varbrad/mindb

🗄 🔍 ⚡️ Schema-less document-oriented collection model data-store for Node & Browsers.

browser data datastore db document javascript json-schema mongo mongodb nodejs nosql query schema

Last synced: 13 Apr 2026

https://github.com/noahweasley/node-user-settings

A universal but simple node library to implement user settings, built to work with Electron.js with little or no configurations

app data electronjs json nodejs persist settings storage sync user

Last synced: 08 Feb 2026