An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/ahmad-ali-rafique/comment-generation-tool

This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.

ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning

Last synced: 07 Oct 2025

https://github.com/tushar2704/insurance-cross-sell

This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.

data datascience h20automl machine-learning mlflow python streamlit-tushar2704

Last synced: 08 Oct 2025

https://github.com/nikoshet/rust-dms-cdc-operator

The rust-dms-cdc-operator is a Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios.

aws cdc data dms parquet pgdatadiff polars postgres rds rust s3 validation

Last synced: 18 Jan 2026

https://github.com/taiwotman/pysparkstreaming

Demonstrate pyspark structured programming using template design pattern

analytics data pyspark streaming wordcount

Last synced: 18 Mar 2026

https://github.com/mewmix/drivehound

magic file signatures + python drive recovery magic

data disk file-signatures harddrive python recovery recovery-tool

Last synced: 08 Oct 2025

https://github.com/pharo-ai/data-imputers

This project contains transformers for missing value imputation

ai data data-science imputer pharo pharo-smalltalk smalltalk

Last synced: 18 Jan 2026

https://github.com/liyakhathshaik/datascout.jl

This is a julia package

data datascout julia

Last synced: 09 Oct 2025

https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer

Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.

analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob

Last synced: 17 Apr 2026

https://github.com/kamal-singh22/ai-driven-emotional-sentiments-analysis

This project leverages machine learning to analyze and classify the emotional sentiment of textual data. The goal is to accurately identify and categorize emotions, aiding applications in customer feedback analysis, social media sentiment analysis, and mental health monitoring.

analysis artificial-intelligence data emotion nlp-machine-learning python sentiment-analysis streamlit text-classification

Last synced: 14 Apr 2026

https://github.com/jpmens/airports-zonedata

$INCLUDE airport locations

airports data dns

Last synced: 19 Jan 2026

https://github.com/alexandregazagnes/rica-analysis

This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.

analysis argiculture business data data-analysis data-analytics food python

Last synced: 29 Apr 2026

https://github.com/definetlynotai/vulnscan_data

Logicytics VulnScan Module's Training Data and old model archive

ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data

Last synced: 11 Oct 2025

https://github.com/strata/data

Tools to help you read data from a range of different data providers.

api data data-integration

Last synced: 27 Jan 2026

https://github.com/davorg/dmp

Data Munging with Perl

book data hacktoberfest munging perl

Last synced: 21 Jan 2026

https://github.com/stdlib-js/ndarray-empty-like

Create an uninitialized ndarray having the same shape and data type as a provided ndarray.

data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 11 Oct 2025

https://github.com/gniquyij/tuqiu

Dark side of OS X

data notes osx safari

Last synced: 12 Oct 2025

https://github.com/jrmedd/emojinal

An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.

data emojis sentiment user-research ux

Last synced: 19 Jan 2026

https://github.com/rohancyberops/r-language

R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.

caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse

Last synced: 12 Oct 2025

https://github.com/genert/metis

Asynchronous data sender library

analytics asynchronous data dependency-free typescript

Last synced: 27 Jan 2026

https://github.com/R-Mahesh45/HR---Resume-Text-Classification

Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.

data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing

Last synced: 13 Oct 2025

https://github.com/iamgmujtaba/github-python-daily-trending

This repository provides an automated, daily-updated list of the top trending Python repositories on GitHub. Using a GitHub Actions workflow, it scrapes data from GitHub's trending page, sorts the results by total stars, and generates a clean, well-structured README file

data data-scraping github-actions tranding tranding-bot

Last synced: 13 Oct 2025

https://github.com/Lemniscate-world/StratAI

This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.

ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading

Last synced: 13 Oct 2025

https://github.com/stdlib-js/strided-base-dtype-resolve-str

Return the data type string associated with a supported strided array data type value.

array data dtype dtypes enum javascript node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 13 Oct 2025

https://github.com/athul64/powerbi

Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.

data data-an data-visualization dax dax-expression powerbi

Last synced: 23 Feb 2026

https://github.com/lisakey/datacamp-data-analyst-python-sql-projects

Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.

analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali

Last synced: 19 Apr 2026

https://github.com/souvik09-tech/adventure-works-kpi-dashboard

This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.

analysis data kpi powerbi visualization

Last synced: 27 Jan 2026

https://github.com/ibilalkayy/covid-tracking-app

This repository contains the code of a covid tracking app that shows the data of covid-19 on Google Map.

covid-19 data google-maps

Last synced: 14 Oct 2025

https://github.com/open-i18n/data-iso-15924

Git mirror for ISO 15924, Codes for the representation of names of scripts data

data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems

Last synced: 14 Mar 2026

https://github.com/akv3sic/cryptocurrency-charts

Cryptocurrency API data visualizations 📈 with Matplolib.

cryptocurrency data data-visualization matplotlib python

Last synced: 16 Oct 2025

https://github.com/bishtrishu/pizza_sales_data_analysis_sql

This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.

cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database

Last synced: 14 Apr 2026

https://github.com/potreic/etl-fashion-trend-analysis

✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊

airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends

Last synced: 27 Jan 2026

https://github.com/nicolasbizzozzero/datagenerator

Randomly generate various commonly used data

data data-generation data-generator data-science

Last synced: 18 Oct 2025

https://github.com/florianwendelborn/metatypes

Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)

code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript

Last synced: 27 Jan 2026

https://github.com/gematik/poc-isik-patient-merge

The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.

data fhir isik poc

Last synced: 19 Oct 2025

https://github.com/juliaextremes/idfdatacanada.jl

A set of methods to get ECCC IDF data from .txt files

canada climate data julia netcdf

Last synced: 21 Oct 2025

https://github.com/fiddlydigital/fastmap

A simple 2D map that is optimized for speed.

array cimage data map

Last synced: 23 Oct 2025

https://github.com/rodekruis/510-data-catalog

The Project is CKAN based Data Catalog Portal for 510

catalog ckan data opendata

Last synced: 23 Jan 2026

https://github.com/atymri/linqsimulator

LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.

console csharp data data-analysis linq query sql

Last synced: 23 Oct 2025

https://github.com/mustika-putri-m/analysis-of-sales-transactions-in-an-online-shop---london

Crucial Question 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?

data data-analysis-python data-analytics data-visualization ecommerce

Last synced: 24 Oct 2025

https://github.com/eshaagarwa/hr-analytics-project

Explore our HR Analytics Dashboard, a powerful Power BI project designed for HR managers and leaders. Analyzed essential KPIs such as Employee Count, Attrition Rate, and Job Satisfaction across various demographics.

dashboard data data-visualization dataanylasis ms-excel ms-excel-data-analytics powerbi statistics

Last synced: 23 Jan 2026

https://github.com/farzai/geonames-php

This package provides a simple way to download Geonames data and format it for friendly use.

countries country-codes data geography geonames

Last synced: 24 Oct 2025

https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2

Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.

dashboard data data-analysis dax-measures excel powerbi powerbidashboard

Last synced: 23 Jan 2026

https://github.com/doziestar/datavinci

DataVinci enables you to visualize data from various sources, generate insights, analyze data with AI models, and receive real-time updates on anomalies

data golang logs pipeline

Last synced: 23 Jan 2026

https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights

Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.

analytics data excel sas sasprogramming statistical-analysis

Last synced: 24 Feb 2026

https://github.com/capire/xtravels-java

Travel booking app using master data from xflights built with CAP Java

cap cds data federation flights java reuse

Last synced: 23 Jan 2026

https://github.com/cmda-tt/course-24-25

🎓 tech track · 2024-2025 · curriculum and syllabus 📊

d3 data datavis datavisualization es6 functional javascript programming svelte

Last synced: 28 Jan 2026

https://github.com/imahdimir/githubdata

A very simple Python package to easily download from and manage a GitHub "Data Repository"

data data-repository python-package

Last synced: 23 Jan 2026

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 26 Oct 2025

https://github.com/rnabla/cuda-des

Bruteforcing DES using CUDA

bruteforce cuda data des encryption gpu parallel standard

Last synced: 27 Oct 2025

https://github.com/mihaiconstantin/lavot

A `React` application that allows users to indicate how votes will be redistributed among candidates for the second round of Romanian presidential elections.

data data-visualization elections react sankey typescript

Last synced: 06 Feb 2026

https://github.com/medz/block

A flexible and efficient binary data block handling library for Dart.

binary blob block data streams

Last synced: 24 Feb 2026

https://github.com/patrikmasiar/algorythm-of-the-night

Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻‍💻

algorithms data interview-questions logic logic-programming math mathematics science

Last synced: 27 Oct 2025

https://github.com/maccccd/wsoa3029a_2444372

This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.

d3js data hacking visualization

Last synced: 24 Jan 2026

https://gitlab.com/Native-Coder/d3-react-component

This is a dead-simple React component that makes D3 implementation a breeze.

chart component d3 data react vis visualization viz

Last synced: 24 Jan 2026

https://github.com/zoekelepiri/ota_observatory

A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.

backend data database spring-boot

Last synced: 06 Feb 2026

https://github.com/CheeseWithSauce/HadithsJSONFormat

Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.

api arabic data dev free hadith islam islamic muslim open-source quran sunnah

Last synced: 24 Feb 2026

https://github.com/ariqf1/learn_data

Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.

data data-engineering mysql

Last synced: 15 Apr 2026

https://github.com/stdlib-js/ndarray-base-output-policy-str2enum

Return the enumeration constant associated with an output ndarray data type policy string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils

Last synced: 15 Apr 2026

https://github.com/alejo1630/titanic_kaggle

This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.

data data-science jupyter-notebook notebook python titanic-survival-prediction

Last synced: 03 May 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2

Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.

data data-analysis deloitte excel forensic-analysis

Last synced: 06 Feb 2026

https://github.com/elissorokin/data-analyst-portfolio-rus

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 25 Feb 2026

https://github.com/aniketkkajania/wassupanalyzer

WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.

data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat

Last synced: 25 Feb 2026

https://github.com/cworld1/novel-data

The data repository of novel analysis

analysis data novel

Last synced: 01 Feb 2026

https://github.com/giladbarnea/to

A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.

conversion data data-conversion json json5 parser script terminal toml yaml

Last synced: 08 Feb 2026

https://github.com/noahweasley/node-user-settings

A universal but simple node library to implement user settings, built to work with Electron.js with little or no configurations

app data electronjs json nodejs persist settings storage sync user

Last synced: 08 Feb 2026

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/pharo-ai/data-preprocessing

Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.

data pharo pharo-smalltalk preprocessing smalltalk

Last synced: 09 Feb 2026

https://github.com/danielbello7/nosql-json-database

Simple and quick database to help development process and speed

data database json json-database models nosql nosql-database nosql-json-database schema

Last synced: 09 May 2026

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/scottleechua/data

Public datasets under CC-BY-4.0 license.

data public-data

Last synced: 18 Mar 2026

https://github.com/mchenryspagg/hng-hire-data-model

The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.

dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema

Last synced: 11 Feb 2026

https://github.com/skygenesisenterprise/aether-account

Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.

account data javascript nextjs platform service sso-service typescript user-interface

Last synced: 16 Apr 2026

https://github.com/tushar2704/applied-ai-playground

This repository serves as a comprehensive collection of resources and projects for Applied Artificial Intelligence (AI). Whether you're an AI enthusiast, a data scientist, or a developer looking to explore practical applications of AI, this repository aims to provide you with valuable materials and hands-on projects to deepen your understanding.

artificial-intelligence data data-science machine-learning machine-learning-algorithms

Last synced: 12 Feb 2026

https://github.com/lmuffato/project-mongodb-dataflights-trybe

Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB

back-end crud data database filter mongo mongodb query trybe-projects

Last synced: 16 Apr 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026