An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/lord3008/instances-of-data-analysis

This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.

data data-analysis

Last synced: 03 Mar 2025

https://github.com/dina-hosny/sequence-trigger-pair-for-all-schema-tables-plsql

A PLSQL script that creates Sequence Trigger Pair for all Schema's Tables

data oracle plsql sequence sequencetrigger sql toad trigger

Last synced: 06 Mar 2026

https://github.com/stdlib-js/array-base-index-of-same-value

Return the index of the first element which equals a provided search element according to the same value algorithm.

array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types

Last synced: 15 May 2026

https://github.com/josemartinezrdev/logisticadb

Logistica Database

data ddl diagrama dml mysql sql

Last synced: 09 Jul 2025

https://github.com/farovictor/mongodbloader

This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.

data go mongodb pipeline

Last synced: 15 May 2026

https://github.com/kylepw/multistack

Example of multiple stacks in one array.

algorithms array data data-structures python stack

Last synced: 17 Mar 2025

https://github.com/samharrison7/datamapper

Making mapping between datasets as simple as possible.

data data-mapper data-mapping data-science data-structures

Last synced: 17 Mar 2025

https://github.com/alexdonh/adonis-cache

Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!

adonis-framework adonisjs cache data dependency redis storing

Last synced: 15 May 2026

https://github.com/mightymetrika/mmirestriktor

Informative Hypothesis Testing Web Applications

data hypothesis infomative power r restriktor statistics testing

Last synced: 17 Mar 2025

https://github.com/antononcube/raku-data-typesystem

Data type system for different data structures.

data data-structures rakulang type-system

Last synced: 09 Jul 2025

https://github.com/sambhav/fb-insights

A tool to analyze your Facebook data dumps and generate insights

analytics data facebook graphs insights language learning machine natural personal processing

Last synced: 17 Mar 2025

https://github.com/alexis-gss/games-data

Games Data is a library of informations about all games, realised under NuxtJs

css3 data games nuxtjs tailwindcss typescript vuejs

Last synced: 13 Mar 2025

https://github.com/ramonmeza/mysteamstats

Visualize your stats from your favorite games on Steam!

data statistics steam steam-api videogame visualization

Last synced: 17 Mar 2025

https://github.com/amethyst-php/email-subscription

Subscribe your email to our mailing-list, we'll promise no spam will be delivered.

amethyst amethyst-package api data email-subscription laravel

Last synced: 17 Mar 2025

https://github.com/metapsy-project/data-depression-psiloctr

Database of psilocybin-assisted therapies for adults with depression versus control conditions.

data

Last synced: 01 Mar 2026

https://github.com/ebrizzzz/data-visualization-project-using-tableau

A data visualization project for the Visual Data Analysis course (Spring Term 2025) at the University of Skövde. This project explores the factors influencing national happiness scores across different global regions from 2005 to 2022.

analytics data data-analysis data-science data-visualization python regression tableau

Last synced: 16 Jun 2025

https://github.com/ezmiller/boe-election-data

CSV files containing parsed NYC Bureau of Elections data for 2009 and 2013

data elections nyc

Last synced: 18 Oct 2025

https://github.com/nanvenomous/sizable

A generic interface to mongo go driver

data driver generic generics go golang mongodb

Last synced: 15 May 2026

https://github.com/theleopard65/isa-imitation

This repository contains a simple C++ implementation of a Von-Neumann architecture simulator. The program mimics the behavior of a basic computer architecture that uses a single memory space for both instructions and data. Users can load programs, execute them, and view the current state of the memory and registers.

32-bit 64-bit ac architecture c-plus-plus data executable explained implementation ir isa mar mdr memory pc registers simulation von-neumann x64 x86

Last synced: 18 Mar 2025

https://github.com/samridhisainii/airbnb-data-analysis

Data analysis of airbnb dataset

analysis data data-visualization eda models

Last synced: 16 May 2026

https://github.com/takamoso/umami

Cross browser compatibility data.

browser compat compatibility data dataset json

Last synced: 27 Mar 2025

https://github.com/ahabdel/amazon-web-scraper

Amazon Web Scraper to scrape pricing adjustments and provide updates on a day to day basis

data web-scraping

Last synced: 29 Oct 2025

https://github.com/noorkhokhar99/text-to-speech-demo

Text to Speech Demo

data python roboflow

Last synced: 27 Mar 2025

https://github.com/cemoktra/data_series

time series handling

data lazy-evaluation time-series

Last synced: 29 Oct 2025

https://github.com/nabilaagha/chest-x-ray-medical-diagnosis-using-deep-learning

This project uses deep learning to classify chest X-ray images for disease detection. It involves data preprocessing, pre-trained CNN models, and the ChestX-ray8 dataset to enhance medical diagnostics with AI.

computer-vision data data-processing deep-learning juypter-notebook medical-image-processing x-ray-images

Last synced: 15 Dec 2025

https://github.com/jorgeatgu/dataset-elecciones-28a

Datasets generados a partir del dataset de elecciones generales de El País

28a data elecciones2019 elections spain

Last synced: 16 May 2026

https://github.com/chocolateboy/corrigenda

Corrections, addenda, and deltas for data that's wrong on the Internet

addenda api corrections corrigenda data json json-data

Last synced: 27 Mar 2025

https://github.com/campiohe/geomask

A very simple lib for creating geometric masks from spatial data using regular grids.

climate data gis weather

Last synced: 30 Dec 2025

https://gitlab.com/sean-c/pdf_rules

Turn PDFs into CSVs by defining rules

Data Cleaning automation data data parsing

Last synced: 14 Apr 2025

https://github.com/vijaykumar1303/sales-data-analysis-and-dashboard-development

To analyze sales data to uncover insights into sales performance, trends, and patterns, and to develop an interactive dashboard that provides a comprehensive view of sales metrics and KPIs.

data dataanalysis datacleaning datavisualisation dax-query powerbi powerquery sql sqldataanalysis

Last synced: 11 Feb 2026

https://github.com/theduardomaciel/cc-pe

Conteúdos, scripts em R e datasets utilizados durante a matéria de Probabilidade e Estatística.

data probability r statistics

Last synced: 27 Mar 2025

https://github.com/prcharan592/olympic-insights-historical-data-analytics-in-r

This project analyzes 120 years of Olympic history (1896–2016), uncovering trends and insights from the data

data data-analytics data-science data-visualization kaggle r-programming

Last synced: 03 Apr 2025

https://github.com/tuscanicz/doctrine-data-applier

Symfony bundle for Doctrine Migrations of data using doctrine entities

data database doctrine entity migrations symfony symfony-bundle

Last synced: 02 Feb 2026

https://github.com/mx51/data-dictionary-action

GitHub Action for generating and checking freshness of data dictionaries

action analytics data

Last synced: 17 Jan 2026

https://github.com/umstek/sampler

Generate elaborate random data instantly.

data faker javascript json sample

Last synced: 20 Jul 2025

https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights

The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.

chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization

Last synced: 17 Feb 2026

https://github.com/tadiusfrank2001/data_mining_projects_labs_cs145

A collection of data mining course assignments to implement advanced predictive statistical analysis models

algorithms data data-mining data-science deep-learning predictive-modeling python3 wide-learning

Last synced: 16 May 2026

https://github.com/muneeb1030/webscrapper_politifact

This initiative seeks to extract and analyze fact-checking data from Politifact.com, providing valuable insights into political statements, rulings, and the evolving information landscape.

data data-collection dataanalysis python3 scrapy scrapy-spider webscraping

Last synced: 09 Sep 2025

https://github.com/shreedata/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn

Last synced: 28 Mar 2025

https://github.com/debruine/faux.jl

Julia version of faux for data simulation

data julia simulation

Last synced: 28 Mar 2025

https://github.com/danicaalana/breast-cancer-random-forest

This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wisconsin Breast Cancer Diagnostic Dataset from scikit-learn, which is a classic and very easy binary classification dataset.

breast-cancer-classification breast-cancer-wisconsin data eda machine-learning-algorithms python random-forest-classifier

Last synced: 16 May 2026

https://github.com/chrisrobertsjr/chrisrobertsjr

Welcome to my Github Profile!

data data-analysis java r sql statistics

Last synced: 03 May 2026

https://github.com/erictleung/2018-new-coder-survey

:beginner: Code to wrangle data from the 2018 New Coder Survey by freeCodeCamp

data data-cleaning dataset freecodecamp new-coders-survey programmers

Last synced: 03 Apr 2025

https://github.com/hyfi06/unam-careers

A utility package for retrieving career information from UNAM.

career data npm-package unam

Last synced: 16 May 2026

https://github.com/paulveillard/cybersecurity-analytics

An ongoing collection of awesome software, libraries, learning tutorials, documents and books, technical resources and cool stuff about Analytics Engineering in Cybersecurity.

analytics bigdata bigquery cybernetics cybersecurity data data-engineering data-science encryption encryption-decryption seo seo-friendly seo-optimization

Last synced: 28 Mar 2025

https://github.com/naufalbasara/superstores-pipeline

Data Pipeline on Dummy E-commerce with Apache Airflow

airflow data data-engineering data-pipeline data-warehouse postgresql

Last synced: 16 May 2026

https://github.com/rd-uk/rduk-data-sqlite

SQLite Data Provider implementation for rduk-data

data rduk sqlite

Last synced: 16 May 2026

https://github.com/stkisengese/numpy-data-fundamentals

A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.

data data-analysis numpy pre-processing

Last synced: 16 May 2026

https://github.com/praveendecode/data-analysis

Implemented data analysis projects with interactive Streamlit UI for user-friendly data exploration and insights presentation

data data-science dataanalysis exploratory-data-analysis insights python streamlit-dashboard tableau tableau-public

Last synced: 04 Apr 2025

https://github.com/denisecase/buzzline-04-case

Adding live visualizations to streaming data applications

animation data kafka matplotlib python streaming

Last synced: 11 Apr 2025

https://github.com/denisecase/cintel-03-data

Getting started with interactive data analytics in Python

analytics data interactive python shiny

Last synced: 11 Apr 2025

https://github.com/nel-zi/zipco_foods

Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.

airflow apache-spark data dataengineering etl pyspark wsl

Last synced: 03 May 2026

https://github.com/madhuresh2011/kulturehire-internship

☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.

data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql

Last synced: 17 Feb 2026

https://github.com/istinnew/cook-me-up

[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.

bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning

Last synced: 16 May 2026

https://github.com/ournet/quotes-data

Ournet quotes data package

data ournet ournet-quotes quotes

Last synced: 04 Apr 2025

https://github.com/ournet/news-data

Ournet news data package

data news news-data news-storage ournet storage

Last synced: 04 Apr 2025

https://github.com/dimaa1608/azurecontent

AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.

azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization

Last synced: 10 Apr 2025

https://github.com/sap-samples/sap-bdc-explore-hyperscaler-data

The repository contains detailed steps to integrate external hyperscaler data sources to SAP Datasphere in the SAP Business Data Cloud per the Open data ecosystem integration principles .

aws azure business cloud data databricks datasphere gcp hyperscalers sap

Last synced: 16 May 2026

https://github.com/mvuorre/osfdatasette

Harvest, wrangle, and serve preprint data from OSF API with Datasette

data datasette open-science preprints

Last synced: 11 Apr 2025

https://github.com/gsmithun4/expressjs-field-validator

Plugin for validating JSON request, middleware for expressjs

data express-js expressjs json-request middleware nodejs request rest-api validation

Last synced: 06 Mar 2026

https://github.com/anti-duhring/nfl-qb-stats

data of all NFL QB starters until 2021

data json nfl qb stats

Last synced: 05 Apr 2025

https://github.com/nanis/unitedat

Unify data sets which consist of separate files with a common header repeated in each one.

cli data etl utility

Last synced: 12 Apr 2025

https://github.com/1sumer/mass-mail-automation

Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.

data oops-in-python python smtp-server tkinter

Last synced: 20 Aug 2025

https://github.com/sibeux/redesigned-broccoli

Repositori untuk menyimpan data file musik

data data-center nasrulwahabi sibeux

Last synced: 24 Jan 2026

https://github.com/webobite/fact-chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp

Last synced: 04 May 2026

https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java

Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.

arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra

Last synced: 12 Apr 2025

https://github.com/halyusa16/basic-sql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 16 May 2026

https://github.com/ashishsingh789/hr_analysis_dashboard

The HR Analyst Dashboard is an interactive Power BI tool that provides insights into HR metrics sourced from Excel. It focuses on data cleaning, transformation, and visualization, enabling stakeholders to explore key indicators like employee demographics and performance through intuitive charts.

dashboard data dataanalysis datacleaning powerbi-desktop visualization

Last synced: 06 Mar 2026

https://github.com/youmenomi/hydreigon

Are you looking for a Hydreigon to classify data for you? Come and catch it!

classify data hydreigon indexer items management pokemon sortable structure typescript

Last synced: 07 May 2025

https://github.com/os-climate/rmi-utility-transition-hub-ingestion-pipeline

Data ingest for RMI's Utility Transition Hub data (as of March 7, 2022)

data emissions-co2 energy-data os-climate

Last synced: 12 Apr 2025

https://github.com/christopherandrewtopalian/catopalian_javascript_data_navigator

A JavaScript application that allows for easy sorting of data. Easily navigate through any amount of data using button filters.

data javascript sorting

Last synced: 13 Apr 2025

https://github.com/webianks/anotech-android

Android application which deals on various anomalous behaviour that occur on server data.

anomaly-detection data server

Last synced: 13 Apr 2025

https://github.com/mawiegand/automatic-point-label-placement-data

Test instances for the automatic point label placement problem.

data datastructures generator javascript labeling problem ruby

Last synced: 16 May 2026

https://github.com/johndelatto/-universities-to-pursue-a-master-s-degree-in-machine-learning

Best Master’s Programs in Machine Learning (ML) for 2021 These are the best universities to pursue a master’s degree in machine learning, with research rankings in AI and machine learning

ai api data education project school

Last synced: 17 Jun 2025

https://github.com/stdlib-js/array-base-assert-any-has-property

Test whether at least one element in a provided array has a specified property, either own or inherited.

any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate

Last synced: 07 May 2025

https://github.com/ramonrsv/f1_data

Provides consolidated access to various sources of Formula 1 information and data, including event schedules, session results, timing and telemetry data, as well as historical information about drivers, constructors, circuits, etc.

data f1 rust

Last synced: 07 Apr 2026

https://github.com/lorinczakos/sql-projects

This is a collection of my SQL scripts that I wrote and were approved through my course with GoIT Romania Data Analyst course

bigquery cte data data-analysis dbeaver marketing-analytics postgresql project-repository sql vscode

Last synced: 16 May 2026

https://github.com/the-tech-idea/beep.winform.sample

Application for Managing your Different DataSources . Still in Alpha.please be patient

application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows

Last synced: 08 Jul 2025

https://github.com/dolanmiu/mclaren-task

A front end assessment task for Mclaren

angular data observable observables rxjs

Last synced: 16 May 2026

https://github.com/miroslavvidovic/distribution-graphs

Creating ASCII graphical histograms in the terminal with https://github.com/philovivero/distribution

ascii data graph histogram python terminal

Last synced: 24 Apr 2026