An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kylepw/multistack

Example of multiple stacks in one array.

algorithms array data data-structures python stack

Last synced: 17 Mar 2025

https://github.com/farovictor/mongodbloader

This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.

data go mongodb pipeline

Last synced: 15 May 2026

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/josemartinezrdev/logisticadb

Logistica Database

data ddl diagrama dml mysql sql

Last synced: 09 Jul 2025

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/stdlib-js/array-base-banded-filled2d-by

Create a filled two-dimensional banded nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 19 May 2026

https://github.com/afeiship/data-pagination

Raw data(items) pagination.

data next page pagination previous total

Last synced: 18 May 2026

https://github.com/redatargaoui/dataconverter

Data conversion functionality to integrate into the software used for autism detection research.

apache-poi data dataconversion excel java

Last synced: 06 Sep 2025

https://github.com/e-kotov/albofr

alboFr: Get French Data on Tiger Mosquito Colonisation

aedes-albopictus data france tiger-mosquito

Last synced: 11 Jun 2026

https://github.com/shubhamsoni98/classification-with-decision-tree

This project predicts iPhone purchases using demographic data (gender, age, salary). A Decision Tree Classifier was used, achieving 88.16% accuracy. Insights from the model can refine marketing strategies, optimize product offerings, and boost sales by targeting key customer segments.

algorithms anaconda classification data data-science descision-tree jupyter-notebook machine-learning prediction python

Last synced: 19 Jan 2026

https://github.com/stdlib-js/array-base-index-of-same-value

Return the index of the first element which equals a provided search element according to the same value algorithm.

array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types

Last synced: 15 May 2026

https://github.com/dina-hosny/sequence-trigger-pair-for-all-schema-tables-plsql

A PLSQL script that creates Sequence Trigger Pair for all Schema's Tables

data oracle plsql sequence sequencetrigger sql toad trigger

Last synced: 06 Mar 2026

https://github.com/lord3008/instances-of-data-analysis

This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.

data data-analysis

Last synced: 03 Mar 2025

https://github.com/francois-lenne/portofolio_flenne_streamlit

portofolio francois lenne using streamlit

data portofolio python slack-api streamlit

Last synced: 15 May 2026

https://github.com/joshuadeguzman/xcraper

Python based stocks exchange data scraper

data pandas python stock-market

Last synced: 18 May 2026

https://github.com/erkylima/algorithms

Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! πŸš€ #Python #Algorithms #DataStructures

algorithms algorithms-and-data-structures data data-structures

Last synced: 19 Jan 2026

https://github.com/phette23/nces-ipeds-archive

download NCES IPEDS data

data datarescue ipeds nces

Last synced: 30 Jun 2026

https://github.com/hadarsharon/grizzlys

User-friendly Python DataFrames πŸ”΅πŸŸ‘ powered by Julia πŸ”΄πŸŸ’πŸŸ£

big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python

Last synced: 18 May 2026

https://github.com/jlee9503/excel-projects

Fitness tracker dashboard, displaying users workout type, calories burned, and steps taken with multiple filters (gender, age, and workout intensity). Implemented using MS Excel.

dashboard data excel

Last synced: 16 Jan 2026

https://github.com/xuender/kstats

Golang statistics library package that supports v1.18+.

algorithms analytics data go golang kstats machine-learning math rounding statistics

Last synced: 20 Jul 2025

https://github.com/thibautre/dataipsum

Configurable data generator (with crumbles inside)

algorithm data random-generation

Last synced: 21 Jul 2025

https://github.com/Axnjr/csv-parser-utils

Homework task for SWE position at Redhat.

csv data dataanalysis datatools pandas python

Last synced: 30 Oct 2025

https://github.com/lambocreeper/spotify-visualiser

Visualise Spotify Data

data spotify visualise

Last synced: 21 Jul 2025

https://github.com/madihanazir/ds-using-c

Basic insights into Data Structures (inspired by Abdul Bari course but in C language)

data self-learning structures-in-c

Last synced: 17 Mar 2025

https://github.com/webobite/fact-chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp

Last synced: 04 May 2026

https://github.com/dan149/uselesscontentcreator

Useless Content Creator (UCC) is a fake content generator, text, html and pdf files.

content customizable data easy-to-use fake-data fake-data-generator faker-generator generator lightweight open-source opensource python python3

Last synced: 03 Apr 2025

https://github.com/brunosalerno/osm_data

Ruby objects for dealing with OSM data, and generating XML files

data openstreetmap ruby xml

Last synced: 21 Apr 2026

https://github.com/ashishsingh789/quantium_data-analysis-_virtual-internship

Completed a job simulation focused on Data Analytics and Commercial Insights for the data science team. Developed expertise in data preparation and customer analytics, utilizing transaction datasets to extract valuable insights and deliver data-driven commercial recommendations

data datawrangling matplotlib pandas pandas-dataframe presentation programming python python-library

Last synced: 07 Apr 2026

https://github.com/garcane/layoffs-exploratory-data-analysis

This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.

data dataanalysis eda mysql sql

Last synced: 29 Oct 2025

https://github.com/webdevcave/collections-php

A PHP library for managing collections of data with support for nested keys.

array collection data helper library nested-keys package php utility utility-classes

Last synced: 28 Jun 2026

https://github.com/iota-pico/data

IOTA Pico Framework Data Structures and Helpers

data iota iota-pico-framework javascript typescript

Last synced: 18 May 2026

https://github.com/jigyasag18/data-analysis-using-ms-excel

This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.

analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization

Last synced: 07 Mar 2026

https://github.com/jigyasag18/amazon-power-bi-dashboard

The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.

data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/yusuf4030/the-data-analyst-toolkit

πŸ“Š Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.

budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis

Last synced: 18 May 2026

https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java

Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.

arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra

Last synced: 12 Apr 2025

https://github.com/cannt39t/data-mining-spider-vk

ΠŸΠ°ΡƒΠΊ ΠΊΠΎΡ‚ΠΎΡ€Ρ‹ΠΉ ΡΠΎΠ±ΠΈΡ€Π°ΡŽΡ‚ всю ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΡŽ ΠΎ Ρ€Π΅ΠΊΠ»Π°ΠΌΠ½Ρ‹Ρ… постах Π² Π³Ρ€ΡƒΠΏΠΏΠ΅ VK

data data-mining python3 vk vkontakte

Last synced: 05 Apr 2025

https://github.com/pbinkley/tweets-online-classes-covid19

A twarc harvest of tweets related to online classes during the COVID-19 outbreak, starting 2020-03-02

data social

Last synced: 06 Mar 2026

https://github.com/luminovrym/crawler-tools-js

Crawler Tools Js adalah sebuah aplikasi yang digunakan untuk scrapping data pada sebuah web

crawler crawler-js data js web-scraping

Last synced: 08 Sep 2025

https://github.com/siongui/xemaauj9k5qn34x88m4h

No source code. Only serve JSON files of Pāli words

data go json pali

Last synced: 15 May 2026

https://github.com/gui-sitton/games

Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/devprnvk/pycryptochain

A implementation of a blockchain-based cryptocurrency in Python. This project aims to provide a fundamental understanding of blockchain technology and cryptocurrency by building a basic version from scratch. Features include blockchain creation, transaction handling, mining rewards, simulation.

blockchain crypto data decryption encryption hashing processing py python salting storage

Last synced: 09 Mar 2026

https://github.com/xjwllmsx/profitable-app-profiles

Analyzes Google Play & App Store data to recommend profitable profiles for free, ad-supported mobile apps

data data-analysis data-cleaning jupyter pandas python

Last synced: 18 May 2026

https://github.com/rellyson/data-engineering-tools

This repository holds examples and documentation about the most used tools in the data engineering ecosystem.

apache-airflow apache-spark data data-engineering jupyter-notebook python tools

Last synced: 17 Jan 2026

https://github.com/bastianolea/servel_elecciones

Resultados electorales desde Servel (2024)

chile comunas data elecciones genero

Last synced: 08 Jul 2025

https://github.com/bastianolea/mineduc_matriculas_superior

Bases de datos de estudiantes matriculados en EducaciΓ³n Superior

chile comunas data educacion social

Last synced: 16 Jun 2026

https://github.com/rid17pawar/friendscircle

Friends Circle is a console based application developed in cpp using Graph Data Structure.

cpp data graph graph-algorithms oop

Last synced: 08 Jun 2026

https://github.com/raghavendranhp/attrition-alchemy

This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.

data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm

Last synced: 18 May 2026

https://github.com/the-universal-linux-society/sysreport

Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.

analysis bash bash-script bash-scripting data report reporting system

Last synced: 15 May 2026

https://github.com/meltymooncakes/blockdata

Minecraft Block data

api data json minecraft minecraft-data

Last synced: 13 Apr 2025

https://github.com/fastpix/flutter-core-data-sdk

A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.

analyt dart data flutter

Last synced: 15 May 2026

https://github.com/pedrozamecki/datatube

Site Open Source para anΓ‘lise de dados de canais do YouTube.

data estatistica statistical-analysis statistics youtube

Last synced: 18 May 2026

https://github.com/inekipelov/swift-codable-advance

A library of extensions for Swift Codable protocols, simplifying the process of encoding and decoding objects.

codable data dictionary json swift

Last synced: 25 Jan 2026

https://github.com/fordinand45/bdp_a_kelompok_3

Project Big Data Python yang diadakan oleh Digitalent Kominfo. Berikut adalah yang ikut serta pada project, yaitu : Dhian Prameswari, Fordinand Pasaribu, dan Muhdad Alfaris Bachmid

data data-analytics data-science linear-regression python3

Last synced: 12 Apr 2026

https://github.com/dscamilo/gestion-clientes-springboot

Proyecto de gestiΓ³n de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyecciΓ³n de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.

data interface java lombok-maven restcontroller spring-boot

Last synced: 15 May 2026

https://github.com/michaelfromyeg/data

Data set dump.

data data-set

Last synced: 16 Jan 2026

https://github.com/estherslabbert/sql

Using SQL working with student data

data python sql sqlite3

Last synced: 06 Apr 2025

https://github.com/caprogs/paris-events-analyzer

A project to analyze events in Paris using open source data provided by the city.

data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation

Last synced: 04 May 2026

https://github.com/webianks/anotech-android

Android application which deals on various anomalous behaviour that occur on server data.

anomaly-detection data server

Last synced: 13 Apr 2025

https://github.com/shrutakeerti/eye-gaze-detection

This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15

ai aiml data eda eeg eeg-signals eye jodhpur mlflow

Last synced: 17 Mar 2025

https://github.com/mksingh431/sql-complete-notes

SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.

data database sql sql-server

Last synced: 21 Apr 2026

https://github.com/clagiordano/weblibs-data-export

Library for generic data export to various formats

clagiordano data export weblibs xlsx

Last synced: 01 Jul 2026

https://github.com/frnt-end/ts-context-items-list

βš›οΈ React Typescript project - Fetch data and display it as a list of 10 items in 10 (pagination) pages. click on each item leads to more details page- using axios, Context and Styled Components.

api axios context context-api data fetch list pagination router router-dom styled-components typescript

Last synced: 19 May 2026

https://github.com/afnanenayet/academic-pinetable

A revamp of the Dartmouth academic timetable. Designed to be intuitive and make searching for classes much easier.

dartmouth data design dev python scraping ui web

Last synced: 11 Jan 2026

https://github.com/tkxwaweru/python_data_manipulation

Manipulating the MASSIVE dataset using python

data dataanalysis excel python

Last synced: 11 Jan 2026

https://github.com/pcpp94/elexon_pipeline_gb_demand

Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.

data electricity elexon gb octopusenergy power powerdata pypsa uk

Last synced: 12 Jul 2025

https://github.com/henryssondaniel/teacup-service-visualization-mysql-java

Connect your Teacup visualization data to a MySQL database

data mysql service teacup visualization

Last synced: 19 May 2026

https://github.com/christopherandrewtopalian/catopalian_javascript_data_navigator

A JavaScript application that allows for easy sorting of data. Easily navigate through any amount of data using button filters.

data javascript sorting

Last synced: 13 Apr 2025

https://github.com/phtrempe/l2a

This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.

applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow

Last synced: 05 May 2026

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/halyusa16/basic-sql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 16 May 2026

https://github.com/echang1802/normandy

Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.

analytics business-intelligence data dataengineering datascience etl pipeline

Last synced: 11 Mar 2026

https://github.com/davidkhala/sql

Standard SQL collection

data sql

Last synced: 06 Apr 2025

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql

This project involves cleaning a dataset containing information about layoffs from companies around the world.

data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql

Last synced: 08 Jun 2026

https://github.com/rllyhz/mini-data-center

This repo is to fulfill my internship assignment at the Office of Communication and Information (Kominfo), Balai Kota, Semarang, Indonesia

chartjs country-information data information-visualization laravel laravel-application

Last synced: 06 Nov 2025

https://github.com/srindot/average_flightdata_collection_fwuav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Sep 2025