An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/rrwen/poster-gisci-osmol

Conference poster and short paper titled "Outlier Detection in OpenStreetMap Data using the RandomForest Algorithm and Variable Contributions" for the GIScience Conference in 2016

2016 algorithm conference contribution data detection forest gis giscience learn machine open openstreetmap osm outlier paper poster random short variable

Last synced: 03 Apr 2025

https://github.com/rrwen/geohoods-to

Geospatial dataset of 1000+ aggregated variables for neighbourhoods in Toronto, ON, CA

csv data dataset geo geojson gis neighborhood neighborhoods neighbourhood neighbourhoods open open-data toronto toronto-open-data

Last synced: 25 Jun 2025

https://github.com/srindot/average_flightdata_collection_fwuav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Sep 2025

https://github.com/dina-hosny/calculate-installments-dates-and-amounts-plsql

PLSQL project to Calculate the installments dates and amounts for contracts

data plsql sql toad trigger

Last synced: 06 Mar 2026

https://github.com/The-Tech-Idea/Beep.winform.Sample

Application for Managing your Different DataSources . Still in Alpha.please be patient

application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows

Last synced: 04 Nov 2025

https://github.com/karensaraimoralesmontiel/8-week-sql-challenge

Case Studies Solutions for the 8-Week-SQL-Challenge.

data database sql

Last synced: 02 Jan 2026

https://github.com/codehard8/web-scrapping

In this repository we have provide a web scrapping project through beautifulSoup and related files

beutifulsoup data houses-for-sale python3 requests-library-python webscraping

Last synced: 01 Jul 2025

https://github.com/scanthe-net/scanthenet-php

PHP API Data Fetcher.

api data php scan scanner threat

Last synced: 25 Jul 2025

https://github.com/jonprice99/regional-election-analysis

An analysis of election results in Allegheny County using Pandas and other Python libraries to better understand the voting habits, practices, and preferences of regional voters.

data data-visualization election-analysis election-data pandas python

Last synced: 05 May 2026

https://github.com/aiwithqasim/p1_explore-weather-trends

In this project, I'll analyze local and global temperature data and compare the temperature trends where I live to overall global temperature trends. Moreover i will use SQL query to extract data from the given Data base and i have to visualize the insight or Average temperature to find the findings.

data dataanalyst database datavisualization nanodegree udacity

Last synced: 22 May 2026

https://github.com/abshek7/big-data

A repository for documenting the learning related to theory and practical notes of big data computing.

big-data data data-engineering mapreduce pyspark

Last synced: 15 Jun 2025

https://github.com/rickyarians/practical-statistic-car-emission

Practical Statistic Project- Car Emission in Canada - 2022

data data-science dataanalysis r rmarkdown rpubs statistics

Last synced: 22 May 2026

https://github.com/ahmad-mtr/prjkt_exam_schedule_test

I hate scrolling in a list of 300+ courses of my Uni exam schedule, so I'm creating this. this's a test btw :)

data strings-manipulation

Last synced: 11 Apr 2025

https://github.com/gusgitmath/cnn_braintumor_classification

Built a CNN for MRI brain tumor classification (Glioma, Meningioma, No Tumor, Pituitary) with 99.4% accuracy. Used data augmentation, optimized learning rates (Adam), and included EarlyStopping, ReduceLROnPlateau for superior performance, averting overfitting. Boosts early, accurate diagnosis, advancing medical treatment.

classification convolutional-neural-networks data deep-learning machine-learning

Last synced: 25 Jul 2025

https://github.com/iamyourdre/naive-bayes-classifier-js

Naive Bayes classifier developed with MySQL, ExpressJS, and NodeJS by @iamyourdre.

backend data data-science expressjs javascript mysql naive-bayes naive-bayes-algorithm naive-bayes-classifier nodejs

Last synced: 08 Apr 2026

https://github.com/iyashwantsaini/tweetify_

Twitter Data Collection, Analysis Tool

collection data twitter twitter-sentiment-analysis

Last synced: 08 Mar 2026

https://github.com/sam-moen/data-analyst-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

data dataanalysis matplotlib mssql pandas powerbi python seaborn sql

Last synced: 08 Mar 2026

https://github.com/mobinx/easymeet-js

EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything

data meeting react realtime screensharing streaming-video webrtc zoom

Last synced: 03 Jan 2026

https://github.com/merrill007/sql-data-warehouse-project

The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.

architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver

Last synced: 22 Mar 2025

https://github.com/hivesolutions/crossline

Simple event pipping and storing infra-structure

counter data opencv warehouse

Last synced: 15 May 2026

https://github.com/GAMELEIRA/studies-database

Esse repositório têm como objetivo alocar todo e qualquer script para aprender e praticar gerenciamento de banco de dados SQL e NoSQL. Nesse projeto, serão consolidados os principais fundamentos e princípios, além da prática de exercícios e desenvolvimento de projetos.

data database mongodb mssql mysql nosql sql

Last synced: 03 May 2025

https://github.com/dcmox/moxymapper

Data mapping made easy

data json mapper

Last synced: 15 May 2026

https://github.com/richelbilderbeek/heyahmama

Data about the Flemish/Dutch band K3

band data k3 package r r-lang r-language

Last synced: 22 May 2026

https://github.com/engineeringmadness/gaming-ai-analytics

Using Databricks to analyze game reviews from Steam web store

data databricks llama pyspark semantic-layer

Last synced: 15 May 2026

https://github.com/luminati-io/linkedin-dataset-samples

Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.

data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping

Last synced: 17 Mar 2025

https://github.com/prernarohra/todo-webapp

Simple Todo App for practice.

axios css data fastapi html json python typescript

Last synced: 06 Apr 2026

https://github.com/rrwen/twitter2return

Module for extracting Twitter data using option objects

access api data extract geo get location media oauth object option post rest return sample social stream token tweet twitter

Last synced: 03 Apr 2025

https://github.com/theanujsinha01/data-analytics-portal-

Data Analytics Portal Built a web-based data analytics tool using Streamlit, Pandas, and Plotly. Supported CSV and Excel uploads (up to 200MB) for data exploration. Features included statistical summaries, group-by aggregation, and frequency counts. Integrated interactive charts (bar, pie, line, scatter) for visual insights. This tool is live now.

analytics data portal

Last synced: 28 Apr 2026

https://github.com/rameshaditya/dynamic-hybrid-data-grid

Facilitates faster read-and-write of large ordered collections of data.

algorithms data data-structures storage

Last synced: 30 Jun 2026

https://github.com/kirkalyn13/xyz-books-pipeline

XYZ Books Pipeline to check and update incoming ISBNs from newly added books from the CRUD UI, and record new data to a CSV file.

api csv data go http rabbitmq

Last synced: 05 Mar 2025

https://github.com/shailu2004/azure_big_data_project

This project demonstrates a comprehensive Azure Data Engineering workflow using multiple Azure resources to process and analyze an e-commerce dataset. The dataset consists of 8 files containing details about customers, payments, orders, and other key information

ai azure cloud data data-engineering

Last synced: 08 Jul 2025

https://github.com/ressuman/next-blog-1-project

Next.js with TypeScript: Fetching Data and Setting Up Routes. This project demonstrates my first experience with Next.js using TypeScript. It involves fetching posts from the JSON Placeholder dummy API, setting up pages, and linking routes.

api-rest data html-css-javascript jsx nextjs14 routing typescript

Last synced: 15 May 2026

https://github.com/jun-labs/json-handling

🔍 Json 데이터 핸들링 예제.

data gson jackson json json-object

Last synced: 15 May 2026

https://github.com/xylambda/data-structures-algorithms

This repository provides implementations of popular algorithms and abstract data types using JAVA.

algorithm algorithms array arraylist avl-tree data data-structures graph heap iterative java linked list netbeans queue recursive set stack tree

Last synced: 30 Jun 2026

https://github.com/realbxnnie/accountservice

A Simple DataStoreService wrapper with session backuping and session locking.

data lua luau roblox

Last synced: 29 Jul 2025

https://github.com/kashyap-prabhat/sigma

A Scala library for probability and statistics formulas, including rules for probability calculations.

data formulas library mathematics probability scala statistics

Last synced: 30 Jun 2026

https://github.com/chompfoods/stub-jaxrs-jersey

JAX-RS Jersey server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jersey nutrition raw recipe-api recipes server server-stub stub stub-server

Last synced: 02 May 2026

https://github.com/shubhamsoni98/analysis-with-sql

This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.

analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql

Last synced: 03 Jan 2026

https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning

This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.

credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects

Last synced: 02 Sep 2025

https://github.com/kenanbek/youtube-data

YouTube stats data over YouTube Data API v3 using Python.

data python youtube youtube-api

Last synced: 13 May 2026

https://github.com/ntnn/dataparse

Parsing, transforming and unmarshalling data.

data data-parser data-parsing data-transformation golang golang-lib

Last synced: 30 Jun 2026

https://github.com/ressuman/csv-writer-project

CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.

data javascript typescript

Last synced: 15 May 2026

https://github.com/alex0x4b/akutils

High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).

data dataframe pandas python

Last synced: 08 Mar 2026

https://github.com/yusuf4030/the-data-analyst-toolkit

📊 Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.

budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis

Last synced: 18 May 2026

https://github.com/jigyasag18/amazon-power-bi-dashboard

The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.

data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/jigyasag18/data-analysis-using-ms-excel

This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.

analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization

Last synced: 07 Mar 2026

https://github.com/iota-pico/data

IOTA Pico Framework Data Structures and Helpers

data iota iota-pico-framework javascript typescript

Last synced: 18 May 2026

https://github.com/ashishsingh789/quantium_data-analysis-_virtual-internship

Completed a job simulation focused on Data Analytics and Commercial Insights for the data science team. Developed expertise in data preparation and customer analytics, utilizing transaction datasets to extract valuable insights and deliver data-driven commercial recommendations

data datawrangling matplotlib pandas pandas-dataframe presentation programming python python-library

Last synced: 07 Apr 2026

https://github.com/cannt39t/data-mining-spider-vk

Паук который собирают всю информацию о рекламных постах в группе VK

data data-mining python3 vk vkontakte

Last synced: 05 Apr 2025

https://github.com/visual-layer/vl-documentation

This is the official repo for the Visual Layer documentation.

data deduplication enrichment visualization

Last synced: 04 Jul 2026

https://github.com/lambocreeper/spotify-visualiser

Visualise Spotify Data

data spotify visualise

Last synced: 21 Jul 2025

https://github.com/Axnjr/csv-parser-utils

Homework task for SWE position at Redhat.

csv data dataanalysis datatools pandas python

Last synced: 30 Oct 2025

https://github.com/thibautre/dataipsum

Configurable data generator (with crumbles inside)

algorithm data random-generation

Last synced: 21 Jul 2025

https://github.com/xuender/kstats

Golang statistics library package that supports v1.18+.

algorithms analytics data go golang kstats machine-learning math rounding statistics

Last synced: 20 Jul 2025

https://github.com/jlee9503/excel-projects

Fitness tracker dashboard, displaying users workout type, calories burned, and steps taken with multiple filters (gender, age, and workout intensity). Implemented using MS Excel.

dashboard data excel

Last synced: 16 Jan 2026

https://github.com/gui-sitton/games

Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/xjwllmsx/profitable-app-profiles

Analyzes Google Play & App Store data to recommend profitable profiles for free, ad-supported mobile apps

data data-analysis data-cleaning jupyter pandas python

Last synced: 18 May 2026

https://github.com/joshuadeguzman/xcraper

Python based stocks exchange data scraper

data pandas python stock-market

Last synced: 18 May 2026

https://github.com/shubhamsoni98/classification-with-decision-tree

This project predicts iPhone purchases using demographic data (gender, age, salary). A Decision Tree Classifier was used, achieving 88.16% accuracy. Insights from the model can refine marketing strategies, optimize product offerings, and boost sales by targeting key customer segments.

algorithms anaconda classification data data-science descision-tree jupyter-notebook machine-learning prediction python

Last synced: 19 Jan 2026

https://github.com/e-kotov/albofr

alboFr: Get French Data on Tiger Mosquito Colonisation

aedes-albopictus data france tiger-mosquito

Last synced: 11 Jun 2026

https://github.com/rid17pawar/friendscircle

Friends Circle is a console based application developed in cpp using Graph Data Structure.

cpp data graph graph-algorithms oop

Last synced: 08 Jun 2026

https://github.com/redatargaoui/dataconverter

Data conversion functionality to integrate into the software used for autism detection research.

apache-poi data dataconversion excel java

Last synced: 06 Sep 2025

https://github.com/raghavendranhp/attrition-alchemy

This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.

data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm

Last synced: 18 May 2026

https://github.com/afeiship/data-pagination

Raw data(items) pagination.

data next page pagination previous total

Last synced: 18 May 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/svenruppert/_data_for_demos

Data used for demos

data datasets images ruppert sven

Last synced: 25 Jan 2026

https://github.com/luminati-io/google-search-api

Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.

data google-scraper html python serp-api web-scraping

Last synced: 25 Jun 2025

https://github.com/meltymooncakes/blockdata

Minecraft Block data

api data json minecraft minecraft-data

Last synced: 13 Apr 2025

https://github.com/bakangmonei/is_final_assignment

My intelligent systems assignment

data data-science intelligent-systems python

Last synced: 02 May 2026

https://github.com/pedrozamecki/datatube

Site Open Source para análise de dados de canais do YouTube.

data estatistica statistical-analysis statistics youtube

Last synced: 18 May 2026

https://github.com/inekipelov/swift-codable-advance

A library of extensions for Swift Codable protocols, simplifying the process of encoding and decoding objects.

codable data dictionary json swift

Last synced: 25 Jan 2026

https://github.com/fordinand45/bdp_a_kelompok_3

Project Big Data Python yang diadakan oleh Digitalent Kominfo. Berikut adalah yang ikut serta pada project, yaitu : Dhian Prameswari, Fordinand Pasaribu, dan Muhdad Alfaris Bachmid

data data-analytics data-science linear-regression python3

Last synced: 12 Apr 2026

https://github.com/mrk214/bible-data-es-spa

La Biblia en formato JSON

api bible biblia data god jesus json spanish

Last synced: 05 Apr 2025

https://github.com/naithikjorapur/practive-tanstacktsx

Practice TanStack with React, Vite, and TypeScript to build fast, type-safe apps. Leverage tools like TanStack Query for data management and Vite for a streamlined development experience.

data exercise fetching html-css-javascript json learning-by-doing practice query router tsx

Last synced: 05 Apr 2025

https://github.com/mekramy/ircity

Iran province, county and city data in json format.

data iran-city json mekramy

Last synced: 05 Apr 2025

https://github.com/michaelfromyeg/data

Data set dump.

data data-set

Last synced: 16 Jan 2026

https://github.com/fastbolt/entity-importer

Entity importing library for importing data from files (CSV and Excel currently) or API into doctrine.

data doctrine2 excel excel-import

Last synced: 17 Feb 2026

https://github.com/styd/sd_struct

Searchable Deep Struct

activesupport data gem openstruct rails ruby structure

Last synced: 18 May 2026

https://github.com/estherslabbert/sql

Using SQL working with student data

data python sql sqlite3

Last synced: 06 Apr 2025

https://github.com/caprogs/paris-events-analyzer

A project to analyze events in Paris using open source data provided by the city.

data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation

Last synced: 04 May 2026