An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/elijah-1994/pre-process-e-commerce-dataset

Importing, Cleaning, and Pre-Processing E-Commerce Data for Analysis Using MySQL.

analytics data dataanalytics datacleaning dataprocessing mysql mysql-database sql

Last synced: 11 Mar 2025

https://github.com/vishwas-chakilam/twitter-sentiment-analysis

Twitter Sentiment Analysis is a Python project that analyzes the sentiment of tweets based on a user-defined keyword. It uses Tweepy to fetch tweets from the Twitter API and TextBlob for sentiment analysis. The application features a user-friendly GUI with Tkinter, displaying tweet sentiment as positive, negative, or neutral.

api data data-science dataanalysis python3 textblob-sentiment-analysis tkinter tweepy-api

Last synced: 11 Mar 2025

https://github.com/syed-bakhtawar-fahim/dsa_algorithm_code

Assalam o Alikum Guys, This is the repo of Data Structure and Algorithm in C programming language. I hope it will help you in learning Data Structure and Algorithm in C. I'm also learning Data Structure and algorithm in Python in better and easy way you can also explore it

algorithm algorithms-and-data-structures c data data-structures-and-algorithms dsa-algorithm dsa-learning-series dsa-practice

Last synced: 12 Apr 2025

https://github.com/juanandres-montero/dataanalysis

Dedicado al análisis de datos.

costa-rica data

Last synced: 10 Aug 2025

https://github.com/ferru97/jsketchfabcrawler

jSketchfabCrawler is a java for the automatic crawling of model's information from sketchfab.com

crawler data database java sketchfab sql

Last synced: 03 Jan 2026

https://github.com/alecxcode/table-parser

Python Table Parser (data extraction)

automation data extraction python robotic-process-automation

Last synced: 04 May 2026

https://github.com/abdullahashfaqvirk/Earth-Engine-Data-Scraper

A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.

beautifulsoup data data-science python requests scraper web-scraping

Last synced: 27 Sep 2025

https://github.com/debjyotisaha/tableau-projects-phase-2

Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.

dashboards data data-analysis data-visualisation tableau

Last synced: 26 Aug 2025

https://github.com/mateuszskoczek/generatorcsv

GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.

admin converter data microsoft365 python school tkinter

Last synced: 26 Aug 2025

https://github.com/kaiepi/ra-annotations

Thread-safe static buffer

data type

Last synced: 13 Jul 2025

https://github.com/karosi12/ng-data-share

Angular communication with input and output properties

angular communication data data-binding input output sharing typescript

Last synced: 16 Jan 2026

https://github.com/0xnu/data-analyst-training

The repository contains training materials for data analysts.

data data-analysis data-analyst

Last synced: 25 Aug 2025

https://github.com/code-str8/time-series-forecasting

Developing a model that effectively forecasts the unit sales of numerous items across various Favorita stores with precision.

data dataanalysis forcasting machine-learning time-series visualizations

Last synced: 31 Mar 2025

https://github.com/scx567888/scx-data

✨ SCX Data

data java scx

Last synced: 05 Apr 2025

https://github.com/franckalbinet/maris-crawlers

Automated data harvesting of MARIS data sources

automation data marine-radioactivity

Last synced: 25 Aug 2025

https://github.com/luminati-io/google-maps-dataset-samples

A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.

api data dataset google-maps maps web-scraping

Last synced: 03 Jan 2026

https://github.com/zoekelepiri/winedataprediction

A machine learning application in wine quality prediction

data descriptive-statistics machine-learning-algorithms

Last synced: 05 Jan 2026

https://github.com/greedchikara/dsajs

Data Structures and Algorithms written in Javascript

algorithms data structures

Last synced: 09 Apr 2026

https://github.com/jun-labs/jq

🧷 Let's practice jq.

data jq json json-data parse

Last synced: 27 Sep 2025

https://github.com/nagipragalathan/linkedin_backup_datas

This repository contains the backup data from my previous LinkedIn account. Unfortunately, my old LinkedIn account was compromised and subsequently blocked by LinkedIn. As a result, I created a new account, but that too got blocked for reasons unknown to me.

backup blocked data linkedin linkedin-account memory nagipragalathan recovery storage

Last synced: 18 Jan 2026

https://github.com/thesfinox/dup-backup

Simple script to backup data with Duplicity to a personal WebDAV server.

backup bash data duplicity script server webdav

Last synced: 28 Apr 2026

https://github.com/plateformeio/docs

The official documentation of the Plateforme framework

api app asgi async data db docs fastapi plateforme pydantic python restx services sqlalchemy

Last synced: 11 Apr 2026

https://github.com/unkaktus/pktconn

wrapper around io.ReadWriteCloser that implements gopacket's 'device'

connection data gopacket packet

Last synced: 29 May 2026

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/idhruvs/angular4-smart-table-demo

Angular4 Smart Table Demo Project

angular4 data tables typescript

Last synced: 21 Apr 2026

https://github.com/laguer/jupyt-nb

Mathematical and Physical Constants ratios in Cosmology and micro physics

analysis constants cosmology data dimensional julia mathematical micro notebook physical physics python ratios science

Last synced: 13 Apr 2026

https://github.com/paulrosset/cyclone

Network data consumption monitoring

data monitoring network networking

Last synced: 23 Aug 2025

https://github.com/pietrapaz/bootcamp_dio_ciencia_de_dados

Bootcamp Potência Tech powered by iFood | Ciência de Dados - Dio ⚠️

cienciadedados dados data datascience python

Last synced: 09 Apr 2025

https://github.com/entorb/analyze-ha-energy

Analyze Home Assistant Solar Production Data

data home-assistant pandas photovoltaic pv python

Last synced: 08 May 2026

https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system

To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.

analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server

Last synced: 29 May 2026

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration

This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.

covid-19 data data-exploration dataset sql

Last synced: 19 Feb 2026

https://github.com/blackroad-os-inc/blackroad-portal

BlackRoad Portal — unified search routing to 30+ BlackRoad services.

blackroad cloudflare-workers data search

Last synced: 04 Apr 2026

https://github.com/thedevreda/jadaerospace

A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero

data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python

Last synced: 02 Aug 2025

https://github.com/ahmad-ali-rafique/wine-quality-dataset

Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.

analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model

Last synced: 21 Aug 2025

https://github.com/anct-cartographie-nationale/mednum-cli

✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv

anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation

Last synced: 02 Aug 2025

https://github.com/rikiitokazu/dataprojects

Data analysis practice using SQL and Python

data python sql web-scraping

Last synced: 12 Apr 2026

https://github.com/publici/state-integrity-data

Data from a comprehensive assessment of state government accountability and transparency

data

Last synced: 04 Feb 2026

https://github.com/soenneker/soenneker.constants.data

A set of commonly used constants related to various types of data

constants csharp data dotnet

Last synced: 12 Mar 2026

https://github.com/andypicke/ev_station_explorer

Shiny App to visualize electric-vehicle charging station data

data electric-vehicles r shiny-apps visualization

Last synced: 29 Jul 2025

https://github.com/veronikagregorec/excel-data-analytics

Excel for data analytics from beginner to advanced

cleaning data excel formulas tables xlookup

Last synced: 21 Jan 2026

https://github.com/romtaug/scoring-stoxx

Scoring et création de portefeuilles du STOXX, CAC et DAX via scrapping Wikipédia et envoi des résultats par mail - yfinance

api data emailing portfolio scoring stoxx wikipedia yfinance

Last synced: 05 Sep 2025

https://github.com/theanujsinha01/mcdonalds-customer-analysis

This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.

case-study data data-visualization dataanalysis

Last synced: 05 Sep 2025

https://github.com/plurid/defocus

Apophatic User Content Resolution [Desearch Concept]

data

Last synced: 08 Nov 2025

https://github.com/jstafford5380/provausio.testing.generators

Generate fake data for testing and/or mocking

data fake-data generator testing

Last synced: 14 Jan 2026

https://github.com/giscience/measures-rest-oshdb-app

A frontend for providing measures for geospatial datasets, using the OSHDB

data dggs geospatial measure openstreetmap rest

Last synced: 20 Apr 2026

https://github.com/plurid/delog

Cloud Service for Centralized Logging

cloud data logging

Last synced: 08 Nov 2025

https://github.com/ayushverma135/dbms-labfile

Created for practical learning, this DBMS lab file offers hands-on exercises covering SQL queries, normalization, indexing, and more. With clear instructions and sample datasets, students gain invaluable experience in database design and management.

data dbms dbms-lab

Last synced: 04 Feb 2026

https://github.com/goto-eof/bitmaptize

Wraps data inside a .bmp and extracts data from .bmp.

bitmap bmp convert data wrap

Last synced: 18 Jan 2026

https://github.com/bhojpur/dlm

The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.

data lifecycle-management

Last synced: 19 Feb 2026

https://github.com/kahlery/my-jupyter-notebook-projects

🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab

data jupyter-notebook python

Last synced: 12 Apr 2026

https://github.com/bmcollier/contiguous

Provides COBOL-style contiguous data structures in Python

cobol contiguous data python

Last synced: 14 Jan 2026

https://github.com/rachelresende/projeto-finan-as

Este repositório é referente a um curso de análise de dados para finanças que realizei em 2025 na Udemy.

analytics data financas finance finance-management

Last synced: 19 Aug 2025

https://github.com/rugwiroparfait/alx_sql

This repo is where I save my queries and learning materials in Data Science program from ALX

anaconda data data-analysis jupyter-notebook sql

Last synced: 19 Aug 2025

https://github.com/h4fide/politicalcompassbot

This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!

bot data greedy-algorithms politics python python3 sql telegram

Last synced: 19 Aug 2025

https://github.com/veivel/f1-sentiment-analysis

An entiment analysis project on tweets about Formula 1. To be reworked.

data f1 nlp-library nlp-machine-learning

Last synced: 04 Jul 2025

https://github.com/chaewonkong/kaggle-competitions

kaggle competitions and lessions

ai data kaggle-competition ml

Last synced: 15 Mar 2025

https://github.com/spajai/etl-sharepoint-data-uploader-pipeline

Custom Python Script to Pull specific data from source and Upload to the Microsoft SharePoint

data etl etl-pipeline microsoft microsoft365 python3 sharepoint sharepoint-online

Last synced: 11 Nov 2025

https://github.com/vara-co/tech-certifications

These are the certifications that back-up some of my skills.

certificates certifications data data-analytics skills

Last synced: 07 Jan 2026

https://github.com/remidumas/rstats

RStats weblog

data ia r science stats

Last synced: 25 Mar 2025

https://github.com/rationalprabal/book-management-app

A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.

data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles

Last synced: 10 Apr 2026

https://github.com/i-rzr-i/domaincommonextensions

The purpose of this repository/library is to provide the most relevant and used extension methods in the life cycle of application development that allow us to improve our code, and writing speed, and use more efficiently dev team time during this period for more complex functionality.

api class data datatype extension helper object parser type util

Last synced: 20 Sep 2025

https://github.com/afnanenayet/ds-a

Some interview prep I've been doing. This repo is reimplementations of algorithms and data structures in Python3

algorithms data interview prep python structures

Last synced: 05 Apr 2025

https://github.com/afolabi022/getting-and-cleaning-data-course-project

Tidy Dataset Creation for Human Activity Recognition" This repository contains the code and files for cleaning and transforming the Human Activity Recognition Using Smartphones dataset into a tidy format. The project demonstrates data wrangling skills in R, including merging datasets

data data-science datacleaning r

Last synced: 25 Mar 2025

https://github.com/miraclx/split-merge

Efficient, flexible data stream chunker and merger

chunk data efficient merge middleware nodejs pipeline split stream

Last synced: 07 May 2026

https://github.com/plurid/datasign

Single Source of Truth Data Contract Specifier

data file-format

Last synced: 08 Nov 2025

https://github.com/doughtnerd/pod-old

Read and write Excel data

data data-analysis excel poi-library workbook

Last synced: 21 Jan 2026

https://github.com/vishwas-chakilam/hr-dashboard

This project involves creating an interactive HR Dashboard using Power BI for visualization and MySQL for data cleaning and analysis. It provides insights into employee performance, attrition, salary distribution, and hiring trends.

dashboard data datac datacleaning datavisualization mysql powerbi

Last synced: 23 Mar 2025

https://github.com/nushratjabenaurnima/cse_477_data_mining

A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.

data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3

Last synced: 09 Apr 2026

https://github.com/nsandoya/python_scrp_project

This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)

beautifulsoup4 data data-analysis jupyter-notebook pandas python3

Last synced: 28 Apr 2026

https://github.com/hakusaro/facts

A fact based knowledge system (FBKS) experiment.

data facts hacktoberfest

Last synced: 03 Jan 2026

https://github.com/miniql/miniql-inline

A MiniQL query resolver for inline data.

data query query-language

Last synced: 27 May 2026

https://github.com/wittyicon29/zeotap-ds-assignment

Internship application assignment

data data-science

Last synced: 19 Aug 2025

https://github.com/ahmad-ali-rafique/linear-regression-modeling

In-depth exploration of linear regression models, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data dataanalytics linear-models linear-regression model multilinear-regression regression regression-models

Last synced: 19 Apr 2026

https://github.com/dms-codes/scrape-tokoalvabet-com

Toko Alvabet Data Scraping and Price Comparator This Python script is designed to scrape data from Toko Alvabet's website and perform price comparison for the obtained products. It includes features for viewing and analyzing product data, as well as comparing prices with other sellers.

data price python scraping

Last synced: 29 Jul 2025

https://github.com/allanotieno254/powerbi-dax-filter-context

This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.

business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization

Last synced: 08 Jan 2026

https://github.com/edjoukou/human_resources

A data analysis project using MySQL Server database

analysis data mysql powerbi sql visualization

Last synced: 25 Sep 2025

https://github.com/elkingarcia11/mlb-gameday-obp-odds

Small Python script that pulls MLB team on-base percentage (OBP) for the current season, loads today’s schedule, and writes CSV files that list each team’s OBP edge against its opponent for the day. It also labels each side of a game as betting favorite, not favorite, or equal using American moneylines from ESPN’s public game data.

api csv data http https json mlb mlb-stats-api moneyline odds python rest sports urllib

Last synced: 30 May 2026

https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization

A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.

api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode

Last synced: 14 Apr 2026

https://github.com/nel-zi/insighthire_agency

Built a web scraping solution using BeautifulSoup to extract job listings from MyJobMag, cleaned the data, and loaded it into PostgreSQL with SQLAlchemy for better job data management.

data dataloading datatransformation sql webscraping

Last synced: 16 May 2025

https://github.com/nel-zi/nuga_bank

Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.

data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline

Last synced: 16 May 2025