An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/stdlib-js/array-base-assert-any-has-property

Test whether at least one element in a provided array has a specified property, either own or inherited.

any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate

Last synced: 07 May 2025

https://github.com/johndelatto/-universities-to-pursue-a-master-s-degree-in-machine-learning

Best Master’s Programs in Machine Learning (ML) for 2021 These are the best universities to pursue a master’s degree in machine learning, with research rankings in AI and machine learning

ai api data education project school

Last synced: 17 Jun 2025

https://github.com/mawiegand/automatic-point-label-placement-data

Test instances for the automatic point label placement problem.

data datastructures generator javascript labeling problem ruby

Last synced: 16 May 2026

https://github.com/webianks/anotech-android

Android application which deals on various anomalous behaviour that occur on server data.

anomaly-detection data server

Last synced: 13 Apr 2025

https://github.com/dina-hosny/calculate-installments-dates-and-amounts-plsql

PLSQL project to Calculate the installments dates and amounts for contracts

data plsql sql toad trigger

Last synced: 06 Mar 2026

https://github.com/christopherandrewtopalian/catopalian_javascript_data_navigator

A JavaScript application that allows for easy sorting of data. Easily navigate through any amount of data using button filters.

data javascript sorting

Last synced: 13 Apr 2025

https://github.com/os-climate/rmi-utility-transition-hub-ingestion-pipeline

Data ingest for RMI's Utility Transition Hub data (as of March 7, 2022)

data emissions-co2 energy-data os-climate

Last synced: 12 Apr 2025

https://github.com/frnt-end/ts-context-items-list

⚛️ React Typescript project - Fetch data and display it as a list of 10 items in 10 (pagination) pages. click on each item leads to more details page- using axios, Context and Styled Components.

api axios context context-api data fetch list pagination router router-dom styled-components typescript

Last synced: 19 May 2026

https://github.com/youmenomi/hydreigon

Are you looking for a Hydreigon to classify data for you? Come and catch it!

classify data hydreigon indexer items management pokemon sortable structure typescript

Last synced: 07 May 2025

https://github.com/ashishsingh789/hr_analysis_dashboard

The HR Analyst Dashboard is an interactive Power BI tool that provides insights into HR metrics sourced from Excel. It focuses on data cleaning, transformation, and visualization, enabling stakeholders to explore key indicators like employee demographics and performance through intuitive charts.

dashboard data dataanalysis datacleaning powerbi-desktop visualization

Last synced: 06 Mar 2026

https://github.com/henryssondaniel/teacup-service-visualization-mysql-java

Connect your Teacup visualization data to a MySQL database

data mysql service teacup visualization

Last synced: 19 May 2026

https://github.com/halyusa16/basic-sql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 16 May 2026

https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java

Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.

arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra

Last synced: 12 Apr 2025

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/webobite/fact-chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp

Last synced: 04 May 2026

https://github.com/davidkhala/sql

Standard SQL collection

data sql

Last synced: 06 Apr 2025

https://github.com/sibeux/redesigned-broccoli

Repositori untuk menyimpan data file musik

data data-center nasrulwahabi sibeux

Last synced: 24 Jan 2026

https://github.com/giosil/export-as

A convenience library for exporting data in different formats.

data data-export export exporter java

Last synced: 26 Jul 2025

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/1sumer/mass-mail-automation

Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.

data oops-in-python python smtp-server tkinter

Last synced: 20 Aug 2025

https://github.com/nanis/unitedat

Unify data sets which consist of separate files with a common header repeated in each one.

cli data etl utility

Last synced: 12 Apr 2025

https://github.com/anti-duhring/nfl-qb-stats

data of all NFL QB starters until 2021

data json nfl qb stats

Last synced: 05 Apr 2025

https://github.com/gsmithun4/expressjs-field-validator

Plugin for validating JSON request, middleware for expressjs

data express-js expressjs json-request middleware nodejs request rest-api validation

Last synced: 06 Mar 2026

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/hidayathamir/get-telegram-group-data

With these project you can get data in csv file from your telegram group.

bahasa-indonesia data python3 scrape telegram telethon

Last synced: 13 Sep 2025

https://github.com/mvuorre/osfdatasette

Harvest, wrangle, and serve preprint data from OSF API with Datasette

data datasette open-science preprints

Last synced: 11 Apr 2025

https://github.com/sap-samples/sap-bdc-explore-hyperscaler-data

The repository contains detailed steps to integrate external hyperscaler data sources to SAP Datasphere in the SAP Business Data Cloud per the Open data ecosystem integration principles .

aws azure business cloud data databricks datasphere gcp hyperscalers sap

Last synced: 16 May 2026

https://github.com/dimaa1608/azurecontent

AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.

azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization

Last synced: 10 Apr 2025

https://github.com/opengeoshub/vdownload

A Powerful Geospatial Data Downloader

data geospatial opendata

Last synced: 19 May 2026

https://github.com/ournet/news-data

Ournet news data package

data news news-data news-storage ournet storage

Last synced: 04 Apr 2025

https://github.com/prasad-chavan1/bank_data_analysis_r

Bank data analysis in R language

data data-analysis data-science r

Last synced: 24 Feb 2025

https://github.com/furkantosun1607/cse201-data-structure

This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.

array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure

Last synced: 26 Jun 2025

https://github.com/ournet/quotes-data

Ournet quotes data package

data ournet ournet-quotes quotes

Last synced: 04 Apr 2025

https://github.com/tomwhite/misp-2017

MISP camp 2017 materials and code

bioinformatics data data-visualization hackathon

Last synced: 18 Apr 2026

https://github.com/istinnew/cook-me-up

[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.

bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning

Last synced: 16 May 2026

https://github.com/sweta-kaundilya/911-calls-capstone-project

For this capstone project we will be analyzing some 911 call data from Kaggle.

data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/madhuresh2011/kulturehire-internship

☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.

data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql

Last synced: 17 Feb 2026

https://github.com/jormaechea/aws-firehose-producer

Easily produce data for your AWS Firehose Data Stream

aws data firehose producer stream

Last synced: 19 May 2026

https://github.com/haykam821/circle-tracking

A tool for generating Markdown tracking of the Circle of Trust experiment.

circle data markdown reddit subreddit tracker trust

Last synced: 19 May 2026

https://github.com/nel-zi/zipco_foods

Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.

airflow apache-spark data dataengineering etl pyspark wsl

Last synced: 03 May 2026

https://github.com/ashishsingh789/data_visualization

Data visualization project using Python to analyze categorical and continuous variables. Includes bar charts, histograms, and scatter plots. Libraries used: pandas, matplotlib, and seaborn.

analysis barchart data data-science data-visualization histogram matplotlib pandas-dataframe scatter-plot seaborn

Last synced: 07 Sep 2025

https://github.com/germanpaul12/automating-hacker-news-and-weather-mails

Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news

beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests

Last synced: 05 May 2026

https://github.com/denisecase/cintel-03-data

Getting started with interactive data analytics in Python

analytics data interactive python shiny

Last synced: 11 Apr 2025

https://github.com/denisecase/buzzline-04-case

Adding live visualizations to streaming data applications

animation data kafka matplotlib python streaming

Last synced: 11 Apr 2025

https://github.com/praveendecode/data-analysis

Implemented data analysis projects with interactive Streamlit UI for user-friendly data exploration and insights presentation

data data-science dataanalysis exploratory-data-analysis insights python streamlit-dashboard tableau tableau-public

Last synced: 04 Apr 2025

https://github.com/stkisengese/numpy-data-fundamentals

A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.

data data-analysis numpy pre-processing

Last synced: 16 May 2026

https://github.com/rd-uk/rduk-data-sqlite

SQLite Data Provider implementation for rduk-data

data rduk sqlite

Last synced: 16 May 2026

https://github.com/naufalbasara/superstores-pipeline

Data Pipeline on Dummy E-commerce with Apache Airflow

airflow data data-engineering data-pipeline data-warehouse postgresql

Last synced: 16 May 2026

https://github.com/yourdataarchitect/french-realestate-data-pipeline

This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.

airlfow data datanalysis datapipeline market-intelligence real-estate

Last synced: 31 Dec 2025

https://github.com/coderooz/hr-dashboard

The goal of this project is to create a power bi dashboard to showcase the attrition data within the company.

data data-analytics power-bi

Last synced: 07 Jan 2026

https://github.com/paulveillard/cybersecurity-analytics

An ongoing collection of awesome software, libraries, learning tutorials, documents and books, technical resources and cool stuff about Analytics Engineering in Cybersecurity.

analytics bigdata bigquery cybernetics cybersecurity data data-engineering data-science encryption encryption-decryption seo seo-friendly seo-optimization

Last synced: 28 Mar 2025

https://github.com/hyfi06/unam-careers

A utility package for retrieving career information from UNAM.

career data npm-package unam

Last synced: 16 May 2026

https://github.com/pyrustic/jayson

Intuitive interaction with JSON files [DEPRECATED, check the project Shared]

data json pyrustic python

Last synced: 17 May 2026

https://github.com/fliplet/fliplet-widget-data-source-query

Data Source Query Provider

data provider widget

Last synced: 11 Apr 2025

https://github.com/boettiger-lab/taxadb-cache

Cache for taxadb files

data

Last synced: 19 May 2026

https://github.com/erictleung/2018-new-coder-survey

:beginner: Code to wrangle data from the 2018 New Coder Survey by freeCodeCamp

data data-cleaning dataset freecodecamp new-coders-survey programmers

Last synced: 03 Apr 2025

https://github.com/encoreshao/data-science

Data analyze examples, using Jupyter notebook and Python!!!

data dataanalysis encore jupyter-notebook

Last synced: 29 Mar 2025

https://github.com/chrisrobertsjr/chrisrobertsjr

Welcome to my Github Profile!

data data-analysis java r sql statistics

Last synced: 03 May 2026

https://github.com/pulgamecanica/d3examples

https://www.oreilly.com/library/view/d3-for-the/9781492046783/

d3 d3-visualization d3js d3v4 data javascript

Last synced: 19 May 2026

https://github.com/kameronbrooks/datalys2-reporting

Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.

data data-visualization html react

Last synced: 08 Apr 2026

https://github.com/danicaalana/breast-cancer-random-forest

This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wisconsin Breast Cancer Diagnostic Dataset from scikit-learn, which is a classic and very easy binary classification dataset.

breast-cancer-classification breast-cancer-wisconsin data eda machine-learning-algorithms python random-forest-classifier

Last synced: 16 May 2026

https://github.com/debruine/faux.jl

Julia version of faux for data simulation

data julia simulation

Last synced: 28 Mar 2025

https://github.com/shreedata/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn

Last synced: 28 Mar 2025

https://github.com/shahules786/titanic-analysis

different analysis of titanic accident (data from kaggle)

analyze data titanic-kaggle

Last synced: 26 Jun 2025

https://github.com/jigyasag18/financial-risk-analysis-project

The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics

data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis

Last synced: 06 Mar 2026

https://github.com/henryssondaniel/teacup-java-report-file

Report Teacup data to a file

data file logs reports teacup

Last synced: 22 Jul 2025

https://github.com/gustavonav/youtubeextractorflask

Aplicação para Extração e tratamento de dados do Youtube.

data full-stack mysql pipelines python web

Last synced: 14 Jun 2025

https://github.com/muneeb1030/webscrapper_politifact

This initiative seeks to extract and analyze fact-checking data from Politifact.com, providing valuable insights into political statements, rulings, and the evolving information landscape.

data data-collection dataanalysis python3 scrapy scrapy-spider webscraping

Last synced: 09 Sep 2025

https://github.com/amarlearning/exploring-the-evolution-of-linux

Data Analysis about the development of the Linux operating system by exploring its Git repository history.

cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux

Last synced: 12 May 2026

https://github.com/tadiusfrank2001/data_mining_projects_labs_cs145

A collection of data mining course assignments to implement advanced predictive statistical analysis models

algorithms data data-mining data-science deep-learning predictive-modeling python3 wide-learning

Last synced: 16 May 2026

https://github.com/eyluldursun/data-science-project

This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.

data data-science obesity r rmarkdown

Last synced: 26 Jun 2025

https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights

The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.

chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization

Last synced: 17 Feb 2026

https://github.com/umstek/sampler

Generate elaborate random data instantly.

data faker javascript json sample

Last synced: 20 Jul 2025

https://github.com/mx51/data-dictionary-action

GitHub Action for generating and checking freshness of data dictionaries

action analytics data

Last synced: 17 Jan 2026

https://github.com/tuscanicz/doctrine-data-applier

Symfony bundle for Doctrine Migrations of data using doctrine entities

data database doctrine entity migrations symfony symfony-bundle

Last synced: 02 Feb 2026

https://github.com/akashlogics/street-data-tracking

Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones

analysis data excel newdataset object-detection opencv python python3 yolo

Last synced: 19 May 2026