An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/agungbudiwirawan/socioeconomic_analysis

The objective of this project is to analyze the socio-economic in Chicago.

chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server

Last synced: 04 Jul 2025

https://github.com/thecoderpinar/earthquake-explorer

🌍🔍 Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.

data-analysis data-science data-visualization earthquake-analysis exploratory-data-analysis geospatial-analysis interactive-maps jupyter-notebook machine-learning python seismic-data

Last synced: 23 Aug 2025

https://github.com/tsffarias/ibge-nomes-brasil

Análise de dados dos nomes no Brasil utilizando a base de dados disponibilizada pelo IBGE (2010)

data-analysis ibge python

Last synced: 05 Jul 2025

https://github.com/bishopce16/stroke_prediction_analysis

The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.

data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau

Last synced: 26 Apr 2025

https://github.com/0mppula/element-compare

A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.

compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand

Last synced: 19 Jan 2026

https://github.com/ahammadmejbah/ibm-project-03-analyzing-spreadsheet-data-with-python

Spreadsheets are computer programs that allow users to enter, view, and change information in a gridlike format. One of the most useful programs on a PC is a spreadsheet, which allows users to organize data in tabular form. A spreadsheet's primary purpose is to store numerical data and sometimes a few words of text.

data-analysis data-analysis-python data-analyst data-science excel python spreadsheet

Last synced: 27 Apr 2025

https://github.com/3bdalrhmans3d/dataqualityproject

An interactive web application for data quality analysis, machine learning, and conversational AI, built with Streamlit.

data-analysis data-visualization ml numpy ollama pandas python seaborn streamlit

Last synced: 23 Feb 2026

https://github.com/tirendazacademy/data-sets

Data sets for Tirendaz Akademi Youtube

data-analysis dataset

Last synced: 12 Sep 2025

https://github.com/bts-cm/airdrop_tool

Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.

airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc

Last synced: 17 Jan 2026

https://github.com/datalayer/desktop

Ξ 🖥️ Datalayer Destkop.

ai data data-analysis data-science datalayer desktop electron

Last synced: 25 Oct 2025

https://github.com/garrettj403/albertaenergysources

Get grid data from Alberta Electric System Operator (AESO)

alberta canada data-analysis energy-data

Last synced: 10 Oct 2025

https://github.com/mykhode/python-sic-mini-project

SAMSUNG SIC Finish Project Course - Python

data-analysis python-analysis samsung-sic

Last synced: 10 Oct 2025

https://github.com/narius2030/sakila-datawarehouse-ssis

Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis

data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis

Last synced: 10 Oct 2025

https://github.com/roaldarbol/anibehavr

🪲 An R package for Analysis of Animal Behaviour

animal-behavior behavioural-states data-analysis r

Last synced: 11 Oct 2025

https://github.com/enzon3/tws-dataset_gen

Program that slowly generates a dataset of news titles and the sentiment of the titles.

data-analysis data-science dataset

Last synced: 14 Apr 2026

https://github.com/sebastian-gregoricchio/snakeatac

Snakemake pipeline for analysis and normalization of ATAC-seq data starting from fastq.gz files.

atac-seq atac-seq-pipeline chromatin-accessibiity data-analysis epigenomics pipeline snakemake

Last synced: 31 May 2026

https://github.com/wtbates99/stock-indicators

A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.

backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series

Last synced: 15 Oct 2025

https://github.com/nasdin/drone-strike-visualization

Connecting to Drone Strike API and performing an analysis on frequency of drone strikes, trend and patterns

analysis api basemap data-analysis data-science data-visualization drone drone-strikes eda geopy jupyter-notebook visualization

Last synced: 16 Oct 2025

https://github.com/ihabbendidi/diamond-analysis

Exploratory statistical analysis of a Diamond dataset

data-analysis data-visualization exploratory-data-analysis machine-learning r

Last synced: 17 Oct 2025

https://github.com/linuxto5re/gateiodatafilteration

Gate.io API analysis: spot, margin, futures; filters high volume, identifies common cryptos.

arbitrage-bot centralized-exchanges cryptocurrency data-analysis gate gateio-api python rest trading-strategies

Last synced: 23 Oct 2025

https://github.com/tushar2704/instagram-user-analytics

This project revolves around the exploration and analysis of user engagement patterns on the popular social media platform, Instagram. By delving into user data and interaction metrics, this project aims to provide valuable insights into user behavior, content performance, and trends.

artificial-intelligence data-analysis data-science instagram project tushar2704

Last synced: 23 Jan 2026

https://github.com/nhsdigital/sde_example_analysis

Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.

data-analysis data-science databricks-notebooks machine-learning mlflow

Last synced: 25 Oct 2025

https://github.com/silveirinhajuan/rotinapy

RotinaPy: Simplify your daily life and maximize productivity with an integrated app for task management, study tracking, flashcards, and more. Built with Streamlit and Python.

data-analysis flashcards llm-integration llm-ui machine-learning ollama productivity python streamlit study study-project study-tracker task-management task-manager

Last synced: 13 Feb 2026

https://github.com/jimbrig/EDA

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 30 Jul 2025

https://github.com/mindful-ai-assistants/credit-card-prediction

💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.

artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning oneness-consciousness predictive-modeling python3 scikit-learn

Last synced: 12 Apr 2025

https://github.com/gustavohnsv/teamwork_mqa

Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.

data-analysis group-project r team-repo

Last synced: 15 Aug 2025

https://github.com/draym/covid19tracker

Coronavirus COVID-19 dashboard to track global cases

covid-19 covid19-tracker dashboard data-analysis

Last synced: 07 Jan 2026

https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate

The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques

big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql

Last synced: 27 Jan 2026

https://github.com/akshat0427/spotify_history

code to find out some insights in spotify streaming data (work in progress)

data-analysis data-visualization

Last synced: 04 Feb 2026

https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql

This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.

data-analysis python retail sql sql-server sqlalchemy

Last synced: 07 Feb 2026

https://github.com/narius2030/hive-datawarehouse-analysis

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics

Last synced: 01 Apr 2025

https://github.com/jimut123/ultimate_date_finder

To find the best place for dating in your country

data-analysis data-science date-cluster dating geo location maps software

Last synced: 16 Jan 2026

https://github.com/iamfoysal/data-analysis

This repository contains various examples and exercises to help learn data science using Python.

data-analysis data-science database jupyter-notebook python3

Last synced: 10 Feb 2026

https://github.com/mljar/enrichment

Data enrichment with AI for pandas DataFrame

data-analysis data-enrichment data-science openai pandas

Last synced: 01 Jul 2025

https://github.com/maciekmalachowski/crypto-charts-site

📊Application that returns financial data for selected cryptocurrency.

binance-api data-analysis jupyter-notebook matplotlib mplfinance numpy pandas python python-binance

Last synced: 12 Apr 2026

https://github.com/gjbex/python-dashboards

Repository that contains material for training sessions on creating dashboards using Python.

dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization

Last synced: 13 Jul 2025

https://github.com/martincastroalvarez/django-data-analytics

Data Analytics, PnL, LTV & retention analysis with Django

analytics beautifulsoup4 d3 d3js data-analysis django ltv rest-api visualization

Last synced: 06 May 2026

https://github.com/alieymsxxn/sql_project_data_job_analysis

This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.

data-analysis postgresql sql sqlite

Last synced: 16 Apr 2025

https://github.com/haloapping/malas-ngetik-clf

Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.

data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/aditiiprasad/whatsstat

A fun and insightful WhatsApp chat analyzer that turns your conversations into beautiful stats, juicy graphs, and quirky insights.

chat-analyzer data-analysis data-visualization nlp streamlit text-processing whatsapp

Last synced: 02 Sep 2025

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 10 May 2026

https://github.com/walidalsafadi/titanic-disaster

In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).

data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction

Last synced: 16 Mar 2025

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 04 Oct 2025

https://github.com/elhaban3ro/thewildtool

TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖

ai audio audio-processing data-analysis data-science dataset deeplearning python

Last synced: 03 Sep 2025

https://github.com/thennen/py-ivtools

A package for reproducible measurement and analysis of current-voltage characteristics of electronic devices.

current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements

Last synced: 23 Jan 2026

https://github.com/yusufcinarci/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19-data-visualization data-analysis data-science data-visualization

Last synced: 22 Jul 2025

https://github.com/aad99bxp/whatsapp-chat-analyzer

A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.

data-analysis heroku-deployment python3

Last synced: 15 Mar 2025

https://github.com/shlokashah/student-depression-and-suicide-rate-prediction

https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/

data-analysis data-visualization machine-learning student suicide-rate-prediction

Last synced: 19 Nov 2025

https://github.com/cbg-ethz/scdna-pipe

Python data analysis pipeline for single cell copy number event history reconstruction

bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow

Last synced: 05 Jan 2026

https://github.com/mathieu2301/pbsc-tracker

Expérience de tracking des vélos en libre service fonctionnants avec PBSC

ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker

Last synced: 23 Jun 2026

https://github.com/quantumudit/alteryx-weekly-challenges

This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community

alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl

Last synced: 27 Jan 2026

https://github.com/pizofreude/data-career-navigator

An interactive dashboard providing deep insights into career opportunities for data-related roles, utilizing a comprehensive dataset sourced from LinkedIn. Features include analysis of experience levels, salaries, key skills, job locations, and industry trends, aiding job seekers and professionals in exploring and identifying optimal career paths.

codeinplace data-analysis data-visualization standford-university

Last synced: 13 Mar 2026

https://github.com/viper373/baidutieba

爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)

baidutieba-crawler bert data-analysis deep-learning python spider

Last synced: 30 Mar 2025

https://github.com/goggle/dataisbeautiful

Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.

data-analysis data-visualisation jupyter-notebook notebook reddit

Last synced: 16 May 2025

https://github.com/winter000boy/dsa-practice

This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.

data-analysis data-science leetcode leetcode-python pandas-python python3

Last synced: 06 Feb 2026

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 17 May 2026

https://github.com/rvalla/chessevolution

Some code to analyze my chess games using the Lichess API.

chess data-analysis lichess lichess-api python

Last synced: 23 Oct 2025

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/kevinschoon/qviz

QViz Interactive Plotting

data-analysis data-visualization go gonum qframe yaegi

Last synced: 01 Jun 2026

https://github.com/codebypinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 12 Oct 2025

https://github.com/cosmoduende/r-arduino

Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two

arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport

Last synced: 24 Apr 2026

https://github.com/briatte/asr

Applied Stats with R and RStudio (first-year social-science tutorials)

course data-analysis data-science data-visualization r statistics

Last synced: 14 Apr 2026

https://github.com/timzatko/fifa-19-dataset-machine-learning

Player's value prediction and game position classification on FIFA 19 dataset.

data-analysis fifa19 machine-learning scikit-learn

Last synced: 04 May 2026

https://github.com/lacerbi/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 11 Oct 2025

https://github.com/grburgess/gbm_kitty

Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.

3ml catalogue data-analysis fermi-science grbs pipelines

Last synced: 29 Jun 2025

https://github.com/jonperk318/machine-learning-analysis-of-hyperspectral-data

Using Non-negative Matrix Factorization (NMF) and Variational Autoencoder (VAE) machine learning architectures to analyze spatial and spectral features of hyperspectral cathodoluminescence (CL) spectroscopy images taken from hybrid inorganic-organic perovskite material

data-analysis data-science deep-neural-networks explained-variance hybrid-perovskite hyperspectral-image-classification machine-learning matplotlib nmf non-negative-matrix-factorization python pytorch scikit-learn semi-supervised-learning signal-processing solar-energy spectroscopy unsupervised-learning vae variational-autoencoder

Last synced: 28 Jan 2026

https://github.com/fenghaojiang/ethereum-etl

ETL(Extract, Transform, Load) data from Ethereum like EVM Block chain

data-analysis ethereum etl

Last synced: 14 Jan 2026

https://github.com/markmelnic/carsen-desktop

A python dashboard app for scraping and tracking cars for sale on websites such as mobile.de

automation dashboard data-analysis interface scraping scraping-websites tkinter-python

Last synced: 08 Oct 2025

https://github.com/markmelnic/scalg

List scoring algorithm. Analyse data using a range based procentual proximity algorithm.

algorithm data-analysis pypi pypi-package score scorer scoring scoring-algorithm

Last synced: 08 Oct 2025

https://github.com/emso-c/stream-analyser

A tool that analyses YouTube live streams.

cli data-analysis guessing highlights python youtube-video

Last synced: 18 Jan 2026

https://github.com/jpquast/icp-ms-data-explorer

A shiny app for the exploration of ICP-MS data.

data-analysis icp-ms r shiny shiny-apps

Last synced: 17 Jan 2026

https://github.com/johnsesana/eda-liquor-sales

Exploratory Data Analysis on Public Datasets

data-analysis data-visualization sql tableau-dashboards

Last synced: 09 Mar 2026