An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/andreip/twitter-authorities

Find authorities for Twitter topics. [Licenta][Undergraduate thesis]

data-analysis mongodb python twitter-topics

Last synced: 19 Apr 2026

https://github.com/bishopce16/stroke_prediction_analysis

The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.

data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau

Last synced: 26 Apr 2025

https://github.com/implicitlayer/alphaanalysis

Full library for analyzing financial data using ML and classical approaches

data-analysis data-science deep-learning finance machine-learning risk-analysis time-series-analysis trading transformers

Last synced: 12 Mar 2026

https://github.com/aadya940/numpyai

A Natural Language Interface to the Numpy Library using LLMs.

ai data-analysis data-science library llm machine-learning numpy python

Last synced: 12 Apr 2025

https://github.com/petulla/readroper

Read single and multi-card ASCII polling datasets in R

ascii data-analysis polling-data r

Last synced: 31 Mar 2025

https://github.com/carlosvinimsouza/dataanalysiswithpython

Learn Data Analysis using Libs Python (Numpy, Pandas, Matplotlib and Seaborn)

data-analysis data-science free-code-camp matplotlib numpy pandas python python3 seaborn

Last synced: 11 Apr 2026

https://github.com/silveirinhajuan/rotinapy

RotinaPy: Simplify your daily life and maximize productivity with an integrated app for task management, study tracking, flashcards, and more. Built with Streamlit and Python.

data-analysis flashcards llm-integration llm-ui machine-learning ollama productivity python streamlit study study-project study-tracker task-management task-manager

Last synced: 13 Feb 2026

https://github.com/0mppula/element-compare

A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.

compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand

Last synced: 19 Jan 2026

https://github.com/roaldarbol/anibehavr

🪲 An R package for Analysis of Animal Behaviour

animal-behavior behavioural-states data-analysis r

Last synced: 11 Oct 2025

https://github.com/ahammadmejbah/ibm-project-03-analyzing-spreadsheet-data-with-python

Spreadsheets are computer programs that allow users to enter, view, and change information in a gridlike format. One of the most useful programs on a PC is a spreadsheet, which allows users to organize data in tabular form. A spreadsheet's primary purpose is to store numerical data and sometimes a few words of text.

data-analysis data-analysis-python data-analyst data-science excel python spreadsheet

Last synced: 27 Apr 2025

https://github.com/datalayer/desktop

Ξ 🖥️ Datalayer Destkop.

ai data data-analysis data-science datalayer desktop electron

Last synced: 25 Oct 2025

https://github.com/arnavk-09/world-population

🌏 Taipy demo to explore world population data...

data-analysis python taipy world-population

Last synced: 31 Mar 2025

https://github.com/narius2030/sakila-datawarehouse-ssis

Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis

data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis

Last synced: 10 Oct 2025

https://github.com/mykhode/python-sic-mini-project

SAMSUNG SIC Finish Project Course - Python

data-analysis python-analysis samsung-sic

Last synced: 10 Oct 2025

https://github.com/3bdalrhmans3d/dataqualityproject

An interactive web application for data quality analysis, machine learning, and conversational AI, built with Streamlit.

data-analysis data-visualization ml numpy ollama pandas python seaborn streamlit

Last synced: 23 Feb 2026

https://github.com/liweitianux/chandra-acis-analysis

Chandra ACIS analysis tools and documents

acis chandra data-analysis

Last synced: 19 Apr 2025

https://github.com/prathamesh693/04_sales-and-demand-forecasting-for-retail-chains

This project aims to forecast weekly sales for retail stores using historical sales and economic data. By applying advanced time series forecasting models, we enable better inventory management, demand planning, and revenue optimization for retail chains. The project includes both traditional statistical models and deep learning techniques.

arima-forecasting data-analysis data-visualization exploratory-data-analysis jupyter-notebook prophet-facebook python spyder-python-ide streamlit-webapp

Last synced: 01 Jul 2025

https://github.com/garrettj403/albertaenergysources

Get grid data from Alberta Electric System Operator (AESO)

alberta canada data-analysis energy-data

Last synced: 10 Oct 2025

https://github.com/bts-cm/airdrop_tool

Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.

airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc

Last synced: 17 Jan 2026

https://github.com/alyssonmach/9-data-science-apps-with-python

[freeCodeCamp Course]: build interactive and data-driven web apps in Python using the Streamlit library.

data-analysis data-science data-visualization machine-learning streamlit-webapp

Last synced: 12 Feb 2026

https://github.com/ktmud/github-life

A data explorer for GitHub projects' life cycles

data-analysis github scraper time-series

Last synced: 16 May 2026

https://github.com/splch/qbs

An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.

classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling

Last synced: 01 Apr 2025

https://github.com/erictleung/microbiome-analysis-resources

:microscope: Resources and notes on studying the human microbiome

bioinformatics data-analysis gut-microbiome microbiology microbiome notes resources

Last synced: 18 Jan 2026

https://github.com/autuanliu/rlang

科学统计实验与研究(数据方面)

data-analysis r

Last synced: 15 Mar 2025

https://github.com/singhkunall/-india-census-2011-population-demographics-dashboard

Interactive Excel dashboard visualizing India's 2011 Census Population Demographics using charts, pivot tables, and slicers.

data-analysis data-visualization excel-dashboard india-census population-data

Last synced: 04 Feb 2026

https://github.com/cbg-ethz/scdna-pipe

Python data analysis pipeline for single cell copy number event history reconstruction

bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow

Last synced: 05 Jan 2026

https://github.com/luizassimoes/fitness-report

Create a personalized Fitness Wrapped report using your Apple Health data with this Streamlit application. Generate comprehensive, detailed summaries of your annual fitness activities, providing valuable insights into your year-long progress and achievements.

data-analysis data-visualization python streamlit

Last synced: 12 Feb 2026

https://github.com/johnsesana/eda-liquor-sales

Exploratory Data Analysis on Public Datasets

data-analysis data-visualization sql tableau-dashboards

Last synced: 09 Mar 2026

https://github.com/jimbrig/eda

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 03 Jan 2026

https://github.com/martial2023/bank-performance-analysis

Analyse de données bancaires du Berka Dataset (1993-1998) pour calculer et visualiser des KPI clés

dashboard data-analysis data-visualization nextjs pandas plotly-express pymongo python recharts-js sqlalchemy

Last synced: 26 Aug 2025

https://github.com/ziaeemehr/itng_nest

Nest Simulator quick guides and examples, adding new model using NESTML

computational-neuroscience data-analysis nest-simulator neuroscience

Last synced: 30 Aug 2025

https://github.com/ronylpatil/whatsapplib

WhatsApp Group Chat Analysis Python Package.

data-analysis open-source pypi-package python-library python-package

Last synced: 02 Jan 2026

https://github.com/visionkernel/centerspoke

Centerspoke is a data management and analysis tool that allows easy access to cloud databases. Say goodbye to using excel for data management. This open-source CLI tool allows for the rapid processing and analysis of all your data, and makes it easy to upload your excel files into your cloud databases.

cli cloud-database data-aggregation data-analysis data-analysis-python data-management data-science python python3

Last synced: 24 May 2026

https://github.com/quantumudit/analyzing-goodreads-famous-quotes

This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 20 May 2026

https://github.com/viper373/baidutieba

爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)

baidutieba-crawler bert data-analysis deep-learning python spider

Last synced: 30 Mar 2025

https://github.com/cosmoduende/r-arduino

Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two

arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport

Last synced: 24 Apr 2026

https://github.com/timzatko/fifa-19-dataset-machine-learning

Player's value prediction and game position classification on FIFA 19 dataset.

data-analysis fifa19 machine-learning scikit-learn

Last synced: 04 May 2026

https://github.com/grburgess/gbm_kitty

Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.

3ml catalogue data-analysis fermi-science grbs pipelines

Last synced: 29 Jun 2025

https://github.com/jonperk318/machine-learning-analysis-of-hyperspectral-data

Using Non-negative Matrix Factorization (NMF) and Variational Autoencoder (VAE) machine learning architectures to analyze spatial and spectral features of hyperspectral cathodoluminescence (CL) spectroscopy images taken from hybrid inorganic-organic perovskite material

data-analysis data-science deep-neural-networks explained-variance hybrid-perovskite hyperspectral-image-classification machine-learning matplotlib nmf non-negative-matrix-factorization python pytorch scikit-learn semi-supervised-learning signal-processing solar-energy spectroscopy unsupervised-learning vae variational-autoencoder

Last synced: 28 Jan 2026

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 15 May 2025

https://github.com/fernandezfran/exma

A Python library with C extensions to analyze and manipulate molecular dynamics trajectories and electrochemical data

computational-physics data-analysis molecular-dynamics oop python science

Last synced: 16 Jan 2026

https://github.com/louislefevre/sstubs-miner

Data mining and analysis for the ManySStuBs4J dataset.

data-analysis data-mining manysstubs4j-dataset msr

Last synced: 30 Mar 2025

https://github.com/walidbosso/r_data_mining

Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees

association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio

Last synced: 23 Mar 2025

https://github.com/gxelab/tutorials

Tutorials of frequently used software packages and libraries in the lab

bioinformatics data-analysis evolution genetics genomics julia python3 r-language statistics visualization

Last synced: 18 Jan 2026

https://github.com/jshinm/web-scrapper

Web Scrapper used to extract NeuroData github repo stats

data-analysis web-scraping

Last synced: 04 Apr 2025

https://github.com/a-r-j/npview

CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files

cli csv data-analysis inspector npy numpy python tsv

Last synced: 18 Jan 2026

https://github.com/kylekirkby/cardatasnatch

CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.

beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering

Last synced: 15 Apr 2025

https://github.com/rmnldwg/liver-smart

Data and analysis pipeline for a study on the potential advantages of daily adaptive liver SBRT performed at the University Hospital Zurich.

data-analysis fractionation jupyter-notebook liver-cancer metastasis radiation-oncology stereotactic

Last synced: 27 May 2026

https://github.com/jimut123/ultimate_date_finder

To find the best place for dating in your country

data-analysis data-science date-cluster dating geo location maps software

Last synced: 16 Jan 2026

https://github.com/alieymsxxn/sql_project_data_job_analysis

This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.

data-analysis postgresql sql sqlite

Last synced: 16 Apr 2025

https://github.com/aditiiprasad/whatsstat

A fun and insightful WhatsApp chat analyzer that turns your conversations into beautiful stats, juicy graphs, and quirky insights.

chat-analyzer data-analysis data-visualization nlp streamlit text-processing whatsapp

Last synced: 02 Sep 2025

https://github.com/yusufcinarci/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19-data-visualization data-analysis data-science data-visualization

Last synced: 22 Jul 2025

https://github.com/aad99bxp/whatsapp-chat-analyzer

A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.

data-analysis heroku-deployment python3

Last synced: 15 Mar 2025

https://github.com/shlokashah/student-depression-and-suicide-rate-prediction

https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/

data-analysis data-visualization machine-learning student suicide-rate-prediction

Last synced: 19 Nov 2025

https://github.com/virajbhutada/music-recommendation-system

This project is designed to provide personalized music recommendations for relaxation and meditation. Leveraging ML and data analysis, the system suggests tracks based on user preferences such as tempo, energy, and genre. Join us in enhancing music discovery through advanced algorithms and community-driven contributions.

data-analysis data-science-projects data-visualization eda html machine-learning ml-algortihms model-deployment model-evaluation music-recommendation-system nlp pivot-table principal-component-analysis python python-library similarity-matrix spotify-data streamlit-web user-experience

Last synced: 24 Jan 2026

https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql

This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.

campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics

Last synced: 27 Feb 2025

https://github.com/goggle/dataisbeautiful

Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.

data-analysis data-visualisation jupyter-notebook notebook reddit

Last synced: 16 May 2025

https://github.com/c0deta1ker/MatBaseX

MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.

cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps

Last synced: 23 Jul 2025

https://github.com/mljar/enrichment

Data enrichment with AI for pandas DataFrame

data-analysis data-enrichment data-science openai pandas

Last synced: 01 Jul 2025

https://github.com/olow304/goboard

Python Data Analysis Dashboard using Public Dataset, Django

dashboard dashboard-templates data-analysis data-science django jupyter-notebook machine-learning python sklearn

Last synced: 11 Apr 2026

https://github.com/mathieu2301/pbsc-tracker

Expérience de tracking des vélos en libre service fonctionnants avec PBSC

ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker

Last synced: 23 Jun 2026

https://github.com/mainakverse/ml-algorithms-starter

List of machine learning algorithms that are needed to start with ML projects and lay a foundation into data science

data-analysis data-science jupyter-notebooks machine-learning-algorithms practice

Last synced: 19 Apr 2025

https://github.com/fabienarcellier/qjoin

qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality

composable data-analysis developer-tools functools python

Last synced: 01 Mar 2026

https://github.com/yahia3200/become-an-independent-data-scientist

My final project for the Applied Plotting, Charting & Data Representation in Python Course

data-analysis data-science data-visualization matplotlib

Last synced: 16 Mar 2025

https://github.com/elhaban3ro/thewildtool

TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖

ai audio audio-processing data-analysis data-science dataset deeplearning python

Last synced: 03 Sep 2025

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 04 Oct 2025

https://github.com/walidalsafadi/titanic-disaster

In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).

data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction

Last synced: 16 Mar 2025

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 10 May 2026

https://github.com/haloapping/malas-ngetik-clf

Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.

data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/martincastroalvarez/django-data-analytics

Data Analytics, PnL, LTV & retention analysis with Django

analytics beautifulsoup4 d3 d3js data-analysis django ltv rest-api visualization

Last synced: 06 May 2026

https://github.com/gjbex/python-dashboards

Repository that contains material for training sessions on creating dashboards using Python.

dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization

Last synced: 13 Jul 2025

https://github.com/maciekmalachowski/crypto-charts-site

📊Application that returns financial data for selected cryptocurrency.

binance-api data-analysis jupyter-notebook matplotlib mplfinance numpy pandas python python-binance

Last synced: 12 Apr 2026

https://github.com/iamfoysal/data-analysis

This repository contains various examples and exercises to help learn data science using Python.

data-analysis data-science database jupyter-notebook python3

Last synced: 10 Feb 2026

https://github.com/narius2030/hive-datawarehouse-analysis

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics

Last synced: 01 Apr 2025

https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql

This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.

data-analysis python retail sql sql-server sqlalchemy

Last synced: 07 Feb 2026

https://github.com/akshat0427/spotify_history

code to find out some insights in spotify streaming data (work in progress)

data-analysis data-visualization

Last synced: 04 Feb 2026

https://github.com/draym/covid19tracker

Coronavirus COVID-19 dashboard to track global cases

covid-19 covid19-tracker dashboard data-analysis

Last synced: 07 Jan 2026

https://github.com/gustavohnsv/teamwork_mqa

Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.

data-analysis group-project r team-repo

Last synced: 15 Aug 2025