An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/syamkakarla98/datascience_head_start

This repository focuses on the building path for the data science.

data-analysis data-science data-visualization machine-learning machinelearning-python python3

Last synced: 03 May 2025

https://github.com/akbaritabar/dask-duckdb-dbeaver

Parallelised and out of memory data analysis using Dask in Python and DuckDB and DBeaver in SQL. Using example of publicly accessible ORCID 2019 XML files

data-analysis data-science pandas parallel-computing python

Last synced: 08 Aug 2025

https://github.com/rasrea/dataviz

这是一个基于Java和Spring Boot的可视化项目,旨在帮助(新手)开发者掌握前后端分离的开发模式。在项目中,用户可以上传数据文件,进行数据预处理,并通过前端的图表模块展示数据。项目涉及Spring Boot的API设计,数据的前后端交互,数据的处理与可视化展示,并且使用了Echarts图表进行数据展示。

data-analysis data-visualization spring-boot

Last synced: 05 Aug 2025

https://github.com/thecoderpinar/samsung_stock_analysis_forecasting_and_volatility_analysis

A comprehensive analysis and forecasting project for Samsung stock data, utilizing historical data to build predictive models and analyze volatility.

data-analysis deep-learning financial-analysis forecasting machine-learning python stock-analysis volatility-forecasting

Last synced: 31 Jul 2025

https://github.com/fluhus/biostuff

Computational biology packages for Go.

bioinformatics biology computational-biology data-analysis go golang

Last synced: 12 Jan 2026

https://github.com/thecoderpinar/big-tech-financial-insights

🚀 A comprehensive project analyzing Big Tech stock prices using time series analysis, volatility modeling, and macroeconomic indicators. Featuring interactive dashboards and automated reporting! 📈💼

data-analysis data-science finance machine-learning macroeconomics stock-analysis time-series-analysis volatility-modeling

Last synced: 03 Apr 2025

https://github.com/louis-heraut/makaho

🥤 MAKAHO (for MAnn-Kendall Analysis of Hydrological Observations) is an interactive cartographic visualization system that allows to calculate trends present in data from hydrometric stations with flows which are little influenced by human actions

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae leaflet r shiny shiny-apps stationarity-test statistics

Last synced: 08 Jul 2025

https://github.com/aditeyabaral/lok-sabha-election-twitter-analysis

Twitter Feeds were analysed during the Lok Sabha Elections 2019 to guage the overall popularities of each party and predict the winner based solely on the tweets made by the population. This was made as a part of our Data Science course (UE18CS203) at PES University.

data-analysis data-science data-visualization elections loksabha nlp prediction probabilistic-graphical-models probability python python3 sentiment-analysis sentiment-classification sentiment-polarity sentiment-scores social-media socialmediaanalytics statistical-analysis statistical-models twitter

Last synced: 16 Apr 2025

https://github.com/kaos599/vit-gpa-calculator

VIT-GPA-Calculator is a Python application that extracts course grade data from a PDF file to calculate your current CGPA and allows you to simulate grade improvements. It leverages Camelot for PDF table extraction and Pandas for data manipulation, making it a handy tool for students and educators alike.

calculator camelot cgpa cgpa-calculator cgpa-simulator data-analysis education pandas pdf python- vellore-institute-of-technology vit-bhopal vit-university

Last synced: 18 Nov 2025

https://github.com/psyteachr/quant-fun-v2

Fundamentals of Quantitative Analysis

data-analysis data-visualization datawrangling r statistics

Last synced: 11 Oct 2025

https://github.com/dr-montasir/mnjs

MATH NODE JS (MNJS): A tiny math library for node.js & JavaScript on browser

data-analysis data-science javascript js jsdelivr library math nextjs npm react svelte sveltekit ts typescript yarn

Last synced: 26 Apr 2025

https://github.com/adrienc21/vulpes

Vulpes: Test many classification, regression models and clustering algorithms to see which one is most suitable for your dataset

automl data-analysis data-science machine-learning models package python scikit-learn statistics

Last synced: 25 Oct 2025

https://github.com/ritvik19/vizard

Intuitive, Interactive, Easy and Quick Visualizations for Data Science Projects

data-analysis data-science data-visualization

Last synced: 10 Apr 2025

https://github.com/mattcieslak/meap

Software for cardiovascular data analysis and visualization

data-analysis electrophysiology python statistics

Last synced: 18 Sep 2025

https://github.com/vishnu-t-r/sql_solved_questions

In this repository you will find SQL queries of multi-difficulty levels that can applied on to large datasets for various business requirements.

business-solutions complex-sql data-analysis sql-data-analysis sql-queries

Last synced: 18 Nov 2025

https://github.com/antoineaugusti/google-search-gdpr

Transform your Google Search GDPR export in CSV

data-analysis gdpr google-search python self-data

Last synced: 28 Oct 2025

https://github.com/srohit0/ml-misc

Miscellaneous Machine Learning and Data Analysis Projects

colaboratory data-analysis data-science data-visualization google-colab machine-learning-algorithms

Last synced: 15 Apr 2025

https://github.com/graphcore-research/kg-topology-toolbox

A Python toolbox to compute topological metrics and statistics for Knowledge Graphs

data-analysis graph-topology knowledge-graph metrics-gathering

Last synced: 17 Jan 2026

https://github.com/jwngr/notreda.me

Historical Notre Dame Fighting Irish football schedules, stats, and analyses

college-football data-analysis data-visualization football notre-dame sports stats

Last synced: 09 Apr 2025

https://github.com/bluebrain/data-validation-framework

Simple framework to create data validation workflows.

data data-analysis python validation validation-tool

Last synced: 14 May 2025

https://github.com/shlokashah/ipl-data-analysis

Data Analysis and Visualizations done on IPL dataset

data-analysis data-visualization pandas powerbi

Last synced: 01 Sep 2025

https://github.com/qzcool/sac

中国证券业协会数据分析和可视化。Data Analysis and Visualization for Securities Association of China (SAC).

china data-analysis finance python

Last synced: 11 Jul 2025

https://github.com/moataz-elmesmary/data-analysis-using-excel-and-power-bi-sessions

Excel & Power BI Sessions from basics to advanced for Data Analysis

data-analysis dax excel powerbi powerpivot powerquery

Last synced: 02 Jan 2026

https://github.com/deshima-dev/decode

:zap: DESHIMA code for data analysis

astronomy data-analysis deshima python spectroscopy submillimeter

Last synced: 18 Jun 2025

https://github.com/jgolafshan/wallstreetsocial

Analyze Reddit posts/comments, uses an NLP model to recognize stock symbols and options positions

data-analysis flexible machine-learning mentioned-stocks named-entity-recognition natural-language-processing nlp reddit social-media-analysis sql wallstreetbets

Last synced: 22 Jul 2025

https://github.com/alexcj10/analyzing-amazon-sales-data

This repository is dedicated to analyzing Amazon sales data to identify trends and insights that can help improve sales strategies and performance.

amazon beautifulsoup data-analysis data-science data-visualization ecommerce machine-learning matlpotlib numpy pandas python sales skilearn

Last synced: 10 Jul 2025

https://github.com/ptyadana/dv-data-visualization-with-python

Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.

csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3

Last synced: 12 Apr 2025

https://github.com/csfelix/datascience-exercises

🐍 Just some DataScience exercises, nothing more... 🐍 (🔑 KeyWords: python, data science, data analysis, pandas 🔑)

data-analysis data-science datascience pandas python python3

Last synced: 05 Jul 2025

https://github.com/misaghmomenib/social-engagement-analysis

In This Repository, Using a Csv File Related to Social Network Interactions I Made a General Analysis That You Can Have at Your Disposal

data-analysis data-visualization git open-source pandas-python python

Last synced: 21 Nov 2025

https://github.com/ayaanhossain/sharedb

An on-disk pythonic embedded key-value store for compressed data storage and distributed data analysis

data-analysis data-storage distributed embedded-database key-value lmb msgpack multiprocessing parallel python python-3 python-library python-script python2 python3 store

Last synced: 13 Apr 2025

https://github.com/thecoderpinar/hms-brainactivity-analysiss

Welcome to the GitHub repo for "HMS - EEG Exploration & Neurocritical Care Journey"! Explore EEG data, understand wave patterns, and delve into conditions like LPDs, GPDs, LRDA, and GRDA.

critical-care data-analysis data-science data-visualization deep-neural-networks eeg eeg-signals exploratory-data-analysis healthcare medical-research neuroscience signal-processing

Last synced: 30 Apr 2025

https://github.com/the-data-dilemma/parquettohuggingface

ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.

audio-dataset audio-processing automatic-speech-recognition data-analysis data-science dataset healthcare-application huggingface huggingface-datasets pandas parquet parquet-generator python3 speech-data speech-recognition speech-to-text speech-translation

Last synced: 21 Aug 2025

https://github.com/avannaldas/quickview

QuickView is a python package for a quick glance of the dataset, just pass the pandas dataframe. It gives some useful summary and plots with just one line of code. All the summary that it outputs is made available through member variables. It is built using matplotlib.

data-analysis data-visualization dataframe machine-learning matplotlib pandas pandas-dataframe plots python3

Last synced: 01 Jul 2025

https://github.com/data-engineering-community/data-engineering-meetup-in-a-box

A collection of guides, resources, and support for DE meetup organizers.

data data-analysis data-engineering data-mining data-structures database meetups

Last synced: 02 Aug 2025

https://github.com/madebyarthouse/momentum-coalition-compass

Application for analysing and visualizing data from a political questionnaire

austria data-analysis politics venn-diagram

Last synced: 27 Jul 2025

https://github.com/shreeparab1890/fifa-wc-2022-qatar-data-analysis-eda

This is a Jupyter Notebook( iPython Notebook) with Data Analysis (EDA) on FIFA WC Qatar 2022 match data.

data-analysis data-analysis-python data-science data-visualization eda fifa matplotlib-pyplot numpy pandas plotly-express python-3

Last synced: 06 Sep 2025

https://github.com/banitalebi/data-visualization-dashboard

The Data Visualization Dashboard serves as an excellent starting point for projects focused on data analysis and representation.

data-analysis data-visualization interactive-charts nextjs shadcn-ui starter talwindcss typescript

Last synced: 10 Apr 2025

https://github.com/dmytrovoytko/data-engineering-amazon-reviews

Data Engineering project for ZoomCamp`24: JSONL -> PostgreSQL/BigQuery + Metabase + Mage.AI

bash-script bigquery codespaces data-analysis data-visualization etl metabase pipeline python-script

Last synced: 31 Jul 2025

https://github.com/amirhosseinhonardoust/community-price-tracker

A community-driven app for tracking and comparing the prices of everyday goods across cities. Built with Python, SQLite, Pandas, and Streamlit, it lets users log, visualize, and analyze item price trends, inflation, and cost-of-living differences with clear charts and full local data privacy.

community-project cost-of-living dashboard data-analysis data-visualization economic-analysis inflation-tracking local-data open-data pandas price-tracker public-insight python sqlite streamlit

Last synced: 08 Nov 2025

https://github.com/kingabzpro/annual-recycled-energy-saved-in-singapore

Learn how much Singapore is saving energy per years by recycling plastics, paper, glass, ferrous and non-ferrous metal

cleaning-data data-analysis data-science deepnote energy environment

Last synced: 19 Jun 2025

https://github.com/abubakkar32/machine-learning-practice

This GitHub repository is a valuable resource for machine learning and Python enthusiasts. It includes a wide range of projects and tools, covering topics like Data Visualization, Data Analysis, ML, DL, Automation, NLP, Web Scraping, and more. Contributors are welcome to join and learn together in this supportive community. Happy coding!

automation big-data chatbot data-analysis data-visualization datastractures machine-learning mathplotlib nlp-machine-learning numpy pandas plotly pypdf2-lib pytest python3 seaborn selenium-webdriver skit-learn webscraping

Last synced: 23 Mar 2025

https://github.com/simeononsecurity/track-helium-mobile-wifi

A collection of scripts and tools that tracks the availability of helium mobile wifi networks in the wild from the Wigle Dataset and Helium API. Updates every 24 hours.

automation bigdata carrieroffload data data-analysis dataset dataset-generation helium openroaming passpoint python

Last synced: 23 Apr 2025

https://github.com/leonism/sample-superstore

This is the Python version analysis approach, towards the legendary Sample Superstore Dataset with Pandas

data-analysis datamining datascience dataset eda jupyter-notebook machine-learning python

Last synced: 09 Oct 2025

https://github.com/sondosaabed/advanced-data-wrangling

In this advanced course, Learning the three phases of data wrangling: gathering, assessing, and cleaning data.

data-analysis data-analyst-nanodegree data-wrangling numpy pandas python

Last synced: 09 Apr 2025

https://github.com/mrgeislinger/udacitydand_proj_wrangleandanalyzedata

Wrangling and analyzing data project for Udacity's Data Analyst Nanodegree. Wrangles WeRateDogs™ (@dog_rates) Twitter data from local, online, and Twitter API sources.

data-analysis data-analyst data-science datascience jupyter-notebook python3 twitter udacity-data-analyst-nanodegree udacity-nanodegree

Last synced: 09 Oct 2025

https://github.com/Haritha1005/data_analyst_coursework_and_practice

This repository showcases my data analytics and data science skills through projects, fostering collaboration and community engagement

data-analysis data-visualization etl excel matplotlib numpy-library pandas powerbi-report python3 r scipy sql tableau

Last synced: 16 Oct 2025

https://github.com/edaaydinea/estimating-the-probability-of-confirmed-covid-19-cases-taking-into-the-intensive-care-unit-icu-

This repository includes the slides and coding parts for the Estimating the Probability of Confirmed COVID-19 Cases Taking into the Intensive Care Unit (ICU).

covid-19 data-analysis data-science data-visualization machine-learning

Last synced: 11 Apr 2025

https://github.com/kennethleungty/pymysql-demo

PyMySQL - Connecting Python and SQL for Data Science

data-analysis data-science mysql pandas python sql

Last synced: 12 Jul 2025

https://github.com/misaghmomenib/nba-games-analysis

This is a Data Analysis Project That You Can Monitor the Number of Games and Wins and Losses of Different NBA Teams and Have Different Analyzes From It.

data-analysis data-analyst data-visualization git open-source python

Last synced: 04 Apr 2025

https://github.com/nafisalawalidris/bitcoin-price-analysis-api

Explore this comprehensive API for Bitcoin price analysis, featuring historical data and real-time updates from major exchanges. Built with FastAPI, Python and PostgreSQL, this open-source project is optimised for efficiency and scalability catering to developers, traders and analysts. Contribute and collaborate to enhance functionality.

api-development bitcoin bitcoin-api cryptocurrency data-analysis fastapi financial-data machine-learning open-source postgresql python restful-api

Last synced: 08 Sep 2025

https://github.com/bobleesj/cif-bond-analyzer

An interactive Python script that computes the minimum atomic bonding distances from sites, generating histograms and pair counts.

crystal-structure data-analysis high-throughput materials-infomatics solid-state

Last synced: 16 Jan 2026

https://github.com/djoshea/trial-data

Interfaces and utilities for analysis of neurophysiology and behavioral data

data-analysis electrophysiology neuroscience

Last synced: 23 Jan 2026

https://github.com/iamantimpal/iamantimpal

👋 Hi, I'm Antim Pal, the Founder of Optimism Educator. An online platform dedicated to empowering students with skills in Computer Science, Web Design, Graphic

data-analysis data-science data-visualization database database-design database-management datascience graphical-user-interface graphics grapic-design reading-list readme readme-badges readme-generator readme-md readme-profile readme-stats readme-template

Last synced: 10 Apr 2025

https://github.com/tzerk/gammaspec

A collection of functions to analyse gamma spectra

data-analysis gamma-ray-spectrometry r

Last synced: 15 Apr 2025

https://github.com/dbeaudoinfortin/napsdataanalysis

Canadian National Air Pollution Surveillance Program (NAPS) data downloader, importer, extractor, analysis, and visualization toolbox.

air-pollution air-quality air-quality-data aqhi aqhi-canada canada cesi data-analysis eccc geoscience heatmap naps pollution

Last synced: 16 Mar 2025

https://github.com/rameerez/dashboards

🍱 Ruby gem to create customizable bento-style admin dashboards in your Rails app

admin admin-dashboard admin-panel bento business business-analytics business-intelligence dashboard data-analysis gem rails ruby

Last synced: 13 Apr 2025

https://github.com/misaghmomenib/data-analysis-projects

A Repository Featuring a Collection of Data Analysis Projects, Showcasing Various Techniques and Tools for Extracting Insights From Data. Explore, Learn, and Utilize These Projects to Enhance Your Data Analysis Skills and Workflows.

data-analysis data-analysis-python data-visualization jupyter-notebook open-source python

Last synced: 13 Apr 2025

https://github.com/iamyajat/whatsapp-chat-analyzer-api

An API to analyse WhatsApp chats and generate insights

data-analysis data-science fastapi python whatsapp

Last synced: 17 Oct 2025

https://github.com/pseudomanifold/enchiridion-tda

An enchiridion for instructing mortals in the hidden arts of topological data analysis

data-analysis graph-kernels persistent-homology topology

Last synced: 10 Apr 2025

https://github.com/neelshah18/scopus-analysis-for-indian-researcher

It contains data analysis of indian researcher in scopus journal from 2000 to 2016. It also have dataset.

artificial-intelligence data-analysis deep-learning india industry jupyter-notebook machine-learning scopus-analysis universities world

Last synced: 06 Jul 2025

https://github.com/anotherkamila/stalky

Self-hosted app to keep track of anything you want. Don't let others stalk you, stalk yourself!

csv data-analysis data-storage geeks privacy-by-design self-hosted timeseries tracking

Last synced: 12 Apr 2025

https://github.com/cusyio/datenanalyse-in-python

Kurs zur automatisierten Aufbereitung, Zusammenfassung und Erstellung von Diagrammen tabellarischer Daten mit Python.

data-analysis pandas-python pandas-tutorial python

Last synced: 24 Apr 2025

https://github.com/stappit/blog

I often post solutions to textbook exercises, including: Bayesian Data Analysis (BDA) by Gelman et al; Causal Inference in Statistics Primer (CISP) by Pearl et al; Purely Functional Data Structures (PFDS) by Okasaki.

bayesian-data-analysis blog data-analysis data-science gelman hakyll haskell pearl purely-functional-data-structures solutions stan static-site statistical-inference statistics

Last synced: 14 Mar 2025

https://github.com/bydevmar/master_masd_fpo

Ce dépôt GitHub regroupe tous les cours, TP, TD, projets, et exercices de ma formation en master en mathématiques appliquées pour la science des données. Parcourez-le pour une vue complète de mon parcours académique, offrant une perspective détaillée de mon apprentissage dans ce domaine.

acp afc algebra big-data-analytics dashboards data-analysis datascience economics english graph-theory latex linear-algebra non-linear-algebra probability prog python scientific-research software-package statistics

Last synced: 05 May 2025

https://github.com/ankan24/machine-learning-data-analysis

This repository contains a collection of Jupyter Notebooks that demonstrate various machine learning and data analysis techniques. The project does not provide a detailed description or specific use cases, but the notebooks cover a range of topics related to machine learning and data analysis.

data-analysis jupiter-notebook machine-learning

Last synced: 19 Oct 2025

https://github.com/sanvishal/Exoplanet-Explore

An Interactive data visualization of Exoplanets

animation d3js data-analysis data-science exoplanet python space visualization

Last synced: 14 Apr 2025

https://github.com/bpkaur/word-frequency-in-moby-dick

To find out the most frequent words in the novel Moby Dick using Python.

beautifulsoup data-analysis data-science moby-dick nltk notebook-jupyter python3

Last synced: 09 Nov 2025

https://github.com/coalio/Assistant

A data science library providing flexible dataframes for Lua 5.1+

data-analysis data-science data-structures dataframe lua

Last synced: 11 Apr 2025

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 28 Apr 2025

https://github.com/flexmonster/pivot-jupyter-notebook

Jupyter Notebook pivot table example with Flexmonster

data-analysis data-science interactive jupyter-notebook pivot-tables python

Last synced: 16 Jun 2025

https://github.com/thecoderpinar/gen-expression

Gene expression analysis is a fundamental component of genomics research, providing valuable insights into how genes are regulated and their impact on various biological processes. This project delves into the realm of gene expression data, aiming to uncover hidden patterns and relationships within complex datasets. 🚀

bioinformatics biotechnology data-analysis data-science data-visualization genomics kaggle machine-learning pca python

Last synced: 30 Apr 2025

https://github.com/nicodupont/mooc

All my finished Moocs on the subject of the data science mainly

data-analysis data-science data-visualization datacamp jupyter-notebook machine-learning mooc pandas python sas sql

Last synced: 28 Apr 2025