Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/valentinfrlch/deephealth

Visualize correlations between health factors.

data-analysis health machine-learning prediction predictive-analytics

Last synced: 18 Feb 2025

https://github.com/llnl/hdtopology

High-dimensional topological data analysis library for NDDAV

analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization

Last synced: 11 Nov 2024

https://github.com/emaasit/pydata-book

Learning data analysis with python

data-analysis jupyter pandas python

Last synced: 21 Nov 2024

https://github.com/anil951/early-detection-of-mental-health

This project develops a predictive model to identify early signs of mental health issues in adolescents using social media activity, school performance, health records, and an AI chatbot. It analyzes emotional tone, academic changes, and health data, offering personalized recommendations and resources for mental wellness.

data-analysis deep-learning early-detection lstm mental-health sentiment-analysis social-media

Last synced: 18 Feb 2025

https://github.com/simranjeet97/ipl-dataanalysis

Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.

artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python

Last synced: 14 Jan 2025

https://github.com/leandronasx/agro-data

Projeto final da formação de analista de dados e dashboard da SoulCode Academy.

bigquery data-analysis gcp looker pandas powerbi python

Last synced: 16 Feb 2025

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 18 Feb 2025

https://github.com/saadarazzaq/excel-merger

Merge multiple Excel and CSV files into a single dataset with the Excel Merger Streamlit app. 📊🔄🚀

data-analysis excel pandas python streamlit-webapp

Last synced: 23 Jan 2025

https://github.com/aravind-selvam/covid_dashboard

With Covid death and vaccine data. I have created a dashboard.

covid-19 data-analysis data-science data-visualization tableau tableau-public visualization

Last synced: 14 Jan 2025

https://github.com/cbg-ethz/scdna-pipe

Python data analysis pipeline for single cell copy number event history reconstruction

bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow

Last synced: 28 Jan 2025

https://github.com/pitmonticone/covid-italy

References for COVID-19 situation in Italy.

coronavirus covid-19 covid-19-italy data data-analysis documentation testing

Last synced: 22 Jan 2025

https://github.com/fbecerra/fbecerra.github.io

Source code for my website www.fernandobecerra.com

data-analysis data-science data-visualization dataviz interactive-visualizations

Last synced: 27 Oct 2024

https://github.com/praju-1/pandas

The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.

data-analysis pandas python

Last synced: 08 Feb 2025

https://github.com/i4ds/ecallisto_ng

Ecallisto NG is a Python package tailored for interacting with Ecallisto data.

data-analysis data-visualization e-callisto ecallisto-international-network numpy pandas python spectrometer

Last synced: 09 Nov 2024

https://github.com/bradleyboehmke/uc-bana-6043

Additional resources for the UC BANA 6043 Statistical Computing course

data-analysis data-science data-visualization python

Last synced: 14 Oct 2024

https://github.com/0mppula/element-compare

A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.

compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand

Last synced: 11 Nov 2024

https://github.com/shengyuan-lu/mgmt-172

Material for MGMT 172 Data and Programming for Analytics. (Course)

csv data-analysis data-visualization jupyter-notebook pandas sql sqllite yfinance

Last synced: 09 Feb 2025

https://github.com/CAIDA/submarine-cable-impact-analysis-public

This repository contains tools implemented for the PAM 2020 paper "Unintended consequences: Effects of submarine cable deployment on Internet routing" to collect and analyze data depicting the impact of the South-Atlantic Cable System (SACS) launch on Internet routing. This codebase can be extended to other use-cases of cable launches, failures, etc.

africa-americas africa-south-america bgp-data-analysis caida-ark-measurement-platform data-analysis historical-traceroutes impact internet-routing ripe-atlas-measurement-platform sacs-cable sail-cable submarine-cables

Last synced: 06 Nov 2024

https://github.com/michaelcurrin/water-crisis-scraper

Scrape and explore data related to Cape Town's water crisis (Python3 application)

cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping

Last synced: 28 Oct 2024

https://github.com/chaganti-reddy/evmarket-india

Electric Vehicle Market Segmentation Analysis in India

data-analysis data-science machine-learning market-segmentation pandas python

Last synced: 20 Jan 2025

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 16 Jan 2025

https://github.com/jethronap/asylumdataku_website

Mini website for reporting analysis of Asylum Data @ DIKU

data-analysis docsify nlp

Last synced: 06 Jan 2025

https://github.com/mindful-ai-assistants/movierevenueanalysis

🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.

correlation-analysis data-analysis data-science heatmap jupyter-notebook open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest

Last synced: 22 Jan 2025

https://github.com/idaraabasiudoh/vehicle-co2emission_model

Predicts CO2 emissions from vehicle fuel consumption using a multiple linear regression model trained on sklearn, based on a dataset of engine sizes and corresponding CO2 emissions in Canada.

data-analysis jupyter-notebook machine-learning python3 scikit-learn

Last synced: 14 Jan 2025

https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django

A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data

analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas

Last synced: 10 Feb 2025

https://github.com/1994nikunj/nlp-toolkit-desktop-app

The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.

data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator

Last synced: 25 Nov 2024

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 17 Feb 2025

https://github.com/iantomasinicola/portfoliodataanalyst

Progetto di Data analysis con Python, Microsoft Sql Server e Excel

data-analysis excel python sql

Last synced: 02 Jan 2025

https://github.com/analyticalnahid/numpy-tutorial

A complete Notebook on Numpy for Data Science

data-analysis numpy numpy-tutorial

Last synced: 02 Feb 2025

https://github.com/dcs-training/r-qgisintegratingspatialanalysis

This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file

data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis

Last synced: 10 Nov 2024

https://github.com/fabienarcellier/qjoin

qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality

composable data-analysis developer-tools functools python

Last synced: 28 Nov 2024

https://github.com/goggle/dataisbeautiful

Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.

data-analysis data-visualisation jupyter-notebook notebook reddit

Last synced: 17 Feb 2025

https://github.com/simoneas02/data-science

🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻

data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql

Last synced: 01 Feb 2025

https://github.com/walidbosso/r_data_mining

Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees

association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio

Last synced: 28 Jan 2025

https://github.com/phollemans/cwutils

CoastWatch Utilities software for working with satellite data files from NOAA CoastWatch and elsewhere

cdat coastwatch-utilities data-analysis data-visualization install4j java noaa-coastwatch remote-sensing satellite-imagery

Last synced: 12 Nov 2024

https://github.com/techytushar/india-odi-analysis

Analysis of ODI cricket matches of Indian Team

cricket data-analysis data-science pandas plotting python3

Last synced: 07 Jan 2025

https://github.com/muzammil-13/mimlrepo

Data Analysis using Python Machine Learning Libraries

data-analysis data-science machine-learning numpy pandas python python-library

Last synced: 16 Jan 2025

https://github.com/arjo129/image-sorter

Sort through folders of videos and images. Root out blurred and overexposed images.

computational-photography data-analysis photo-browser photo-gallery photography uwp uwp-apps

Last synced: 06 Jan 2025

https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring

A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.

data-analysis patient-monitoring

Last synced: 10 Feb 2025

https://github.com/hongbo-wei/global-status-of-cc-security-certification

Data visualization of CC Security Certification using VUE, Django, and MySQL.

big-date common-criteria data-analysis data-visualisation data-visualization

Last synced: 14 Jan 2025

https://github.com/w-edward/youtube-keyword-popularity-analyzer

An effort to discover the top trending keywords on Youtube.

data-analysis node-js numpy python webscraping youtube-api

Last synced: 16 Jan 2025

https://github.com/alieymsxxn/sql_project_data_job_analysis

This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.

data-analysis postgresql sql sqlite

Last synced: 28 Nov 2024

https://github.com/viper373/baidutieba

爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)

baidutieba-crawler bert data-analysis deep-learning python spider

Last synced: 05 Feb 2025

https://github.com/freekatz/english-reading

考研英语(10-19)数据集及相关数据分析

data-analysis dataset

Last synced: 02 Feb 2025

https://github.com/zrkhadija/data-analysis-for-financial-time-series

In this notebook, we performed data analysis on financial time series data from Yahoo Finance for the US market. We examined seasonality, trends, stationarity, and other aspects such as outliers and correlations.

autocorrelation correlation-analysis data-analysis financial-analysis time-series-analysis timeseries-forecasting visualization

Last synced: 09 Feb 2025

https://github.com/supertetelman/kaggle-public

A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.

cnn data-analysis deep-learning machine-learning matlab python

Last synced: 28 Jan 2025

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 05 Feb 2025

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 02 Jan 2025

https://github.com/al-ghaly/airline-company-data-warehouse

Data Warehouse modeling, design, implementation, and analysis for an Airline Company.

data-analysis data-warehousing database-modeling sql-server

Last synced: 22 Jan 2025

https://github.com/yogeshnile/nifty50-index-time-series-analysis

In this repo i did analysis of Nifty50 five year data from 01-04-2015 to 31-03-2020. Data Downloaded from nse official website.

data-analysis matplotlib nifty numpy pandas plotly python3 time-series-analysis

Last synced: 10 Jan 2025

https://github.com/mindlessmuse666/client-data-analysing-tool

Проект производственной практики: Инструмент для анализа данных, построенный с использованием Python (бэкэнд, фронтэнд PyQt6), Pandas, Matplotlib и SQLite. Это приложение позволяет пользователям загружать данные в формате CSV, фильтровать их, визуализировать ключевые показатели с помощью графиков и создавать отчеты.

data-analysis desktop-application matplotlib pandas pyqt6 pyqt6-desktop-application python sqlite student-project

Last synced: 23 Dec 2024

https://github.com/zelosleone/finncorr

A .NET Core financial analysis tool/API for calculating correlations between time series data with interactive visualizations powered by ML.NET and Plotly.js.

aspnet-core correlation-analysis csv-parser data-analysis dotnet financial-analysis machine-learning ml-net plotly rest-api statistical-analysis swagger time-series visualization

Last synced: 06 Feb 2025

https://github.com/tirendazacademy/data-sets

Data sets for Tirendaz Akademi Youtube

data-analysis dataset

Last synced: 01 Jan 2025

https://github.com/c0deta1ker/matbase

MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.

cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps

Last synced: 30 Nov 2024

https://github.com/andr3w03/bike-sharing-dashboard

Bike Sharing Data Analysis Streamlit Dashboard

dashboard data-analysis data-visualization python streamlit

Last synced: 29 Jan 2025

https://github.com/virajbhutada/tableau-data-vizzes

Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.

business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights

Last synced: 10 Jan 2025

https://github.com/shlokashah/student-depression-and-suicide-rate-prediction

https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/

data-analysis data-visualization machine-learning student suicide-rate-prediction

Last synced: 19 Feb 2025

https://github.com/antononcube/wl-outlieridentifiers-paclet

Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.

data-analysis hampel outlier-detection outliers

Last synced: 08 Feb 2025

https://github.com/andreaschandra/who-suicides-statistics

Exploratory Data Analysis for Suicides using Python

data-analysis data-science eda python

Last synced: 12 Feb 2025

https://github.com/jimbrig/eda

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 23 Jan 2025

https://github.com/gjbex/python-dashboards

Repository that contains material for training sessions on creating dashboards using Python.

dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization

Last synced: 22 Nov 2024

https://github.com/juliusmarkwei/concrete-data

Data analysis, machine learning, model evaluation and optimization on the Concret_ Dataset

data-analysis data-science data-visualization ensemble-learning machine-learning modeling

Last synced: 01 Jan 2025

https://github.com/kevinyang372/san-francisco-crime-data-analysis

An ARIMA prediction model for forecasting potential crimes based on users' time and location

data-analysis machine-learning

Last synced: 30 Jan 2025

https://github.com/quantumudit/analyzing-gamerevolution-games

This project focuses on scraping data related to video games from the GameRevolution website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 17 Feb 2025

https://github.com/verbasik/yandex.practicum.datascience

Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.

data-analysis data-science machine-learning yandex-praktikum

Last synced: 10 Jan 2025

https://github.com/pratishtha-abrol/astronomy-dataanalysis

A key technique in Data Driven Astronomy

astronomy astropy crossmatch data-analysis

Last synced: 05 Feb 2025

https://github.com/sevdanurgenc/data-modeling-techniques-lecture-notes

In this repo, I have the course contents of Data Modelling Techniques training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 25 - 26 January 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization

Last synced: 29 Jan 2025

https://github.com/louislefevre/sstubs-miner

Data mining and analysis for the ManySStuBs4J dataset.

data-analysis data-mining manysstubs4j-dataset msr

Last synced: 05 Feb 2025

https://github.com/grburgess/gbm_kitty

Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.

3ml catalogue data-analysis fermi-science grbs pipelines

Last synced: 23 Jan 2025

https://github.com/winter000boy/dsa-practice

This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.

data-analysis data-science leetcode leetcode-python pandas-python python3

Last synced: 30 Jan 2025

https://github.com/milind220/hk-air-quality-analysis

My final project for a statistics and data analysis course. Whew that was a lot of graphs!

data-analysis jupyter-notebook numpy pandas python python3 scipy seaborn statistics

Last synced: 03 Jan 2025

https://github.com/timzatko/fifa-19-dataset-machine-learning

Player's value prediction and game position classification on FIFA 19 dataset.

data-analysis fifa19 machine-learning scikit-learn

Last synced: 03 Jan 2025

https://github.com/deep-diver/enron-data-analysis

Data Analysis and Machine Learning on Enron Data

data-analysis enron-data exploratory-data-analysis machine-learning

Last synced: 05 Feb 2025

https://github.com/seyedhosseinzadeh/ws_tm

Weather web scraping and Time series model to predict temperature, humidity and barometer

data-analysis deep-learning lstm-model machine-learning prediction prediction-model weather web-scraping

Last synced: 10 Jan 2025

https://github.com/shivamswarnkar/tesla-stock-prediction

Making prediction of close prices of Tesla Stocks using different regression methods.

data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction

Last synced: 26 Jan 2025

https://github.com/cego669/datathonengopevi

Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!

data-analysis data-science datathon python r streamlit xgboost-classifier

Last synced: 08 Dec 2024