An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with correlation-analysis

A curated list of projects in awesome lists tagged with correlation-analysis .

https://github.com/ddz16/preformer

This repository contains the pytorch code for the 2023 ICASSP paper "Preformer: Predictive Transformer with Multi-Scale Segment-wise Correlations for Long-Term Time Series Forecasting”

artificial-intelligence correlation-analysis deep-learning deep-neural-networks predictive-modeling time-series time-series-analysis time-series-forecasting time-series-prediction transformer

Last synced: 17 Jul 2025

https://github.com/chgl16/data-mining-algorithm

:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法

apriori-algorithm correlation-analysis data-classification data-mining-algorithms k-means-clustering

Last synced: 12 May 2025

https://github.com/open-risk/correlationmatrix

correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations

correlation-analysis correlation-matrices data-analysis data-science statistics

Last synced: 04 Jul 2025

https://github.com/jware-solutions/ggca

Blazing fast Gene/GEM Correlation Analysis for Rust and Python

bioinformatic correlation correlation-analysis gene-expression gene-expression-modulation python rust

Last synced: 31 Mar 2025

https://github.com/mittelmark/snha

St. Nicolas House Algorithm implementation in R - predicting correlation networks using association chains

correlation-analysis network network-analysis network-reconstruction r-package

Last synced: 06 Mar 2026

https://github.com/dagiteferi/financial-news-sentiment-stock-market-correlation-analysis

Analyze financial news sentiment and its correlation with stock market movements. Use NLP, sentiment analysis, and financial analytics to uncover insights for enhanced financial forecasting and innovative investment strategies.

correlation-analysis financial-analysis financial-forecasting sentiment-analysis

Last synced: 12 May 2025

https://github.com/jacksonwalters/scotus-v-public

Capstone project for The Data Incubator ('18). Plots SCOTUS vs. public opinion polarity over time given keywords.

correlation-analysis deep-learning machine-learning opinion-polls opinion-summarization sentiment-polarity supreme-court supreme-court-cases

Last synced: 20 Mar 2025

https://github.com/devanshi-bavaria/predictive-modeling-for-stock-market-trends

📈 Comprehensive stock price analysis, including preprocessing, clustering, correlation, and predictive modeling, to enhance investment insights and accuracy. 💡

clustering-analysis correlation-analysis eda ml permutation-test

Last synced: 30 Apr 2025

https://github.com/pavankethavath/microsoft-classifying-cybersecurity-incidents-with-ml

A machine learning pipeline for classifying cybersecurity incidents as True Positive(TP), Benign Positive(BP), or False Positive(FP) using the Microsoft GUIDE dataset. Features advanced preprocessing, XGBoost optimization, SMOTE, SHAP analysis, and deployment-ready models. Tools: Python, scikit-learn, XGBoost, LightGBM, SHAP and imbalanced-learn

classificationreport correlation-analysis dataanalysis decision-tree-classifier exploratory-data-analysis feature-engineering feature-selection gradientboosting hyperparameter-tuning joblib lgbmclassifier logistic-regression machine-learning modelselection pandas randomforestclassifier randomsearchcv shap smote xgboost-classifier

Last synced: 23 Apr 2025

https://github.com/twguest/tiepy

Solutions to the Transport-of-Intensity Equation (Speckle-Tracking, Paganin Algorithm etc.,)

correlation-analysis phase-retrieval wavefront-sensing x-ray-diffraction

Last synced: 02 Apr 2026

https://github.com/zelosleone/finncorr

A .NET Core financial analysis tool/API for calculating correlations between time series data with interactive visualizations powered by ML.NET and Plotly.js.

aspnet-core correlation-analysis csv-parser data-analysis dotnet financial-analysis machine-learning ml-net plotly rest-api statistical-analysis swagger time-series visualization

Last synced: 09 Feb 2026

https://github.com/zrkhadija/data-analysis-for-financial-time-series

In this notebook, we performed data analysis on financial time series data from Yahoo Finance for the US market. We examined seasonality, trends, stationarity, and other aspects such as outliers and correlations.

autocorrelation correlation-analysis data-analysis financial-analysis time-series-analysis timeseries-forecasting visualization

Last synced: 09 Feb 2026

https://github.com/sbouchard01/corrhod

A code that populates dark matter halos and computes correlation functions

abacus correlation-analysis cosmology densitysplit n-body

Last synced: 31 Oct 2025

https://github.com/mindful-ai-assistants/movierevenueanalysis

🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.

correlation-analysis data-analysis data-science heatmap jupyter-notebook oneness-consciousness open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest

Last synced: 14 Apr 2025

https://github.com/firaskahlaoui/pca-insights

PCA Insights is a data analysis project aimed at applying Principal Component Analysis (PCA) to high-dimensional datasets for dimensionality reduction, visualization, and exploration.

correlation-analysis datavisualization dimensionalityreduction pca python standarization

Last synced: 30 Jul 2025

https://github.com/alicankaya192/world-happiness-report-2025

Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).

correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report

Last synced: 21 Jun 2026

https://github.com/richard-sti/gpc

Generalised Partial Correlation.

correlation-analysis statistics

Last synced: 30 Apr 2025

https://github.com/sushant1827/lstm_for_household_power_consumption

This project explores the application of Long Short-Term Memory (LSTM) networks in predicting household power consumption. Using data collected at one-minute intervals, we demonstrate how LSTM can be leveraged for accurate forecasting.

correlation-analysis data-visualization dropout evaluation-metrics feature-engineering lstm lstm-model resampling reshaping-datasets time-series-analysis time-series-forecasting

Last synced: 19 May 2026

https://github.com/tedoaba/kaim-w1

Predictive Analytics through Sentiment and Correlation Analysis

correlation-analysis financial-analysis financial-forecasting kaim sentiment-analysis

Last synced: 21 May 2026

https://github.com/hafidaso/comprehensive-market-analysis-of-airbnb-listings-trends-pricing-and-host-insights

This project conducts a thorough market analysis of Airbnb listings, encompassing various aspects such as market trends, property types, pricing, occupancy rates, host distribution, and regulatory impacts.

correlation-analysis data-cleaning geospatial-analysis pandas python

Last synced: 05 May 2026

https://github.com/sushantdhumak/lstm_for_household_power_consumption

This project explores the application of Long Short-Term Memory (LSTM) networks in predicting household power consumption. Using data collected at one-minute intervals, we demonstrate how LSTM can be leveraged for accurate forecasting.

correlation-analysis data-visualization dropout evaluation-metrics feature-engineering lstm lstm-model resampling reshaping-datasets time-series-analysis time-series-forecasting

Last synced: 26 Mar 2025

https://github.com/mahnoorsheikh16/estimating-aggregate-import-demand-function

To test whether the economic theory for imports, real GDP and relative prices holds in real life scenario, this study analyses the extent of association between the variables through regression analysis. The paper can be used to assess the difference in impact of global events on GDP and import volume.

autocorrelation breakpoint-detection chow-test correlation-analysis cross-sectional-data economics-models f-test imf-data log-log-graphs policy-analysis ramsay regression-analysis spss-statistics time-series-analysis wald-test

Last synced: 19 Mar 2026

https://github.com/sohhamseal/help-me-with-my-data

A website to help users view, verify and modify data for preprocessing and apply various classical ML algorrithms

correlation-analysis data-transformation data-visualization descriptive-analysis exploratory-data-analysis k-means-clustering regression-analysis

Last synced: 19 Mar 2026

https://github.com/ayushi-gajendra/spotify-audio-dna-descriptive-analysis

Strategic data analysis using descriptive statistics to identify popularity patterns in Spotify tracks. Features data-driven insights for playlist curation.

central-tendency correlation-analysis data-analytics-project descriptive-statistics excel google-sheets market-analysis skewness spotify variability-analysis visualization

Last synced: 05 Jun 2026

https://github.com/ifigeneiatsiflidou/applied-statistics-project

Project for an Applied Statistics course, involving exploratory data analysis and predictive modeling of movie revenue using engineered features and multiple linear regression.

correlation-analysis data-analysis linear-regression python scikit-learn visualization

Last synced: 29 Apr 2026

https://github.com/megaemce/correlations.world

Correlations between various world data

chartjs correlation-analysis iq statistics world world-data

Last synced: 18 Aug 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/dheyhasan/echo-trends

EchoTrends is a data visualization app that analyzes your Spotify playlists and reveals insightful patterns—such as track duration, popularity, and statistical correlations—using interactive charts and statistical tests. Built with React (frontend) and FastAPI (backend), it offers both functional analysis and a demo landing

correlation-analysis data-visualization fastapi javascript music-analysis python react recharts spotify-api tailwindcss

Last synced: 11 Apr 2026

https://github.com/filip-kustura/statistics-olympics-analysis

A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.

correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project

Last synced: 05 Jan 2026

https://github.com/edanur-y/cramer-s-v-and-eta-correlation-ratio-examples

Testing correlation between two categorical variables and testing correlation between a non-dichotomous categorical variable and a quantitative variable. ⭕SPSS

correlation-analysis spss

Last synced: 02 Jan 2026

https://github.com/dhchenx/correlation-kit

A toolkit for estimating the correlation between variables

binary-variable correlation-analysis kendalltau multi-category pearson spearman

Last synced: 22 Jul 2025

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-1

Multiple-Linear-Regression-1. Consider only the below columns and prepare a prediction model for predicting Price of Toyota Corolla.p

cooks-distance correlation-analysis influence-plot leverage-score multi-linear-regression ols ols-regression-model p-value pairplot python r-square-values regression-plot scatter-plot statsmodels

Last synced: 19 May 2026

https://github.com/vbhvsingh0/deforestation_rainfall_correlation

The aim of this project is to check if there is any correlation of rainfall with deforestation in Pennsylvania state of USA.

correlation-analysis data-science matplotlib-pyplot numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/razamehar/statistical-analysis-on-the-boston-housing-data

R-based statistical analysis of Boston Housing Data. Explored feature scales, computed descriptive stats, visualized data, and identified outliers (e.g., higher crime rates in specific areas). Examined variable relationships, calculated correlation coefficients, and presented findings via cross-classifications.

boston-housing-dataset correlation-analysis cross-classification descriptive-statistics frequency-distribution outliers-detection r-programming statistical-analysis

Last synced: 02 Feb 2026

https://github.com/luliatuccu/education_great_equalizer

This project examines the relationship between education and income among Black and White Americans using American Community Survey data. It highlights racial disparities, with White Americans showing stronger correlations between higher education and income. Spatial analyses reveal significant income gaps and systemic barriers for Black Americans

american-community-survey college-degree-impact correlation-analysis education geographic-mapping higher-education-outcomes income-gap-analysis income-inequality racial-disparities racial-equity spatial-visualization systemic-barriers temporal-trends

Last synced: 07 Jan 2026

https://github.com/m-rishab/job-recruitment-prediction-and-hr-dashboard-using-plotly

This project features make it ideal for dynamic HR dashboards, offering insights into candidate profiles and recruitment processes.

correlation-analysis flask kmeans-clustering numpy pandas plotly python scikit-learn seaborn standardscaler

Last synced: 12 Apr 2026

https://github.com/pngo1997/adult-income-analysis

Explores the UCI Adult dataset, analyzing demographic and work-related factors to understand income distribution.

correlation-analysis eda normalization python visualization

Last synced: 14 May 2026

https://github.com/stefagnone/ames_housing_feature_engineering

Feature engineering project on the Ames Housing dataset, focusing on creating impactful new features to improve housing price predictions.

ames-housing-dataset correlation-analysis data-science feature-engineering real-estate-analytics

Last synced: 05 Apr 2025

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 09 May 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 08 May 2026

https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate

This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.

correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif

Last synced: 17 May 2026

https://github.com/victory-ik/supportive-leadership-and-employee-satisfaction-and-performance

Using data from the European Working Conditions Survey (EWCS) 2015, this research explores how supportive leadership impacts employee well-being, with work engagement serving as a mediating factor.

composite-measure correlation-analysis cronbach-alpha data-filtering data-transformation data-visualization exploratory-data-analysis mediation-analysis r regression-analysis

Last synced: 13 Jul 2025

https://github.com/ndomah1/learning-probability-and-statistics

This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.

correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics

Last synced: 18 Jan 2026

https://github.com/onome-joseph/recommender-system-for-games

This project is designed to suggest games to users based on the games they recently played. By leveraging a correlation-based algorithm, the system identifies and recommends games that align with the user's preferences.

correlation-analysis data-insights data-science recommender-system

Last synced: 12 Oct 2025

https://github.com/wikixen/breast-cancer-id-model

An ML model a team & I made in R that predicts whether or not a person has breast cancer.

correlation-analysis machine-learning r

Last synced: 15 Apr 2026

https://github.com/apsinghanalytics/business_eda_python

Exploratory Data Analysis of a Business Using Python and its Libraries: Numpy, Pandas, Seaborn, and Matplotlib

correlation-analysis exploratory-data-analysis matplotlib seaborn visualization

Last synced: 13 May 2026

https://github.com/mjteran/correlation_hypotheses_eda

This project uses correlation analysis to explore relationships between health and lifestyle factors. It involves Exploratory Data Analysis EDA, Hypothesis testing, and various Correlation tests (Pearson, Point-Biserial, Phi Coefficient, Kendall’s Tau) to identify significant correlations.

correlation-analysis exploratory-data-analysis hypothesis-testing matplotlib python seaborn

Last synced: 15 May 2026

https://github.com/keerthanapalanikumar/health-analysis

This project analyzes a health monitoring dataset to explore relationships between various health metrics, including age, gender, heart rate, blood pressure, respiratory rate, body temperature, activity level, oxygen saturation, sleep quality, stress level, and timestamps.

correlation-analysis data-cleaning

Last synced: 27 Mar 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/t4vexx/ia-classification-analysis

This GitHub repository hosts my AI evaluation work, featuring a Kaggle dataset analysis, experiments with three ML algorithms (including hyperparameter tuning), and a detailed exploration of wine quality data through outlier detection, correlation, and normalization techniques.

ai artificial-intelligence bert-embeddings bert-model confusion-matrix correlation-analysis embedding fake-news-detection logistic-regression python3 random-forest-classifier svm-classifier

Last synced: 01 May 2026

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 04 May 2026

https://github.com/ngangawairimu/data-processing-with-pandas-music-trends-on-spotify.

This project analyzes Spotify’s top tracks dataset, exploring trends in musical features, artist popularity, genre distribution, and correlations between various song attributes.

correlation-analysis exploratory-data-analysis pandas statistical-analysis

Last synced: 05 May 2026

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/chrisduvillard/portfoliocorrelationsimulator

A synthetic asset portfolio simulator to explore the benefits of uncorrelated asset returns.

asset-allocation correlation-analysis portfolio-simulation python streamlit

Last synced: 03 Jan 2026

https://github.com/princeoncada/quant-pca-risk

Applies Principal Component Analysis (PCA) to daily returns of 20 US equities (2015–2025) to uncover hidden risk factors. Explores variance explained, scree, loadings, factor returns, covariance reconstruction, and Varimax rotation. Results show 3–5 PCs capture ~75% of portfolio risk.

correlation-analysis covariance-matrix dimensionality-reduction factor-models matplotlib numpy pandas pca portfolio-risk principal-component-analysis python quantitative-finance time-series-analysis variance varimax

Last synced: 06 May 2026

https://github.com/makoczoro/credit-default-risk-analysis-eda

This repository contains the detailed EDA Analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 20 May 2026

https://github.com/apsinghanalytics/wikiviewscryptopricetrendanalysis

Crypto Sentiment Analysis via Wikipedia Page View Trends and Bitcoin Price and Volume Trends

correlation-analysis crypto data-visualization exploratory-data-analysis seaborn time-series

Last synced: 10 Oct 2025