An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with statsmodels

A curated list of projects in awesome lists tagged with statsmodels .

https://github.com/BayesWitnesses/m2cgen

Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

c csharp dartlang go haskell java javascript lightgbm lightning machine-learning php python r ruby rust scikit-learn statistical-learning statsmodels xgboost

Last synced: 14 Mar 2025

https://github.com/bayeswitnesses/m2cgen

Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

c csharp dartlang go haskell java javascript lightgbm lightning machine-learning php python r ruby rust scikit-learn statistical-learning statsmodels xgboost

Last synced: 14 May 2025

https://github.com/mars-project/mars

Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

dask dataframe joblib lightgbm machine-learning numpy pandas python pytorch ray scikit-learn statsmodels tensor tensorflow xgboost

Last synced: 25 Apr 2025

https://github.com/dipanjans/practical-machine-learning-with-python

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

classification clustering computer-vision convolutional-neural-networks deep-learning jupyter jupyter-notebook keras machine-learning natural-language-processing nltk notebook pandas prophet python scikit-learn spacy statsmodels tensorflow time-series-analysis

Last synced: 14 May 2025

https://github.com/dipanjanS/practical-machine-learning-with-python

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

classification clustering computer-vision convolutional-neural-networks deep-learning jupyter jupyter-notebook keras machine-learning natural-language-processing nltk notebook pandas prophet python scikit-learn spacy statsmodels tensorflow time-series-analysis

Last synced: 25 Mar 2025

https://github.com/softwareag/nyoka

Nyoka is a Python library that helps to export ML models into PMML (PMML 4.4.1 Standard).

lightgbm machine-learning nyoka pmml pmml-exporter python python-library scikit-learn statsmodels xgboost

Last synced: 04 Apr 2025

https://github.com/heidelbergcement/hcrystalball

A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosystem.

cross-validation data-science fbprophet model-selection pmdarima sarimax sklearn sklearn-api sklearn-compatible sklearn-library sktime statsmodels tbats time-series time-series-forecasting transformer wrapper

Last synced: 05 Apr 2025

https://github.com/eigenfoo/tests-as-linear

Python port of "Common statistical tests are linear models" by Jonas Kristoffer Lindeløv.

linear-models python scipy statistical-tests statsmodels

Last synced: 11 Apr 2025

https://github.com/sayakpaul/a-b-testing-with-machine-learning

Implemented an A/B Testing solution with the help of machine learning

python sklearn statsmodels xgboost

Last synced: 04 Oct 2025

https://github.com/majorlift/volatility-modeling-python-datasci

Undergraduate thesis, Seoul National University Dept. of Economics — "Modeling Volatility and Risk Spillover Between the Financial Markets of US and China Using GARCH Value-at-Risk Forecasting and Granger Causality."

arima-forecasting data-science data-vizualization financial-engineering garch-model granger-causality jupyter-notebook numpy pandas pyplot python3 regression-models research-paper risk-modelling scipy-stats seaborn statsmodels time-series-analysis value-at-risk volatility-modeling

Last synced: 25 Apr 2025

https://github.com/esvs2202/concrete-compressive-strength-prediction

The aim of this project is to develop a solution using Data science and machine learning to predict the compressive strength of a concrete with respect to the its age and the quantity of ingredients used.

anaconda data-visualization flask gunicorn-web-server heroku-deployment html5 joblib jupyter-notebook machine-learning-algorithms matplotlib-pyplot numpy pandas pycharm-ide python3 randomizedsearchcv scikit-learn seaborn statsmodels xgboost-regression

Last synced: 12 Oct 2025

https://github.com/msikorski93/forecasting-inflation-rates-of-poland

Recently inflation is a popular topic in Poland and is highest since 2001. Experts presume inflation in Poland should continue to rise, and by the end of 2021 it will be close to 8%. This notebook aims to develop a forecasting model for time series using Python.

autoregressive-models cnn inflation-forecasting lstm sarima statsmodels tensorflow time-series-forecasting

Last synced: 27 Apr 2025

https://github.com/bhattbhavesh91/linear-regression-assumptions

A small repository explaining how you can validate your linear regression model based on assumptions

assumptions linear-regression linear-regression-assumptions linear-regression-python ols-statsmodels python statsmodels video

Last synced: 17 Apr 2025

https://github.com/stats-tests/statstests

Statstests: a Python package that provides a complement of process and statistical tests for statsmodels statistical models.

count-models data-analysis hyphotesis-tests python regression-models statistics statsmodels

Last synced: 17 Mar 2026

https://github.com/zoharbabin/edgar_analytics

EDGAR Analytics – Python Library for Extracting, Analyzing, and Forecasting SEC EDGAR Filings. Streamline your financial analysis with comprehensive metrics, growth rates, and automated reporting capabilities.

accounting arima data-analysis-python edgar finance financial-analysis financial-metrics forecasting investment sec sec-filings statsmodels stock-market xbrl

Last synced: 17 Jan 2026

https://github.com/mdh266/crimetime

Python web application for exploring and forecasting crime rates in NYC

data-science docker flask-application forecasting-crime-rates geospatial-analysis pandas python statsmodels time-series-analysis

Last synced: 30 Jul 2025

https://github.com/open-risk-academy/academy-course-dat31048

Supporting material for the Open Risk Academy course "Exploratory Data Analysis using Pandas, Seaborn and Statsmodels"

elearning exploratory-data-analysis pandas python seaborn statsmodels

Last synced: 05 May 2025

https://github.com/surajiyer/python-data-utils

🚀 Utility classes and functions for common data science libraries

clustering etc matplotlib multiview-clustering nlp pandas sklearn spark statsmodels utilities

Last synced: 12 Apr 2025

https://github.com/farahibrar/programming-in-python

Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.

beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow

Last synced: 16 Jul 2025

https://github.com/themagicalmammal/stock-analyser

Simple to use interfaces for basic technical analysis of stocks.

matplotlib numpy pandas seaborn statsmodels

Last synced: 29 Oct 2025

https://github.com/ascender1729/salarypredictionlinearreg

SalaryPredictionLinearReg is a Python-based project utilizing linear regression to predict salaries from years of experience. It covers data loading, model training, detailed statistical analysis, and visualization of results.

data-science linear-regression machine-learning python salary-prediction scikit-learn seaborn statsmodels

Last synced: 07 May 2025

https://github.com/nfultz/intro_dueto

~PyDataLA 2020~ talk

calculus keras slides statsmodels

Last synced: 06 Jun 2026

https://github.com/mgckaled/ignite-devia-supervised_algorithms

Repositório que reuni os módulos 7 ao 13 da Formação Desenvolvimento IA 2023-2024, desenvolvido pela Rocketseat Education.

gradio joblib pandas python scikit-learn statsmodels uvicorn

Last synced: 12 Apr 2026

https://github.com/vineet416/chronic-kidney-disease-prediction

This repository contain code of Chronic Kidney Disease Detection Prediction Project. The goal of this project is predict the chronic kidney disease using parameters like Diabetes Mellitus, Blood Urea, Sugar, Hypertension etc.. I used multiple machine learning algorithms with hyperparameter tuning which is having highest accuracy score of 97.5

data-visualization data-wrangling exploratory-data-analysis feature-engineering feature-selection hyperparameter-tuning machine-learning matplotlib numpy pandas plotly pre-processing python seaborn sklearn-library statsmodels

Last synced: 14 Apr 2026

https://github.com/atharvapathak/sales_forecasting_project

Forecasted product sales using time series models such as Holt-Winters, SARIMA and causal methods, e.g. Regression. Evaluated performance of models using forecasting metrics such as, MAE, RMSE, MAPE and concluded that Linear Regression model produced the best MAPE in comparison to other models

datamining demand-forecasting feature-engineering machine-learning machinelearning python regression-trees retail sales sales-forecasting seaborn sklearn statsmodels time-series-analysis time-series-decomposition

Last synced: 12 Apr 2026

https://github.com/anushkapatil0727/airbnb-driven-market-saturation-and-forecasting-visualization-dashboard

Analyzed Airbnb's impact on NYC rentals using Python, SQL, and Tableau. Built an ETL pipeline, performed forecasting with scikit-learn, and created interactive dashboards to visualize pricing trends and listing saturation.

analystics business-analytics etl-pipeline kpi linear-regression matplotlib-python pandas-python postgresql problem-solving seaborn-python sklearn statsmodels tableau-dashboards visualization

Last synced: 09 May 2026

https://github.com/psyplot/psy-reg

Psyplot plugin for visualizing and calculating regression plots

data-science matplotlib netcdf psyplot regression-analysis statsmodels visualization xarray

Last synced: 15 Mar 2026

https://github.com/rakibhhridoy/easywaydiveinto-datascience

Data Science is not as easy as it seems at first. The most problem faced by new learner are lack of resource knowledge as well as confusion in using the various resources. I hope this repository will benefit confusion learner.

algorithms algorithms-implemented bayesian-statistics data-science deep-learning deep-neural-networks linear-algebra machine-learning matplotlib multivariate-calculus numpy optimization pandas python scikit-learn scipy seaborn statistics statsmodels tensorflow

Last synced: 06 Apr 2026

https://github.com/alumik/time-series-decomposition

A practical example of time series decomposition

seasonal-trend-loess statsmodels time-series-analysis

Last synced: 16 Mar 2025

https://github.com/bessarodrigo/linear-regression-salaries

Análise dos fatores que influenciam os salários dos colaboradores de uma empresa, utilizando técnicas de regressão linear múltipla.

matplotlib pandas python regression regression-models seaborn statistics statsmodels

Last synced: 07 May 2026

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 07 May 2026

https://github.com/rakibhhridoy/exploratorydataanalysis-python

Exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

ab-testing chitest data-science eda exploratory-data-analysis ftest hypotheses hypothesis-testing inferential-statistics numpy pandas python statistical-analysis statistics statsmodels ttest

Last synced: 30 Apr 2026

https://github.com/bessarodrigo/regressao_linear_salarios

Análise dos fatores que influenciam os salários dos colaboradores de uma empresa, utilizando técnicas de regressão linear múltipla.

matplotlib pandas python regression regression-models seaborn statistics statsmodels

Last synced: 02 Apr 2025

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 09 May 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 08 May 2026

https://github.com/antoogal/datainsightexample

Example project showing skills with analytical tools

jupyter pandas-library powerbi python scipy sql sqlite3 statsmodels

Last synced: 11 Mar 2025

https://github.com/jabulente/tukeys-honest-significant-difference

This project explores Tukey’s Honest Significant Difference test as a robust statistical method for comparing group means after conducting ANOVA. In real-world data analysis, we often need to determine not just whether groups are different, but which specific groups differ

ai anova-test exploratory-data-analysis ml post-hoc-analysis python scipy statsmodels tukey-hsd

Last synced: 08 May 2026

https://github.com/kostadinlambov/time-series-forecasting

This project evaluates the predictive performance of a CNN-LSTM Hybrid deep learning model for Bitcoin price movement prediction.

keras-tensorflow matplotlib-pyplot mlflow numpy optuna pandas python scikit-learn seaborn statsmodels ta-lib tensorflow

Last synced: 07 Apr 2026

https://github.com/guslovesmath/o3_aqi_emission_ml

Analyzing O3 Air Quality Index trends (2000-2023) in the U.S., this project identifies regions with rising pollution. Utilizing exploratory data analysis and time-series modeling, it offers actionable insights for informed policy decisions on urgent O3 pollution issues.

forecasting machine-learning statsmodels time-series train-test-split

Last synced: 08 Oct 2025

https://github.com/yeisson8a/tallerdataanalysis

Taller de Data Analysis utilizando Python y las librerías Pandas, NumPy, Seaborn, Matplotlib para manipulación de datos de un archivo, así como la librería Statsmodels para regresiones lineales.

analisis-de-datos linear-regression matplotlib numpy pandas python3 seaborn statsmodels

Last synced: 14 Apr 2026

https://github.com/tekewin/gold_prices_sarimax

Predicting gold prices with SARIMAX (FRED data)

python statsmodels time-series visualization

Last synced: 19 Apr 2026

https://github.com/e-panourgia/international_sports_events_window_dressing_and_repression

International Sports Events and Repression in Autocracies. Statistical Analysis with Python

analysis sklean statsmodels

Last synced: 25 Oct 2025

https://github.com/karencofre/marketing-segmentacion-en-powerbi

Proyecto prueba de hipótesis en powerbi y python

bigquery google-colab powerbi python sql statsmodels

Last synced: 31 Jan 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/mehassanhmood/tiktok_project

TikTokClaimDetective: Claim or Opinion using ML

machine-learning python scipy sklearn statistics statsmodels

Last synced: 16 Apr 2026

https://github.com/jturner-uofl/pymmeans

Estimated marginal means, contrasts, and small-sample mixed-model inference for Python — a native port of R's emmeans and pbkrtest.

biostatistics bootstrap causal-inference contrasts data-science dunnett emmeans g-computation kenward-roger least-squares-means lsmeans marginal-means mixed-models pbkrtest post-hoc-tests python satterthwaite statistics statsmodels tukey

Last synced: 03 Jun 2026

https://github.com/ledsouza/curso_de_estatistica_parte_4

Projeto de estatística para estudo de correlação e regressão

data-science data-visualization numpy pandas seaborn statsmodels vitrinedev

Last synced: 17 Apr 2026

https://github.com/dataiku/dss-plugin-generalized-linear-models

Dataiku DSS plugin to train Generalized Linear Models

generalized-linear-models glm machine-learning statsmodels

Last synced: 04 Apr 2026

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 04 Apr 2026

https://github.com/genndy007/data-analysis

Laboratory works and learning of Data Science (MS PowerBI, Jupyter Notebook, scipy, sklearn, statsmodels, pandas, matplotlib etc)

jupyter-notebook matplotlib pandas powerbi scipy sklearn statsmodels

Last synced: 18 Apr 2026

https://github.com/dynamicanupam/bike_sharing_demand_study_using_linear_regression

Build a regression model to understand the factors on which the demand for bike sharing systems vary on and help a company optimise its revenue.

exploratory-data-analysis linear-regression matplotlib multiple-linear-regression pandas python seaborn sklean statsmodels

Last synced: 18 Apr 2026

https://github.com/gabrielmazzotta/time-series-analysis-and-forecasting_case-study

This project focuses on Time Series Analysis techniques, uncovering patterns and leveraging forecasting models to predict future sales trends.

arima-model exponential-smoothing holt-winters-forecasting sarima statsmodels time-series-analysis time-series-decomposition

Last synced: 19 Apr 2026

https://github.com/shortthirdman/distributedlagmodels

Distributed Lag Models in Dynamic Model Time Series

disributed-log-models numpy pandas python statsmodels time-series

Last synced: 19 Apr 2026

https://github.com/rodriguesl1/analise-ibovespa-fiap

Modelo de previsão do índice IBOVESPA utilizando técnicas de séries temporais. O projeto inclui análise exploratória, decomposição sazonal, testes de estacionariedade e modelagem com Prophet, AutoARIMA e outros modelos estatísticos para apoiar decisões de investimento.

autoarima b3 brasil data-analysis economia finance forecasting ibovespa pandas prophet python statsmodels time-series

Last synced: 19 Apr 2026

https://github.com/thaisgarcia/statistical-analysis

O Statsmodels é uma biblioteca em Python dedicada à estimação e teste de modelos estatísticos. Ele fornece ferramentas para realizar análises estatísticas detalhadas, como regressão linear, modelos de séries temporais, análise de variância e testes estatísticos.

pandas statsmodels

Last synced: 20 Apr 2026

https://github.com/offthetab/statistics_homework

Практические работы по статистике МИРЭА 3 курс.

numpy pandas python statsmodels

Last synced: 05 Apr 2026

https://github.com/arush18/customer-segmentation

Machine learning-based customer segmentation using classification models, data preprocessing, and exploratory analysis.

marketing sklearn statsmodels unsupervised-machine-learning

Last synced: 27 Apr 2026

https://github.com/nyx1311/timelstm

TimeLSTM: An interactive Streamlit app for multi-step time series forecasting using LSTM networks, featuring data preprocessing, visualization, GPU-accelerated model training, and automated result export.

deep-learning deep-neural-networks lstm-neural-networks numpy pandas ploty python3 scikit-learn-python statsmodels streamlit torch tqdm

Last synced: 27 Apr 2026

https://github.com/fikri-rouzan/burnaway-capstone-data-science

Dashboard analitik interaktif untuk memetakan faktor fisik dan pola kerja pemicu burnout pada software developer.

jupyter-notebook matplotlib pandas pillow plotly python scikit-learn seaborn statsmodels streamlit

Last synced: 08 Jun 2026

https://github.com/willie-conway/boston-housing-market-analysis

This repository contains an 📈analysis of the 🏘️Boston Housing Data focusing on the factors influencing housing prices. Using statistical tests and data visualizations, we explore key relationships, including the impact of proximity to the Charles River 🌊, the age of homes 🏡, the NOX concentrations 🧪, and employment center distances 🚗.

analysis anova boston-housing-dataset boston-housing-price-prediction matplotlib nox pandas pearson-correlation python regression-analysis scipy seaborn statsmodels t-test

Last synced: 30 Apr 2026