Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/worst001/note_machine_learning
整理了机器学习相关资料与手册,包括数学基础、机器学习模型实现示例、神经网络。
ai data-analysis deep-learning development guide learning machine-learning markdown mkdocs note notebook
Last synced: 08 May 2026
https://github.com/ezzz-lui/rsm-evaluationproject
Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst
Last synced: 13 Feb 2026
https://github.com/kirrrto/amazon-market-research-dashboard
Amazon product research dashboard for spreadsheet imports, supplier product pages, specification matrices, gap analysis, requirement drafts and supplier follow-up exports.
amazon-product-research data-analysis ecommerce market-research pandas product-development python specification-analysis streamlit
Last synced: 17 Jun 2026
https://github.com/quantumudit/thereyougo-store-analysis
This project focuses on scraping all the products and their related info from the "There You Go" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 21 Apr 2026
https://github.com/abhi18av/innovation-competition
Submission for a programming challenge
clojure clojurescript data-analysis
Last synced: 13 Jun 2026
https://github.com/lisa-ho/breadit
Respository for scraping and analysing data from the Reddit/Sourdough community to explore lockdown baking trends.
data-analysis data-viz nltk python reddit-api sentiment-analysis web-scraping
Last synced: 01 May 2026
https://github.com/riddhis2226/titanic-survival-data-analysis
Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.
data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3
Last synced: 01 May 2026
https://github.com/rakumar99/jp-morgan-chase-virtual-internship
This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel
conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup
Last synced: 02 Mar 2026
https://github.com/reinmagine/eliminating-no-sensor
Contains my project that analyzes air quality sensor data to determine if the NO (Nitric Oxide) sensor in N. Mai, Los Angeles, CA can be removed without affecting data accuracy.
air-quality-sensor colab-notebook cost-optimization data-analysis data-optimization matplotlib-python nitric-oxide pyspark-python python sql
Last synced: 14 Jun 2026
https://github.com/wrighang/shipping-data-analysis
Independent Project: Transit time trends analysis following a major shipping process change.
data-analysis matplotlib numpy pandas python
Last synced: 18 Apr 2026
https://github.com/soufianboukir/ecom-analytics-platform
End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.
data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries
Last synced: 14 Jun 2026
https://github.com/sunnybibyan/marketing_campaign_analysis_power_bi_dashboard
Campaign Performance Analysis This project analyzes the performance of Spring, Summer, and Fall marketing campaigns, revealing key insights and actionable recommendations.
data-analysis data-visualization dax marketing-campaign powerbi
Last synced: 19 Mar 2026
https://github.com/dogan-the-analyst/model_car_warehouse_analysis
This is a SQL project.
Last synced: 15 Jun 2026
https://github.com/madeiradata/microsoft-data-analysts-club
Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts
data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report
Last synced: 19 Mar 2026
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 15 Jun 2026
https://github.com/ensinho/data-analysis
My repository for data analysis studys in Python.
csv data-analysis graphs python python-documentation
Last synced: 15 Jun 2026
https://github.com/com-480-data-visualization/project-2023-the-vizards
Lausanne Transportation : a data visualization of the Lausanne Transportation network. Developed by the Vizards team as part of the EPFL Data Visualization course project (COM-480).
buses data-analysis data-science data-visualization epfl lausanne map metro public-transport public-transportation switzerland webgl
Last synced: 01 May 2026
https://github.com/chetanmalviya513/Firm-Financial-Transaction-Analysis
📊 Financial Analysis & Forecasting Processed large-scale financial data using Python for trend analysis and insights. Developed interactive Tableau dashboards to improve forecasting accuracy and reduce costs by 25%.
data-analysis financial-data forecasting insights msexcel pandas python reporting tableau-dashboards
Last synced: 15 Jun 2026
https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings
This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.
boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams
Last synced: 01 May 2026
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 16 Jun 2026
https://github.com/kaushik0911/jubilant-guide
A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.
data-analysis data-visualization geospatial-analysis python streamlit
Last synced: 16 Jun 2026
https://github.com/techshot25/baltimore-911-calls
Analysis of 911 calls provided by the city of Baltimore.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning machine-learning-algorithms statistics
Last synced: 16 Jun 2026
https://github.com/mindgamesnl/yanderestats
https://mindgamesnl.github.io/YandereStats/
data-analysis reporting-pipeline yandere yandere-sim
Last synced: 18 Jun 2026
https://github.com/film2549/data-analysis-of-a-simulated-marketing-business-case-using-python-sql-and-power-bi
Data Analysis of a Simulated Marketing Business Case Using Python, SQL and Power BI
chulalongkorn computer-engineering computer-science data-analysis data-visualization database marketing nltk-library pandas powerbi pyodbc python simulation sql sqlserver
Last synced: 01 May 2026
https://github.com/archie-cm/credit_risk_model_vix_id-x_partners
The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments
credit-risk data-analysis data-visualization machine-learning scorecard
Last synced: 01 May 2026
https://github.com/kalebers/economic_analysis_data_science
Data Analysis Python project using economy data base to predict percentage of good and bad payers
data-analysis data-science machine-learning pandas python scipy sklearn-library
Last synced: 18 Apr 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/jamiemagee/rhi
Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.
data-analysis open-data open-government rhi
Last synced: 25 Feb 2026
https://github.com/duoan/machine-learning-notebook
A notebook repository for tracking learning machine learning notebook.
data-analysis decision-tree ensemble-model gbdt machine-learning numpy pandas xgboost
Last synced: 18 Jun 2026
https://github.com/jmssnr/shuffle-kit
shuffle-kit: model and analyze playing card shuffles in Python
data-analysis playing-cards python shuffle statistics
Last synced: 19 Jun 2026
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/lebrancconvas/how-much-love-in-thai-song
How much Love song among the Thai Songs?
data-analysis side-project web-scraping
Last synced: 19 Jun 2026
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 22 Apr 2026
https://github.com/kirkalyn13/open-signal-report-generator
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 19 Jun 2026
https://github.com/nafisalawalidris/data-analysis-with-python
This repo features Jupyter Notebook labs for learning data analysis with Python. Explore data acquisition, wrangling, visualization, modeling, and evaluation. Enhance your skills in Python data analysis.
data-acquisition data-analysis data-science data-wrangling exploratory-data-analysis feature-engineering machine-learning model-development model-evaluation-and-refinement pandas
Last synced: 02 May 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/souravsuvarna/whatsapp-chat-analyzer-api
The WhatsApp Chat Analyzer API is a public api specifically designed for frontend enthusiasts who are interested in building a WhatsApp Chat Data Visualizer project. Built on FastAPI, this API offers a seamless and efficient method to process chat data and returns the processed result data in JSON format.
api data-analysis data-science fastapi publicapi python
Last synced: 20 Jun 2026
https://github.com/denisecase/nlp-03-text-exploration
Exploratory analysis of text corpora using tokenization, frequency, co-occurrence, and bigrams to reveal structure in text.
bigrams co-occurence corpus-analysis data-analysis nlp python text-analysis text-exploration tokenization
Last synced: 02 Jun 2026
https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes
Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.
analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3
Last synced: 02 May 2026
https://github.com/markmusic27/data-statistics-calculator
💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.
data-analysis standard-deviation statistics statistics-calculator
Last synced: 20 Jun 2026
https://github.com/alicankaya192/world-happiness-report-2025
Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).
correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report
Last synced: 21 Jun 2026
https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries
🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈
ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics
Last synced: 21 Jun 2026
https://github.com/datadotworld/dw-jupyter-contents
Jupyter ContentsManager implementation for data.world
data data-analysis data-science dwstruct-t50-public-projects jupyter jupyter-notebook jupyterlab reference-implementation
Last synced: 22 Jun 2026
https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice
A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.
climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather
Last synced: 02 May 2026
https://github.com/paweljakubas/j-data-analysis
Data analysis using J
data-analysis j machine-learning statistics
Last synced: 23 Jun 2026
https://github.com/ryanfranklin237/data-cleansing
A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells
data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python
Last synced: 23 Jun 2026
https://github.com/ac12644/fractz-ai-data-analyst
Analyze data and gain insights instantly with FRACTZ's AI Data Analyst. Flexible, fast analytics tailored to your needs.
ai data-analysis data-visualization
Last synced: 01 Feb 2026
https://github.com/rogernet/desafio-profissional-produto-data-driven
Ajudar a formar Analistas de Produto, PMs e Gestores de Negócio capazes de tomar decisões estratégicas baseadas em dados.
data-analysis data-science data-visualization product
Last synced: 23 Jun 2026
https://github.com/abhik1711/material-classification-and-energy-band-prediction---excavate-25
A Two-Stage Machine Learning Pipeline: A Binary Classifier to identify insulators with high accuracy and a Stacking Regressor to predict precise band gap values for insulators by leveraging advanced feature engineering techniques and ensemble learning methods
data-analysis machine-learning python
Last synced: 23 Jun 2026
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/heiderjeffer/evaluating-rule-offsetting-schemes-for-sustainable-policy-growth-in-modern-democracies
Python Java. Research Proposal RP
artificial-intelligence data-analysis data-collection data-merging python qualitative-data-analysis quantitative-analysis statistical-analysis
Last synced: 09 Jun 2026
https://github.com/rudra-g-23/find-my-joint
A utility to find potential join keys (matching columns) across multiple DataFrames.
data-analysis data-visualization join network-graph pandas pandas-dataframe
Last synced: 24 Jun 2026
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 27 Mar 2026
https://github.com/melogabriel/nubank-expenses-analysis
This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.
data-analysis data-visualization googlesheets lookerstudio pandas python
Last synced: 02 May 2026
https://github.com/xuri/excelize-cs
Excelize is a C# port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
agent ai chart csharp data-analysis data-science data-visualization excel excelize formula microsft office ooxml parser spreadsheet xlsm xlsx
Last synced: 03 Mar 2026
https://github.com/mayankyadav23/air-bnb-data-analysis
Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.
advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt
Last synced: 19 Mar 2026
https://github.com/mauriceling/sipy
Python-Based Statistical Graphical User Interface for Python
data-analysis julia julia-language jupyter jupyter-kernels pandas pandas-python python python3 r r-packages r-project r-stats scikit-learn scipy scipy-stats statistical-analysis statistical-tests statistics
Last synced: 15 Apr 2026
https://github.com/infinitode/duplipy
DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.
ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting
Last synced: 28 Jun 2026
https://github.com/ganeshkumartk/ncov-2019
[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019
corona data-analysis ncov ncov-2019 statistics wuhan wuhan-coronavirus wuhan-virus
Last synced: 05 Jun 2026
https://github.com/samruddhi3012/customer-behavior-analysis
Hello there! This repo contains python project based on E-Commerce Customer Behavior analysis.
customer-segmentation customerbehavior data-analysis ecommerce python
Last synced: 02 May 2026
https://github.com/fdtomasi/regain-applications
Containers for notebooks and data where REGAIN has been used.
algorithms data-analysis latent-variable-models machine-learning minimization network-inference regain sklearn time-series
Last synced: 16 Apr 2026
https://github.com/gaurav-van/house_price_predictor_streamlit_web_app
Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project
data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit
Last synced: 02 May 2026
https://github.com/virajbhutada/movie-rental-store-analytics-sql-powerbi-excel
Dive into the DVD rental industry with my Capstone project, Movie Rental Analytics. Analyzing the Sakila DVD Rental Store Database, I extract insights through exploratory data analysis (EDA) and Power BI visualizations. Findings inform strategies for optimizing film inventory, enhancing business operations, and customer experiences.
business-intelligence capstone-project customer-behavior-analysis data-analysis data-science excel exploratory-data-analysis film-ratings mece movie-database movie-rental mysql powerbi powerbi-visuals revenue-analysis sql sql-database
Last synced: 05 Jun 2026
https://github.com/amruthadevops/suspicious_web_threat_interactions
To detect and analyze patterns in web interactions for identifying suspicious or potentially harmful activities
cyber-security data-analysis data-science data-visualization jupyter-notebook machine-learning powerbi python
Last synced: 29 Jun 2026
https://github.com/savinrazvan/heredity
An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.
ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python
Last synced: 29 Jun 2026
https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration
Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.
data-analysis data-visualization dataframe mysql pandas plotly python streamlit
Last synced: 02 May 2026
https://github.com/aliciagilmatute/analisis-multinivel-bayesiano
Este estudio explora el análisis multinivel desde un enfoque bayesiano para evaluar la variabilidad del rendimiento en matemáticas entre 10 centros educativos
bayesian-statistics cmdstanr data-analysis hierarchical-models multilevel-models rstats rstudio stan
Last synced: 29 Jun 2026
https://github.com/jethronap/jstat-gui
Web-based GUI application for data analysis
data-analysis data-visualization java jstat mongodb
Last synced: 08 May 2026
https://github.com/santiagortiiz/snowflake-data-warehousing
Snowflake University. Snowflake Data Warehousing. Foundamentals
big-data data-analysis data-warehouse olap snowflake
Last synced: 19 Mar 2026
https://github.com/nakshjainsonigara/vba-canteenmanagementsystem
The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.
canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word
Last synced: 30 Jan 2026
https://github.com/fybex/chatgpt-conversations-analysis
Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.
chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis
Last synced: 02 May 2026
https://github.com/jofaval/pima-indian-diabetes
Data Analysis and Classification of Pima Indian Women's Diabetes in 1988
data-analysis data-science deep-learning google-colab kaggle logistic-regression machine-learning pima-diabetes-data python scikit-learn xgboost
Last synced: 16 Apr 2026
https://github.com/datavil/framex
A light-weight, dataset obtaining library for fast prototyping, tutorial creation, and experimenting.
data-analysis data-fetching data-science dataframe datasets visualization
Last synced: 06 Jun 2026
https://github.com/uchida16104/healthanalysis
It abstracts the health status of each device from its operational time calculated from RescueTime and analyzes the data.
data-analysis portfolio portfolio-website security security-tool
Last synced: 02 Feb 2026
https://github.com/nfaltir/youtube-channel-analysis
Youtube API channel Analysis using pandas
data-analysis data-science data-visualization google webscraping youtube youtube-api
Last synced: 02 May 2026
https://github.com/chandansoren/diabetics_prediction
Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.
data-analysis machine-learning python svm
Last synced: 06 Jun 2026
https://github.com/kinshuk-code-1729/data-visualisation-using-python
This Repository consists of several python snippets for creating Two-Dimensional (2D) Graphics
data-analysis data-science data-visualization matplotlib visualization
Last synced: 02 Jun 2026
https://github.com/mr-chang95/loan_data_visualization
Data Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook loans python udacity-data-analyst-nanodegree
Last synced: 24 Apr 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/savinrazvan/degrees
A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.
actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon
Last synced: 24 Apr 2026
https://github.com/thevinh-ha-1710/rstudio-statistics
This project deeply studies 2 datasets using applied statistics techniques.
applied-statistics data-analysis data-science data-visualization rmarkdown rstudio
Last synced: 31 Jan 2026
https://github.com/athityakumar/btp
btech btp daru data-analysis networkx nlp project python ruby
Last synced: 24 Apr 2026
https://github.com/leosimoes/uerj-tcc-analisador-dados
Trabalho de conclusão de curso (TCC) em Engenharia de Computação. Aplicativo Web para preparação e análise de dados, criação de gráficos e modelos de regressão linear e logistica.
computer-engineer data-analysis data-science data-visualization linear-logistic linear-regression python streamlit
Last synced: 24 Apr 2026
https://github.com/gauthamnairvm/trex-app
Text Refinement EXplorer - An EDA tool for text based data.
data-analysis data-visualization groq-api large-language-models llama3 natural-language-processing text2sql
Last synced: 03 May 2026
https://github.com/keganedwards/housing-prices-exploration
Using machine learning algorithms to explore housing prices
data-analysis data-science python school-project
Last synced: 24 Apr 2026
https://github.com/ferrangarciarovira/premier-league-betting-analysis
Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.
betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics
Last synced: 03 May 2026
https://github.com/cs-joy/pandasv2.0.3
learn data analysis with pandas
data-analysis pandas pandas-learning
Last synced: 03 May 2026
https://github.com/flyingfathead/neurograph-framework
A versatile tool for visualizing entropy loss in TensorFlow-based neural network training, providing insightful scatter plots with annotations.
data-analysis data-analysis-python data-visualization entropy graph graphs neural-network neural-networks neural-networks-visualization nn python python3 tensorflow tensorflow2 training visualization visualization-tools
Last synced: 24 Apr 2026
https://github.com/mirseo/pandas_learning
pandas_learning
data-analysis data-analysis-python data-science data-visualization numpy numpy-example pandas pd python python-3 python3
Last synced: 03 May 2026
https://github.com/serhatderya/medical_examination_research
This repository contains a research about medical examinations (such as body measurements, results from various blood tests, and lifestyle choices).
catplot data-analysis data-analytics data-cleaning data-preparation data-preprocessing data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations heatmap jupyter-notebook medical preprocessing python research seaborn
Last synced: 24 Apr 2026
https://github.com/vansh-py04/data-extraction-and-text-analysis
The objective of this assignment is to extract textual data articles from the given URL and perform text analysis to compute variables that are explained
data-analysis data-extraction data-science nlp nlp-machine-learning python textanalysis webscraping
Last synced: 24 Apr 2026
https://github.com/anandanraju/youtube-data-api-model
The YouTube Analytics API enables you to generate custom reports containing YouTube Analytics data. The API supports reports for channels and for content owners. Report fields are characterized as either dimensions or metrics
analytics data-analysis data-science metrics model python telemetry youtube youtube-api
Last synced: 03 May 2026
https://github.com/steno-aarhus/legliv
Substitution of red meat with legumes and risk of primary liver cancer in UK Biobank participants: A prospective cohort study
cancer-research data-analysis epidemiology nutritional-epidemiology nutritional-science open-science reproducibility reproducible-research rstats ukbiobank
Last synced: 03 Mar 2026
https://github.com/anastasius21/imdb-movie-analysis
Analysis of IMDb's Top 1000 Movies dataset using Pandas, Matplotlib, and Seaborn. It provides visualizations and insights into various aspects of movies, such as ratings, genres, directors, and release years.
data-analysis data-exploration data-science data-visualization imdb imdb-dataset jupyter-notebook python
Last synced: 25 Apr 2026
https://github.com/manikantasanjay/time_series_data_analysis_on_stocks
Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.
data-analysis pandas stock time-series time-series-analysis
Last synced: 03 May 2026