https://github.com/aastopher/mma_outcome

Simple exploratory analysis of UFC Fights and Vegas fight odds from 1993 to 2021
https://github.com/aastopher/mma_outcome

data-analysis data-visualization

Last synced: 12 months ago
JSON representation

Simple exploratory analysis of UFC Fights and Vegas fight odds from 1993 to 2021

Host: GitHub
URL: https://github.com/aastopher/mma_outcome
Owner: aastopher
Created: 2021-08-30T21:23:05.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2022-07-18T06:51:23.000Z (over 3 years ago)
Last Synced: 2023-04-05T01:48:13.671Z (almost 3 years ago)
Topics: data-analysis, data-visualization
Language: Python
Homepage:
Size: 42.9 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# MMA Simple Analysis

## Datasets

2 datasets are used, both of which are pulled from Kaggle.com. These datasets provide characteristics about UFC fighters (height, reach, etc.) and betting odds data for individual fights.
* [MMA Fighter Dataset](https://www.kaggle.com/rajeevw/ufcdata)
* [MMA Odds Dataset](https://www.kaggle.com/mdabbert/ufc-fights-2010-2020-with-betting-odds)

## Analysis Outline

For this analysis, we will look into (1) to what degree fighter attributes (height and reach) contribute to match outcome, (2) to what degree do fight these attributes affect different groups (weightclass and gender), and finally (3) to what extent do Vegas odds follow fighter reach?

## Analysis and Conclusion

Basic exploration of the data sets reveal that longer reach and taller height contribute to a slightly higher win percentage. Furthermore, there exists a noticeable difference in the odds distribution between red fighter and blue fighter. Through a scatter plot, we can see that odds favor the red fighter. This can be explained by how the corners are chosen. the colors are seeded as follow Red fighter is the champion or the veteran fighter; blue fighter is the contender or underdog. Lastly, by looking into the mean odds by fighter reach, we can interpret the relationship as follows:
* For the fighter with a reach advantage (i.e. a longer reach), as fighter reach increases, the odds increasingly favor the fighter with a reach advantage
* For the fighter with a reach disadvantage (i.e. a shorter reach), as fighter reach increases, the odds increasingly disfavor the fighter with a reach disadvantage

## Dependencies

Running the project will require the following packages:
* numpy
* pandas
* matplotlib

## Running the Project

4 optional flags are available:
* `-v` or `--verbose` Adds verbose logging for fined-grained program logging
* `-o` or `--output` Exports datasets, individual and combined, to CSV
* `-p` or `--prefix` Adds a prefix to all non-essential exported data with simple string
* `-d` or `--dark` Plots output with a dark-mode theme

1 positional arguments required: `command`. Command accepts 1 of 3 options:
* `explore` Plots and outputs data for each individual dataset and analyses
* `analyze` Plots and outputs data for the combined dataset and analyses
* `deep` Plots and outputs data for both individual and combined datasets and analyses

Running the program

`python main.py `

Running unit tests with included test runner

`python3 unit_tests/test_main.py`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/aastopher/mma_outcome

Awesome Lists containing this project

README