https://github.com/malolm/football-player-detection-with-yolov8

Football player detection YOLOv8 fine-tuning
https://github.com/malolm/football-player-detection-with-yolov8

cuda jupyterlab python3 yolov8-detection

Last synced: 3 months ago
JSON representation

Football player detection YOLOv8 fine-tuning

Host: GitHub
URL: https://github.com/malolm/football-player-detection-with-yolov8
Owner: MaloLM
License: apache-2.0
Created: 2024-08-10T16:22:12.000Z (11 months ago)
Default Branch: main
Last Pushed: 2024-08-14T22:12:37.000Z (11 months ago)
Last Synced: 2025-03-23T17:47:34.191Z (4 months ago)
Topics: cuda, jupyterlab, python3, yolov8-detection
Language: Jupyter Notebook
Homepage:
Size: 42.9 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Football Player Detection with YOLOv8

This project demonstrates the use of Ultralytics YOLOv8 for a specific task: detecting football players in images. The project includes the entire process, from dataset preparation to model testing, including training, validation and exportation.

![demo](./demo_gif.gif)

## Dataset

The dataset used for this project is the [HuggingFace football dataset](https://huggingface.co/datasets/keremberke/football-object-detection). This dataset contains images of football matches, with annotations indicating the location and size of football players and the ball in each image.

## Base model

The pre-trained YOLO model used in this project is YOLOv8m for object detection. However, the project is flexible and allows for the use of any other YOLO model size, as long as it is also for object detection.

The table below summarizes the key characteristics of the YOLOv8m model:

| Model | Size (pixels) | mAP val 50-95 | Speed (CPU ONNX ms) | Speed (A100 TensorRT ms) | Params (M) | FLOPs (B) |
|-------------|---------------|---------------|---------------------|--------------------------|------------|-----------|
| YOLOv8m | 640 | 50.2 | 234.7 | 1.83 | 25.9 | 78.9 |

- Size (pixels): The input image size for the model is 640 pixels.
- mAP val 50-95: The mean average precision (mAP) score of the model on the validation dataset, calculated using intersection over union (IoU) thresholds of 0.5 to 0.95, is 50.2.
- Speed (CPU ONNX ms): The average inference time of the model on a CPU using ONNX runtime is 234.7 milliseconds.
- Speed (A100 TensorRT ms): The average inference time of the model on an NVIDIA A100 GPU using TensorRT is 1.83 milliseconds.
- Params (M): The number of trainable parameters in the model is 25.9 million.
- FLOPs (B): The number of floating point operations required to perform a forward pass through the model is 78.9 billion.

## Environment Recommendations

To ensure the project runs smoothly, it is recommended to use a fresh Python virtual environment (venv) with Python 3.9. The project has been tested on Windows 11 with WSL2, and it is likely to work on Linux distributions as well.

It is recommended to use a base Python venv instead of a conda venv, as the Ultralytics ecosystem works better through pip than conda. This can help to avoid package version issues.

The project uses JupyterLab with notebooks, but it is not recommended to use Ultralytics technologies through a notebook, as many display features such as camera streaming and live inference are not displayable from inside a notebook.

If you prefer to use YOLO through a conda install, you can use the following command to install it from the conda-forge channel:

```
conda install -c conda-forge ultralytics
```

Key Learnings and Future Directions

- **Data Requirements**: The model requires a large amount of data from various perspectives to perform well. Classes with few samples may be poorly detected or even misclassified.
- **Ultralytics YOLOv8 Framework**: I have gained experience in using the Ultralytics YOLOv8 framework, which will be beneficial for future use cases such as image segmentation, pose estimation, object tracking, and image classification, as well as heatmaps.
- **Model Optimization**: The project has motivated me to focus on optimizing CNN model performance to cover most CNN use cases, such as embedded computer vision.
- **Dataset Limitations**: The dataset used in this project is relatively small, which led to the "ball" class having very few annotations. This resulted in sport balls being detected only in the context in which they were annotated. For example, balls are hardly detected inside the cage or under shadows... Data is the model.

## Improvements

- **Expand the Classes**: Adding more classes to the model, such as referee, goal cage, and goal keeper, could improve make the model even more usefull.
- **Increase the Dataset Size**: Adding more annotated data to the dataset could help to improve the model's accuracy and robustness.
- **Use Online Football Videos as Test Set**: Instead of using a separate test dataset, merging the test data into the training dataset and using online football videos as the test set could provide a more realistic evaluation of the model's performance in real-world scenarios. This could also increase the training data size, which is currently limited.

## References

- Ultralytics YOLOv8 Documentation:
- Ultralytics Python Usage Documentation:
- Ultralytics Training Mode Documentation:
- YOLOv8 Tutorial by Ultralytics:
- YOLOv8 Training Tutorial by Ultralytics:
- Ultralytics YOLOv9 Documentation:
- COCO Dataset Guide:
- StackOverflow: YOLOv8 Handling Different Image Sizes:
- Understanding YOLO BBox Format:
- Ultralytics Issue: Adding Custom Classes to YOLOv8:
- Adding Custom Classes to YOLOv8 Tutorial:
- Setting Up Jupyter ML with GPU Support:
- Tips for Best Training Results with YOLOv5:

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/malolm/football-player-detection-with-yolov8

Awesome Lists containing this project

README