https://github.com/weiaicunzai/pytorch-yolo
Pytorch Implementation of yolov1
https://github.com/weiaicunzai/pytorch-yolo
pytorch yolov1
Last synced: 2 months ago
JSON representation
Pytorch Implementation of yolov1
- Host: GitHub
- URL: https://github.com/weiaicunzai/pytorch-yolo
- Owner: weiaicunzai
- Created: 2017-02-19T02:34:11.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-08-05T13:01:54.000Z (almost 7 years ago)
- Last Synced: 2025-02-04T19:39:06.830Z (4 months ago)
- Topics: pytorch, yolov1
- Language: Python
- Homepage:
- Size: 95.7 KB
- Stars: 1
- Watchers: 4
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Under Construction........
# pytorch implementation of yolov1
*here is the [paper](https://arxiv.org/abs/1506.02640v5)*
*write this code for learning purpose, learned from [this](https://github.com/xiongzihua/pytorch-YOLO-v1) repository*
*network architecture will be like [this](https://github.com/pjreddie/darknet/blob/master/cfg/yolov1.cfg)*
## Abstruct
Pytorch implementation of yolov1
## implementation details
- Image preprocessing:
I don't use mean substracting and std deviation as the preprocessing tricks,
because not all the case we can know the mean and std of a dataset, for example,
a live camera video flow.Yolo source code and the paper both suggests that no
mean subtracting and std deviation applied:The source code in [yolov2](https://github.com/pjreddie/darknet/blob/d3828827e70b293a3045a1eb80bfb4026095b87b/examples/yolo.c):
```c
load_args args = {0};
args.w = net->w;
args.h = net->h;
args.paths = paths;
args.n = imgs;
args.m = plist->size;
args.classes = classes;
args.jitter = jitter;
args.num_boxes = side;
args.d = &buffer;
args.type = REGION_DATA;args.angle = net->angle;
args.exposure = net->exposure;
args.saturation = net->saturation;
args.hue = net->hue;
```In [yolov2](https://arxiv.org/abs/1612.08242v1) paper:
```
During training we use standard data augmentation tricks including random
crops, rotations, and hue, saturation, and exposure shifts.
```- Image Channel
I'm using OpenCV as my image reading library, so the RGB order is BGR,
If you want to use my code to predict image, or videos, please change
your channel order to BGR- Compatibility
I use Python3.5 to write the code, I've already tried my best to maintain
the compatibility as possible, e.g. the result int / int is float in Python3.5
I tried my best to avoid using / sign, but I Can't make any promises about
the compatibility. Please use Python3.5!