https://github.com/iitzco/faced

🚀 😏 Near Real Time CPU Face detection using deep learning
https://github.com/iitzco/faced

computer-vision convolutional-neural-networks deep-learning face-detection fully-convolutional-networks python python-library tensorflow

Last synced: 3 months ago
JSON representation

🚀 😏 Near Real Time CPU Face detection using deep learning

Host: GitHub
URL: https://github.com/iitzco/faced
Owner: iitzco
License: mit
Created: 2018-08-23T15:33:53.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2019-12-22T16:35:52.000Z (over 5 years ago)
Last Synced: 2025-03-29T12:08:46.052Z (3 months ago)
Topics: computer-vision, convolutional-neural-networks, deep-learning, face-detection, fully-convolutional-networks, python, python-library, tensorflow
Language: Python
Homepage:
Size: 85.5 MB
Stars: 551
Watchers: 39
Forks: 145
Open Issues: 29
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # *faced*

🚀 😏 CPU (Near) Real Time face detection



  



## How to install

```bash

$ pip install git+https://github.com/iitzco/faced.git

```

> Soon to be available on `PyPI`.

## How to use

### As library

```python

import cv2

from faced import FaceDetector

from faced.utils import annotate_image

face_detector = FaceDetector()

img = cv2.imread(img_path)

rgb_img = cv2.cvtColor(img.copy(), cv2.COLOR_BGR2RGB)

# Receives RGB numpy image (HxWxC) and

# returns (x_center, y_center, width, height, prob) tuples. 

bboxes = face_detector.predict(rgb_img, thresh)

# Use this utils function to annotate the image.

ann_img = annotate_image(img, bboxes)

# Show the image

cv2.imshow('image',ann_img)

cv2.waitKey(0)

cv2.destroyAllWindows()

```

### As command-line program

```bash

# Detection on image saving the output

$ faced --input imgs/demo.png --save

```

or

```bash

# Live webcam detection

$ faced --input webcam

```

or

```bash

# Detection on video with low decision threshold

$ faced --input imgs/demo.mp4 --threshold 0.5

```

See `faced --help` for more information.

## Examples



  

  





  

  





  

  



## Performance

CPU (i5 2015 MBP)          |  GPU (Nvidia TitanXP)

:-------------------------:|:-------------------------:

~5 FPS  | > 70 FPS

## Comparison with Haar Cascades

Haar Cascades are one of the most used face detections models. Here's a comparison with OpenCV's implementation showing *faced* robustness.

*faced*             |  Haar Cascade

:-------------------------:|:-------------------------:

![](examples/demo_yolo.gif)  |  ![](examples/demo_haar.gif)

![](examples/foo-faced.png)  |  ![](examples/foo-haar.png)

![](examples/gino-faced.png)  |  ![](examples/gino-haar.png)

## About *faced*

*faced* is an ensemble of 2 deep neural networks (implemented using **tensorflow**) designed to run at Real Time speed in CPUs.

#### Stage 1:

A custom fully convolutional neural network (FCNN) implementation based on [YOLO](https://pjreddie.com/darknet/yolo/). Takes a 288x288 RGB image and outputs a 9x9 grid where each cell can predict bounding boxes and probability of one face.



  



#### Stage 2:

A custom standard CNN (Convolutions + Fully Connected layers) is used to take a face-containing rectangle and predict the face bounding box. This is a fine-tunning step. (outputs of Stage 1 model is not so accurate by itself, this is a *corrector* step that takes the each bouding box predicted from the previous step to improve bounding box quality.)



  



### Why not just perform transfer learning on trained YOLO (or MobileNet+SSD) ?

Those models were designed to support multiclass detection (~80 classes). Because of this, these networks have to be powerfull enough to capture many different low and high level features that allow them to understand the patterns of very different classes. Powerful in this context means large amount of learnable parameters and hence big networks. These big networks cannot achieve real time performance on CPUs. [1]

This is an overkill for the simple task of just detecting faces. This work is a proof of concept that lighter networks can be designed to perform simpler tasks that do not require relatively large number of features.

[1] Those models cannot perform Real Time on CPU (YOLO at least). Even tiny-yolo version cannot achieve 1 fps on CPU (tested on 2015 MacBook Pro with 2.6 GHz Intel Core i5).

### How was it trained?

Training was done with [WIDER FACE](http://mmlab.ie.cuhk.edu.hk/projects/WIDERFace/) dataset on Nvidia Titan XP GPU.

> If you are interested in the training process and/or data preprocessing, just raise an `issue` and we'll discuss it there.

### How to run on GPU?

Just install `tensorflow-gpu` instead of `tensorflow`.

### Status

🚧 Work in progress 🚧

Models will be improved and uploaded.

**This is not a Production ready system. Use it at your own risk.**

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/iitzco/faced

Awesome Lists containing this project

README