Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/scrn-vrc/3d-face-landmark-in-unitycg-hlsl

3D face landmark detection using MediaPipe's Facemesh and Iris tracking models
https://github.com/scrn-vrc/3d-face-landmark-in-unitycg-hlsl

face-detection machine-learning shaders unity3d vrchat

Last synced: 11 days ago
JSON representation

3D face landmark detection using MediaPipe's Facemesh and Iris tracking models

Host: GitHub
URL: https://github.com/scrn-vrc/3d-face-landmark-in-unitycg-hlsl
Owner: SCRN-VRC
License: apache-2.0
Created: 2021-05-17T03:36:35.000Z (almost 4 years ago)
Default Branch: main
Last Pushed: 2021-12-24T20:36:08.000Z (about 3 years ago)
Last Synced: 2025-01-30T19:12:45.559Z (20 days ago)
Topics: face-detection, machine-learning, shaders, unity3d, vrchat
Language: C++
Homepage:
Size: 70.8 MB
Stars: 115
Watchers: 5
Forks: 18
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# 3D Face Landmark in UnityCG HLSL

### NOTE: This was built and tested with Unity 2019.4.29f1 using built-in render pipeline, there may be shader compatibility issues with other versions.

## Table of Contents
- [Overview](#overview)
- [Problems](#problems)
- [Setup](#setup)
- [Implementation Details](#implementation-details)
- [Facemesh Shader](#facemesh-shader-outputs)
- [Iris Shader](#iris-shader-outputs)
- [Blendshape Values](#blendshape-values-calculations)
- [Avatar Setup](#avatar-setup)
- [Python, C++ Code](#python-c-code)
- [Facemesh Architecture](#facemesh-architecture)
- [Iris Architecture](#iris-architecture)
- [Resources](#resources)

## Overview
Face landmark detection using two MediaPipe models: Facemesh and Iris, implemented inside Unity using over 120 Custom Render Textures.

Designed for mobile platforms, the two networks are perfect for use in VR. Both networks are capped at 60 FPS in this implementation but can be modified to run faster.

***Figure 1. Facemesh Outputs***

The original Facemesh model has three main outputs: face flag, 468 3D face points, and 133 2D contour points (edge points of face, eyes, mouth). My implementation of Facemesh throws out the 133 2D contour outputs to save computation time since I won't be using those.

To retrieve the head rotations, I use SVD (singular value decomposition) on three points outputted from Facemesh to get the rotation matrix. The eye positions are also used to feed into the Iris tracking model.

***Figure 2. Iris Outputs***

The Iris model has two outputs, same as the original, 71 3D brows and contour points and 5 3D iris points.

Right eye inputs must be flipped horizontally during input and flipped back at output.

At the final step, a shader takes Facemesh and Iris model outputs, calculates blendshape values based on key points and smooths the results.

## Problems
- No face detection, meaning the user must have their face within the center of the input camera.
- SVD (singular value decomposition) not accurate, there isn't any SVD libraries written for HLSL. The only one I found was UnitySVDComputeShader and it only solves 3x3 matrices, meaning I can only use 3 points from Facemesh to solve the rotation.
- Slow implementation, it's only fast cause it's made for mobile devices.
- Iris tracking also requires the eyes to be in the very center. Without a good SVD method, it's harder to undo rotations and translations for the eyes accurately.

## Setup

1. Download the latest package in [Release](https://github.com/SCRN-VRC/3D-Face-Landmark-in-UnityCG-HLSL/releases)
2. Import the .unitypackage into a Unity project.
3. Open the prebuilt scene or drop in a prefab.
4. Enter Playmode to run the network.

## Implementation Details

### Facemesh Shader Outputs
Location: `.../FaceLandmarkDetection/CRTs/FaceMesh`
* **CRT: L20**
* Format: R Float
* Size: 39 x 36
* 13 XYZ points per row, 36 rows, 468 points total
* These are the unmodified outputs of the 3D Facemesh points
* **CRT: L20_RR**
* Format: ARGB Float
* Size: 13 x 36
* XYZ stored in RGB, 13 per row, 36 rows, 468 total
* RR stands for reversed rotation, this CRT attempts to undo the head rotations for more accurate measurements during blend shape calculations
* **CRT: L24**
* Format: R Float
* Size: 1 x 1
* Face flag output, 1 is face, 0 is not a face
* **CRT: Procrustes Analysis**
* Format: ARGB Float
* Size: 8 x 8
* This shader calculates the rotations of the head using SVD
* Row 0 stores the variables, Rows 1 - 7 saves the last frame for smoothing calculations

Name | Location (X, Y)
-----|---------
Rotation Matrix Top | 0, 0
Rotation Matrix Middle | 1, 0
Rotation Matrix Bottom | 2, 0
Scale and Frobenius norm | 3, 0
Target Centroid | 4, 0
Source Centroid | 5, 0

### Iris Shader Outputs
Location: `.../FaceLandmarkDetection/CRTs/Iris`

*One iris network is used for both eyes. To keep track of which output is left or right, each layer of the network adds a 10000000.0 to output location (0, 0) if it's the right eye.*

*Right eyes inputs are flipped horizontally.*

* **CRT: iris_L57**
* Format: R Float
* Size: 3 x 71
* The unmodified outputs of the 3D brow and contour points
* Each row contains the XYZ position, 71 points total
* **CRT: iris_L82**
* Format: R Float
* Size: 3 x 5
* Unmodified outputs of the 5 iris tracking points
* Each row contains XYZ position, row 0 is iris center, the other 4 are the surrounding points
* **CRT: LR_Brows**
* Format: ARGB Float
* Size: 8 x 18
* Stores the outputs based on whether it's the right or left eye. XYZ position stored in RGB
* Bottom 4 x 18 contains brows/contour points for the **right eye**
* Top 4 x 18 contains brows/contour points for the **left eye**
* **CRT: LR_Eyes**
* Format: ARGB Float
* Size: 2 x 5
* XYZ position stored in RGB
* Column 0 contains iris points for **right eye**
* Column 1 contains iris points for **left eye**

### Blendshape Values Calculations
Location: `.../FaceLandmarkDetection/CRTs/`

* **CRT: BlendValues**
* Format: ARGB Float
* Size: 8 x 8
* Calculates the blendshape values based on the distance between key points. Ex. mouth open/close values are calculated between the distance of the top and bottom lip points
* Outputs are unclamped values scaled between 0.0 to 1.0
* Row 0 stores the variables, Rows 1 - 7 saves the last frame for smoothing calculations

Name | Location (X, Y)
-----|---------
Rotation Matrix Top | 0, 0
Rotation Matrix Middle | 1, 0
Rotation Matrix Bottom | 2, 0
Mouth open, shrink, smile | 3, 0
Eyes blink left, right | 4, 0
Brow left up/down, right up/down | 5, 0
Iris left XY, right XY position | 6, 0

The rotation matrix is copied over from the Procrustes Analysis CRT.

## Avatar Setup

**This is more of a tech demo, not made for actual use. Setting this up correctly is a tedious process for an impractical effect. If you wish to continue, you'll need to know how to edit shaders.**

1. At least two meshes, the face must be a completely separate mesh from the rest of the body. No more than 16384 vertices on the face because the blendshapes are baked onto 128 x 128 textures.

2. Start creating the blendshapes in Blender:
1. blink right
2. blink left
3. right brow inner down
4. right brow outer down
5. left brow inner down
6. left brow outer down
7. right eye looking in
8. right eye looking out
9. right eye looking up
10. right eye looking down
11. left eye looking in
12. left eye looking out
13. left eye looking up
14. left eye looking down
15. mouth wide open
16. mouth shrink
17. mouth smile
18. mouth frown

3. Save as FBX, import into Unity.

4. Make sure the object with the skinned mesh renderer on it has no rotation or scale.

5. Open up the Bake Blendshapes editor window located under Tools -> SCRN -> Bake Blendshapes in your menu.

6. Hit "Bake" when you're done filling out the blendshapes. The texture should save into `Assets/FaceLandmarkDetection/` as `xxxx_blendBaked.asset`

7. If you want to use the example shaders in the Demo folder, put the FaceTrack material on the face mesh and HeadRotation on everything else.

8. Set the `xxxx_blendBaked.asset` as the Baked Blendshape Texture in the FaceTrack material.

9. Create a mask for HeadRotation, it uses UV0, 0.0 is no rotation, 1.0 is full rotation. Ex. if we want the neck to gradually follow the rotation, put a gradient on the neck UV.

10. Most likely the example shaders aren't good enough, you can move it into your own shader by copying everything I have between these tags in both the shader into an equivalent **Vertex shader**

```HLSL
// ------------------- FACE TRACKING START ------------------- //

...

// ------------------- FACE TRACKING END ------------------- //
```

11. Drop the FaceLandmarkDetection.prefab from the `FaceLandmarkDetection/Demoes/Face Tracking/Prefabs` folder somewhere on your avatar so the CRTs get referenced.

### Python, C++ Code

The purpose of the Python and C++ code is to help me debug intermediate layers of the networks. It runs ***very slow*** because it has to output every single network layer.

#### Requirements
- C++
- OpenCV 4.0 or above
- Python
- Facemesh
- Python 3.8.x
- Tensorflow 2.3.0
- Iris
- Python 3.8.x
- ONNX 1.9.0

I suggest using a virtual environment and a package manager.

### Facemesh Architecture

[Facemesh model card](https://mediapipe.page.link/facemesh-mc)

### Iris Architecture

[Iris landmark model card](https://mediapipe.page.link/iris-mc)

## Resources
- [MediaPipe Facemesh](https://tfhub.dev/mediapipe/tfjs-model/facemesh/1/default/1)
- [Keijiro's MediaPipe Iris Implementation](https://github.com/keijiro/IrisBarracuda)
- [Procrustes Analysis](https://stackoverflow.com/questions/18925181/procrustes-analysis-with-numpy)
- [UnitySVDComputeShader](https://github.com/vanish87/UnitySVDComputeShader/blob/master/UnitySVDComputeShader/Assets/Math.cginc)
- [CHARLIZE THERON_HEAD](https://sketchfab.com/3d-models/charlize-theron-head-ca8926f4284045c6a2e75a6f7642c851)
- [Bricks Game](https://www.shadertoy.com/view/MddGzf)

Thanks to [Merlin](https://twitter.com/MerlinVR_), [Raliv](https://twitter.com/RalivDev) and [orels1](https://twitter.com/orels1_) for the help.

If you have questions or comments, you can reach me on Discord: SCRN#8008 or Twitter: https://twitter.com/SCRNinVR