Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/Tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform
https://github.com/Tencent/ncnn

android arm-neon artificial-intelligence caffe darknet deep-learning high-preformance inference ios keras mlir mxnet ncnn neural-network onnx pytorch riscv simd tensorflow vulkan

Last synced: about 2 months ago
JSON representation

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Lists

README

        

![ncnn](https://raw.githubusercontent.com/Tencent/ncnn/master/images/256-ncnn.png)

# ncnn

[![License](https://img.shields.io/badge/license-BSD_3_Clause-blue.svg?style=for-the-badge)](LICENSE.txt)
[![Download Total Count](https://img.shields.io/github/downloads/Tencent/ncnn/total.svg?style=for-the-badge)](https://github.com/Tencent/ncnn/releases)
[![codecov](https://img.shields.io/codecov/c/github/Tencent/ncnn/master?style=for-the-badge)](https://codecov.io/gh/Tencent/ncnn)

ncnn is a high-performance neural network inference computing framework optimized for mobile platforms.
ncnn is deeply considerate about deployment and uses on mobile phones from the beginning of design.
ncnn does not have third-party dependencies.
It is cross-platform and runs faster than all known open-source frameworks on mobile phone cpu.
Developers can easily deploy deep learning algorithm models to the mobile platform by using efficient ncnn implementation, creating intelligent APPs, and bringing artificial intelligence to your fingertips.
ncnn is currently being used in many Tencent applications, such as QQ, Qzone, WeChat, Pitu, and so on.

ncnn 是一个为手机端极致优化的高性能神经网络前向计算框架。
ncnn 从设计之初深刻考虑手机端的部署和使用。
无第三方依赖,跨平台,手机端 cpu 的速度快于目前所有已知的开源框架。
基于 ncnn,开发者能够将深度学习算法轻松移植到手机端高效执行,
开发出人工智能 APP,将 AI 带到你的指尖。
ncnn 目前已在腾讯多款应用中使用,如:QQ,Qzone,微信,天天 P 图等。

---

技术交流 QQ 群

637093648 (超多大佬)

答案:卷卷卷卷卷(已满)

Telegram Group

Discord Channel

Pocky QQ 群(MLIR YES!)

677104663 (超多大佬)

答案:multi-level intermediate representation

他们都不知道 pnnx 有多好用群

818998520 (新群!)

---

## Download & Build status

https://github.com/Tencent/ncnn/releases/latest

**[how to build ncnn library](https://github.com/Tencent/ncnn/wiki/how-to-build) on Linux / Windows / macOS / Raspberry Pi3, Pi4 / POWER / Android / NVIDIA Jetson / iOS / WebAssembly / AllWinner D1 / Loongson 2K1000**

Source

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-full-source.zip)

- [Build for Android](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-android)
- [Build for Termux on Android](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-termux-on-android)

Android

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-android-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-android.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Aandroid)

Android shared

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-android-vulkan-shared.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-android-shared.zip)


- [Build for iOS on macOS with xcode](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-ios-on-macos-with-xcode)

iOS

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ios-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ios.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Aios)

iOS-Simulator

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ios-simulator-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ios-simulator.zip)


- [Build for macOS](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-macos)

macOS

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-macos-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-macos.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Amacos)

Mac-Catalyst

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-mac-catalyst-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-mac-catalyst.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Amac-catalyst)

watchOS

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-watchos.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Awatchos)

watchOS-Simulator

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-watchos-simulator.zip)

tvOS

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-tvos-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-tvos.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Atvos)

tvOS-Simulator

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-tvos-simulator-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-tvos-simulator.zip)

visionOS

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-visionos.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Avisionos)

visionOS-Simulator

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-visionos-simulator.zip)

Apple xcframework

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-apple-vulkan.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-apple.zip)

- [Build for Linux / NVIDIA Jetson / Raspberry Pi3, Pi4 / POWER](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-linux)

Ubuntu 20.04

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ubuntu-2004.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ubuntu-2004-shared.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-x64-gpu-gcc)

Ubuntu 22.04

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ubuntu-2204.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-ubuntu-2204-shared.zip)

windows

- [Build for Windows x64 using VS2017](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-windows-x64-using-visual-studio-community-2017)

VS2015

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2015.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2015-shared.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Awindows)

VS2017

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2017.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2017-shared.zip)

VS2019

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2019.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2019-shared.zip)

VS2022

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2022.zip)
[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-windows-vs2022-shared.zip)

- [Build for WebAssembly](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-webassembly)

WebAssembly

[](https://github.com/Tencent/ncnn/releases/latest/download/ncnn-20240410-webassembly.zip)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Aweb-assembly)

- [Build for ARM Cortex-A family with cross-compiling](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-arm-cortex-a-family-with-cross-compiling)
- [Build for Hisilicon platform with cross-compiling](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-hisilicon-platform-with-cross-compiling)
- [Build for AllWinner D1](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-allwinner-d1)
- [Build for Loongson 2K1000](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-loongson-2k1000)
- [Build for QNX](https://github.com/Tencent/ncnn/wiki/how-to-build#build-for-qnx)

Linux (arm)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-arm-cpu-gcc)

Linux (aarch64)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-aarch64-cpu-gcc)

Linux (mips)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-mips-cpu-gcc)

Linux (mips64)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-mips64-cpu-gcc)

Linux (ppc64)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-ppc64-cpu-gcc)

Linux (riscv64)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-riscv64-cpu-gcc)

Linux (loongarch64)

[](https://github.com/Tencent/ncnn/actions?query=workflow%3Alinux-loongarch64-cpu-gcc)

---

## Support most commonly used CNN network

## 支持大部分常用的 CNN 网络

- Classical CNN:
[VGG](https://github.com/BVLC/caffe/wiki/Model-Zoo#models-used-by-the-vgg-team-in-ilsvrc-2014)
[AlexNet](https://github.com/BVLC/caffe/tree/9b891540183ddc834a02b2bd81b31afae71b2153/models/bvlc_alexnet)
[GoogleNet](https://github.com/BVLC/caffe/tree/9b891540183ddc834a02b2bd81b31afae71b2153/models/bvlc_googlenet)
Inception
...
- Practical CNN:
[ResNet](https://github.com/tornadomeet/ResNet)
[DenseNet](https://github.com/liuzhuang13/DenseNet)
[SENet](https://github.com/hujie-frank/SENet)
[FPN](https://github.com/unsky/FPN)
...
- Light-weight CNN:
[SqueezeNet](https://github.com/forresti/SqueezeNet)
[MobileNetV1](https://github.com/tensorflow/models/blob/master/research/slim/nets/mobilenet_v1.md)
[MobileNetV2/V3](https://github.com/tensorflow/models/blob/master/research/slim/nets/mobilenet/README.md)
[ShuffleNetV1](https://github.com/farmingyard/ShuffleNet)
[ShuffleNetV2](https://github.com/opconty/keras-shufflenetV2)
[MNasNet](https://github.com/tensorflow/models/tree/master/research/slim/nets/nasnet)
...
- Face Detection:
[MTCNN](https://github.com/ipazc/mtcnn)
[RetinaFace](https://github.com/biubug6/Pytorch_Retinaface)
[scrfd](https://github.com/nihui/ncnn-android-scrfd)
...
- Detection:
[VGG-SSD](https://github.com/lzx1413/CAFFE_SSD)
[MobileNet-SSD](https://github.com/chuanqi305/MobileNet-SSD)
[SqueezeNet-SSD](https://github.com/chuanqi305/SqueezeNet-SSD)
[MobileNetV2-SSDLite](https://github.com/chuanqi305/MobileNetv2-SSDLite)
[MobileNetV3-SSDLite](https://github.com/XiaoyuHuang96/MobilenetV3SSDLite-tfkeras)
...
- Detection:
[Faster-RCNN](https://github.com/rbgirshick/py-faster-rcnn)
[R-FCN](https://github.com/daijifeng001/R-FCN)
...
- Detection:
[YOLOv2](https://github.com/longcw/yolo2-pytorch)
[YOLOv3](https://github.com/ultralytics/yolov3)
[MobileNet-YOLOv3](https://github.com/eric612/MobileNet-YOLO)
[YOLOv4](https://github.com/Tianxiaomo/pytorch-YOLOv4)
[YOLOv5](https://github.com/ultralytics/yolov5)
[YOLOv7](https://github.com/WongKinYiu/yolov7)
[YOLOX](https://github.com/Megvii-BaseDetection/YOLOX)
...
- Detection:
[NanoDet](https://github.com/RangiLyu/nanodet)
- Segmentation:
[FCN](https://github.com/unsky/FPN)
[PSPNet](https://github.com/hszhao/PSPNet)
[UNet](https://github.com/zhixuhao/unet)
[YOLACT](https://github.com/dbolya/yolact)
...
- Pose Estimation:
[SimplePose](https://github.com/dog-qiuqiu/Ultralight-SimplePose)
...

---

## HowTo

**[use ncnn with alexnet](https://github.com/Tencent/ncnn/wiki/use-ncnn-with-alexnet) with detailed steps, recommended for beginners :)**

**[ncnn 组件使用指北 alexnet](https://github.com/Tencent/ncnn/wiki/use-ncnn-with-alexnet.zh) 附带详细步骤,新人强烈推荐 :)**

**[use netron for ncnn model visualization](https://netron.app)**

**[out-of-the-box web model conversion](https://convertmodel.com/#outputFormat=ncnn)**

[ncnn low-level operation api](https://github.com/Tencent/ncnn/wiki/low-level-operation-api)

[ncnn param and model file spec](https://github.com/Tencent/ncnn/wiki/param-and-model-file-structure)

[ncnn operation param weight table](https://github.com/Tencent/ncnn/wiki/operation-param-weight-table)

[how to implement custom layer step by step](https://github.com/Tencent/ncnn/wiki/how-to-implement-custom-layer-step-by-step)

---

## FAQ

**[ncnn throw error](https://github.com/Tencent/ncnn/wiki/FAQ-ncnn-throw-error)**

**[ncnn produce wrong result](https://github.com/Tencent/ncnn/wiki/FAQ-ncnn-produce-wrong-result)**

**[ncnn vulkan](https://github.com/Tencent/ncnn/wiki/FAQ-ncnn-vulkan)**

---

## Features

- Supports convolutional neural networks, supports multiple input and multi-branch structure, can calculate part of the branch
- No third-party library dependencies, does not rely on BLAS / NNPACK or any other computing framework
- Pure C++ implementation, cross-platform, supports Android, iOS and so on
- ARM NEON assembly level of careful optimization, calculation speed is extremely high
- Sophisticated memory management and data structure design, very low memory footprint
- Supports multi-core parallel computing acceleration, ARM big.LITTLE CPU scheduling optimization
- Supports GPU acceleration via the next-generation low-overhead Vulkan API
- Extensible model design, supports 8bit quantization and half-precision floating point storage, can import caffe/pytorch/mxnet/onnx/darknet/keras/tensorflow(mlir) models
- Support direct memory zero copy reference load network model
- Can be registered with custom layer implementation and extended
- Well, it is strong, not afraid of being stuffed with 卷 QvQ

## 功能概述

- 支持卷积神经网络,支持多输入和多分支结构,可计算部分分支
- 无任何第三方库依赖,不依赖 BLAS/NNPACK 等计算框架
- 纯 C++ 实现,跨平台,支持 Android / iOS 等
- ARM Neon 汇编级良心优化,计算速度极快
- 精细的内存管理和数据结构设计,内存占用极低
- 支持多核并行计算加速,ARM big.LITTLE CPU 调度优化
- 支持基于全新低消耗的 Vulkan API GPU 加速
- 可扩展的模型设计,支持 8bit [量化](tools/quantize) 和半精度浮点存储,可导入 caffe/pytorch/mxnet/onnx/darknet/keras/tensorflow(mlir) 模型
- 支持直接内存零拷贝引用加载网络模型
- 可注册自定义层实现并扩展
- 恩,很强就是了,不怕被塞卷 QvQ

---

## supported platform matrix

- ✅ = known work and runs fast with good optimization
- ✔️ = known work, but speed may not be fast enough
- ❔ = shall work, not confirmed
- / = not applied

| | Windows | Linux | Android | macOS | iOS |
| ---------- | ------- | ----- | ------- | ----- | --- |
| intel-cpu | ✔️ | ✔️ | ❔ | ✔️ | / |
| intel-gpu | ✔️ | ✔️ | ❔ | ❔ | / |
| amd-cpu | ✔️ | ✔️ | ❔ | ✔️ | / |
| amd-gpu | ✔️ | ✔️ | ❔ | ❔ | / |
| nvidia-gpu | ✔️ | ✔️ | ❔ | ❔ | / |
| qcom-cpu | ❔ | ✔️ | ✅ | / | / |
| qcom-gpu | ❔ | ✔️ | ✔️ | / | / |
| arm-cpu | ❔ | ❔ | ✅ | / | / |
| arm-gpu | ❔ | ❔ | ✔️ | / | / |
| apple-cpu | / | / | / | ✔️ | ✅ |
| apple-gpu | / | / | / | ✔️ | ✔️ |
| ibm-cpu | / | ✔️ | / | / | / |

---

## Project examples

-
-
-
-
-
-
- 🤩
-


-

- Call ncnn from Fortran

- Use ncnn for real-time speech
recognition (i.e., speech-to-text); also support embedded devices and provide
mobile Apps (e.g., Android App)

---

## License

[BSD 3 Clause](LICENSE.txt)