An open API service indexing awesome lists of open source software.

https://github.com/awesomelistsio/awesome-computer-vision

A curated list of awesome libraries, frameworks, tools, datasets, and research papers in computer vision, covering topics such as object detection, image segmentation, 3D vision, and more.
https://github.com/awesomelistsio/awesome-computer-vision

List: awesome-computer-vision

awesome awesome-list awesome-lists computer-vision

Last synced: 4 months ago
JSON representation

A curated list of awesome libraries, frameworks, tools, datasets, and research papers in computer vision, covering topics such as object detection, image segmentation, 3D vision, and more.

Awesome Lists containing this project

README

        

# Awesome Computer Vision [![Awesome Lists](https://srv-cdn.himpfen.io/badges/awesome-lists/awesomelists-flat.svg)](https://github.com/awesomelistsio/awesome)

[![Buy Me A Coffee](https://srv-cdn.himpfen.io/badges/buymeacoffee/buymeacoffee-flat.svg)](https://tinyurl.com/2h9aktmd)   [![Ko-Fi](https://srv-cdn.himpfen.io/badges/kofi/kofi-flat.svg)](https://tinyurl.com/d4xnrptz)   [![PayPal](https://srv-cdn.himpfen.io/badges/paypal/paypal-flat.svg)](https://tinyurl.com/mr22naua)   [![Stripe](https://srv-cdn.himpfen.io/badges/stripe/stripe-flat.svg)](https://tinyurl.com/e8ymxdw3)

> A curated list of awesome libraries, frameworks, tools, datasets, and research papers in computer vision, covering topics such as object detection, image segmentation, 3D vision, and more.

## Contents

- [Libraries and Frameworks](#libraries-and-frameworks)
- [Tools and Applications](#tools-and-applications)
- [Object Detection](#object-detection)
- [Image Segmentation](#image-segmentation)
- [3D Computer Vision](#3d-computer-vision)
- [Face Recognition](#face-recognition)
- [Datasets](#datasets)
- [Research Papers](#research-papers)
- [Learning Resources](#learning-resources)
- [Books](#books)
- [Community](#community)
- [Contribute](#contribute)
- [License](#license)

## Libraries and Frameworks

- [OpenCV](https://opencv.org/) - An open-source library providing computer vision and machine learning algorithms for image and video analysis.
- [Detectron2](https://github.com/facebookresearch/detectron2) - A high-performance object detection library developed by Facebook AI Research.
- [Dlib](http://dlib.net/) - A modern C++ toolkit with machine learning algorithms and tools for computer vision.
- [Scikit-Image](https://scikit-image.org/) - A Python library for image processing, built on top of SciPy.
- [Mediapipe](https://mediapipe.dev/) - A cross-platform framework by Google for building multimodal ML solutions, including face detection and pose estimation.
- [DeepLabV3](https://github.com/tensorflow/models/tree/master/research/deeplab) - A deep learning model for semantic image segmentation.
- [YOLO (You Only Look Once)](https://github.com/AlexeyAB/darknet) - A real-time object detection system.
- [MMDetection](https://github.com/open-mmlab/mmdetection) - An open-source object detection toolbox based on PyTorch.

## Tools and Applications

- [LabelImg](https://github.com/tzutalin/labelImg) - An open-source image annotation tool for labeling datasets.
- [OpenPose](https://github.com/CMU-Perceptual-Computing-Lab/openpose) - A real-time multi-person detection library for human pose estimation.
- [VGG Image Annotator (VIA)](https://www.robots.ox.ac.uk/~vgg/software/via/) - A lightweight tool for manual image annotation.
- [DeepFaceLab](https://github.com/iperov/DeepFaceLab) - A tool for creating deepfakes using face swapping.
- [ImageAI](https://github.com/OlafenwaMoses/ImageAI) - A Python library built to empower developers to build applications using deep learning for computer vision.

## Object Detection

- [Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks (2015)](https://arxiv.org/abs/1506.01497) - The paper introducing Faster R-CNN for object detection.
- [SSD: Single Shot MultiBox Detector (2016)](https://arxiv.org/abs/1512.02325) - A method for real-time object detection.
- [YOLO: You Only Look Once - Unified, Real-Time Object Detection (2016)](https://arxiv.org/abs/1506.02640) - A fast and accurate object detection algorithm.
- [EfficientDet: Scalable and Efficient Object Detection (2020)](https://arxiv.org/abs/1911.09070) - A model architecture focused on balancing accuracy and efficiency.

## Image Segmentation

- [U-Net: Convolutional Networks for Biomedical Image Segmentation (2015)](https://arxiv.org/abs/1505.04597) - A convolutional neural network designed for biomedical image segmentation.
- [DeepLabV3+: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation (2018)](https://arxiv.org/abs/1802.02611) - An advanced model for semantic segmentation.
- [Mask R-CNN (2017)](https://arxiv.org/abs/1703.06870) - An extension of Faster R-CNN for instance segmentation.
- [PSPNet: Pyramid Scene Parsing Network (2017)](https://arxiv.org/abs/1612.01105) - A semantic segmentation model using a pyramid pooling module.

## 3D Computer Vision

- [PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation (2017)](https://arxiv.org/abs/1612.00593) - A deep learning model for 3D point cloud processing.
- [NeRF: Neural Radiance Fields for View Synthesis (2020)](https://arxiv.org/abs/2003.08934) - A model for representing 3D scenes using neural networks.
- [Open3D](http://www.open3d.org/) - An open-source library for 3D data processing and visualization.
- [Colmap](https://colmap.github.io/) - A general-purpose Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipeline.

## Face Recognition

- [FaceNet: A Unified Embedding for Face Recognition and Clustering (2015)](https://arxiv.org/abs/1503.03832) - A model for face recognition using deep learning.
- [DeepFace: Closing the Gap to Human-Level Performance in Face Verification (2014)](https://www.cs.toronto.edu/~ranzato/publications/taigman_cvpr14.pdf) - A method for facial recognition developed by Facebook.
- [OpenFace](https://cmusatyalab.github.io/openface/) - An open-source deep learning model for face recognition.
- [DeepFaceLab](https://github.com/iperov/DeepFaceLab) - The leading software for creating deepfakes.

## Datasets

- [ImageNet](https://www.image-net.org/) - A large-scale image dataset used for image classification and object detection.
- [COCO (Common Objects in Context)](https://cocodataset.org/) - A dataset for object detection, segmentation, and captioning tasks.
- [PASCAL VOC](http://host.robots.ox.ac.uk/pascal/VOC/) - A dataset for visual object category recognition and detection.
- [CelebA](https://mmlab.ie.cuhk.edu.hk/projects/CelebA.html) - A large-scale face attributes dataset.
- [KITTI](http://www.cvlibs.net/datasets/kitti/) - A dataset for autonomous driving research.

## Research Papers

- [Visual Attention Mechanisms (2015)](https://arxiv.org/abs/1409.0473) - Introduction of visual attention mechanisms in neural networks.
- [Self-Supervised Learning for Visual Representation (2020)](https://arxiv.org/abs/2006.10029) - A study on self-supervised learning techniques for computer vision tasks.
- [BigGAN: Generative Adversarial Networks for Large-Scale Image Synthesis (2018)](https://arxiv.org/abs/1809.11096) - A generative model for high-quality image synthesis.

## Learning Resources

- [Stanford CS231n: Convolutional Neural Networks for Visual Recognition](http://cs231n.stanford.edu/) - A popular course on computer vision and convolutional networks.
- [Deep Learning for Computer Vision](https://www.coursera.org/specializations/deep-learning) - Part of Andrew Ng’s deep learning specialization on Coursera.
- [PyImageSearch](https://www.pyimagesearch.com/) - A blog and resource hub for computer vision tutorials.
- [Kaggle: Computer Vision Datasets](https://www.kaggle.com/datasets?tags=13212-computer-vision) - A collection of computer vision datasets on Kaggle.

## Books

- *Deep Learning for Computer Vision* by Rajalingappaa Shanmugamani - A guide to deep learning techniques in computer vision.
- *Computer Vision: Algorithms and Applications* by Richard Szeliski - A comprehensive book on computer vision algorithms.
- *Learning OpenCV* by Gary Bradski and Adrian Kaehler - A practical guide to using the OpenCV library.

## Community

- [Reddit: r/ComputerVision](https://www.reddit.com/r/computervision/) - A subreddit for computer vision discussions.
- [PyImageSearch Community](https://www.pyimagesearch.com/community/) - A forum for discussing computer vision and image processing.
- [CVPR Conference](https://cvpr2024.thecvf.com/) - The IEEE Conference on Computer Vision and Pattern Recognition.
- [Kaggle Computer Vision Forum](https://www.kaggle.com/tags/computer-vision/discussion) - A community for computer vision discussions on Kaggle.

## Contribute

Contributions are welcome!

## License

[![CC0](https://mirrors.creativecommons.org/presskit/buttons/88x31/svg/by-sa.svg)](http://creativecommons.org/licenses/by-sa/4.0/)