Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/amoshnin/ml-augmented.reality.cutpaster
https://github.com/amoshnin/ml-augmented.reality.cutpaster
Last synced: about 5 hours ago
JSON representation
- Host: GitHub
- URL: https://github.com/amoshnin/ml-augmented.reality.cutpaster
- Owner: amoshnin
- License: mit
- Created: 2022-01-17T17:56:52.000Z (almost 3 years ago)
- Default Branch: master
- Last Pushed: 2022-01-17T17:57:59.000Z (almost 3 years ago)
- Last Synced: 2024-05-21T05:56:37.301Z (6 months ago)
- Language: TypeScript
- Size: 91.8 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# AR Cut & Paste
An AR+ML prototype that allows cutting elements from your surroundings and pasting them in an image editing software.
Although only Photoshop is being handled currently, it may handle different outputs in the future.
Demo & more infos: [Thread](https://twitter.com/cyrildiagne/status/1256916982764646402)
⚠️ This is a research prototype and not a consumer / photoshop user tool.
**Update 2020.05.11:** If you're looking for an easy to use app based on this research, head over to https://clipdrop.co
## Modules
This prototype runs as 3 independent modules:
- **The mobile app**
- Check out the [/app](/app) folder for instructions on how to deploy the app to your mobile.
- **The local server**
- The interface between the mobile app and Photoshop.
- It finds the position pointed on screen by the camera using [screenpoint](https://github.com/cyrildiagne/screenpoint)
- Check out the [/server](/server) folder for instructions on configuring the local server- **The object detection / background removal service**
- For now, the salience detection and background removal are delegated to an external service
- It would be a lot simpler to use something like [DeepLap](https://github.com/shaqian/tflite-react-native) directly within the mobile app. But that hasn't been implemented in this repo yet.## Usage
### 1 - Configure Photoshop
- Go to "Preferences > Plug-ins", enable "Remote Connection" and set a friendly password that you'll need later.
- Make sure that your PS document settings match those in ```server/src/ps.py```, otherwise only an empty layer will be pasted.
- Also make sure that your document has some sort of background. If the background is just blank, SIFT will probably not have enough feature to do a correct match.### 2 - Setup the external salience object detection service
#### Option 1: Set up your own model service (requires a CUDA GPU)
- As mentioned above, for the time being, you must deploy the
BASNet model (Qin & al, CVPR 2019) as an external HTTP service using this [BASNet-HTTP wrapper](https://github.com/cyrildiagne/basnet-http) (requires a CUDA GPU)- You will need the deployed service URL to configure the local server
- Make sure to configure a different port if you're running BASNet on the same computer as the local service
#### Option 2: Use a community provided endpoint
A public endpoint has been provided by members of the community. This is useful if you don't have your own CUDA GPU or do not want to go through the process of running the servce on your own.
Use this endpoint by launching the local server with `--basnet_service_ip http://u2net-predictor.tenant-compass.global.coreweave.com`
### 3 - Configure and run the local server
- Follow the instructions in [/server](/server) to setup & run the local server.
### 4 - Configure and run the mobile app
- Follow the instructions in [/app](/app) to setup & deploy the mobile app.
## Thanks and Acknowledgements
- [BASNet code](https://github.com/NathanUA/BASNet) for '[*BASNet: Boundary-Aware Salient Object Detection*](http://openaccess.thecvf.com/content_CVPR_2019/html/Qin_BASNet_Boundary-Aware_Salient_Object_Detection_CVPR_2019_paper.html) [code](https://github.com/NathanUA/BASNet)', [Xuebin Qin](https://webdocs.cs.ualberta.ca/~xuebin/), [Zichen Zhang](https://webdocs.cs.ualberta.ca/~zichen2/), [Chenyang Huang](https://chenyangh.com/), [Chao Gao](https://cgao3.github.io/), [Masood Dehghan](https://sites.google.com/view/masoodd) and [Martin Jagersand](https://webdocs.cs.ualberta.ca/~jag/)
- RunwayML for the [Photoshop paste code](https://github.com/runwayml/RunwayML-for-Photoshop/blob/master/host/index.jsx)
- [CoreWeave](https://www.coreweave.com) for hosting the public U^2Net model endpoint on Tesla V100s