Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/chenyirui/GIM
This repository is the official repository of the GIM.
https://github.com/chenyirui/GIM
Last synced: about 1 month ago
JSON representation
This repository is the official repository of the GIM.
- Host: GitHub
- URL: https://github.com/chenyirui/GIM
- Owner: chenyirui
- Created: 2024-06-02T08:09:32.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-06-02T08:30:11.000Z (7 months ago)
- Last Synced: 2024-06-24T09:51:23.689Z (7 months ago)
- Size: 4.75 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- Awesome-Segment-Anything - [code
README
# GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
This repository is the official repository of the GIM.
## Abstract
The extraordinary ability of generative models emerges as a new trend in image editing and generating realistic images, posing a serious threat to the trustworthiness of multimedia data and driving the research of image manipulation detection and location(IMDL).
However, the lack of a large-scale data foundation makes IMDL task unattainable. In this paper, a local manipulation pipeline is designed, incorporating the powerful SAM, ChatGPT and generative models. Upon this basis, We propose the GIM dataset, which has the following advantages: 1) Large scale, including over one million pairs of AI-manipulated images and real images. 2) Rich Image Content, encompassing a broad range of image classes 3) Diverse Generative Manipulation, manipulated images with state-of-the-art generators and various manipulation tasks. The aforementioned advantages allow for a more comprehensive evaluation of IMDL methods, extending their applicability to diverse images. We introduce two benchmark settings to evaluate the generalization capability and comprehensive performance of baseline methods. In addition, we propose a novel IMDL framework, termed GIMFormer, which consists of a ShadowTracer, Frequency-Spatial Block (FSB), and a Multi-window Anomalous Modelling (MWAM) Module. Extensive experiments on the GIM demonstrate that GIMFormer surpasses previous state-of-the-art works significantly on two different benchmarks.![Visualization](fig/Visualization.png)