【打印本页】      【下载PDF全文】   查看/发表评论  下载PDF阅读器  关闭
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览 39次   下载 34 本文二维码信息
码上扫一扫!
一种基于ViT的局部图像拷贝检测算法
朱辰,陈于勋,陈禹坤,王总辉
分享到: 微信 更多
(浙江大学国家卓越工程师学院 杭州 中国 310015;浙江大学计算机科学与技术学院 杭州 中国 310027)
摘要:
近年来,随着图像编辑技术的不断发展,数字图像的知识产权保护形势日益严峻。为有效遏制图像内容的非法窃取、复制与篡改行为,切实保障用户的合法权益,图像拷贝检测作为内容审核体系的关键组成部分,其重要性不言而喻。其中,一种常见的规避检测手段是攻击者将源图像的部分内容巧妙地植入目标图像中,形成“画中画”式的局部拷贝。针对这一挑战,本文提出了一种基于ViT的局部图像拷贝检测算法。该算法旨在利用ViT强大的局部特征提取能力,在原始全局特征的基础上提取出图像的局部特征,揭示图像间潜在的依赖关系,并通过重新排列相关图像的顺序,提高具有更高相似度的图像在检测结果中的排序,从而精准实现对图像局部信息的拷贝检测。本文与基于自监督描述符的图像拷贝检测算法(A Self-Supervised Descriptorfor Image Copy Detection)相比,在DISC2021数据集上取得显著提升,局部拷贝图像识别准确率提高10%,μAP (Micro AveragePrecision)指标提高10%~15%。此外,本文还利用热力图进行了直观的定性分析,证实了ViT能够敏锐地捕捉到图像中存在局部拷贝的区域,进一步验证了算法的有效性。本文提出的基于ViT的局部图像拷贝检测重排序算法能对使用画中画图像增强方式的拷贝图像进行有效检测,并在DISC2021数据集上取得了优异成绩,拓展了数字图像取证领域研究的新思路。
关键词:  图像拷贝检测  ViT  图像增强  重排序  热力图
DOI:10.19363/J.cnki.cn10-1380/tn.2025.11.07
投稿时间:2024-01-15修订日期:2024-04-19
基金项目:本课题得到国家重点研发计划项目(No.2020AAA0140000)和浙江省教育厅一般科研项目(No.Y202351231)资助。
A ViT-based Algorithm for Localized Image Copy Detection
ZHU Chen,CHEN Yuxun,CHEN Yukun,WANG Zonghui
Polytechnic Institute, Zhejiang University, Hangzhou 310015, China;College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Abstract:
In recent years, with the continuous development of image editing technology, the situation of intellectual property protection of digital images has become increasingly severe. In order to effectively curb the illegal stealing, copying and tampering of image content, and effectively protect the legitimate rights and interests of users, image copy detection, as a key component of the content audit system, is of great importance. One of the common means to avoid detection is that the attacker skillfully implants part of the source image into the target image, forming a "picture-in-picture" type of partial copy. To address this challenge, this paper proposes A ViT-based Algorithm for Localized Image Copy Detection. The algorithm aims to utilize the powerful local feature extraction capability of ViT to extract the local features of an image on the basis of the original global features, to reveal the potential dependencies between the images, and to improve the ordering of images with higher similarity in the detection results by rearranging the order of related images, so as to accurately realize the copy detection of local information of an image. Compared with A Self-Supervised Descriptor for Image Copy Detection, this paper achieves a significant improvement on the DISC2021 dataset, with 10% increase in localized copy image recognition accuracy, and 10% - 15% increase in μAP (Micro Average Precision) metrics. In addition, this paper also carries out an intuitive qualitative analysis using heat maps, which confirms that ViT is able to acutely capture the region of the image where local copies exist, further validating the effectiveness of the algorithm. The ViT-based localized image copy detection reordering algorithm proposed in this paper is able to effectively detect copy images using picture-in-picture image enhancement, and achieves excellent results on the DISC2021 dataset, which expands new ideas for research in the field of digital image forensics.
Key words:  image copy detection  ViT  image enhancement  reordering  heat map