Dr Sean McKeown S.McKeown@napier.ac.uk
Lecturer
Beyond Hamming Distance: Exploring Spatial Encoding in Perceptual Hashes
Mckeown, Sean
Authors
Abstract
Forensic analysts are often tasked with analysing large volumes of data in modern investigations, and frequently make use of hashing technologies to identify previously encountered images. Perceptual hashes, which seek to model the semantic (visual) content of images, are typically compared by way of Normalised Hamming Distance, counting the ratio of bits which differ between two hashes. However, this global measure of difference may overlook structural information, such as the position and relative clustering of these differences. This paper investigates the relationship between localised/positional changes in an image and the extent to which this information is encoded in various perceptual hashes. Our findings indicate that the relative position of bits in the hash does encode useful information. Consequently, we prototype and evaluate three alternative perceptual hashing distance metrics: Nor-malised Convolution Distance, Hatched Matrix Distance, and 2-D Ngram Cosine Distance. Results demonstrate that there is room for improvement over Hamming Distance. In particular, the worst-case image mirroring transform for DCT-based hashes can be completely mitigated without needing to change the mechanism for generating the hash. Indeed, perceived hash weaknesses may actually be deficits in the distance metric being used, and large-scale providers could potentially benefit from modifying their approach.
Citation
Mckeown, S. (2025, April). Beyond Hamming Distance: Exploring Spatial Encoding in Perceptual Hashes. Presented at DFRWS EU 2025, Brno, Czech Republic
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | DFRWS EU 2025 |
Start Date | Apr 1, 2025 |
End Date | Apr 4, 2025 |
Acceptance Date | Nov 27, 2024 |
Deposit Date | Dec 17, 2024 |
Journal | Forensic Science International: Digital Investigation |
Electronic ISSN | 2666-2817 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Keywords | Perceptual Hashing, Semantic Approximate Matching, Distance Metrics, Hamming Distance, Image Forensics, Content Matching |
This file is under embargo due to copyright reasons.
Contact repository@napier.ac.uk to request a copy for personal use.
Related Outputs
PHASER: Perceptual Hashing Algorithms Evaluation and Results -an Open Source Forensic Framework
(2024)
Presentation / Conference Contribution
You might also like
Fingerprinting JPEGs With Optimised Huffman Tables
(2018)
Journal Article
A forensic analysis of streaming platforms on Android OS
(2022)
Journal Article
InfoScout: An interactive, entity centric, person search tool.
(2016)
Presentation / Conference Contribution
Fast Filtering of Known PNG Files Using Early File Features
(2017)
Presentation / Conference Contribution
Microtargeting or Microphishing? Phishing Unveiled
(2020)
Presentation / Conference Contribution
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search