Dr Sean McKeown S.McKeown@napier.ac.uk
Lecturer
Forensic analysts are often tasked with analysing large volumes of data in modern investigations, and frequently make use of hashing technologies to identify previously encountered images. Perceptual hashes, which seek to model the semantic (visual) content of images, are typically compared by way of Normalised Hamming Distance, counting the ratio of bits which differ between two hashes. However, this global measure of difference may overlook structural information, such as the position and relative clustering of these differences. This paper investigates the relationship between localised/positional changes in an image and the extent to which this information is encoded in various perceptual hashes. Our findings indicate that the relative position of bits in the hash does encode useful information. Consequently, we prototype and evaluate three alternative perceptual hashing distance metrics: Nor-malised Convolution Distance, Hatched Matrix Distance, and 2-D Ngram Cosine Distance. Results demonstrate that there is room for improvement over Hamming Distance. In particular, the worst-case image mirroring transform for DCT-based hashes can be completely mitigated without needing to change the mechanism for generating the hash. Indeed, perceived hash weaknesses may actually be deficits in the distance metric being used, and large-scale providers could potentially benefit from modifying their approach.
McKeown, S. (2025, April). Beyond Hamming Distance: Exploring Spatial Encoding in Perceptual Hashes. Presented at DFRWS EU 2025, Brno, Czech Republic
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | DFRWS EU 2025 |
Start Date | Apr 1, 2025 |
End Date | Apr 4, 2025 |
Acceptance Date | Nov 27, 2024 |
Online Publication Date | Mar 24, 2025 |
Publication Date | 2025-03 |
Deposit Date | Dec 17, 2024 |
Publicly Available Date | Mar 25, 2026 |
Journal | Forensic Science International: Digital Investigation |
Electronic ISSN | 2666-2817 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 52 |
Issue | Suppl |
Article Number | 301878 |
DOI | https://doi.org/10.1016/j.fsidi.2025.301878 |
Keywords | Perceptual Hashing, Semantic Approximate Matching, Distance Metrics, Hamming Distance, Image Forensics, Content Matching |
Public URL | http://researchrepository.napier.ac.uk/Output/4011064 |
Beyond Hamming Distance: Exploring Spatial Encoding in Perceptual Hashes
(2.2 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc-nd/4.0/
Fingerprinting JPEGs With Optimised Huffman Tables
(2018)
Journal Article
A forensic analysis of streaming platforms on Android OS
(2022)
Journal Article
InfoScout: An interactive, entity centric, person search tool.
(2016)
Presentation / Conference Contribution
Fast Filtering of Known PNG Files Using Early File Features
(2017)
Presentation / Conference Contribution
Microtargeting or Microphishing? Phishing Unveiled
(2020)
Presentation / Conference Contribution
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search