Skip to main content

Research Repository

Advanced Search

Exploring the Need For an Updated Mixed File Research Data Set

Davies, Simon R.; Macfarlane, Richard; Buchanan, William J.



Mixed file data sets are used in a variety of research areas, including Digital Forensics, Malware analysis and Ransomware detection. Researchers recently seem to either have to create their own custom data sets or well-known data sets are used, but additional file types have been added. In the majority of research, these data sets are never published. This paper outlines ongoing research around currently available mixed file data sets to determine if a need exists for a new modern cybersecurity mixed file data set, and how this could be created and maintained. Part of the investigation would include identifying common files types currently needed for research, and an outline methodology into data set creation is also presented. It was found when reviewing ransomware detection research literature that almost no proposal provided enough detail on how the test data set was created, or sufficient description of its actual content, to allow it to be recreated by other researchers interested in reconstructing their environment and validating the research results.

Presentation Conference Type Conference Paper (Published)
Conference Name 2021 International Conference on Engineering and Emerging Technologies (ICEET)
Start Date Oct 27, 2021
End Date Oct 28, 2021
Online Publication Date Jan 5, 2022
Publication Date 2022
Deposit Date Feb 23, 2022
Publisher Institute of Electrical and Electronics Engineers
Pages 426-430
Series ISSN 2409-2983
Book Title 2021 International Conference on Engineering and Emerging Technologies (ICEET)
Public URL
Publisher URL