Many corpora of intelligence interest are so large that it is impractical to read them entirely. Analysts need tools that will focus attention on significant structures and particular documents. Here we exploit singular value decomposition and word2vec as tools for this purpose, and compare them with one another in a real-world application — a malware forum from the dark web.
Authors:
Nasser Alsadhan, David Skillicorn, Richard Frank
Published:
International Symposium on Foundations of Open Source Intelligence and Security Informatics (FOSINT-SI)
July 2017
https://dl-acm-org.proxy.lib.sfu.ca/doi/abs/10.1145/3110025.3116205