What is the
Concept Extraction method used for?
◦ Identifying document types using keywords that may exist in the document contents
◦ Filtering out known system files on a system
◦ Identifying temporary files generated by applications that may contain documentary information
◦ Identifying files of a different format, but which contain exactly the same data (such as a PDF and its matching MS Word document)