: Supported File Types for Scanning Assets
Focus
Focus

Supported File Types for Scanning Assets

Table of Contents

Supported File Types for Scanning Assets

Learn about the file type categories that
Data Security
supports for asset scanning.
Data Security
supports the file type categories listed below to scan assets.
Data Security
extracts metadata and contextual content for more than 100 file formats. For the supported categories listed below,
Data Security
supports all Apache Tika file types.
The following table lists the supported file categories. The example file types don't represent a complete list—rather, commonly used formats: this list changes as the underlying technology supports new file types. Use the examples to gauge the scope of the file categories.
File Type Support for Scanning
File Type Category
Scanning Support
Example File Types
Hyper Text Markup Language
htm, xhtml, ETC.
XML and derived formats
ooxml, ETC.
Microsoft Office, OLE-based and XML-based
doc, docx, ETC.
Source code
Java, C, C++, ETC.
Mail
MS Outlook msg, MS Outlook pst, RFC 822, mbox, ETC.
Executable programs and libraries
Windows EXE, Linux/BSD, ETC.
Open Document Format
odf, ETC.
iWorks
numbers, pages, keynote, ETC.
Portable Document Format
pdf, ETC.
Electronic Publication Format
ePub, ETC.
Rich Text Format
rtf, ETC.
Compression and packaging format
tar, rar, zip, 7zip, ETC.
Text
txt, ETC.
Feed and Syndication
RSS, Atom, IPTC ANPA, ETC.
Help formats
chm, ETC.
Java class files and archives
jar, ETC.
Font formats
TrueType, Adobe Font, ETC.

Recommended For You