About Custom Document Types

Table of Contents

Custom Document Types for Enterprise DLP

About Custom Document Types

Learn more about how Enterprise Data Loss Prevention (E-DLP) uses custom documents you upload to prevent exfiltration of sensitive data.

On May 7, 2025, Palo Alto Networks is introducing new Evidence Storage and Syslog Forwarding service IP addresses to improve performance and expand availability for these services globally.

You must allow these new service IP addresses on your network to avoid disruptions for these services. Review the Enterprise DLP Release Notes for more information.

Where Can I Use This?	What Do I Need?
NGFW (Managed by Panorama or Strata Cloud Manager) Prisma Access (Managed by Panorama or Strata Cloud Manager) Prisma Browser	Enterprise Data Loss Prevention (E-DLP) license Review the Supported Platforms for details on the required license for each enforcement point. Or any of the following licenses that include the Enterprise DLP license Prisma Access CASB license Next-Generation CASB for Prisma Access and NGFW (CASB-X) license Data Security license

Where Can I Use This?

What Do I Need?

NGFW (Managed by Panorama or Strata Cloud Manager)
Prisma Access (Managed by Panorama or Strata Cloud Manager)
Prisma Browser

Enterprise Data Loss Prevention (E-DLP) license
Review the Supported Platforms for details on the required license for each enforcement point.

Or any of the following licenses that include the Enterprise DLP license

Prisma Access CASB license
Next-Generation CASB for Prisma Access and NGFW (CASB-X) license
Data Security license

Enterprise Data Loss Prevention (E-DLP) supports upload and detection of custom documents containing intellectual property for which you want to prevent exfiltration. You can upload a custom document type to Enterprise DLP, or use a predefined document type, to classify and detect standardized documents and prevent exfiltration of sensitive data. You use the uploaded custom document types in data profiles as match criteria. Additionally, you can use custom document types along with predefined Machine Learning-based data patterns to apply additional ML-based detection algorithms complimented by confidential or sensitive data specific to your organization.

All custom documents you upload are isolated to the Enterprise DLP tenant you upload them to, and are not shared across different Enterprise DLP tenants. This means that if you want to use the same custom document in multiple data profiles in a multi-tenant environment, you must upload the custom document to each of the Enterprise DLP tenants.

Enterprise DLP supports file and non-file inspection for both predefined and custom document types.

Enterprise DLP uses Indexed Document Matching and Trainable Classifiers to fingerprint and index uploaded custom documents to scan for and detect documents that completely or partially match what you have already uploaded.

Indexed Document Matching (IDM)—Used to fingerprint documents and create a document type for documents commonly used by your organization. Uploading multiple documents allows you to create a custom document repository that you can use in a data profile.
Trainable Classifiers—Supervised machine learning model that analyzes document types for classifications. As you upload more custom documents as types, Enterprise DLP is able to continuously train the ML model to accurately detect sensitive data matches to inspect for and prevent exfiltration (Positive Training Documents) and those to ignore (Negative Training Set). The upload of set of custom documents using Trainable Classifiers is referred to as a custom document model.

Using IDM and Trainable Classifiers for detection of sensitive data is powerful enables Enterprise DLP to continuously improve its detection capabilities by indexing unstructured text in your documents.

IDM Examples
- Examples of different types of custom documents where IDM can be successfully applied are:
  - Standardized forms or documents specific to your business or organization
  - Patent documents
  - Specific business agreements
  - Specific intellectual property documents
- Examples of different types of custom documents where IDM is less successful because they are too generic or not specific to your organization
  - Generic whitepapers
  - Generic datasheets
  - Image or graphic-heavy documents with little text.
  - PDF files that contain non-extractable text.
    
    Some PDFs render text correctly on screen but use font encoding that prevents text extraction. To check whether a PDF is affected, open the file and copy a section of text. If the pasted result is garbled or contains unreadable characters, Enterprise DLP can't extract the text reliably and produces lower matching scores.
Trainable Classifier Examples
- Examples of different types of custom Positive Training Documents:
  - Proprietary product source code
  - Proprietary product formulas
  - Prerelease earnings, sales estimates, or accounting documents
  - Confidential marketing plans
  - Patient medical records
  - Customer purchasing documents and patterns
  - Confidential legal documents, and Merger & Acquisition documents
  - Proprietary manufacturing methods
- Examples of different types of custom Negative Training Documents:
  - Proprietary code from open source projects
  - Non-proprietary product information
  - Details of published annual accounts
  - Published marketing collateral and advertising copy
  - Healthcare documents
  - Publicly available consumer data
  - Publicly available materials and press releases
  - Industry standards and research

For example, your organization both buys and sells software. You want to only detect instances of sensitive customer data contained in invoices for software that you sell. In this case, you can upload a copy of your organization's invoice as a custom document types for fingerprinting.

However, custom document types will be less effective if you wanted to detect receipts for software your organization purchases. This is because there is too much variance in format between the various software vendors your organization purchases from. Greater document variance results in less accurate detection of matched traffic.

Custom Document Types for Enterprise DLP

Upload a Custom Document Type

Enterprise DLP Docs

About Custom Document Types

Enterprise DLP Docs

About Custom Document Types

Activation & Onboarding

Next-Generation Firewalls

SASE

Cloud-Delivered Security Services

Endpoints

Visibility & Monitoring

Best Practices

Experts Corner