Predefined Data Patterns on SaaS Security API

Learn about how SaaS Security API categorizes predefined data patterns.
Use one of the following topics:
We are in the process of replacing SaaS Security DLP (Classic) with SaaS Security DLP. During this process, use the topic that matches your tenant. If you purchased SaaS Security with Enterprise DLP Add‑on, opted in for a trial of SaaS Security with Enterprise DLP Add‑on, or have a new tenant with SaaS Security DLP, use SaaS Security API Data Profiles; otherwise, use SaaS Security API Data Patterns—SaaS Security DLP (Classic).

SaaS Security API Data Patterns—SaaS Security DLP (Classic)

SaaS Security API provides predefined data patterns that enable you to discover sensitive content and how that content is being shared or accessed in your managed cloud applications. The service automatically scans your cloud applications when you Add Cloud Apps to SaaS Security API using predefined data patterns, classifies all documents using machine learning, and checks hash on all Microsoft Office documents, PDF, and portable executable files against WildFire rules without requiring you to create any policies.
As the service displays incidents that match the predefined data patterns, you can explore and filter the results to determine if the content that the service reported poses a risk to your organization. Then, you can do any of the following to prevent future violations:
SaaS Security API categorizes predefined data patterns as follows:
Content Category
Scans for
Intellectual Property
Scans files for RSA and AWS secret keys and confidential documents that are at risk of being stored or shared in a way that could result in a loss of intellectual property.
You can specify
File Extensions to Exclude
. Excluding files that are unlikely to have intellectual property information that is public and not at risk of being exposed or shared in non-compliant ways helps minimize false positives.
Personally Identifiable Information (PII)
Scans for PII data, such as U.S., Canadian, and international social security numbers. It also scans for Tax IDs from the U.S., Australia, Canada, Germany, and the UK for both the Unique Tax Payer ID, (UTR) and National Insurance Number (NINO) formats.
For each type of PII that SaaS Security API scans for, you can specify the minimum number of occurrences required to trigger a match. As the number of violations for a specific asset exceeds the specified threshold, the severity of the risk increases.
Financial Information
Scans for financial data including credit card numbers, credit card magnetic stripe data, international bank account numbers, financial accounting, bank statements, personal finance, invoices, and other financial documents. By default, SaaS Security API performs strict checking on credit card numbers to reduce false positives.
Healthcare Information
Scans healthcare documents for exposure to sensitive or confidential information, related to Clinical Laboratory Improvement Amendments (CLIA) number, Drug Enforcement Administration (DEA) number, and other healthcare documents.
SaaS Security API uses machine learning algorithms to classify information and to detect sensitive information.
Legal Information
Scans legal documents for exposure to sensitive or confidential information related to bankruptcy filings, lawsuits, business agreements, mergers and acquisition information, patents, and other legal documents.
SaaS Security API uses machine learning algorithms to classify information and to detect sensitive information.
Malware
Scans files using WildFire Analysis to detect and protect against malicious portable executables (PEs), Microsoft Office Files, Adobe Portable Document Format (PDF) files, and known threats based on file hash.
A hash is a unique fingerprint of a file. It is string of letters and digits that is generated as a result of running a file through a cryptographic hash function.
By default, SaaS Security API automatically submits portable executable files to the WildFire service for analysis (
Windows executables
).

SaaS Security API Data Profiles

SaaS Security API provides predefined data profiles, which include predefined data patterns, that enable you to discover sensitive content and how that content is being shared or accessed in your managed cloud applications. The service automatically scans your cloud applications when you Add Cloud Apps to SaaS Security API using predefined data patterns, classifies all documents using machine learning, and checks hash on all Microsoft Office documents, PDF, and portable executable files against WildFire rules without requiring you to create any policies.
The predefined data patterns and data profiles that come with DLP (Data Loss Prevention) work automatically: you don't enable, configure, or create data policies to use them, unless you want to open incidents. After your end users upload files that include social security numbers or credit card numbers, for example, and SaaS Security API scans theses assets, SaaS Security API evaluates, identifies, then exposes those assets. These tools are built into SaaS Security API—they’re automatically provisioned and protect your data.
  • SaaS Security with Enterprise DLP Add–on—
  • SaaS Security DLP—
SaaS Security with Enterprise DLP Add‑on provides you exclusive access to predefined data patterns and data profiles. SaaS Security web interface displays all predefined data patterns and data profiles irrespective of your having SaaS Security with Enterprise DLP Add–on, and uses a lock icon to highlight data patterns and data profiles that require the license.
As the service displays incidents that match the predefined data patterns, you can explore and filter the results to determine if the content that the service reported poses a risk to your organization. Then, you can do any of the following to prevent future violations:
SaaS Security API categorizes predefined data patterns as follows:
Predefined Data Profile
SaaS Security with Enterprise DLP Add‑on Required?
Scans For
Bulk CCN
Yes
Credit card numbers or Voyager credit card numbers more than or equal to 100.
CCPA (California Consumer Privacy Act)
Yes
Bank - American Bankers Association Routing Number, Bank - International Bank Account Number, Driver License - US, Address - US, Tax Id - US - TIN, Credit Card Number, Magnetic Stripe Information, Passport - US, Address - US, National Id - US Social Security Number - SSN.
Corporate Financial Docs
Yes
Financial accounting and generic financial information.
Financial Information
No
Bank statements, bank routing number, credit card numbers (strict checking), bankruptcy filing, international bank account number, invoices, magnetic stripe information, and Committee on Uniform Securities identification procedure number.
GDPR (General Data Protection Regulation)
Yes
GDPR- Driver's License, Tax ID, National ID, and Passport.
GLBA (Gramm-Leach-Bliley Act)
Yes
Credit card number, Voyager credit card, magnetic stripe information, Tax Id - US - TIN, and National Id - US Social Security Number - SSN.
Healthcare
No
Clinical Laboratory Improvement Amendments (CLIA) number, Drug Enforcement Administration (DEA) number, and other healthcare documents.
Intellectual Property
Yes
Source code, AWS secret key, access key, company confidential.
Legal
Yes
Legal- Legal documents including lawsuits, M&A, standard business agreements, patents, and bankruptcy filings.
Malware
No
All Microsoft Office documents, PDF, and portable executable files, and known threats against WildFire. The verdict is based on a hash, which is a unique fingerprint of a file.
PII (Personally-Identifiable Information)
Yes
Tax ID, National ID, Passport, and Driver’s License.
Profanity
Yes
Censored, blasphemous, personal, homophobic, racial, and sexual content.
Sensitive content
Yes
National ID, Bank information, AWS Secret key or access key, company confidential, CCN.
U.K. PIOCP
No
Tax ID or National ID.
Self Harm
Yes
Self Harm - Suicidal.

Recommended For You