ELI5: What is DLP?

Picture a guard standing at every exit of a building, checking bags to make sure nobody takes anything valuable out. DLP (Data Loss Prevention) works like that guard for your computer network. It watches emails, file transfers, and other ways data moves around, and if someone tries to send secret information outside the company — whether on purpose or by accident — it steps in and stops it.

Overview

Data Loss Prevention (DLP) is a set of tools and policies designed to detect and prevent the unauthorized transfer of sensitive data outside the organization. DLP systems inspect data at rest, in motion, and in use to identify sensitive content based on patterns, keywords, classification labels, and policies. When a policy violation is detected, DLP can alert, block, encrypt, or quarantine the data.

Key Concepts

  • DLP deployment types:
    • Network DLP — monitors data in transit on the network; inspects email, web, file transfers
    • Endpoint DLP — installed on workstations and servers; monitors copy/paste, USB transfers, printing, screen captures
    • Cloud DLP — monitors data in cloud applications and storage; often integrated with CASB
  • Detection methods:
    • Pattern matching / regex — detects credit card numbers, SSNs, and other structured data formats
    • Keyword matching — flags content containing specific words or phrases
    • Document fingerprinting — creates a hash of sensitive documents and detects copies or derivatives
    • Classification-based — enforces policies based on data classification labels and metadata
    • Machine learning — identifies sensitive content based on trained models
  • Policy actions: alert, block, encrypt, quarantine, log, notify manager
  • Common use cases:
    • Preventing email of unencrypted PII or PHI
    • Blocking USB transfers of classified documents
    • Detecting credit card numbers in cloud storage
    • Preventing source code from being uploaded to unauthorized repositories
  • False positives — DLP can generate many false positives; tuning is essential for operational effectiveness

Exam Tips

Remember

DLP monitors data in all three states: at rest, in transit, in use. Network DLP catches email and web exfiltration. Endpoint DLP catches USB and print. Cloud DLP integrates with CASB. Tuning is critical to reduce false positives.

Connections

Practice Questions

Scenario

See case-dlp for a practical DevOps scenario applying these concepts.