v0.9.4-beta · Apache 2.0 · Written in Rust

DLP that never sees
your data.

The scanning engine runs entirely in your environment. Document content never crosses our boundary — only metadata findings reach the cloud. Open-source, single binary, 10-minute setup.

10 min
to first scan
0 bytes
of content leaves your env
21×
faster than Tika
Apache 2.0
open source
nanodlp architecture — data plane in your environment, only metadata to cloud

Connecting to the apps your team already uses

Google Drive
Microsoft 365
Slack
Dropbox
GitHub
AWS S3
MySQL
Salesforce
Confluence
Jira
Notion
Zendesk
HubSpot
Okta
Box
GitLab
LinkedIn
SharePoint
Google Workspace
MS Teams
Snowflake
Zoom
Gmail
OneDrive
Airtable
Figma
Linear
Discord
Intercom
Postman
View all connectors and roadmap →
Why nanodlp

Legacy DLP was built before SaaS existed.

Traditional DLP tools require you to route all your data through a vendor's cloud. nanodlp flips the model — the scanner lives in your environment, and only cryptographic metadata ever leaves.

Zero trust boundary

Document content never crosses our network boundary. We receive only hashed metadata — file IDs, owner, pattern type, severity, and a SHA-256 of the matched value.

21× faster scanning

Written in Rust with a custom document parser. No JVM, no garbage collection pauses. Scans tens of thousands of files in minutes on a single CPU core.

10-minute setup

Single binary. No Kubernetes, no agents to manage, no professional services engagement. Connect your first SaaS app and see findings in under 10 minutes.

Fully open source

Apache 2.0 licensed. Read every line of code that touches your data. No black-box ML models, no opaque vendor decisions. Audit it, fork it, extend it.

Auditable detectors

Every detector is a named regex with optional proximity keywords and a mathematical validator. No ML black boxes. Override or extend any built-in pattern via a TOML overlay file.

Compliance-ready

HIPAA, PCI-DSS, GDPR, SOC 2, and HITRUST evidence packages generated automatically. Quarterly compliance reports with zero manual effort.

Privacy architecture

Where your bytes actually go.

Most DLP vendors say "we protect your data" while routing it through their cloud. nanodlp makes an architectural guarantee: document content is discarded before the network call.

nanodlp architecture diagram
OAuth tokens stay in your secret store. We never see them.
Document content is read, scanned, and discarded locally.
No document content, no snippets, no previews cross the boundary.
Only: file ID, owner, pattern type, severity, match count, value hash.
Detection engine

Built-in detectors for every major data type.

Regex + proximity checks + mathematical validators (Luhn, MOD-97, Verhoeff). Not ML — fast, auditable, and predictable. Extend with TOML overlays, no recompilation needed.

☁️
Cloud Secrets

AWS, GCP, Azure keys and tokens. Validated against known entropy thresholds.

🔌
SaaS API Keys

GitHub, Slack, Stripe, Twilio, SendGrid and dozens more. Format-validated.

🤖
AI Provider Keys

OpenAI, Anthropic, HuggingFace, Groq, Mistral, Cohere and more.

🔑
Private Keys

RSA, EC, OpenSSH, PGP, PKCS8, DSA, PuTTY private key formats.

🇺🇸
PII (US)

SSN, EIN, ITIN, phone, passport, driver's license, email address.

🌍
PII (International)

UK NIN, India Aadhaar/PAN, Brazil CPF/CNPJ, France INSEE, Germany, Canada SIN.

💳
Financial

Credit cards (BIN + Luhn), IBAN (MOD-97), bank routing numbers, SWIFT/BIC.

Crypto

Bitcoin addresses, Ethereum addresses, WIF private keys.

# custom-patterns.toml — extend without recompiling
[[detector]]
name = "internal_employee_id"
pattern = "EMP-[0-9]{6}"
severity = "medium"
keywords = ["employee", "staff", "hr"]
Get started

Up and running in 10 minutes.

Single binary. No dependencies. No Kubernetes. No professional services.

# 1. Install
curl -sSf https://get.nanodlp.io | sh

# 2. Authenticate your first connector
nanodlp auth google-drive

# 3. Run a scan
nanodlp scan --connector google-drive

# Output:
✓  Scanned 14,832 files across all connectors in 4m 12s
✓  Found 47 critical findings, 203 medium, 891 low
✓  Dashboard at http://localhost:7777
docker run --rm \
  -v ~/.nanodlp:/config \
  -v /var/run/docker.sock:/var/run/docker.sock \
  ghcr.io/nanodlp/nanodlp:latest \
  scan --connector google-drive
helm repo add nanodlp https://charts.nanodlp.io
helm install nanodlp nanodlp/nanodlp \
  --set connector.googleDrive.enabled=true \
  --set secrets.existingSecret=nanodlp-creds
# nanodlp.toml
[connectors.google_drive]
enabled = true
credential_file = "/run/secrets/gdrive_token"

[scan]
schedule = "0 2 * * *"  # daily at 2am
severity_threshold = "medium"

[output]
endpoint = "https://app.nanodlp.io/ingest"
api_key_env = "NANODLP_API_KEY"
Compliance

Evidence-ready for every audit.

Continuous scanning means your compliance evidence is always current. Generate auditor-ready reports in one command.

HIPAA
PCI-DSS
GDPR
SOC 2
HITRUST i1
ISO 27001
CCPA
FedRAMP (roadmap)
nanodlp compliance report — generated 2026-04-30T02:00:00Z
HIPAA §164.312(a)(2)(iv) — Encryption / Decryption
  Status:  PASS
  Finding: 0 unencrypted PHI fields in transit (all connectors use TLS 1.3)
  Evidence: scan_run_id=sr_20260430_0200, connector=google-drive, files_scanned=14832

PCI-DSS Req 3.3 — Sensitive Authentication Data
  Status:  FAIL — ACTION REQUIRED
  Finding: 3 files contain full PAN data (unmasked)
  Files:   drive://Finance/Q1-2026/card-export.xlsx (owner: alice@example.com)
  Remediation: nanodlp remediate --finding-id f_8821a3
Comparison

Honest comparison with alternatives.

We win on privacy architecture, speed, and cost. We lose on breadth of integrations and enterprise support. Here's the full picture.

Capability nanodlp Nightfall AI Microsoft Purview Symantec DLP
Data plane in customer envPartial
Open source
Setup time10 minHoursDays–weeksWeeks
Scanning speed21× fasterModerateSlowSlow
No content to vendor cloudPartial
Auditable detectorsML black boxPartialPartial
Free tier✓ OSS
Endpoint DLP✗ (roadmap)
Email DLP✗ (roadmap)
Honest tradeoffs

What we don't do (yet).

We'd rather tell you upfront than have you discover it in a POC.

No endpoint DLP

We scan SaaS apps, not laptops. If you need USB blocking or browser extension DLP, look at Nightfall or Purview for now. On our roadmap for v2.0.

No email scanning

Gmail and Outlook scanning is on the roadmap. Today we cover file storage and collaboration apps, not email bodies or attachments in transit.

No real-time blocking

nanodlp is a scanner and alerter, not an inline proxy. We detect and alert; we don't block uploads in real time. That requires a different architecture.

No ML classification

We use regex + validators, not LLMs. This means we're faster and auditable, but we won't catch "this document looks like an NDA" without explicit patterns.

Pricing

The data plane is free. Always.

The open-source binary, all connectors, all built-in detectors — free forever. Pay only for the hosted control plane when you need multi-org dashboards and compliance reporting.

OSS
Free, forever

For engineers who want to run DLP in their own environment.

  • ✓ Single binary (Apache 2.0)
  • ✓ All connectors
  • ✓ All built-in detectors
  • ✓ TOML pattern overlay
  • ✓ Self-hosted local dashboard
  • ✓ Community support
Install →
Enterprise
Talk to us

For organisations with 1,000+ employees, air-gap, or FedRAMP requirements.

  • ✓ Everything in Team
  • ✓ SSO / SAML
  • ✓ Dedicated customer success
  • ✓ Custom SLAs
  • ✓ HITRUST r2 path
  • ✓ Air-gapped deployment
  • ✓ FedRAMP path (roadmap)
Book a demo →

Start scanning in 10 minutes.

No credit card. No vendor call. No data leaves your environment.