More Visibility. Less Work.
Building data classification from scratch is slow, expensive, and distracts from your core product. Our SDK gives you powerful, accurate intelligence in a lightweight package that you can integrate in days, not quarters.

The Smarter Way to Build
Embed powerful data discovery into your product with minimal effort.
Deep Visibility
Automatically flags sensitive data out-of-the-box with no need for policy configuration. Includes hundreds of regulatory and industry-specific identifiers.
Multiple Data Types
Extracts and analyzes text from various file types and identifies sensitive content within images using built-in OCR.
Effortless Compliance
Streamlines compliance with evolving privacy laws such as USA PII, CCPA, HIPAA, GDPR, and more.
High Performance
Processes large files quickly (in seconds) for fast insights with low latency as data volume grows.
Minimal False Positives
Machine learning engines are continuously trained and refined for reliable analysis across many data types.
Exact Data Matching
Uses advanced fingerprinting to create unique identifiers, handling large datasets (2B+ records) with precision.
Easy SDK Integration
Designed for developers, by developers.
Built for Speed and Scale
Don't let data classification bottle-neck your application performance.Cross-Platform Support
Seamlessly integrate with Windows, macOS, and Linux environments. Supports C/C++, C#(.NET), GO, and Java.
Flexible API
Provides straightforward C APIs for data scanning, with support for both full-buffer and streaming modes.
Lightweight Footprint
Designed to run efficiently within your existing architecture without bloating your application.
Who Embeds Our SDK?
Add data classification to any product without building it yourself.
Know what sensitive data you're backing up. Help customers prioritize recovery and meet compliance requirements.
Add DLP capabilities to your SIEM, SOAR, or endpoint protection without building classification from scratch.
Automatically identify PII and privileged content during litigation holds and document review.
Offer data discovery as a service to your clients with white-label reporting and multi-tenant support.
Scan files on upload to prevent sensitive data from being stored in unauthorized locations.
Detect sensitive content in messages and attachments before they leave your customers' environment.
Frequently Asked Questions
What platforms does the SDK support?
Windows, macOS, and Linux with native bindings for C/C++, C#(.NET), Go, and Java. It integrates seamlessly into your existing architecture regardless of your tech stack.
How is the SDK licensed?
We offer flexible OEM licensing based on your deployment model—per-seat, per-device, or unlimited. Contact us to discuss pricing that fits your product and business model.
Does the SDK require an internet connection?
No. The SDK runs entirely offline with no cloud dependencies. All classification happens locally—critical for air-gapped environments and strict data residency requirements.
How many classifiers are included?
150+ pre-built classifiers covering PII, PHI, PCI, HIPAA, GDPR, CCPA, and more. All classifiers run simultaneously with no configuration required.
Can I add custom classifiers?
Yes. Define custom patterns or use Exact Data Matching (EDM) to fingerprint your customers' specific sensitive data like employee IDs or account numbers.
What file types can the SDK process?
1,000+ file types including Office documents, PDFs, archives, emails, and images. Built-in OCR extracts text from scanned documents and images at no additional cost.
Looking for an end-user product instead? See Risk Finder → — our ready-to-deploy sensitive data scanner with the same classification engine.
Ready to integrate?
Accelerate your roadmap and deliver value to your customers faster.