More Visibility. Less Work.

Building data classification from scratch is slow, expensive, and distracts from your core product. Our SDK gives you powerful, accurate intelligence in a lightweight package that you can integrate in days, not quarters.

Cross-Platform Text Extraction Included OCR Included 150+ Classifiers Runs Offline

System diagram of the Inspect Data SDK

The Smarter Way to Build

Embed powerful data discovery into your product with minimal effort.

Deep Visibility

Automatically flags sensitive data out-of-the-box with no need for policy configuration. Includes hundreds of regulatory and industry-specific identifiers.

Multiple Data Types

Extracts and analyzes text from various file types and identifies sensitive content within images using built-in OCR.

Effortless Compliance

Streamlines compliance with evolving privacy laws such as USA PII, CCPA, HIPAA, GDPR, and more.

High Performance

Processes large files quickly (in seconds) for fast insights with low latency as data volume grows.

Minimal False Positives

Machine learning engines are continuously trained and refined for reliable analysis across many data types.

Exact Data Matching

Uses advanced fingerprinting to create unique identifiers, handling large datasets (2B+ records) with precision.

Easy SDK Integration

Designed for developers, by developers.

Built for Speed and Scale

Don't let data classification bottle-neck your application performance.

Cross-Platform Support

Seamlessly integrate with Windows, macOS, and Linux environments. Supports C/C++, C#(.NET), GO, and Java.

Flexible API

Provides straightforward C APIs for data scanning, with support for both full-buffer and streaming modes.

Lightweight Footprint

Designed to run efficiently within your existing architecture without bloating your application.

Who Embeds Our SDK?

Add data classification to any product without building it yourself.

Backup & Recovery

Know what sensitive data you're backing up. Help customers prioritize recovery and meet compliance requirements.

Security Vendors

Add DLP capabilities to your SIEM, SOAR, or endpoint protection without building classification from scratch.

eDiscovery & Legal

Automatically identify PII and privileged content during litigation holds and document review.

MSPs & MSSPs

Offer data discovery as a service to your clients with white-label reporting and multi-tenant support.

Cloud Storage

Scan files on upload to prevent sensitive data from being stored in unauthorized locations.

Email & Collaboration

Detect sensitive content in messages and attachments before they leave your customers' environment.

Frequently Asked Questions

What platforms does the SDK support?

Windows, macOS, and Linux with native bindings for C/C++, C#(.NET), Go, and Java. It integrates seamlessly into your existing architecture regardless of your tech stack.

How is the SDK licensed?

We offer flexible OEM licensing based on your deployment model—per-seat, per-device, or unlimited. Contact us to discuss pricing that fits your product and business model.

Does the SDK require an internet connection?

No. The SDK runs entirely offline with no cloud dependencies. All classification happens locally—critical for air-gapped environments and strict data residency requirements.

How many classifiers are included?

150+ pre-built classifiers covering PII, PHI, PCI, HIPAA, GDPR, CCPA, and more. All classifiers run simultaneously with no configuration required.

Can I add custom classifiers?

Yes. Define custom patterns or use Exact Data Matching (EDM) to fingerprint your customers' specific sensitive data like employee IDs or account numbers.

What file types can the SDK process?

1,000+ file types including Office documents, PDFs, archives, emails, and images. Built-in OCR extracts text from scanned documents and images at no additional cost.

Looking for an end-user product instead? See Risk Finder → — our ready-to-deploy sensitive data scanner with the same classification engine.

Ready to integrate?

Accelerate your roadmap and deliver value to your customers faster.