Question 1

How is the DeepInspect platform structured?

Accepted Answer

DeepInspect is split across two planes. The control plane handles policy authoring, versioning, and configuration. The enforcement plane runs inline with application traffic and applies the active policy to every AI request. The two planes are separated so a policy change is explicit, reviewable, and version-controlled before it takes effect in production.

Question 2

What deployment modes does DeepInspect support?

Accepted Answer

Three modes. Self-hosted deployments run inside the customer's VPC or on-premises environment and keep request payloads inside the customer network boundary. Cloud-hosted deployments run in DeepInspect's managed environment for customers that prefer a SaaS operational model. Hybrid deployments place the gateway as a network proxy in front of egress traffic, for environments where retrofitting application-level integration is impractical.

Question 3

Can DeepInspect run in a fully air-gapped environment?

Accepted Answer

Yes. Air-gapped deployment is the strict form of the self-hosted mode. The control plane, the enforcement gateway, and the forensic store run inside an isolated network with no route to the public internet. Because the gateway is model-agnostic, the upstream model can be a local inference runtime such as vLLM, an internal model server, or Ollama running inside the same network, and natural-language policies are evaluated by a customer-hosted LLM or SLM. Policy evaluation, payload transformation, and record commitment complete with no outbound call.

Question 4

How does the gateway scale?

Accepted Answer

Gateway instances are stateless and interchangeable. A fleet of gateway pods behind a load balancer scales horizontally with request volume, with each pod holding a read-only copy of the active policy version in local memory. Policy updates propagate through the control plane as atomic version switches, so a given request evaluates against exactly one policy version end to end.

Question 5

What happens during a control-plane outage?

Accepted Answer

A control-plane outage freezes policy versions at the last-known-good state on every gateway pod, which lets enforcement continue during a management-plane incident. Replay and rollback workflows resume when the control plane recovers, and every record committed during the incident remains independently verifiable through its per-record signature.

Question 6

How does the gateway authenticate callers?

Accepted Answer

End-user authentication stays with the calling application and its identity provider — OIDC, SAML, or the enterprise's own SSO. The application attaches the resulting identity context to each call, and the gateway verifies a DeepInspect-issued access token to confirm the caller is authorized to reach it before reading that identity context into every policy evaluation.

Question 7

How does the gateway handle multi-provider routing and failover?

Accepted Answer

The same gateway that evaluates policy also selects the upstream model. Per-route configuration declares a primary provider, eligible failover providers, and tier-based routing rules. Selection runs after policy admits the request, so cost optimization never overrides the constraint that policy permits a given destination for a given data class. Health checks track latency, error rate, and rate-limit headroom, and the gateway shifts traffic when a provider degrades.

Built for Security Teams.

What Are the Core Components of the DeepInspect Platform?

How Does the DeepInspect Gateway Deploy in Production?

Runtime behavior

Deployment modes

Air-Gapped Deployment

Scaling and High Availability

Routing, Failover, and Cost Telemetry