Should we use HTTP or STDIO MCP for production?

HTTP for any production deployment with regulatory exposure. STDIO is fine for developer tools and short-lived local processes. HTTP transport gives you the audit trail and the gateway-layer enforcement that examinations and incident response require.

Do MCP servers need their own auth layer if we have a gateway?

Yes, as defense in depth. The gateway is the primary enforcement point, but the MCP server's own auth catches the case where traffic reaches the server through a path the gateway is not in front of. The two layers should be configured consistently.

How do we propagate user identity through an agent to an MCP server?

OAuth 2.1 token exchange (RFC 8693) is the standard pattern. The agent client exchanges its own token for a token that represents the agent acting on behalf of the user, then presents the exchanged token to the MCP server. The MCP server validates the propagation chain.

Does the gateway work with custom MCP transports?

For any MCP transport that runs over HTTP or HTTPS, yes. For STDIO or other non-network transports, the gateway is not in the traffic path and cannot enforce or audit. The architectural choice of transport determines the enforcement reach.

How does MCP authentication interact with the EU AI Act?

MCP traffic between an agent and a tool server inside a high-risk AI system is in scope for Article 12 logging. The audit record from the gateway covers the Article 19 field-level requirements (timestamp, principal identity, data classification, decision outcome). The MCP server-side audit is complementary but not a substitute.

MCP Server Authentication for Enterprise Deployments: Identity, Authorization, and the Boundary Question

Model Context Protocol (MCP) servers have moved from developer-tool integrations to production agent backends inside enterprises. The protocol started as the way an agent client could connect to a structured tool surface (file system, database, API), and has expanded into the standard interface for any tool an enterprise agent needs to invoke. The authentication and authorization model for MCP traffic at enterprise scale differs from REST-API authentication in two specific ways: the principal acting on the MCP server is typically an agent on behalf of a human or upstream agent, and the tool invocations have to carry the propagation chain so that the MCP server can authorize against the actual end-principal rather than the agent's standing service identity.

I want to walk through the MCP authentication model, the transport-boundary distinction that determines what a gateway can enforce, the failure modes that have surfaced in early production deployments, and the gateway-layer controls that produce an audit trail an enterprise security team can actually use.

What MCP authentication looks like

The MCP specification describes the protocol for client-server interactions including tool discovery, tool invocation, and structured responses. The authentication layer is defined at the transport level rather than embedded in the protocol itself. HTTP-transport MCP uses standard HTTP authentication patterns (typically OAuth 2.1 with JWT bearer tokens, or mTLS for service-to-service). STDIO-transport MCP runs inside a local process boundary and inherits the OS-level process identity.

The base case for a hosted MCP server is HTTP transport with OAuth 2.1. The agent client authenticates to the MCP server using a token. The token represents either the agent's standing service identity or, in the more architecturally clean case, the agent acting on behalf of a specific upstream principal.

The OAuth client-credentials grant produces tokens that represent the agent's standing identity. The token-exchange grant (RFC 8693) produces tokens that represent the agent acting for a specified human user. The choice between them has direct consequences for what the MCP server can authorize against and what the audit record can reconstruct.

The agent-on-behalf-of identity problem

A production agent typically acts on behalf of a human user, an upstream agent, or a system-initiated workflow. The agent's own credentials authorize the agent against the MCP server. The question of what scope of action the agent is allowed depends on the upstream principal it is acting for.

If the agent's token to the MCP server is a client-credentials token, the MCP server sees the agent's standing identity and authorizes against the union of permissions the agent itself holds. Any user the agent acts for inherits the agent's effective permissions. This is the failure pattern OWASP Top 10 for Agentic Applications calls excessive agency.

If the agent's token is a token-exchange token representing the agent-on-behalf-of-user, the MCP server can authorize against the intersection of what the agent is allowed to do and what the upstream user is allowed to do. The effective permission set is the intersection, not the union. Different users acting through the same agent get different effective scopes.

For enterprise deployments, the intersection model is the one regulators and security architecture reviews expect. The implementation effort to get there is nontrivial because it requires the agent client, the upstream identity provider, and the MCP server to coordinate on token exchange and on how the propagated identity is verified.

The HTTP vs STDIO boundary

The transport-boundary question matters because it determines what a gateway can enforce and what it cannot.

HTTP-transport MCP traffic flows over standard network protocols and can be intercepted, classified, and policy-evaluated by a gateway in front of the MCP server. The gateway can enforce identity-propagation requirements, per-tool authorization, response classification, and per-decision audit. This is the case that fits DeepInspect's enforcement boundary cleanly.

STDIO-transport MCP runs inside a local process boundary. The agent process spawns the MCP server process and communicates over stdin and stdout. There is no network hop to intercept. A gateway in the HTTP traffic path does not see STDIO MCP at all. Enforcement for STDIO MCP has to happen at the host process layer (host-based controls, OS-level audit) or inside the agent client itself. This is outside the HTTP-AI-traffic boundary DeepInspect operates on.

Enterprise architecture decisions about MCP transport should be made deliberately. STDIO is fine for developer tools and short-lived local processes. For production agent backends with regulatory exposure, HTTP transport with OAuth 2.1 and gateway-layer policy enforcement is the pattern that produces an audit trail.

Failure modes from early production deployments

The MCP deployment patterns visible in the threat-research and incident-disclosure record over the past six months have surfaced specific failure modes. Authentication bypass in widely deployed MCP server implementations, including the LiteLLM CVE family in June 2026, has shown that the MCP server's own auth layer is itself a high-value attack surface. Implementations that defer authentication to a downstream identity provider sometimes fail-open on identity-provider unreachability. Implementations that hold long-lived provider API keys in server memory expose those keys to any code-execution issue on the host.

Tool-permission escalation has been observed when an MCP server exposes a tool catalog that the agent client can enumerate without authentication scope checks. Agent clients that discover tools they were not specifically authorized to call can attempt to invoke them, and some server implementations have authorized the invocation on the basis of the agent's general session rather than per-tool scope.

Identity-spoofing through inadequate token validation has been observed when MCP servers accept JWTs without checking the issuer, the audience, the signature chain, or the token-exchange provenance. An attacker who can forge or alter a JWT can act as any principal the server recognizes.

Gateway-layer controls

A policy gateway at the AI request boundary, applied to HTTP-transport MCP traffic, adds three controls.

First, identity-binding enforcement. The gateway requires every MCP request to carry a verifiable identity assertion for the calling principal. For agent-on-behalf-of patterns, the gateway requires the propagation chain. Requests missing identity context are blocked at the boundary before reaching the MCP server.

Second, per-tool authorization. The gateway maintains the policy that says which identities can invoke which tools on which MCP servers. The MCP server's own authorization is a defense-in-depth layer behind the gateway's enforcement. The gateway's policy version is pinned in the audit record.

Third, per-decision audit. Every MCP request and response produces a per-decision record with the principal identity, the tool invoked, the arguments at the parameter level, the response classification, the policy decision, and the request ID. The record commits before the response returns to the agent client.

What sits outside the gateway boundary for MCP

The HTTP-AI-traffic boundary applies to MCP the same way it applies to direct LLM calls. STDIO-transport MCP, local-only MCP servers, and any MCP traffic that does not flow over network HTTP are outside the gateway's enforcement scope. The architectural decision to use HTTP-transport MCP for production deployments is what brings the traffic inside the boundary.

Local credential theft, host-level RCE on the MCP server host, and downstream provider-API abuse with credentials stolen from the MCP server are also outside the gateway boundary. Those concerns require host-level security controls (EDR, hardening, secret management) rather than gateway-layer policy.

DeepInspect

This is exactly what DeepInspect does for HTTP-transport MCP traffic. DeepInspect sits inline between the agent client and the MCP server, enforces identity-bound policy on every tool invocation, and writes a per-decision audit record outside the calling application. The architecture is stateless and identity-aware: each request is evaluated against the propagated principal identity, the agent identity, and the per-tool policy in effect, with the decision committed before the response returns.

The audit record carries the fields an enterprise security review and a regulatory inquiry both ask for: principal identity, agent identity, acts-on-behalf-of chain, tool invoked, arguments at the parameter level (with PII handling), policy ID applied, decision outcome, request and response timestamps, and the signature for tamper-evidence. The same record format covers direct LLM traffic and MCP traffic, which means the SOC has a single query path across both.

For platform teams that have been deploying MCP servers behind nothing but the server's own auth layer and are now reading the OWASP AISVS chapter-8 verification requirements against the production architecture, the gateway is the layer that fills the gap. Book a demo today.