B2B SaaS with AI Features: How Enterprise Security Reviews Now Block the Deal
B2B SaaS vendors that added AI features in the last twelve months are now meeting an enterprise security review process that did not exist when the product was scoped. Buyers ask about identity context at the model API call, per-decision audit records, prompt-level data classification, and the deployment regime under the EU AI Act. Sales cycles stall on questions the engineering team did not anticipate. This piece walks through what enterprise security reviews now ask of SaaS-with-AI vendors, where most product architectures are exposed, and the inspection layer that closes the gap before procurement does.

The B2B SaaS vendors that shipped AI features in 2025 are running into a security review process in 2026 that did not exist when the features were scoped. Enterprise buyers, prompted by their internal AI governance committees, are asking specific questions about how the SaaS handles prompts, where identity context is attached, what per-decision audit records the customer can retrieve, and how the vendor maps to EU AI Act Articles 12, 19, and 26. The questionnaire that used to ask about SOC 2 and data residency now has a twenty-line AI section. Sales cycles that closed in 45 days last year are stalling at 90 days while procurement waits on architecture answers.
I want to walk through what enterprise security reviews now ask of SaaS-with-AI vendors, where the typical AI feature architecture falls short, and the inspection layer that closes the gap.
What enterprise security reviews now ask
The AI section of a 2026 enterprise security review covers five areas that did not appear in 2024 questionnaires.
The first is identity attribution at the model API call. Reviewers ask whether the vendor can produce, for any specific AI interaction inside the SaaS, the identity of the natural person whose action triggered the model call. A vendor that proxies all customer prompts through a single OpenAI API key bound to the vendor's organization fails this question. The credential identifies the vendor, not the customer's user.
The second is prompt-level data classification. The reviewer wants to know whether the vendor inspects the prompt content for the customer's regulated data types before it reaches the model. A customer in healthcare wants to know whether PHI in a prompt is detected and redacted. A customer in finance wants to know what happens when a user pastes a customer's MNPI-relevant identifier into an AI feature. Vendors who pass the prompt through unmodified fail this question.
The third is per-decision audit records. Reviewers want a sample audit record. They want fields. They want to know whether the customer can pull every AI decision involving their tenant data from the vendor on demand, in a structured format, with cryptographic integrity. Vendors who can produce only aggregate usage logs fail this question.
The fourth is EU AI Act mapping. Reviewers ask whether the SaaS's AI feature, when used by a customer in a high-risk category (Annex III), triggers the customer's Article 26 deployer obligations and how the vendor supports those obligations as the provider. Vendors who have not done the Article 6 self-assessment fail this question.
The fifth is shared responsibility documentation. Reviewers want a written description of what the vendor is responsible for and what the customer is responsible for at each step of the AI feature's data flow. Vendors who cannot produce this document on request fail this question.
Where SaaS AI feature architectures are exposed
The typical AI feature architecture in a B2B SaaS product was designed for product velocity, not for an enterprise security review. Three structural exposures recur in the questionnaires I see.
Shared service credentials hide the natural person
When a SaaS product calls OpenAI or Anthropic on behalf of a customer's user, the call uses an API key issued to the vendor. The provider's logs show the vendor as the caller. The vendor's own logs may or may not record which customer user initiated the call. Few products attach the customer's user identity, the customer's tenant identifier, and the customer's role information to the model API call as structured metadata. When the customer's compliance team asks "produce all model calls initiated by user jdoe@acmecorp.com between March 15 and March 22," the vendor either cannot produce the records or produces them from an application log that the reviewer rejects as self-attestation.
Prompt content is opaque to the customer
The customer never sees the prompt that the SaaS sends to the model on its behalf. The customer cannot run its own DLP against that prompt. The customer's data classification rules are not applied. If a sensitive data type ends up in the prompt because of a user action inside the SaaS, the customer has no way to know and no way to prevent it. Reviewers know this and ask what the vendor does to detect regulated data types at the prompt boundary before transmission to the model provider.
Audit records are application logs, not policy decision records
Most SaaS products that show "AI activity" in a customer-facing audit log are surfacing application events: "user requested AI summary." The record lacks the policy that was in effect, the data classification that applied, the model and version used, the input data fingerprint, and the integrity signature that lets the customer rely on the record in a regulatory dispute. Application logs fail the Article 12 traceability test under regulator questioning.
Mandate vs Compliance for SaaS with AI features
The enterprise customer's compliance team operates under EU AI Act Article 26 (deployer), Article 12 (record-keeping), and any sector regulation that applies to the customer's industry. The SaaS vendor operates under Articles 8 through 17 (provider) for the AI components the vendor builds, and under Article 25 if the vendor is downstream of an OpenAI or Anthropic model.
The letter of these articles can be satisfied with documentation. Surviving an enterprise security review requires more.
The reviewer's test is whether the vendor's architecture produces records that a regulator would accept. The reviewer is sitting in a chair the regulator will eventually sit in. The questions are a dress rehearsal for the regulatory inquiry the customer expects after August 2, 2026.
The records test
The reviewer asks the vendor to produce, for a sample user action, the full record. Identity of the natural person. Tenant identifier. Role and authorization context. Model and version called. Prompt text or prompt fingerprint. Data classification applied. Policy version evaluated. Decision outcome. Response handling. Timestamp. Integrity signature. The vendors that can produce this in a single response field win the review. The vendors that say "we have application logs" lose it.
The architecture test
The reviewer asks how the records are produced. Specifically, whether the application that performed the AI action also wrote the audit record, or whether a separate enforcement layer captured the record independently. Self-attestation by the application that performed the action does not satisfy a regulator's traceability test. Reviewers know this and ask the architecture question directly.
The exit test
The reviewer asks how the customer leaves the vendor and takes its AI audit records with it. The DORA and EU AI Act regimes both require deployers to retain records that may sit inside vendor systems. The customer needs a way to export those records on a schedule and on exit. The vendor that has not designed an export path loses on the exit test.
DeepInspect
This is the gap DeepInspect closes for SaaS vendors selling AI features into enterprise accounts. DeepInspect sits inline between the SaaS application and any LLM API. For every customer user action that calls a model, DeepInspect attaches the customer's user identity, the customer's tenant identifier, and the customer's role context. It evaluates a per-tenant policy that classifies prompt content, redacts or blocks regulated data, and records the decision with cryptographic integrity.
The vendor gets a single architecture that satisfies the customer's Article 12 traceability requirement, the customer's Article 26 deployer obligations, the customer's data classification rules, and the customer's audit export demand. The audit record is produced by the proxy, not by the application, which closes the self-attestation gap that breaks enterprise reviews.
If your sales cycle is stalling on the AI section of enterprise security questionnaires and you have customers asking for Article 12-grade audit records, let's talk.
Frequently asked questions
- Why are AI features now a procurement blocker for B2B SaaS?
Enterprise buyers updated their AI governance policies in the second half of 2025 in anticipation of the August 2, 2026 EU AI Act enforcement date. The policies require that any vendor handling regulated data through AI features satisfy specific architectural requirements: identity attribution at the model API call, per-decision audit records, prompt-level data classification, and a documented shared responsibility model. SaaS vendors that built AI features without these architectural properties cannot answer the questionnaire and the procurement team blocks the renewal or new deal. This is not a hypothetical procurement risk. It is happening in renewal cycles right now.
- Does Article 26 of the EU AI Act apply to my SaaS customer?
Article 26 applies to deployers of high-risk AI systems. If your customer uses your SaaS in a function that falls under Annex III (creditworthiness, employment screening, education access, insurance pricing, biometric identification, critical infrastructure, law enforcement, migration, or several other categories), the customer is a deployer under Article 26. Your customer inherits the deployer obligations. Your SaaS architecture determines whether your customer can satisfy them. Most B2B SaaS vendors discover Article 26 applies indirectly, because their customers happen to use the product in a high-risk function the vendor did not anticipate.
- What is the difference between application audit logs and per-decision audit records?
Application audit logs record application events from the application's perspective: a user clicked something, a feature was used, a result was returned. Per-decision audit records record the policy decision that gated the AI action: which identity was permitted, which data classification applied, which policy version evaluated, which outcome was produced. The first is generated by the application that took the action; the second is generated by an enforcement layer independent of the application. Regulators draw a line between the two because the system under audit cannot be the system generating the audit record. Self-attestation fails the traceability test under Article 12.
- What does an enterprise security review want to see in an AI shared responsibility document?
The customer's compliance team wants a written description, signed by the vendor, that maps the data flow of each AI feature and identifies which steps the vendor controls and which steps the customer controls. Where is the prompt assembled. Who attaches identity context. Who runs data classification. Which policy is evaluated and by whom. Where is the audit record committed. Who retains the record and for how long. Who is responsible for incident notification when an AI-related event occurs. The customer uses this document to satisfy its own internal governance and to discharge the deployer obligations under Article 26.
- How do SaaS vendors typically support a customer's audit export?
The minimum the customer needs is a programmatic interface that lets the customer export, in a structured format with integrity signatures, every per-decision audit record involving the customer's tenant over a chosen time window. The vendor should support exports on a scheduled cadence, on the customer's demand, and on contract exit. Audit records that live only inside the vendor's product surface and cannot be exported in bulk fail the customer's record retention test under DORA Article 12 of the AI Act and most sector regulations.