Protegrity Explains Why AI Security Starts With the Data Layer

By Protegrity

May 15, 2026

Summary

5 min

The AI Innovator article argues that AI security starts with the data layer:
James Rice explains that AI workflows retrieve, transform, and recombine enterprise data across systems, creating new exposure points that traditional perimeter-based controls were not designed to govern.
Protegrity POV: context-aware data protection is the durable control plane for AI:
The article highlights how tokenization, anonymization, masking, and semantic controls can help organizations protect sensitive data as it moves through RAG pipelines, knowledge graphs, agentic workflows, and AI-driven outputs.

A recent contributed article in The AI Innovator by James Rice, Vice President and AI Data Evangelist at Protegrity, examines why enterprise AI security must focus on the data and knowledge flowing through AI systems, rather than treating the model as the primary control point.

The article argues that AI workflows operate differently from traditional applications. They retrieve context, synthesize information across repositories, generate outputs, and trigger actions across systems. As a result, sensitive data can move through logs, embeddings, vector stores, knowledge graphs, prompts, and generated outputs in ways that traditional perimeter-based security controls were not designed to govern.

Why AI workflows challenge traditional security models

The article explains that AI pipelines can bypass familiar trust boundaries because data is continuously copied, transformed, recombined, and passed between systems. A single AI workflow may draw from customer records, internal documents, financial reports, and knowledge bases, creating new combinations of sensitive information that static controls may not detect.

For enterprises, that means the workflow itself can become the attack surface. Security teams need visibility into how data moves, how context is assembled, and how outputs are generated across AI systems.

RAG and agentic AI expand the discovery surface

James Rice also highlights the risks created by retrieval-augmented generation and agentic AI workflows. These systems can discover and synthesize information across connected repositories, including stale folders, legacy archives, misconfigured systems, and knowledge sources that teams may not realize are still accessible.

As vector stores and knowledge graphs consolidate enterprise context, they become powerful sources of business intelligence. Without proper governance, they can also become high-value exposure points for sensitive data and enterprise knowledge.

Why context-aware data protection matters

The article positions context engineering as a new control plane for AI security. Because AI systems consume meaning, not only rows, tables, files, or objects, protection needs to account for what the data is, how sensitive it is, who or what is using it, and what task it supports.

Data-centric protections such as tokenization, anonymization, masking, and structured semantic controls can help sensitive data move through AI workflows with reduced exposure. This allows organizations to preserve utility for training, retrieval, inference, and personalized outputs while enforcing controls based on purpose and context.

AI security as a data architecture decision

The article’s core message is that AI security is an architectural issue. As enterprises move from copilots to autonomous agents, governance needs to happen at runtime across retrieval, tool calls, data access, and actions.

Organizations that embed security into the data layer can support broader AI adoption while maintaining control over where data goes, how it is used, and why AI systems take specific actions. In this view, the data layer becomes the durable control point for AI security, even as models, agents, infrastructure, and tooling continue to change.

Note: This summary is based on the external The AI Innovator article “The Weak Link in AI Security Isn’t the Model. It’s the Data” and is provided for convenience. Please refer to the original publication for full context and source reporting.

Read the full article