All blogs

Rethinking the AI Data Path with Secure Data Platforms

David A. Chapa David A. Chapa
AI Strategist, Hitachi Vantara

June 1, 2026


For years, enterprise infrastructure treated security, storage, networking, and compute as mostly separate operational layers. Disruptive technologies tend to blur those lines. We saw this with backup and recovery, virtualization, and now with AI. With each shift, organizations realized pretty quickly that the specialists driving those disruptive technologies – like AI – often developed a better understanding of the full operational stack than the siloed infrastructure teams themselves because they had to understand how the entire workload behaved end-to-end.

As AI systems evolve into larger reasoning models and agentic workflows operating across distributed data sources, the infrastructure challenges are no longer just about raw compute power. Security, data locality, orchestration, context memory coordination, and the movement of data between systems are all becoming part of the same operational strategy, and all critical to delivering scalable, secure and efficient AI. That is part of what makes NVIDIA Vera BlueField-4 STX security announcement interesting.

Underneath all the product and platform messaging is a fairly important architectural acknowledgment that the bottleneck in AI workflows is the data movement itself. NVIDIA says this directly in its messaging, describing the shift from “human interaction to machine reasoning and data movement.”

For years, most AI conversations sounded a lot like traditional HPC discussions centered almost exclusively around GPUs, FLOPS, and scaling compute. The primary differences became model parameter counts and how quickly centralized AI servers could load training data. Meanwhile, some of us have been arguing for a while that the longer-term challenge would eventually become memory architecture, orchestration efficiency, data locality, and the cost of moving data through distributed AI systems.  Let’s face it, moving data has always been one of the most expensive line items in any IT strategy. Sometimes that cost showed up as latency. Sometimes it showed up as larger and faster storage platforms, higher-speed networking, or the infrastructure required to move massive amounts of data around the environment efficiently.

But unlike the challenges we faced in the late 90s and early 2000s, the issue is no longer simply getting data off storage fast enough, but how efficiently context, embeddings, inference pipelines, memory state, and orchestration layers move between GPUs, DPUs, CPUs, memory tiers, networks, and distributed systems without introducing unnecessary latency or inefficiency into the workflow itself.

That is part of why locality is becoming such an important architectural consideration. Moving the data and the services operating on that data closer to the execution environment creates efficiencies across the overall system.

What NVIDIA is describing with Vera BlueField-4 STX is a tighter integration between compute, networking, storage, memory coordination, and security operating much closer to the AI execution pipeline itself. You may wonder why security is such a large component of this announcement because, quite frankly, security should already be table stakes. The reason for the emphasis is that when AI data is compromised, the impact can propagate through distributed systems very quickly. At scale, even small amounts of poisoned data can multiply across inference pipelines, agents, workflows, and data sources much faster than with traditional enterprise applications.

That is why all of these categories taken together are so important. Security, locality, orchestration, governance, and runtime visibility are no longer independent operational concerns. Even NVIDIA’s security messaging reflects that shift, with discussions around inline enforcement directly within the AI data path instead of relying entirely on traditional perimeter models.

That direction makes sense because AI workloads behave very differently from traditional enterprise applications and even classic HPC environments. These systems are increasingly dynamic, distributed, and orchestration-heavy, with growing context windows, increasing agent-to-agent communication, and continuous interaction between inference pipelines, APIs, vector databases, tools, external data sources, and other models in real time. As those interactions scale, the operational cost of moving data around the environment starts becoming a much larger part of the overall performance equation.

One of the more interesting parts of the Vera BlueField-4 STX direction is that NVIDIA is not just talking about storage throughput or adding another infrastructure component around the GPU. The architecture is focused on reducing friction. In system design and architecture, we look at where friction exists and what we can do to minimize or abstract that friction from the operational workflow. The friction NVIDIA Vera BlueField-4 STX is addressing is directly inside the AI execution pipeline itself by bringing data services, orchestration, networking, security enforcement, and memory coordination much closer to the compute environment where reasoning and inference are actually occurring. If you understand how networking routers work, think of this as reducing hops.

The security side of the announcement is equally interesting because it reflects how AI is changing traditional assumptions around enterprise security architecture. NVIDIA describes storage as becoming “a real-time system that governs how agents access, trust, and act on data.”

In many ways, this starts resembling some of the same operational concerns we see in real-time systems where trusted data, governance, access controls, and decision integrity become critical because the system itself is actively participating in operational decision-making. If the underlying system is compromised, manipulated, or operating on bad data, it can directly impact the decisions being made.

The same concern increasingly applies to enterprise AI operating at scale. That is a very different operational model than the perimeter-based approaches most enterprise environments were originally designed around.

Traditional enterprise applications were relatively predictable and deterministic. Agentic AI systems are not either, they are far from it. Autonomous systems continuously interact with distributed data sources, APIs, tools, models, inference pipelines, and other agents with very little human intervention occurring between transactions. As those interactions scale, runtime visibility, governance, inline inspection, and policy enforcement all become much more important operationally. With NVIDIA Vera BlueField-4 STX and NVIDIA DOCA, the infrastructure itself becomes part of the security model instead of simply being the environment AI applications run on top of.

The broader architectural convergence in the NVIDIA Vera BlueField-4 STX announcement may ultimately be the biggest takeaway. Enterprise AI infrastructure is no longer just a collection of hardware components deployed into an environment and managed independently by siloed infrastructure teams. It is becoming a coordinated systems solution where compute, memory, networking, orchestration, locality, governance, and security all operate together as part of the same execution environment. That is very different from traditional IT, and I think the industry is only now starting to understand how significant that shift really is. Which is why Hitachi Vantara is focused on helping our customers build Responsible Enterprise AI environments with NVIDIA.

Accelerate your transformation with Hitachi IQ, Hitachi Vantara’s AI-solution suite for modern workloads.


David A. Chapa

David A. Chapa

David A. Chapa serves as Chief AI Strategist at Hitachi Vantara. He works at the intersection of AI systems, enterprise infrastructure, and long-horizon risk, focusing on how early architectural decisions shape financial exposure, operational resilience, and strategic flexibility over time. His perspective emphasizes memory-centric systems design, data locality, and sovereign AI environments that help organizations transition from pilot-stage experimentation into durable production capability.