The Filesystem Is the Database: Why Agents Need a New Storage Primitive

Subramanya N

Something interesting is happening in the agentic infrastructure space, and it is not what most people expected. For the past two years, the dominant paradigm for giving agents access to knowledge has been Retrieval-Augmented Generation: embed your documents, store them in a vector database, and let the model query them at inference time. RAG worked. It was good enough. But “good enough” has a shelf life, and in 2026, that shelf life is expiring.

A new pattern is emerging across the industry, and it is converging from multiple directions at once. Mintlify replaced its entire RAG pipeline with a virtual filesystem and saw session creation drop from 46 seconds to 100 milliseconds [1]. Turso built AgentFS, a SQLite-backed filesystem that gives every agent its own copy-on-write sandbox [2]. Box, the enterprise content giant, announced that it is repositioning its entire platform as a virtual filesystem layer for AI agents [3]. And ByteDance open-sourced OpenViking, a context database that organizes all agent memory, resources, and skills as a hierarchical filesystem [4].

These are not niche experiments. They are signals of a fundamental shift. The filesystem is becoming the universal interface for agent cognition, and the database is quietly becoming its substrate.

Why RAG Hit a Wall

RAG was the right answer for 2023. You had a pile of documents, a model with a limited context window, and you needed a way to surface relevant chunks at query time. Vector embeddings and similarity search solved that problem elegantly.

But agents are not chatbots. An agent does not ask one question and leave. It explores. It reads a file, discovers a reference, follows it, reads another file, runs a command, writes an output. This is not a retrieval problem. It is a navigation problem.

RAG pipelines struggle with this for three reasons. First, they are stateless by design. Every query is independent; there is no concept of “I was just looking at this directory, now show me the adjacent file.” Second, they flatten structure. A documentation site with a clear hierarchy of sections, pages, and code examples gets shredded into anonymous 512-token chunks that lose their organizational context. Third, they are expensive at scale. Embedding computation, vector index maintenance, and re-ranking all add latency and cost that compound as the corpus grows.

The filesystem solves all three. It is inherently stateful (the agent has a working directory). It preserves structure (directories, subdirectories, files). And it is fast because the operations are simple: ls, cat, grep, find. These are not novel abstractions. They are the most battle-tested interface in computing.

The Convergence: Four Approaches, One Pattern

What makes this moment significant is that the filesystem pattern is emerging independently across very different contexts.

Mintlify’s ChromaFs is perhaps the most instructive example. Mintlify powers documentation assistants for thousands of companies. Their original architecture was textbook RAG: chunk the docs, embed them, retrieve at query time. When they replaced it with ChromaFs, a virtual filesystem that intercepts UNIX commands and translates them into Chroma database queries, the results were dramatic. Session creation went from 46 seconds to 100 milliseconds, a 460x improvement. Marginal cost per conversation dropped from $0.0137 to effectively zero [1]. The key insight: the agent already knows how to navigate a filesystem. Teaching it to use cat /auth/oauth.mdx is trivial compared to teaching it to formulate the right vector query.

Turso’s AgentFS attacks a different problem: agent isolation and auditability. Every agent gets its own SQLite-backed filesystem with copy-on-write semantics. The host filesystem is a read-only base layer; the agent writes to a SQLite delta layer. Every file operation, tool call, and state change is recorded. The entire agent runtime, files, state, history, fits in a single portable SQLite file [2]. This is not just a filesystem. It is an auditable, reproducible execution environment.

Box’s enterprise VFS is the most strategically significant. Box CEO Aaron Levie has been explicit: agents need a filesystem to do knowledge work in the enterprise [3]. But Box is not pitching a literal filesystem. They are pitching a “dynamic data delivery contract” that can be backed by object storage, relational databases, or their own content platform. The filesystem is the interface; the backing store is whatever makes sense for the data. What makes Box’s play interesting is the governance layer: permissions, audit trails, and compliance boundaries that carry over automatically from the content platform to the agent.

ByteDance’s OpenViking takes the pattern furthest. It organizes all agent context, memories, resources, skills, knowledge, under a viking:// protocol using standard filesystem semantics. Agents navigate with ls and find. But the clever part is the tiered access model: every piece of context is processed into three layers. L0 is a one-sentence summary for quick retrieval. L1 is an overview with core information for planning. L2 is the full content for deep reading [4]. The agent starts with L0, drills into L1 when it needs more, and only loads L2 when it is doing detailed work. On the LoCoMo benchmark, this reduced token consumption from 24.6 million to 4.2 million while increasing task completion rates to 52% [4].

Filesystem as Interface, Database as Substrate

The pattern that connects all four is what I would call the VFS duality: the filesystem wins as the interface, and the database wins as the substrate. This is not an either-or choice. It is a layered architecture.

Why the filesystem wins as the interface is straightforward. LLMs are trained on the internet, and the internet is built by developers who think in terms of files, directories, paths, and command-line tools. Models are unusually competent with these primitives because they have seen billions of examples of developers navigating codebases, reading files, and running shell commands. When you give an agent a filesystem, you are meeting it where its training data lives.

Why the database wins as the substrate is equally clear. The moment agent memory needs to be shared, audited, queried by multiple agents, or made reliable under concurrency, you need database guarantees. ACID transactions, access control, semantic search, version history: these are hard problems that databases have spent decades solving. Reimplementing them on top of a literal filesystem is a path to pain.

The VFS pattern gives you both. The agent sees files and directories. The system sees tables, indexes, and access control lists. ChromaFs stores everything in Chroma but exposes it as files. AgentFS stores everything in SQLite but exposes it as a POSIX filesystem. OpenViking uses its own storage engine but exposes it as viking:// paths. Box uses its enterprise content platform but exposes it as a navigable tree.

But Can a VFS Actually Beat the Native Filesystem?

The natural objection to all of this is: why not just use the real filesystem? POSIX is right there. Every operating system ships with it. Why add an abstraction layer?

I wanted to answer this question empirically, so I built markdownfs, a from-scratch virtual filesystem in Rust designed specifically for agent workloads [6]. It supports the full set of UNIX-like commands (ls, cat, grep, find, chmod, chown), Git-style versioning with content-addressable storage, multi-user permissioning, and exposes three access methods: a CLI/REPL, an HTTP/REST API, and an MCP server that agents like Claude and Cursor can connect to directly.

The architecture is simple: an in-memory inode table backed by a content-addressable blob store using SHA-256 hashing, with tokio::RwLock for safe concurrent access. Files are deduplicated automatically. Version control uses the same commit/revert model as Git, but at the filesystem level. Persistence is handled through atomic bincode snapshots.

When I benchmarked markdownfs against the native filesystem across the standard agent operations (file creation, reads, writes, directory listing, grep, find, move, copy, deletion), markdownfs averaged roughly 130x faster across the board. The reasons are structural, not incidental. In-memory operations eliminate disk I/O entirely. Content-addressable storage means duplicate files are stored once. Zero-copy reads mean the agent gets data without serialization overhead. And because the entire filesystem state lives in a single process, there are no system call boundaries to cross.

The comparison is particularly stark for the operations agents perform most frequently:

Operation	Why VFS Wins
Repeated reads (agent re-reading context)	In-memory, zero-copy. No disk seeks, no page cache misses.
grep across files (agent searching for patterns)	All content is in-memory. No directory traversal, no file handle management.
Rapid file creation (agent producing work artifacts)	No filesystem journaling, no inode allocation on disk, no fsync.
Directory listing (agent exploring structure)	BTreeMap lookup vs. readdir syscalls.

But performance is not the real argument. The real argument is what the native filesystem cannot do. A POSIX filesystem has no concept of semantic search. It has no built-in versioning (you need Git for that). It has no tiered access model (you get the whole file or nothing). It has no content deduplication. It has no audit trail of agent operations. And critically, it has no MCP interface, which means agents cannot access it through the standard protocol that the ecosystem is converging on.

The VFS is not just faster. It is a richer primitive. It gives you the familiar interface of ls and cat while adding the capabilities that agents actually need: versioning, permissions, search, deduplication, and protocol-native access via MCP or HTTP.

What This Means for RAG

To be clear, RAG is not dead. Vector search remains valuable for fuzzy, semantic queries where the agent genuinely does not know what it is looking for. But the honest assessment is that RAG has been over-applied. Many of the use cases where teams deployed RAG pipelines, documentation retrieval, codebase navigation, enterprise knowledge management, are better served by a filesystem interface.

The evidence is striking. Mintlify’s 460x speedup came from replacing RAG with a filesystem, not augmenting it [1]. Research from Letta shows that agents using simple filesystem operations achieve 74% accuracy on memory benchmarks, competitive with specialized retrieval tools. And agentic keyword search approaches can achieve over 90% of RAG performance without vector databases at all [5].

The future is likely hybrid. RAG for open-ended semantic search. Filesystem for structured navigation and task execution. But the center of gravity is shifting toward the filesystem, and the strategic implications are significant.

The Strategic Imperative

If you are building agentic infrastructure, you need a VFS strategy. Here is why.

For SaaS companies: the lesson from Box is that the filesystem is becoming the integration surface for agents. If your platform’s content is not navigable as a filesystem, agents will bypass you. The SaaS companies that expose their data through filesystem-like interfaces will become part of the agentic workflow. Those that do not will become invisible to agents, which means invisible to users.

For infrastructure vendors: the database is not going away. It is moving underneath the filesystem. This is actually good news for database companies. Turso understood this and built AgentFS on top of SQLite. Every agent that spins up creates a new database. The more agents the world runs, the more databases the world needs. But the database needs to disappear behind a filesystem abstraction.

For enterprises: the governance story is what matters. Box’s pitch is not really about filesystems. It is about the fact that their permission model, audit trail, and compliance infrastructure automatically extends to agents when content is accessed through the VFS layer [3]. This is the answer to the question every CISO is asking: “How do we let agents access our content without creating a security nightmare?”

The Unifying Layer

The agentic infrastructure stack has been evolving in clear phases: tools (MCP), skills, and context graphs. The virtual filesystem fits into this arc as the delivery mechanism for all three. MCP tools are invoked through the filesystem. Skills are stored as files. Context graphs are navigated as directory trees. The filesystem does not replace these layers. It unifies them behind a single, familiar interface.

This is the real insight. The filesystem is not a new idea. It is the oldest abstraction in computing. But that is exactly why it works for agents. In a world where we are inventing new paradigms every quarter, the most powerful move might be reaching back to the most proven interface we have and putting a modern database behind it.

The companies that understand this, Mintlify, Turso, Box, ByteDance, are not building something new. They are recognizing something old and giving it a new job.

References:

[1] Mintlify. (2026, April 2). How we built a virtual filesystem for our Assistant. Mintlify Blog.

[2] Turso. (2026). The Missing Abstraction for AI Agents: The Agent Filesystem. Turso Blog.

[3] Blocks and Files. (2026, March 9). Box pitches ‘virtual filesystem’ layer for AI agents. Blocks and Files.

[4] Volcengine. (2026). OpenViking: An open-source context database for AI Agents. GitHub.

[5] Signals. (2026, February). Keyword Search is All You Need: Achieving RAG-Level Performance Without Vector Databases Using Agentic Tool Use. Signals.

[6] Subramanya N. (2026). markdownfs: A high-performance, concurrent markdown database built in Rust. GitHub.