The case of AI tool registries – O'Reilly

As enterprise AI agent adoption scales, the absence of centralized, organization-level tool infrastructure is creating compounding costs. When adoption is made to optimize deployment speed, enterprises expose themselves to a combination of risks: duplicate engineering effort, security risks, and operational ambiguity.

Each enterprise needs its own shared tool registry, reflecting its specific regulatory environment, security posture, and operational traditions. To be clear, this is not an argument for any public package manager, such as npm, pypi, or maven. The infrastructure that every enterprise needs is internal; Its scope is limited to its own teams, its own data, its own policies, its own domain. Trying to expand the scope beyond the boundaries of individual organizations would result in premature standardization in a fast-moving, emerging space.

A shared enterprise tool registry is not an optimization or good use. This is the basic infrastructure as agent deployment goes beyond initial experiments. Its case rests on two pillars: reducing coordination costs and enabling risk management, both for the humans building with the agents and for the agents themselves.

AI agents rely on tools that retrieve data, write records, trigger workflows, and call external APIs. according to McKinseyIn most large organizations these tools are created by separate teams in an ad hoc manner: undocumented, ungoverned, and invisible to the rest of the organization. This pattern is familiar to most engineering leaders, and its fragmentation compounds with the deployment of every new agent. Teams rebuild what already exists elsewhere, security reviews miss devices that were never registered, and when something breaks, no one has a complete picture of what’s going on or why.

Coordination failure at infrastructure scale

The software industry solved a similar problem with package managers decades ago. Centralized registries gave teams a way to find, trust, and control shared code. The lesson was clear: preventing duplication and incompatibility is an infrastructure problem, not a discipline problem.

The agent era presents a similar problem in a new domain. When? Kong launches its enterprise MCP registry In February 2026, they clearly pointed out the problems of manual MCP configuration, hardcoded and managed tool isolation across teams, fragmented integration, and limited organization visibility.

Fragmented device development is not the result of poor engineering practice. Rather, it is the predictable result of asking teams to solve an infrastructure problem at the application level.

visibility problem

“Gravity’s”State of AI Agent Security 2026“The survey reveals what happens when agent tooling is invisible to those responsible for securing it. The survey found that only 14.4% of teams with agents beyond the planning stage have full security approval, and 88% of organizations had an agent-related security incident this year. Bad practices like shared API keys are endemic, with only 22% of organizations treating agents as independent identities. This governance gap turns agents from productivity boosters to high-velocity liabilities. which are capable of performing unauthorized actions or leaking sensitive data before a human can intervene.

The story is clear: adoption is outpacing governance, and the race to speed is requiring old lessons to be re-taught. Most deployed agents (and the MCP servers powering them!) are operating without any security sign-off. This is not primarily a resource failure, and it is not something that the registry alone can solve. Security teams can’t review what they can’t find, and without a registry, searches are manual, incomplete, and out of date. A registry does not make a device inherently secure; Rather, it makes security work possible by ensuring that tools do not exist as temporary, ad-hoc shims, but as invented artifacts that can be audited and policy attached to.

It’s worth revisiting public package managers here. These registries are not able to eliminate many security problems, issues such as typos, malicious packages, and dependency confusion, clearly showing that centralization alone is not a security solution. But they also show the opposite: the registry is a prerequisite for security. Many community responses to violations in these ecosystems provide centralization of power. Centralization does not guarantee security, but decentralization loses the means to coordinate it.

Governance requires a shared context

The default state in most agent deployments is permissive: devices are available unless explicitly blocked. AgilityFeat’s Analysis of Enterprise AI Guardrails This recognizes the structural risk this creates, as an architecture not built on default-by-default increases risk and creates maintenance costs.

Permission-by-default, replicated across dozens of independent agent deployments, creates an attack surface that grows with adoption. Reversing this requires a coordination point, a shared, organization-wide reference. The registry itself is not a governance layer, but it makes governance possible. When every device an agent can use is registered with ownership, version, and review status, the governance layer has something concrete to enforce. Without that context, the policy has to be re-implemented by each consumer team, and consistency becomes impossible.

Fronteg’s framework for AI agent governance outlines what that policy layer looks like operationally: Agent actions are mapped to clear, granular guardrails that define operational boundaries for any agent attempt or execution. These handrails remain outside the registry, but they are dependent on it. A guardrail that references a device the security team has never heard of cannot be written in the first place.

What’s needed for a production-grade tool catalog

A mature enterprise tool registry has two main functions, search and versioning, and serves as the basis for two others: authentication metadata and access control. Think of it as an internal developer portal (IDP) built for the agent era, solving the same coordination problem that IDPs solved for service teams, but one layer up.

Discovery allows any team building an agent to discover existing tools before writing new tools. By having proprietary metadata, version history, and usage metrics centralized, duplication is reduced not through mandates but through reduced friction. A well-designed catalog goes far beyond a flat list: devices should be grouped hierarchically by functional domain so that both humans and agents can quickly find the relevant capabilities.

Versioning bridges a gap that neither search nor access controls: when the agent’s behavior changes, why did it change? A tool registry that tracks versions gives enterprises the visibility to answer that question. Was it the model? A tool prompt update? An underlying API change? Without proper versioning, finding the answer turns from a simple difference comparison to a time-consuming, manual investigation.

Certification status (things like security approval, API contract validation, PII handling checks) is metadata that is exposed to the registry, not a limitation that the registry itself imposes. The actual review work happens through the security organization’s existing tooling. The registry’s contribution is making the results of that review visible at the time a team is deciding whether to adopt a device, ensuring that the review actually informs the decision it was intended to inform.

Access control works the same way. A policy layer enforces authorization across the scope of agent identity, team, environment, and type of action, reading from the registry to determine which devices exist and who owns them. Centralization of the registry lets access controls be implemented consistently, rather than forcing each team to come up with something special.

None of this is achievable when each team maintains its own separate tooling stack. Platform teams already understand why IDPs exist. The value of the paradigm is no different in the agent context.

compound cost of inaction

The costs of inaction are direct, and not just operational and security-related. Without a searchable, well-organized list of tools, teams are constantly reinventing the wheel, because it is easier to create a tool than to find one that already exists. Duplication means redundancy and technical debt. A registry converts that unnecessary expense into real functionality by making devices searchable and reusable.

For platform engineering teams, the trajectory is clear. The adoption of agents is increasing, along with it is increasing tool duplication, and small-scale shims will not be able to keep up as the number of agents and tools increases. The safety risks documented in the gravity survey would be broader, not narrower, without structural intervention.

Organizations that now build centralized device infrastructure will be able to quickly onboard new agents, continuously control them, and audit them if something goes wrong. Those who procrastinate will rediscover the hard way what platform teams learned a decade ago: coordination problems don’t solve themselves at the application level. They compound there.

The case of AI tool registries – O’Reilly

Coordination failure at infrastructure scale

visibility problem

Governance requires a shared context

What’s needed for a production-grade tool catalog

compound cost of inaction

Download: AI Malaise and Baby Making Technology

A coding implementation to recover hidden malware IOCs with flare-floss beyond classic strings analysis

Related Articles

Leave a Comment Cancel Reply