Goli Bhargav

Reference architecture

Multi-tenant LLM platform

A production multi-tenant LLM platform comes together from ten components across three layers — edge concerns, the platform core, and cross-cutting capabilities. The diagram below is interactive: click any component to read the design rationale, the trade-off most worth surfacing, and the choice I’d make in production.

EdgeCoreCross-cuttingAPI GatewayIdentity & AuthZModel GatewayPrompt StoreTool / MCP LayerVector StoreEval FrameworkGovernance / PolicyObservabilityControl Plane