{"id":90991,"date":"2026-05-17T16:34:57","date_gmt":"2026-05-17T16:34:57","guid":{"rendered":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/"},"modified":"2026-05-17T16:34:57","modified_gmt":"2026-05-17T16:34:57","slug":"meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production","status":"publish","type":"post","link":"https:\/\/youzum.net\/ja\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/","title":{"rendered":"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production"},"content":{"rendered":"<p>Running AI agents in a local script is straightforward. Running them reliably in production across teams, across restarts, with isolated environments per context is a different problem entirely. BerriAI, the company behind the LiteLLM AI Gateway, is now open-sourcing a purpose-built answer to that problem: the <strong><a href=\"https:\/\/github.com\/BerriAI\/litellm-agent-platform\" target=\"_blank\" rel=\"noreferrer noopener\">LiteLLM Agent Platform<\/a><\/strong>. The platform is described as a simple, self-hosted infrastructure platform for running multiple agents in production. <\/p>\n<h2 class=\"wp-block-heading\"><strong>What Problem Does it Solve?<\/strong><\/h2>\n<p>It helps to understand what happens when you try to scale agents beyond a single process. Agents are stateful: they carry session history, tool call results, and intermediate reasoning across turns. If the container running your agent crashes, restarts, or gets replaced during a deployment, that session state is gone unless something is explicitly managing it. At the same time, different teams often need different runtime environments, different tools, different secrets, different access scopes which means you cannot throw all agents into one shared container.<\/p>\n<p>The platform manages two things: per-team and per-context sandboxes, and session continuity across pod restarts and upgrades. These two capabilities are the core infrastructure primitives the platform provides.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Architecture and Technical Stack<\/strong><\/h2>\n<p>The platform is a standalone Next.js dashboard for LiteLLM v2 managed agents, covering sessions chat, agent CRUD, and live status. The codebase is primarily TypeScript (92.8%), with Shell scripts for provisioning, a Dockerfile for containerization, and CSS for the dashboard UI.<\/p>\n<p>The architecture separates concerns cleanly. A <strong>web process<\/strong> runs on port 3000 and serves the Next.js dashboard. A <strong>worker process<\/strong> handles async agent tasks. <strong>Postgres<\/strong> is used as the persistent backing store, and a schema migration runs as an init container on startup \u2014 so the database is always in the correct state before the application boots.<\/p>\n<p>For the sandbox layer \u2014 the isolated runtime environment where agents actually execute \u2014 sandboxes run on Kubernetes via the <code>kubernetes-sigs\/agent-sandbox<\/code> CRD. Local development uses kind. If you are not already familiar with it: <code>kind<\/code> (Kubernetes in Docker) lets you spin up a full Kubernetes cluster locally using Docker containers as nodes, without needing a cloud provider. The <code>agent-sandbox<\/code> CRD (Custom Resource Definition) is a Kubernetes extension from <code>kubernetes-sigs<\/code> that the platform installs to manage the lifecycle of individual sandbox environments.<\/p>\n<p>The platform also includes a harness system under <code>harnesses\/opencode<\/code>, which contains the configuration for running coding agents \u2014 such as Claude Code or OpenAI Codex \u2014 inside isolated sandboxes with a vault proxy for credential management. BerriAI team also maintains a separate <code>litellm-agent-runtime<\/code> repository, described as a coding-agent runtime that runs inside per-session VMs provisioned by a LiteLLM proxy, generic by design, with customization happening via harness configuration or a hydrate payload.<\/p>\n<p>One practical detail worth noting is how environment variables are handled across sandbox containers. Anything in <code>.env<\/code> prefixed with <code>CONTAINER_ENV_<\/code> is injected into every sandbox container with the prefix stripped \u2014 for example, <code>CONTAINER_ENV_GITHUB_TOKEN=ghp_...<\/code> means the container sees <code>GITHUB_TOKEN=ghp_...<\/code> This gives teams a clean way to pass secrets into sandboxed agent sessions without modifying container images.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"964\" height=\"720\" data-attachment-id=\"79915\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/16\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1.gif\" data-orig-size=\"964,720\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1.gif\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1.gif\" alt=\"\" class=\"wp-image-79915\" \/><figcaption class=\"wp-element-caption\">https:\/\/github.com\/BerriAI\/litellm-agent-platform<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Getting Started<\/strong><\/h2>\n<p>The prerequisites for local development are Docker Desktop, <code>kind<\/code>, <code>kubectl<\/code>, <code>helm<\/code>, and a LiteLLM gateway. No cloud credentials are required to get started locally. The quickstart is two commands:<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\"no-line-numbers\"><code class=\"no-wrap language-php\">bin\/kind-up.sh\ndocker compose up<\/code><\/pre>\n<\/div>\n<\/div>\n<p><code>bin\/kind-up.sh<\/code> is idempotent \u2014 it provisions a kind cluster named <code>agent-sbx<\/code>, installs the agent-sandbox controller, and loads the harness image. <code>docker compose up<\/code> boots Postgres, runs the schema migration, and starts the web process on port 3000 along with the worker.<\/p>\n<p>For production deployment, the recommended path is AWS EKS for the sandbox cluster and Render for the web and worker processes. <code>bin\/eks-up.sh<\/code> provisions the EKS cluster, and a Render Blueprint provides a one-click deployment option. <\/p>\n<h2 class=\"wp-block-heading\"><strong>Relationship to the LiteLLM Gateway<\/strong><\/h2>\n<p>The Agent Platform is a layer on top of the existing LiteLLM ecosystem, not a replacement for it. LiteLLM\u2019s core is a Python SDK and Proxy Server \u2014 an AI Gateway \u2014 that calls 100+ LLM APIs in OpenAI format, with cost tracking, guardrails, load balancing, and logging, supporting providers including Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, SageMaker, HuggingFace, vLLM, and NVIDIA NIM. The Agent Platform consumes a running LiteLLM gateway as a dependency and builds agent orchestration and session management infrastructure on top of it. Model routing, cost tracking, and rate limiting remain in the gateway layer. Sandbox isolation, session continuity, and the management dashboard are handled by the Agent Platform.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Marktechpost\u2019s Visual Explainer<\/strong><\/h2>\n<div>\n<p>  <!-- Header --><\/p>\n<div class=\"llmap-header\">\n<div class=\"llmap-logo\">\n      <svg viewbox=\"0 0 24 24\" fill=\"none\" stroke=\"#000\">\n        <path d=\"M12 2L2 7l10 5 10-5-10-5z\"><\/path><path d=\"M2 17l10 5 10-5\"><\/path><path d=\"M2 12l10 5 10-5\"><\/path>\n      <\/svg>\n    <\/div>\n<div>\n<div class=\"llmap-title\">LiteLLM Agent Platform<\/div>\n<div class=\"llmap-subtitle\">Self-Hosted Agent Infrastructure Guide<\/div>\n<\/div>\n<div class=\"llmap-badge\">Alpha<\/div>\n<\/div>\n<p>  <!-- Progress --><\/p>\n<div class=\"llmap-progress-bar\">\n<div class=\"llmap-progress-fill\"><\/div>\n<\/div>\n<p>  <!-- Tabs --><\/p>\n<div class=\"llmap-tabs\">\n<div class=\"llmap-tab active\" data-slide=\"0\">Overview<\/div>\n<div class=\"llmap-tab\" data-slide=\"1\">Concepts<\/div>\n<div class=\"llmap-tab\" data-slide=\"2\">Architecture<\/div>\n<div class=\"llmap-tab\" data-slide=\"3\">Prerequisites<\/div>\n<div class=\"llmap-tab\" data-slide=\"4\">Quickstart<\/div>\n<div class=\"llmap-tab\" data-slide=\"5\">Production<\/div>\n<\/div>\n<p>  <!-- Slides --><\/p>\n<div class=\"llmap-slides\">\n<p>    <!-- Slide 0: Overview --><\/p>\n<div class=\"llmap-slide active\" data-index=\"0\">\n<div class=\"llmap-slide-num\">01 \/ 06<\/div>\n<div class=\"llmap-slide-title\">What is LiteLLM Agent Platform?<\/div>\n<div class=\"llmap-slide-desc\">BerriAI open-sourced this platform on May 8, 2026. It is a self-hosted infrastructure layer for running multiple AI agents in production, built on top of the LiteLLM AI Gateway.<\/div>\n<div class=\"llmap-cards\">\n<div class=\"llmap-card\">\n<div class=\"llmap-card-icon\"><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/17.0.2\/72x72\/1f9f1.png\" alt=\"\ud83e\uddf1\" class=\"wp-smiley\" \/><\/div>\n<div class=\"llmap-card-title\">Self-Hosted<\/div>\n<div class=\"llmap-card-text\">Runs entirely on your own infrastructure. No data leaves your environment. Suited for regulated industries and teams with data residency requirements.<\/div>\n<\/div>\n<div class=\"llmap-card\">\n<div class=\"llmap-card-icon\"><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/17.0.2\/72x72\/1f916.png\" alt=\"\ud83e\udd16\" class=\"wp-smiley\" \/><\/div>\n<div class=\"llmap-card-title\">Multi-Agent<\/div>\n<div class=\"llmap-card-text\">Designed to run multiple agents in parallel, with full isolation between teams and contexts using per-session sandboxes.<\/div>\n<\/div>\n<div class=\"llmap-card\">\n<div class=\"llmap-card-icon\"><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/17.0.2\/72x72\/1f501.png\" alt=\"\ud83d\udd01\" class=\"wp-smiley\" \/><\/div>\n<div class=\"llmap-card-title\">Session Continuity<\/div>\n<div class=\"llmap-card-text\">Agent sessions persist across pod restarts and upgrades, so stateful work is not lost when containers are replaced.<\/div>\n<\/div>\n<div class=\"llmap-card\">\n<div class=\"llmap-card-icon\"><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/17.0.2\/72x72\/26a1.png\" alt=\"\u26a1\" class=\"wp-smiley\" \/><\/div>\n<div class=\"llmap-card-title\">Open Source (MIT)<\/div>\n<div class=\"llmap-card-text\">Fully open source under the MIT license. Repo: github.com\/BerriAI\/litellm-agent-platform. File issues and contribute directly.<\/div>\n<\/div>\n<\/div>\n<div class=\"llmap-note\">\n<div class=\"llmap-note-label\">Prerequisite Knowledge<\/div>\n<div class=\"llmap-note-text\">This guide assumes familiarity with Docker, basic command-line usage, and a general understanding of what an AI agent is (a model that calls tools and runs multi-step tasks). Kubernetes experience helps but is not required to follow along.<\/div>\n<\/div>\n<\/div>\n<p>    <!-- Slide 1: Concepts --><\/p>\n<div class=\"llmap-slide\" data-index=\"1\">\n<div class=\"llmap-slide-num\">02 \/ 06<\/div>\n<div class=\"llmap-slide-title\">Key Concepts to Know First<\/div>\n<div class=\"llmap-slide-desc\">Before running the platform, understand these four building blocks. They appear throughout the setup and configuration.<\/div>\n<ul class=\"llmap-steps\">\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">A<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">LiteLLM Gateway<\/div>\n<div class=\"llmap-step-text\">The underlying AI Gateway that the Agent Platform depends on. It routes requests to 100+ LLM providers (OpenAI, Anthropic, Bedrock, VertexAI, etc.) using a unified OpenAI-format API. The Agent Platform does not include the gateway, you must have one running separately and point the platform at it.<\/div>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">B<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Sandbox<\/div>\n<div class=\"llmap-step-text\">An isolated container environment where a single agent session executes. Each sandbox is independent, meaning one agent cannot access the filesystem, secrets, or state of another. Sandboxes are provisioned and torn down per session using the <span class=\"llmap-inline-code\">kubernetes-sigs\/agent-sandbox<\/span> CRD (Custom Resource Definition).<\/div>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">C<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Harness<\/div>\n<div class=\"llmap-step-text\">A configuration layer that defines how a specific type of coding agent (such as Claude Code or OpenAI Codex) runs inside a sandbox. The platform ships with an <span class=\"llmap-inline-code\">opencode<\/span> harness under <span class=\"llmap-inline-code\">harnesses\/opencode\/<\/span>. The harness image is loaded into the kind cluster during setup.<\/div>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">D<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">CRD (Custom Resource Definition)<\/div>\n<div class=\"llmap-step-text\">A Kubernetes extension that lets you define new resource types. The platform uses the <span class=\"llmap-inline-code\">kubernetes-sigs\/agent-sandbox<\/span> CRD to teach your Kubernetes cluster how to manage agent sandboxes as first-class resources, the same way it manages pods or deployments.<\/div>\n<\/div>\n<\/li>\n<\/ul><\/div>\n<p>    <!-- Slide 2: Architecture --><\/p>\n<div class=\"llmap-slide\" data-index=\"2\">\n<div class=\"llmap-slide-num\">03 \/ 06<\/div>\n<div class=\"llmap-slide-title\">How the Platform Is Structured<\/div>\n<div class=\"llmap-slide-desc\">The platform has four main components. Understanding how they connect helps when debugging or deploying to production.<\/div>\n<table class=\"llmap-table\">\n<thead>\n<tr>\n<th>Component<\/th>\n<th>What It Does<\/th>\n<th>Tech<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><span class=\"llmap-inline-code\">web (:3000)<\/span><\/td>\n<td>Next.js dashboard. Provides the UI for sessions chat, agent CRUD operations, and live status monitoring.<\/td>\n<td>Next.js, TypeScript<\/td>\n<\/tr>\n<tr>\n<td><span class=\"llmap-inline-code\">worker<\/span><\/td>\n<td>Background process that handles async agent tasks, decoupled from the web server.<\/td>\n<td>TypeScript<\/td>\n<\/tr>\n<tr>\n<td><span class=\"llmap-inline-code\">postgres<\/span><\/td>\n<td>Persistent backing store for session state, agent configs, and metadata. Schema migration runs automatically as an init container on startup.<\/td>\n<td>PostgreSQL<\/td>\n<\/tr>\n<tr>\n<td><span class=\"llmap-inline-code\">sandbox cluster<\/span><\/td>\n<td>Kubernetes cluster where individual agent sandboxes run, managed via the agent-sandbox CRD controller. Locally: kind. In production: AWS EKS.<\/td>\n<td>Kubernetes (kind \/ EKS)<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"llmap-note\">\n<div class=\"llmap-note-label\">Separation of Concerns<\/div>\n<div class=\"llmap-note-text\">The LiteLLM gateway handles model routing, cost tracking, rate limiting, and guardrails. The Agent Platform handles sandbox lifecycle, session management, and the management dashboard. They run as separate services and the Agent Platform consumes the gateway as a dependency.<\/div>\n<\/div>\n<\/div>\n<p>    <!-- Slide 3: Prerequisites --><\/p>\n<div class=\"llmap-slide\" data-index=\"3\">\n<div class=\"llmap-slide-num\">04 \/ 06<\/div>\n<div class=\"llmap-slide-title\">Prerequisites Before You Start<\/div>\n<div class=\"llmap-slide-desc\">Install and verify these tools before running any setup commands. The quickstart will not work without all five.<\/div>\n<ul class=\"llmap-steps\">\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">1<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Docker Desktop<\/div>\n<div class=\"llmap-step-text\">Required to build and run containers, and to power kind (which runs Kubernetes nodes as Docker containers). Download from docker.com\/products\/docker-desktop. Verify with:<\/div>\n<pre><code>docker --version<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">2<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">kind (Kubernetes in Docker)<\/div>\n<div class=\"llmap-step-text\">Used to provision a local Kubernetes cluster for running sandboxes. Install via Homebrew on macOS (<span class=\"llmap-inline-code\">brew install kind<\/span>) or from kind.sigs.k8s.io. Verify with:<\/div>\n<pre><code>kind --version<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">3<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">kubectl<\/div>\n<div class=\"llmap-step-text\">The Kubernetes command-line tool. Used by the setup scripts to interact with the kind cluster. Install from kubernetes.io\/docs\/tasks\/tools. Verify with:<\/div>\n<pre><code>kubectl version --client<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">4<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">helm<\/div>\n<div class=\"llmap-step-text\">The Kubernetes package manager. Used to install the agent-sandbox controller into the kind cluster. Install from helm.sh\/docs\/intro\/install. Verify with:<\/div>\n<pre><code>helm version<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">5<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">A Running LiteLLM Gateway<\/div>\n<div class=\"llmap-step-text\">The Agent Platform requires a LiteLLM gateway URL to route model calls. If you do not have one running, start with the official LiteLLM quickstart at docs.litellm.ai. You will point the Agent Platform at this URL during configuration.<\/div>\n<\/div>\n<\/li>\n<\/ul><\/div>\n<p>    <!-- Slide 4: Quickstart --><\/p>\n<div class=\"llmap-slide\" data-index=\"4\">\n<div class=\"llmap-slide-num\">05 \/ 06<\/div>\n<div class=\"llmap-slide-title\">Local Quickstart<\/div>\n<div class=\"llmap-slide-desc\">Clone the repo and run two commands to get the full platform running locally. No cloud credentials needed for local development.<\/div>\n<ul class=\"llmap-steps\">\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">1<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Clone the repository<\/div>\n<div class=\"llmap-step-text\">Pull the repo from GitHub:<\/div>\n<pre><code>git clone https:\/\/github.com\/BerriAI\/litellm-agent-platform\ncd litellm-agent-platform<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">2<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Configure your .env file<\/div>\n<div class=\"llmap-step-text\">Copy the example env file and fill in your LiteLLM gateway URL and any secrets:<\/div>\n<pre><code>cp .env.example .env\n# Edit .env and set your LITELLM_GATEWAY_URL and other required values<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">3<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Provision the local kind cluster<\/div>\n<div class=\"llmap-step-text\">This script is idempotent, meaning safe to run multiple times. It provisions a kind cluster named <span class=\"llmap-inline-code\">agent-sbx<\/span>, installs the agent-sandbox controller via helm, and loads the harness image:<\/div>\n<pre><code>bin\/kind-up.sh<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">4<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Start all services<\/div>\n<div class=\"llmap-step-text\">Boots Postgres, runs the schema migration as an init container, and starts the web server on port 3000 and the worker process:<\/div>\n<pre><code>docker compose up<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">5<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Open the dashboard<\/div>\n<div class=\"llmap-step-text\">Navigate to <span class=\"llmap-inline-code\">http:\/\/localhost:3000<\/span> in your browser. You should see the LiteLLM Agent Platform dashboard with options to create agents, open sessions, and monitor live status.<\/div>\n<\/div>\n<\/li>\n<\/ul>\n<div class=\"llmap-note\">\n<div class=\"llmap-note-label\">Passing Secrets into Sandboxes<\/div>\n<div class=\"llmap-note-text\">Any variable in <span class=\"llmap-inline-code\">.env<\/span> prefixed with <span class=\"llmap-inline-code\">CONTAINER_ENV_<\/span> is automatically injected into every sandbox container with the prefix stripped. Example: <span class=\"llmap-inline-code\">CONTAINER_ENV_GITHUB_TOKEN=ghp_\u2026<\/span> means the sandbox sees <span class=\"llmap-inline-code\">GITHUB_TOKEN=ghp_\u2026<\/span> This is the correct way to pass credentials into agent sessions.<\/div>\n<\/div>\n<\/div>\n<p>    <!-- Slide 5: Production --><\/p>\n<div class=\"llmap-slide\" data-index=\"5\">\n<div class=\"llmap-slide-num\">06 \/ 06<\/div>\n<div class=\"llmap-slide-title\">Production Deployment<\/div>\n<div class=\"llmap-slide-desc\">The recommended production setup separates the sandbox cluster (AWS EKS) from the web and worker processes (Render). The repo ships scripts and a Blueprint for both.<\/div>\n<ul class=\"llmap-steps\">\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">1<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Provision the EKS sandbox cluster<\/div>\n<div class=\"llmap-step-text\">The <span class=\"llmap-inline-code\">bin\/eks-up.sh<\/span> script provisions an AWS EKS cluster configured to run agent sandboxes. This replaces kind as the sandbox backend. Requires AWS credentials in your environment:<\/div>\n<pre><code>bin\/eks-up.sh<\/code><\/pre>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">2<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Deploy web and worker to Render<\/div>\n<div class=\"llmap-step-text\">The repo includes a Render Blueprint under <span class=\"llmap-inline-code\">deploy\/render\/<\/span> that deploys the web and worker services to Render with one click. See <span class=\"llmap-inline-code\">deploy\/render\/README.md<\/span> for the Blueprint URL and required environment variables.<\/div>\n<\/div>\n<\/li>\n<p>        <span class=\"llmap-step-line\"><\/span><\/p>\n<li class=\"llmap-step\">\n<div class=\"llmap-step-num\">3<\/div>\n<div class=\"llmap-step-content\">\n<div class=\"llmap-step-label\">Use the Developer API directly (optional)<\/div>\n<div class=\"llmap-step-text\">You can interact with the platform programmatically via its REST API using curl or any HTTP client. The full API reference covering how to create an agent, open a session, send a message, and read the reply is at <span class=\"llmap-inline-code\">src\/server\/DEVELOPER.md<\/span> in the repo.<\/div>\n<pre><code># Example: create an agent session via curl\ncurl -X POST http:\/\/localhost:3000\/api\/sessions \n  -H \"Content-Type: application\/json\" \n  -d '{\"agent_id\": \"your-agent-id\"}'<\/code><\/pre>\n<\/div>\n<\/li>\n<\/ul>\n<div class=\"llmap-note\">\n<div class=\"llmap-note-label\">Architecture Summary for Production<\/div>\n<div class=\"llmap-note-text\">AWS EKS runs the sandbox cluster where agent sessions execute in isolation. Render hosts the Next.js web dashboard and the async worker. Postgres (managed or self-hosted) persists session state. The LiteLLM gateway runs separately and handles all model API routing. These four components communicate over the network and can be scaled independently.<\/div>\n<\/div>\n<div>\n        Platform is currently in alpha public preview. File issues at github.com\/BerriAI\/litellm-agent-platform. Architecture details at <span class=\"llmap-inline-code\">docs\/k8s-backend.md<\/span> in the repo.\n      <\/div>\n<\/div>\n<\/div>\n<p><!-- end slides --><\/p>\n<p>  <!-- Nav footer --><\/p>\n<div class=\"llmap-nav\">\n    <button class=\"llmap-nav-btn\" disabled>\u2190 Prev<\/button><br \/>\n    <span class=\"llmap-nav-counter\">1 \/ 6<\/span><br \/>\n    <button class=\"llmap-nav-btn primary\">Next \u2192<\/button>\n  <\/div>\n<p>  <!-- Marktechpost tagline --><\/p>\n<div class=\"llmap-mtp\">\n<div class=\"llmap-mtp-dot\"><\/div>\n<div class=\"llmap-mtp-text\">Published by <a href=\"https:\/\/www.marktechpost.com\/\" target=\"_blank\">Marktechpost<\/a> \u00a0|\u00a0 AI\/ML News and Research for Developers and Engineers<\/div>\n<div class=\"llmap-mtp-dot\"><\/div>\n<\/div>\n<\/div>\n<p><!-- end #llmap-guide --><\/p>\n<h2 class=\"wp-block-heading\"><strong>Key Takeaways<\/strong><\/h2>\n<ul class=\"wp-block-list\">\n<li>BerriAI open-sourced LiteLLM Agent Platform, a self-hosted infrastructure layer for running multiple AI agents in production with per-team sandbox isolation and session continuity across pod restarts.<\/li>\n<li>Sandboxes run on Kubernetes via the <code>kubernetes-sigs\/agent-sandbox<\/code> CRD \u2014 locally with kind, in production with AWS EKS \u2014 no cloud credentials needed to get started.<\/li>\n<li>The platform sits on top of the existing LiteLLM Gateway, which handles model routing, cost tracking, and rate limiting across 100+ LLM providers in OpenAI format.<\/li>\n<li>The quickstart is two commands: <code>bin\/kind-up.sh<\/code> provisions the kind cluster and installs the sandbox controller; <code>docker compose up<\/code> boots Postgres, web (:3000), and worker.<\/li>\n<li>Released under MIT license and currently in alpha public preview<\/li>\n<\/ul>\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<p>Check out\u00a0the\u00a0<strong><a href=\"https:\/\/github.com\/BerriAI\/litellm-agent-platform\" target=\"_blank\" rel=\"noreferrer noopener\">GitHub Repo<\/a>.\u00a0<\/strong>Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">150k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p>Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.?\u00a0<strong><a href=\"https:\/\/forms.gle\/MTNLpmJtsFA3VRVd9\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Connect with us<\/mark><\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/05\/16\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\">Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>Running AI agents in a local script is straightforward. Running them reliably in production across teams, across restarts, with isolated environments per context is a different problem entirely. BerriAI, the company behind the LiteLLM AI Gateway, is now open-sourcing a purpose-built answer to that problem: the LiteLLM Agent Platform. The platform is described as a simple, self-hosted infrastructure platform for running multiple agents in production. What Problem Does it Solve? It helps to understand what happens when you try to scale agents beyond a single process. Agents are stateful: they carry session history, tool call results, and intermediate reasoning across turns. If the container running your agent crashes, restarts, or gets replaced during a deployment, that session state is gone unless something is explicitly managing it. At the same time, different teams often need different runtime environments, different tools, different secrets, different access scopes which means you cannot throw all agents into one shared container. The platform manages two things: per-team and per-context sandboxes, and session continuity across pod restarts and upgrades. These two capabilities are the core infrastructure primitives the platform provides. Architecture and Technical Stack The platform is a standalone Next.js dashboard for LiteLLM v2 managed agents, covering sessions chat, agent CRUD, and live status. The codebase is primarily TypeScript (92.8%), with Shell scripts for provisioning, a Dockerfile for containerization, and CSS for the dashboard UI. The architecture separates concerns cleanly. A web process runs on port 3000 and serves the Next.js dashboard. A worker process handles async agent tasks. Postgres is used as the persistent backing store, and a schema migration runs as an init container on startup \u2014 so the database is always in the correct state before the application boots. For the sandbox layer \u2014 the isolated runtime environment where agents actually execute \u2014 sandboxes run on Kubernetes via the kubernetes-sigs\/agent-sandbox CRD. Local development uses kind. If you are not already familiar with it: kind (Kubernetes in Docker) lets you spin up a full Kubernetes cluster locally using Docker containers as nodes, without needing a cloud provider. The agent-sandbox CRD (Custom Resource Definition) is a Kubernetes extension from kubernetes-sigs that the platform installs to manage the lifecycle of individual sandbox environments. The platform also includes a harness system under harnesses\/opencode, which contains the configuration for running coding agents \u2014 such as Claude Code or OpenAI Codex \u2014 inside isolated sandboxes with a vault proxy for credential management. BerriAI team also maintains a separate litellm-agent-runtime repository, described as a coding-agent runtime that runs inside per-session VMs provisioned by a LiteLLM proxy, generic by design, with customization happening via harness configuration or a hydrate payload. One practical detail worth noting is how environment variables are handled across sandbox containers. Anything in .env prefixed with CONTAINER_ENV_ is injected into every sandbox container with the prefix stripped \u2014 for example, CONTAINER_ENV_GITHUB_TOKEN=ghp_&#8230; means the container sees GITHUB_TOKEN=ghp_&#8230; This gives teams a clean way to pass secrets into sandboxed agent sessions without modifying container images. https:\/\/github.com\/BerriAI\/litellm-agent-platform Getting Started The prerequisites for local development are Docker Desktop, kind, kubectl, helm, and a LiteLLM gateway. No cloud credentials are required to get started locally. The quickstart is two commands: Copy CodeCopiedUse a different Browser bin\/kind-up.sh docker compose up bin\/kind-up.sh is idempotent \u2014 it provisions a kind cluster named agent-sbx, installs the agent-sandbox controller, and loads the harness image. docker compose up boots Postgres, runs the schema migration, and starts the web process on port 3000 along with the worker. For production deployment, the recommended path is AWS EKS for the sandbox cluster and Render for the web and worker processes. bin\/eks-up.sh provisions the EKS cluster, and a Render Blueprint provides a one-click deployment option. Relationship to the LiteLLM Gateway The Agent Platform is a layer on top of the existing LiteLLM ecosystem, not a replacement for it. LiteLLM\u2019s core is a Python SDK and Proxy Server \u2014 an AI Gateway \u2014 that calls 100+ LLM APIs in OpenAI format, with cost tracking, guardrails, load balancing, and logging, supporting providers including Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, SageMaker, HuggingFace, vLLM, and NVIDIA NIM. The Agent Platform consumes a running LiteLLM gateway as a dependency and builds agent orchestration and session management infrastructure on top of it. Model routing, cost tracking, and rate limiting remain in the gateway layer. Sandbox isolation, session continuity, and the management dashboard are handled by the Agent Platform. Marktechpost\u2019s Visual Explainer LiteLLM Agent Platform Self-Hosted Agent Infrastructure Guide Alpha Overview Concepts Architecture Prerequisites Quickstart Production 01 \/ 06 What is LiteLLM Agent Platform? BerriAI open-sourced this platform on May 8, 2026. It is a self-hosted infrastructure layer for running multiple AI agents in production, built on top of the LiteLLM AI Gateway. Self-Hosted Runs entirely on your own infrastructure. No data leaves your environment. Suited for regulated industries and teams with data residency requirements. Multi-Agent Designed to run multiple agents in parallel, with full isolation between teams and contexts using per-session sandboxes. Session Continuity Agent sessions persist across pod restarts and upgrades, so stateful work is not lost when containers are replaced. Open Source (MIT) Fully open source under the MIT license. Repo: github.com\/BerriAI\/litellm-agent-platform. File issues and contribute directly. Prerequisite Knowledge This guide assumes familiarity with Docker, basic command-line usage, and a general understanding of what an AI agent is (a model that calls tools and runs multi-step tasks). Kubernetes experience helps but is not required to follow along. 02 \/ 06 Key Concepts to Know First Before running the platform, understand these four building blocks. They appear throughout the setup and configuration. A LiteLLM Gateway The underlying AI Gateway that the Agent Platform depends on. It routes requests to 100+ LLM providers (OpenAI, Anthropic, Bedrock, VertexAI, etc.) using a unified OpenAI-format API. The Agent Platform does not include the gateway, you must have one running separately and point the platform at it. B Sandbox An isolated container environment where a single agent session executes. Each sandbox is independent, meaning one agent cannot access the filesystem, secrets, or state<\/p>","protected":false},"author":2,"featured_media":90992,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"pmpro_default_level":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"_pvb_checkbox_block_on_post":false,"footnotes":""},"categories":[52,5,7,1],"tags":[],"class_list":["post-90991","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-club","category-committee","category-news","category-uncategorized","pmpro-has-access"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production - YouZum<\/title>\n<meta name=\"description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/youzum.net\/ja\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\" \/>\n<meta property=\"og:locale\" content=\"ja_JP\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production - YouZum\" \/>\n<meta property=\"og:description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta property=\"og:url\" content=\"https:\/\/youzum.net\/ja\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\" \/>\n<meta property=\"og:site_name\" content=\"YouZum\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/DroneAssociationTH\/\" \/>\n<meta property=\"article:published_time\" content=\"2026-05-17T16:34:57+00:00\" \/>\n<meta name=\"author\" content=\"admin NU\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u57f7\u7b46\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin NU\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593\" \/>\n\t<meta name=\"twitter:data2\" content=\"11\u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\"},\"author\":{\"name\":\"admin NU\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\"},\"headline\":\"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production\",\"datePublished\":\"2026-05-17T16:34:57+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\"},\"wordCount\":2040,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif\",\"articleSection\":[\"AI\",\"Committee\",\"News\",\"Uncategorized\"],\"inLanguage\":\"ja\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\",\"url\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\",\"name\":\"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production - YouZum\",\"isPartOf\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif\",\"datePublished\":\"2026-05-17T16:34:57+00:00\",\"description\":\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\",\"breadcrumb\":{\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#breadcrumb\"},\"inLanguage\":\"ja\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif\",\"width\":964,\"height\":720},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/youzum.net\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/yousum.gpucore.co\/#website\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"name\":\"YouSum\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/yousum.gpucore.co\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ja\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\",\"name\":\"Drone Association Thailand\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"width\":300,\"height\":300,\"caption\":\"Drone Association Thailand\"},\"image\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/DroneAssociationTH\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\",\"name\":\"admin NU\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"caption\":\"admin NU\"},\"url\":\"https:\/\/youzum.net\/ja\/members\/adminnu\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production - YouZum","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/youzum.net\/ja\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/","og_locale":"ja_JP","og_type":"article","og_title":"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production - YouZum","og_description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","og_url":"https:\/\/youzum.net\/ja\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/","og_site_name":"YouZum","article_publisher":"https:\/\/www.facebook.com\/DroneAssociationTH\/","article_published_time":"2026-05-17T16:34:57+00:00","author":"admin NU","twitter_card":"summary_large_image","twitter_misc":{"\u57f7\u7b46\u8005":"admin NU","\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593":"11\u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#article","isPartOf":{"@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/"},"author":{"name":"admin NU","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c"},"headline":"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production","datePublished":"2026-05-17T16:34:57+00:00","mainEntityOfPage":{"@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/"},"wordCount":2040,"commentCount":0,"publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"image":{"@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif","articleSection":["AI","Committee","News","Uncategorized"],"inLanguage":"ja","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/","url":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/","name":"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production - YouZum","isPartOf":{"@id":"https:\/\/yousum.gpucore.co\/#website"},"primaryImageOfPage":{"@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage"},"image":{"@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif","datePublished":"2026-05-17T16:34:57+00:00","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","breadcrumb":{"@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#breadcrumb"},"inLanguage":"ja","potentialAction":[{"@type":"ReadAction","target":["https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/"]}]},{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#primaryimage","url":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif","width":964,"height":720},{"@type":"BreadcrumbList","@id":"https:\/\/youzum.net\/meet-litellm-agent-platform-a-kubernetes-based-self-hosted-infrastructure-layer-for-isolated-agent-sandboxes-and-persistent-session-management-in-production\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/youzum.net\/"},{"@type":"ListItem","position":2,"name":"Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production"}]},{"@type":"WebSite","@id":"https:\/\/yousum.gpucore.co\/#website","url":"https:\/\/yousum.gpucore.co\/","name":"YouSum","description":"","publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/yousum.gpucore.co\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ja"},{"@type":"Organization","@id":"https:\/\/yousum.gpucore.co\/#organization","name":"Drone Association Thailand","url":"https:\/\/yousum.gpucore.co\/","logo":{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","width":300,"height":300,"caption":"Drone Association Thailand"},"image":{"@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/DroneAssociationTH\/"]},{"@type":"Person","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c","name":"admin NU","image":{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","caption":"admin NU"},"url":"https:\/\/youzum.net\/ja\/members\/adminnu\/"}]}},"rttpg_featured_image_url":{"full":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif",964,720,false],"landscape":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif",964,720,false],"portraits":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif",964,720,false],"thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH-150x150.gif",150,150,true],"medium":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH-300x224.gif",300,224,true],"large":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif",964,720,false],"1536x1536":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif",964,720,false],"2048x2048":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH.gif",964,720,false],"trp-custom-language-flag":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH-16x12.gif",16,12,true],"woocommerce_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH-300x300.gif",300,300,true],"woocommerce_single":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH-600x448.gif",600,448,true],"woocommerce_gallery_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/05\/593413924-a68b21fa-20b6-4ddc-a011-1bc9e6a3b296-1-KtzVuH-100x100.gif",100,100,true]},"rttpg_author":{"display_name":"admin NU","author_link":"https:\/\/youzum.net\/ja\/members\/adminnu\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/youzum.net\/ja\/category\/ai-club\/\" rel=\"category tag\">AI<\/a> <a href=\"https:\/\/youzum.net\/ja\/category\/committee\/\" rel=\"category tag\">Committee<\/a> <a href=\"https:\/\/youzum.net\/ja\/category\/news\/\" rel=\"category tag\">News<\/a> <a href=\"https:\/\/youzum.net\/ja\/category\/uncategorized\/\" rel=\"category tag\">Uncategorized<\/a>","rttpg_excerpt":"Running AI agents in a local script is straightforward. Running them reliably in production across teams, across restarts, with isolated environments per context is a different problem entirely. BerriAI, the company behind the LiteLLM AI Gateway, is now open-sourcing a purpose-built answer to that problem: the LiteLLM Agent Platform. The platform is described as a&hellip;","_links":{"self":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/posts\/90991","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/comments?post=90991"}],"version-history":[{"count":0,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/posts\/90991\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/media\/90992"}],"wp:attachment":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/media?parent=90991"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/categories?post=90991"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/tags?post=90991"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}