Hosting & Deployment

Hosting platforms and edge networks for deploying AI agent applications

ToolTypePricingOSSServerlessContainersEdge ComputeAi ToolingMcp HostingVerified
Vercelcloud
Free (Hobby)$20/user/mo ProCustom Enterprise
2026-04-28
Cloudflarecloud
Free (Workers: 100k requests/day)$5/mo Workers Paid$25/mo ProCustom Enterprise
2026-04-28
Netlifycloud
Free (100GB bandwidth)$19/user/mo ProCustom Enterprise
2026-04-28
Railwaycloud
Free trial ($5 credit)$5/mo Hobby$20/seat/mo ProCustom Enterprise
2026-04-28
Fly.iocloud
Free (3 shared VMs)Pay-as-you-go ($0.0015/hr per shared CPU)$29/mo LaunchCustom Enterprise
2026-04-28

Supported Not supported Unverified

What do these features mean?
  • Serverless Serverless function execution with auto-scaling
  • Containers Run persistent containers or Docker images
  • Edge Compute Execute code at the edge, close to users globally
  • Ai Tooling Built-in AI/LLM inference or AI-specific SDK
  • Mcp Hosting First-party support for deploying MCP servers

Missing a tool in this category? Use the add-tool skill to generate the file, then open a PR.

npx skills run add-tool
Open a PR →

Hosting & Deployment

Where you deploy your AI agent application affects latency, cost, runtime constraints, and what kinds of workloads you can run. The platforms in this category range from serverless-first (Vercel, Netlify) to container-based (Railway, Fly.io) to edge-native (Cloudflare).

The key architectural decision is whether your agent workload fits a serverless model or needs persistent compute. Serverless is great for request-response patterns — a user sends a message, the agent processes it, returns a response. But agents that run long tasks, maintain WebSocket connections, or do background processing often hit serverless timeout limits.

What to consider:

  • Runtime model — serverless functions (Vercel, Netlify, Cloudflare Workers) have execution time limits, typically 10-60 seconds on free tiers. Container platforms (Railway, Fly.io) run persistent processes with no timeout. If your agent needs to run multi-step tasks that take minutes, serverless won't work without breaking the task into smaller chunks.
  • AI-specific tooling — Vercel ships the Vercel AI SDK with built-in streaming, tool calling, and structured output. Cloudflare has Workers AI for on-edge inference. The others are runtime-agnostic — they'll host your agent but don't provide AI-specific primitives.
  • Edge vs. origin — Cloudflare Workers and Vercel Edge Functions run code at the edge (close to users). Railway and Fly.io run containers in specific regions (Fly can distribute globally). Edge is better for latency-sensitive interactions; origin is simpler to reason about.
  • MCP server hosting — Cloudflare has first-party support for deploying MCP servers on Workers with built-in OAuth. Other platforms can host MCP servers but require manual setup.
  • Cost model — serverless platforms charge per invocation (cheap at low volume, unpredictable at scale). Container platforms charge per compute-hour (predictable, but you pay even when idle). AWS/GCP are notably absent from this list — they're infrastructure, not deployment platforms in the same sense.

For most AI agent apps, start with whichever platform matches your framework. Next.js on Vercel is the path of least resistance. Python agents that need persistent processes fit better on Railway or Fly.io. Cloudflare is the play if you need global edge compute or MCP server hosting.