Which open-source models can I run?

The roadmap includes a curated set of OSS models. If you have a target model family, share it and we’ll prioritize support.

Can I run private models in specific regions?

Regional placement is a core goal for sovereignty/latency. Availability depends on the deployment environment and model requirements.

Yes. The value is a single control plane where routing policies can consider private deployments as first-class targets.

It’s in progress. We’re looking for design partners with clear deployment constraints and workloads.

AI Gateway feature

Name: Edgee - Edge Component Platform
Author: Edgee

Early access

Run open-source LLMs as on-demand, serverless instances and route to them through the same Edgee gateway API alongside public providers.

We’re building serverless provisioning and placement. Tell us your model and deployment constraints.

You choose a model (and constraints like region/performance).
Edgee provisions and exposes it behind a stable gateway model name.
Your app calls Edgee as usual; routing can target the private model when appropriate.
Observability and cost signals remain centralized at the gateway.

Deploy closer to your users or inside specific regions to meet latency and compliance needs.

Your app keeps one API while you mix public models and your private deployments.

Provisioning and scaling are handled as a service, rather than another platform to maintain.

Answers reflect current direction and may evolve as the platform ships.