Inference without friction on dex28
One contracts-ready control plane for routing, vectors, perception, and governed APIs — from first prompt to production scale.
Custom APIs
Build and expose APIs for any domain
Enterprise RBAC, fast playground iteration, and production scaling on the same surface — design once, promote cleanly, and operate with quotas you can explain to finance.
Governed access
Roles, keys, and audit trails aligned to how your org already works.
Playground to prod
Ship the same routes from spike tests to signed-off endpoints.
Elastic throughput
Plan capacity around real traffic, not vendor surprises.
Control plane
AI infrastructure, unified — not a patchwork of vendors
dex28 is the control plane for all AI infrastructure: routing, vectors, perception, and guarantees — so teams ship solutions instead of stitching vendors.
Routing
Model and path rules with observability on every hop.
Vectors
Embeddings and retrieval that stay coherent under load.
Perception
Vision and speech preprocessing for multimodal products.
Guarantees
SLAs, quotas, and policy you can reference in contracts.
Control plane stack: routing, vectors, perception, and guarantees. One layer is highlighted at a time.
Product shapes
Endpoints teams actually ship
Design chatbots, analytics, and vision apps with permissions intact; proxy LLMs cost-effectively without rebuilding plumbing each sprint.
Example: multimodal grading APIs for edtech — rubric-backed scoring, OCR, and a dedicated API route on one platform. Explore the Education demo.
Flow: clients to gateway to custom API routes to models.
Surface tour
One hub, many workloads
Select a focus — previews mirror how teams ship on dex28.
Surface tour
Productize AI across industries
Same platform for chatbots, search, vision, and compliance-heavy workloads — from prototype to governed production.
Dashboard
Live token economics, RBAC-scoped keys — monitor any custom API usage from one hub.
Playground
No-code prompt routing for chat and vision — test before deploy, then promote to production.
Documentation
SDK quickstarts for unified integrations — one surface for every AI API you expose.
Embeddings
Similarity search and sharding for RAG, chat, and analytics — throughput you can plan around.
Perception
OCR, ASR, and diagram understanding — preprocess once for any vision or speech downstream.
Education
Math grading as a showcase: design rubrics, OCR, and a custom /v1/math-grade API on one platform.
Architecture
Gateway → custom logic → proxied LLMs → durable storage — RBAC and queues at enterprise scale.
Client value
Accelerate AI productization — one contracts-ready surface for builders and operators.
dex28 turns fragmented inference stacks into something you can scope, price, and defend in front of legal and leadership — same control plane from pilot to governed production.
Partners & ecosystem
Built to compose with the stack you already run
Model providers, cloud platforms, and integrators meet dex28 at a stable API boundary — swap vendors without rewriting your product surface.
Mimics token-by-token routing decisions with grounded retrieval cues.
48kHz · streaming partials