Skip to content

▪ The platform

Run, stream, verify.One platform.

An OpenAI-compatible inference gateway over a heterogeneous, verifiable fleet. Every output ships with a signed receipt. Every receipt resolves to lineage. Every lineage anchors to hardware that anyone can re-run.

▪ Capability stack

Eight surfaces. One verified runtime.

capability 01

OpenAI-compatible inference gateway

A drop-in /v1/chat/completions endpoint over the verified compute network. Same SDKs, same JSON, same streaming protocol — but every request is routed across heterogeneous Mac, Windows, and Linux operator nodes.

  • OpenAI SDK works unchanged
  • Streaming SSE + non-streaming
  • Per-key spend & rate limits
POST /v1/chat/completions
curl https://api.ryvion.ai/v1/chat/completions \
  -H "Authorization: Bearer ryv_…" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "ryv-llama-3-8b",
    "messages": [{"role":"user","content":"Hi"}],
    "stream": true
  }'
HTTP 200 · text/event-streamopenai-compat

capability 02

Ghost-stream UX

Provisional tokens render ghosted while the network speculates. On commit they snap solid. On rollback they wipe and the replacement enters from the left. No other inference network exposes this.

  • Provisional → commit transition (180ms spring)
  • Rollback wipe with replacement
  • Built on multi-token speculative decoding
stream · ryv-llama-3-8bLIVE
The audit trail resolved cleanly across five nodes; receipts are signed and the lineage matches.Each provisional token will commit unless

31.4

tok/s

2.1×

spec-up

142ms

ttft

capability 03

Receipts on every output

Every completion produces an Ed25519-signed receipt over output hash, model id, model revision, node id, timestamp, and (if audited) FuzzyTrace digest. Content-addressed CIDs you can verify offline.

  • RYV1 envelope · Ed25519
  • TranscriptDigest CID per output
  • Public key bundle for offline verify
RYV1 · receiptPASS
{
  "model": "ryv-llama-3-8b",
  "node":  "n_de_rtx7900_01",
  "tokens": 412,
  "output_hash":
    "sha256:9c1e4d7f42a8b3c1…",
  "transcript_digest_cid":
    "ryv1:sha256:e3b0c4429…",
  "signature":
    "ed25519:5c8f7a13b9d2e4f6…",
  "timestamp": "2026-05-12T18:42:09Z"
}

capability 04

Cross-node verification

Opt-in dual execution on Mac arm64 and Windows amd64 nodes. FuzzyTrace numerical comparison classifies every audited output as PASS, AMBIGUOUS, or FAIL — calibrated tolerance, not a hash match.

  • Dual-execute on heterogeneous hardware
  • FuzzyTrace numerical audit
  • PASS / AMBIGUOUS / FAIL verdict

FuzzyTrace audit · 2-of-2

M4 Pro
macarm64 · de · n_a1b2…
PASS
RTX 4070Ti
winamd64 · ca · n_c3d4…
PASS

0.0034

∆ logit

99.7%

overlap

PASS

verdict

capability 05

Heterogeneous fleet routing

Capability-typed router targets Mac M-series, NVIDIA RTX 30/40-series, and AMD RX 7000 nodes. Jurisdiction-aware. Reputation-gated. You see exactly which hardware ran your job in the receipt.

  • Mac arm64 · Windows amd64 · Linux amd64
  • Country-routed when policy demands it
  • Reputation tiers gate workload class

Online nodes · 6

+2 last 5m
M4 Pro
macus
RTX 4070 Ti
winca
RX 7900 XTX
linuxde
M3 Max
macuk
RTX 3090
linuxfr
RTX 4090
winca

capability 06

Knowledge & RAG

Upload documents, embed once with bge-m3, retrieve at inference time. Retrieved chunk hashes ride into the receipt so audits can reproduce the exact context window.

  • Verified embedding lane
  • Chunk hashes in the receipt
  • OpenAI Responses-API style retrieval

Knowledge · ks_org_eu_policy

policy.pdf142bge-m3
handbook.md38bge-m3
regs-2026.txt91bge-m3
retrieved 3 chunks · receipt embeds chunk hashes

capability 07

Agent hosting

Deploy long-running agents that share the same gateway, the same receipts, the same firewall. A managed runtime per agent — invocations are receipted just like raw inference.

  • Per-agent memory + tools
  • Receipts on every invocation
  • Optional PII firewall
agent_support_triageRUNNING
modelryv-llama-3-8b
memorykb_org_support
firewallstrict (PII redact)
invocations · 24h1,402
receipts · 24h1,402
runtime cost$0.21/hr

capability 08

Spend control & billing

Per-key budgets, alert thresholds, and a real-time spend ledger. Stripe-backed. No surprise bills — the ceiling stops requests, not your card.

  • Hard caps + alert tiers
  • Stripe Connect payouts to operators
  • Receipt-anchored audit trail

Spend · this month

$1,284.07

limit $5,000.00 · 25%

inference$842.10
embeddings$112.30
agent hosting$329.67
alerts50%, 75%, 100%

▪ What you don't get

Positioningmatters.

Inference networks blur into each other. Here's what Ryvion deliberately is not.

×

Not a GPU marketplace

We don't auction your job to whichever bidder shows up. The router picks based on capability, jurisdiction, and reputation — buyers don't negotiate with operators.

×

Not an OpenAI router

We don't proxy to OpenAI, Anthropic, or any closed provider. Models run on the verified Ryvion fleet. Same API surface, different substrate.

×

Not raw GPU rental

You don't get SSH. You don't manage CUDA. We give you a verified inference primitive — the platform handles fleet, drivers, and runtime.

▪ Workspace

Your dashboard, the moment you sign in.

Live receipts, live spend, live fleet. The buyer surface is built for the same audience as the receipts — people who want every cell to be a real value.

app.ryvion.ai/buyer/overview
▪ illustrative

▪ overview

Today

12 May 2026 · 18:42 UTC

requests

14,392

+8%

receipts

14,392

100%

p50 ttft

142ms

-12%

spend

$84.21

+4%

Recent receipts

streaming
PASSryv1:sha256:9c1e4d7f…a8b3ryv-llama-3-8b2s ago
PASSryv1:sha256:42e1c803…77f2ryv-llama-3-8b4s ago
PASSryv1:sha256:b81f0904…01c5bge-m36s ago
PASSryv1:sha256:1fa2b3c4…d5e6phi-49s ago
PASSryv1:sha256:7e80af11…3320ryv-llama-3-8b11s ago
NETWORK 6/6 ONLINETPS 32.4P50 142MS▪ ALL VERIFIED

▪ Start

Create a workspace, send your first verified request.