Product Demo Docs Blog About Contact Sign in Sign up
Blog · · Philippe Laporte

Cheaper Compute, Borrowed Trust

The economics of moving AI inference off the hyperscalers are obvious. The trust implications are not. As more inference shifts onto distributed and non-hyperscaler infrastructure, we are quietly re-introducing a problem the cloud era let us forget: how do you actually know the computation ran the way you were told?

For a decade, running an AI workload mostly meant renting a slice of one of a few hyperscalers. That is changing. Independent GPU networks, distributed compute platforms, and specialty inference providers are pulling real workloads off the majors, and for good reasons: lower cost, more sovereignty over where data and computation live, better resilience, and access to capacity the incumbents cannot always supply on demand.

This is a healthy correction. Concentrating the world's compute in a handful of providers was never going to be the permanent shape of the market. But every shift in infrastructure carries a second-order change that is easy to miss while you are busy celebrating the first-order win. Here, the second-order change is about trust.

What changes when the hardware isn't yours

When you run a job on hardware you do not own and cannot inspect, you are extending trust that three things are true: that the model you specified is the model that actually ran, that it ran on the exact input you provided, and that the output you were handed is the real result of that computation. On a hyperscaler, you extend that trust to a brand and a contract. The provider is a known entity with a reputation and a legal relationship, and for most purposes that is enough.

With a hyperscaler, you trust a brand and a contract. On a distributed pool, who exactly are you trusting, and what proof do you hold?

On a distributed pool of third-party machines, the question gets sharper. Who, specifically, ran your job? What stops a node from returning a cheaper approximation, a stale cached answer, or simply the wrong result? In most designs, nothing does, beyond the same blind trust, now extended to parties you cannot name. The computation is treated as correct because the system assumes it is correct.

Most workloads don't care. A growing set can't.

For a great many workloads, that assumption is fine. If you are generating embeddings for an internal search index or summarizing documents for your own use, a rare wrong answer is a tolerable cost, and the savings are worth it. Not every computation needs a paper trail.

But there is a growing class of buyers for whom blind trust is not an option, because they owe an answer for correctness to someone else. A financial firm answering to a regulator. A vendor whose enterprise customer demands proof of how an output was produced. A healthcare payer whose automated decisions are audited. A company whose model output might be cited in a dispute or a filing. For all of them, "our compute provider said it ran correctly" is an assertion, not evidence. And as automated decisions draw more scrutiny across finance, insurance, healthcare, and hiring, the number of people who need evidence rather than reassurance is growing, not shrinking.

"Our vendor said it computed correctly" is an assertion, not evidence.

Proof, not promises

The way out is not to trust harder. It is to stop needing trust for the part that can be proven. Cryptographic verification lets you require proof that a specific computation happened as claimed: a receipt that binds a particular model, a particular input, and a particular output together, in a form anyone holding it can check independently, without re-running the whole job and without taking the provider's word for anything.

That turns correctness from a promise into an artifact. Instead of "trust us," you have a receipt you can hand to your auditor, your regulator, your customer, or your own records, and they can verify it themselves. The trust you used to place in the provider is replaced, for the part that matters, by mathematics.

Verification is not the opposite of distributed compute

It would be easy to read this as an argument against distributed compute. It is the opposite. Verification is the thing that makes distributed compute usable for the work that currently stays on the hyperscalers out of caution. Compute supplies the capacity. Verification supplies the trust. They are complementary layers, and the second is what unlocks the regulated and high-stakes markets the first has been working to reach.

The networks and platforms moving compute off the majors have solved the genuinely hard problems of cost, distribution, and scale. The missing piece, for the buyers who need it, is a way to prove the work was done right. Supply that, and a whole category of workloads that could not previously leave the safety of a brand-name cloud suddenly can.

This is the layer we are building at Cyberian: an independent verification service that issues a cryptographic receipt for each inference job, so the result carries proof it ran as claimed, wherever it ran.


If you are moving inference onto compute you do not fully control, and you owe someone an answer for its correctness, the time to think about proof is before it is asked of you, not after.

PL
Philippe Laporte
Founder and CEO of Cyberian Systems, building verified AI inference infrastructure for regulated industries.

Try the live demo · Follow on LinkedIn · RSS