Compute Appreciates While You Sleep

Mosiah.org · article artifact

The old cloud argument says companies should rent compute because servers depreciate.

That argument was right for ordinary servers. It is wrong, or at least incomplete, for AI compute in the open-model era.

The missing fact is simple:

Compute is capital whose productivity is externally upgraded by the ecosystem's software progress.

A GPU is booked like a depreciating asset. Finance departments see a three-to-five-year schedule: buy the machine, write it down, replace it later. But the same H100 can run a better institution every quarter because the models, inference stacks, kernels, post-training recipes, quantization methods, memory systems, routing layers, and agent architectures around it keep improving.

The metal ages. The productive capacity can rise.

That is the load-bearing economic fact under the automatic-capital thesis.

Labor depreciates the moment you stop paying for it. Capital appreciates while you sleep. And for the first time in history, the appreciation rate of owned machinery can be set by the pace of global AI research rather than only by the owner's own maintenance.

What peaked was deference

The model layer did not peak in capability. The models are still getting better.

What peaked was model-layer deference.

For a few years, the industry behaved as if the frontier labs' eventual dominance was inevitable. Software companies positioned themselves as complements. Enterprises treated the model provider as the natural center. Cloud partners wrote enormous checks. Customers paid token bills as if the return would sort itself out. The labs wanted to become the thing everyone worked inside.

That era is visibly over.

Amazon reporting a jailbreak on its own portfolio company, Microsoft testing a fine-tuned DeepSeek swap, ServiceNow recasting itself as the control tower for agents, corporate buyers discovering the AI blind spot in their budgets, David Sacks reframing safety as property, and open-weight defenders arguing for private fine-tuning and owned inference are not isolated events. They are the coordinated posture of everyone downstream.

The downstream coalition has discovered the same thing:

Do not become a complement to the model lab. Own the layer where work is governed, remembered, routed, audited, and improved.

The defense is no longer a mood. It is product strategy, procurement strategy, political strategy, and training recipe.

The token meter does not rest

The attached case study, Fix This Code, shows why the deference cracked.

The public story around Fable was a security panic: a model could find and repair vulnerabilities, and the same capability could be used to exploit them. That story is true as far as it goes. Fixing a vulnerability and proving it exploitable are not cleanly separable skills. The defender's proof of concept and the attacker's proof of concept can be the same document.

But the political economy underneath the panic matters more.

AI agents are not priced like ordinary software. A chatbot answering a question burns a small number of tokens. An agent working for hours can burn tokens by the millions. The meter never rests.

That creates a budget shock before it creates a productivity revolution. Uber and ServiceNow can push employees to use coding agents, burn through annual budgets early, and still struggle to draw a clean line from machine-written output to customer-valued results. The software may write more code, find more flaws, and produce more patches. But more code is not automatically more product, and more security work is often pure cost: necessary, defensive, and invisible to the customer.

The security case is the ugliest version. Once AI can surface vulnerabilities by the thousand, the world has to fix them. The same capability that creates the defensive to-do list also creates the offensive risk. The guaranteed winner is the vendor selling the cure to the disease its own technology helps spread.

That is why buyers resist model-layer capture. They are not merely haggling over price. They are resisting a future in which the lab owns the instrument, the meter, the learning loop, and the emergency.

The cloud analogy breaks

The strongest argument against owned compute is the history of cloud computing.

Companies used to own servers. Then cloud won because servers were depreciating capital with relatively flat productivity. Renting gave firms elasticity, managed operations, security specialization, and faster access to improved infrastructure. Most companies were right not to run their own data centers.

AI compute changes the analogy.

In the open-model era, owned compute can receive functional upgrades from outside the owner's walls. A local or private cluster that ran yesterday's model can run today's better open model, tomorrow's distilled model, next month's better inference runtime, and the next wave of agent scaffolding. The owner did not invent those improvements. The ecosystem delivered them.

This is ecosystem-level recursive self-improvement without a single owner.

People recognize recursive self-improvement when they imagine it inside a lab: a model helps build a better model, which helps build a still better model. But a quieter version is already running in the open. Distillation, Chinese open releases, better post-training recipes, Tinker-style training infrastructure, faster kernels, quantization, routing, memory, and agent frameworks all raise the productive output of already-owned machines.

That is the disanalogy with cloud servers. Ordinary servers depreciated because the useful software stack did not make the same box dramatically more valuable every quarter. AI compute can depreciate physically while appreciating functionally.

The rent-vs-own calculation changes when the rented thing gets better for the owner if the owner already has it.

It breaks CFO accounting

Corporate accounting is built to see the GPU as depreciating equipment.

The finance department sees purchase price, depreciation schedule, utilization risk, maintenance burden, and obsolescence. It compares that against a cloud bill or a model subscription. On paper, renting often looks safer.

But if owned compute is functionally appreciating, the accounting frame is biased.

A GPU booked as a depreciating asset may be an appreciating productive instrument when open software progress raises the quality of what it can do. The organization that treats the machine only as hardware misses the free external upgrade stream. It underprices ownership and overprices rented access.

This turns the automatic-capital thesis from a preference argument into an arbitrage claim, but the arbitrage has to be stated precisely.

Generic functional appreciation flows to all compute. Hyperscalers' GPUs get better too. If a better open model makes every H100 more productive, then rented H100s also improve, and competitive cloud pricing should pass some of that improvement through. Tokens-per-second appreciation alone does not decide rent versus own.

The arbitrage lives in the non-transferable part of appreciation.

The rented machine can give you cheaper or better generic inference. It cannot give you ownership of the compounding institution unless the memory, source graph, private context, evaluation records, corrections, workflows, and agent loops remain yours. The same hardware becomes more valuable because it is entangled with the owner's artifacts and purposes. That entanglement is not a commodity GPU-hour. It is institutional capital.

There is also a sovereignty premium. Fable being dark for eighteen days made rental revocability observable. If the tool, model, policy switch, and learning loop live somewhere else, the owner's machine can become someone else's pause button. Owned compute is not only a bet on cheaper inference. It is a bet that the compounding part of the system should not be interruptible by a supplier, partner, regulator, or platform politics.

Whoever prices that non-transferable appreciation first buys underpriced machinery.

The appreciation is conditional

"Compute appreciates" is not a universal law.

There are three forces at once.

First, functional appreciation: software, models, kernels, quantization, routing, memory, post-training, and agent systems make existing hardware more useful.

Second, supply expansion: chip production, data centers, inference ASICs, cloud buildout, new hardware generations, and software efficiency all increase effective compute supply. The CFO is not wrong to notice that an H100 can run better models while a B200 or successor may run them much cheaper. Functional appreciation and hardware-generation displacement are both true at the same time.

Third, reflexive capex cycles: shortage narratives trigger overinvestment, delayed supply arrives late, and the market can move from scarcity to glut faster than balance sheets expect.

So the precise claim is not that compute always appreciates. The precise claim is:

Compute appreciates functionally when software and model progress increase its productive output faster than hardware obsolescence and supply expansion erode its opportunity cost.

That condition matters.

There is also a political dependency. Functional appreciation flows to compute owners only while open models track the frontier. If open weights stall, get regulated away, or fall permanently behind closed labs, then the appreciation re-concentrates inside the labs. The same hardware still exists, but the better models only run in someone else's building.

That means the owned-capital equilibrium depends on the open ecosystem persisting.

This is not only a technical fact. It is a political fact.

Open weights, Chinese releases, private fine-tuning, model substitution, and property-oriented AI policy are not side issues. They are the preconditions for compute ownership to beat model rental. Sacks's coalition is not incidental to the thesis. It is part of the market structure that makes the thesis possible.

If the open ecosystem persists, compute owners receive global AI progress as an external upgrade. If the open ecosystem closes, compute owners are pushed back into dependency.

The Bridgewater lesson

The quietest evidence is not a speech. It is a recipe.

Bridgewater publishing a training procedure around Tinker matters because it turns the defensive posture into operational knowledge. The response to model-layer dominance is not merely "use open models" in the abstract. It is: build the internal capacity to tune, evaluate, route, govern, and improve models against your own work.

That is the difference between renting intelligence and owning a learning loop.

A company that can fine-tune, evaluate, and deploy its own models on its own compute is not merely saving on API calls. It is converting global model progress into private institutional capital. It can take public capabilities and attach them to proprietary context, procedures, risk tolerances, compliance boundaries, and taste.

The model becomes swappable. The learning loop stays.

That is why the downstream defense is so coordinated. Microsoft wants the model to be an unscrewed part. ServiceNow wants to govern the agent layer. Amazon wants leverage over the lab it funded. Enterprises want the work surface, memory, and controls to stay theirs. Security professionals want defenders to keep the tools attackers will have anyway. The open ecosystem wants the frontier commoditized enough that no lab owns the future by default.

Different motives. One posture.

AGI as compute looking for work

The compute-glut version of the thesis sounds strange until the allocation unit changes.

If AI demand is measured by human prompting, then there may be too much compute. Most people do not know what to do with twenty-four hours a day of chatbot time.

But if AI demand is measured by irreducible processes deserving sustained attention, the world is compute-starved. Every case, patient, classroom, codebase, market, neighborhood, city budget, supply chain, lab notebook, and publication can absorb a standing machine of attention.

The bottleneck is not absolute usefulness. It is task allocation. Institutions do not yet know how to attach compute to all the processes that deserve it.

That is why a compute glut would not mean too much intelligence. It would mean compute supply exceeding current human and institutional task-allocation bandwidth.

AGI, in this economic sense, is when compute supply exceeds prompt supply.

The compute glut is the physicalization of AGI: compute must find work for itself.

Automatic capital is the machinery that lets it find the right work.

The bet

The model labs want the world to rent artificial labor. They want to own the best worker and charge everyone else for access.

The downstream coalition increasingly wants the opposite: open weights, private fine-tuning, owned inference, swappable models, governed agents, and internal learning loops.

This is why the compute economics matter. If compute is merely depreciating hardware, the cloud analogy wins and renting dominates. If compute is functionally appreciating capital, ownership becomes an arbitrage.

The synthesis is simple:

Labor depreciates the moment you stop paying for it. Capital appreciates while you sleep. Owned AI compute is the first machinery whose appreciation rate can be set by the pace of global AI research.

That is the deepest version of why buyers should want to own.

Not because every firm should become a model lab. Not because every household should administer Linux servers. Not because clouds disappear.

Because the durable surplus is not in a rented answer. It is in the compounding system that runs your work, remembers your corrections, absorbs global progress, and keeps improving while you sleep.