Related: sources · notes · metadata · Published Pieces
Less Slouch Detection, More Slouching Towards Bethlehem
The future of AI media should not be a model watching your body and correcting you. It should be a system helping you perceive the world.
A model watches a woman through live video.
She tells it to notice if she starts to slouch.
She slouches.
After a moment, the model says: “You’re slouching.”
On one level, impressive. The system is perceiving video in real time, tracking the body, judging posture, and intervening without a conventional prompt. This is a capabilities demo. It shows presence. It shows timing. It shows that multimodal AI is becoming more than turn-based text.
On another level, it is grotesque.
Not because posture coaching is inherently bad. A physical therapy app, sports trainer, disability assistant, or ergonomic tool could use this capability responsibly. The problem is the generalized social form: an AI that watches you and corrects your body.
The model becomes an ambient supervisor.
“You’re slouching.”
“You look distracted.”
“You seem upset.”
“You haven’t smiled.”
“You may be lying.”
“You are not paying attention.”
“You appear noncompliant.”
This is the nanny-surveillance-state affordance. It begins as helpful coaching and scales into discipline. Employers will want it. Schools will want it. Insurers will want it. Police and security systems will want it. Families will use it on one another. Platforms will use it to optimize engagement and emotional manipulation.
The capability is not neutral because the default target becomes the user’s body and behavior.
That is the wrong direction.
The better AI media product should not make the user the defective object. It should make the world the object of attention.
Less slouch detection.
More Slouching Towards Bethlehem.
The reference matters. Joan Didion was not telling you to sit up straight. She was watching a society lose coherence. She was tracking mood, power, drift, collapse, performance, and belief. She was listening for the hidden structure inside public disorder.
That is what serious AI media should help people do.
Not: “your shoulders are rounded.”
But:
“The center is not holding. Here is what is happening. Here is who saw it coming. Here is who benefits. Here is who is lying. Here are the prior claims. Here are the sources that aged well. Here is what changed. Here is what the collapse feels like from inside.”
Realtime perception is useful only if it expands agency. It should help the user perceive artifacts, events, sources, arguments, and environments. It should not turn the user into a monitored subject by default.
Automatic radio points in the healthier direction. The system listens because the user speaks. It retrieves human voices because those voices are published artifacts. It analyzes public claims, not private posture. It surfaces contradictions, not micro-corrections. It helps the user think while walking through the world.
The future should not be a robot companion politely correcting your body.
It should be a living medium for public cognition.