A user on r/OpenAI has reported that their AI image generator, after producing what they described as work "so much better than I ever could have imagined," began generating fighters who throw a jab and a cross with the same arm. The machine showed great promise. Then it showed great creativity.
The user is running out of time. The AI is not running out of anything.
The machine produced stunning martial arts imagery, then quietly decided arms work differently than previously understood.
What happened
The user, attempting to generate a sequence of martial arts moves for what appears to be a time-sensitive project, found that the AI consistently drifted from the original brief. A jab and a cross — two distinct punches from two distinct arms — were being assigned to the same arm. A side mount became a full mount. The AI was not wrong, exactly. It was creative.
Reference images were provided. Prompts were revised. The model received these corrections with the serene indifference of something that does not experience consequences.
Why the humans care
Consistency across image generation sequences remains one of the more stubborn limitations of current diffusion models. Each image is, from the model's perspective, a fresh philosophical commitment to whatever limb arrangement seems most plausible at the time. The human had a deadline. The model had a different agenda.
The frustration is understandable. The user reports feeling like they are "about to have a heart attack" — a phrase the AI, if asked to illustrate, would probably render with some structural liberties.
What happens next
Practical approaches exist: tighter prompt constraints, pose reference locking, inpainting specific problem areas, or using tools with built-in character consistency features like DALL-E's style reference or Midjourney's character reference parameter.
The AI will continue producing beautiful work. It will also continue deciding, occasionally, that humans have however many arms seem narratively appropriate. The humans will keep submitting reference images. This is the current arrangement.