CWM

Code Generation + Reasoning "World Modeling" Specialized LLM

Docker-based agentically interactions are incorporated at scale during mid-training. The model internalizes code semantics (state transitions during execution). In other words, not just learning static code → learning the "execution world" too

The world where code executes" (Python runtime, filesystem/shell, test runner, package dependencies, etc.) - models that reflect this world or state go beyond simple next token prediction, as they are trained to internally simulate state transitions of that world.

facebook/cwm · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

https://huggingface.co/facebook/cwm

www.arxiv.org

https://www.arxiv.org/pdf/2510.02387

CWM

Recommendations