
MonadGPT is a finetune of Mistral-Hermes on 11,000 early modern texts in English, French and Latin (17th century)
Hermes 4.3
- First production model fully post-trained on the Psyche distributed learning network
- Achieves equivalent throughput to centralized training while hiding communication costs between internet-distributed nodes using the DisTrO optimizer

Seonglae Cho
