arxiv.org
https://arxiv.org/pdf/2503.14476
Multi-Conv DAPO
MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Despite improvements by length extrapolation, efficient attention and memory modules, handling infinitely long documents with linear complexity without performance degradation during extrapolation...
https://arxiv.org/abs/2507.02259


Seonglae Cho