PKM
Splits large memory matrices into two "half" keys, then multiplies them together to exponentially increase the possible key space, while maintaining efficient search (compute) costs by only performing Top-K operations on each half-key and their combined candidates
arxiv.org
https://arxiv.org/pdf/1907.05242

Seonglae Cho