Not only limiting features per token, but also limiting tokens per feature to prevent excessive use of specific features for efficient resource allocation
Mutual Choice SAE
Improving performance by allocating more resources to tokens that are difficult to reconstruct
Feature Choice SAE, Mutual Choice SAEs