Generative Spoken Language Model based on continuous word-sized...
In NLP, text language models based on words or subwords are known to outperform their character-based counterparts. Yet, in the speech community, the standard input of spoken LMs are 20ms or...
https://arxiv.org/abs/2310.05224


Seonglae Cho