Deep Learning

Cloze Distillation Improves Psychometric Predictive Power

Training models on next word prediction (NWP) has led to significant developments in general-purpose language models (LMs). However, this objective lacks cognitive motivation under theories of language processing that show human prediction to be …