A research team from Georgia Tech, Microsoft Research and Microsoft Azure AI studies the collections of “lottery tickets” in extremely over-parametrized models, revealing the generalization performance pattern of winning tickets and proving the existence of “super tickets.”

The paper Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization is on arXiv.

