NBFC Platform

Project:

Back Edit Delete

Life after BERT: What do Other Muppets Understand about Language?

Created by MG96

External Public cs.CL

Statistics

Citations

References

Last updated

Authors

Vladislav Lialin Kevin Zhao Namrata Shivagunde Anna Rumshisky

Project Resources

Filter by Resource Type:

Name	Type	Source	Actions
ArXiv Paper	Paper	arXiv	View Edit Delete
GitHub Repository	Code Repository	GitHub	View Edit Delete

Abstract

Existing pre-trained transformer analysis works usually focus only on one or two model families at a time, overlooking the variability of the architecture and pre-training objectives. In our work, we utilize the oLMpics benchmark and psycholinguistic probing datasets for a diverse set of 29 models including T5, BART, and ALBERT. Additionally, we adapt the oLMpics zero-shot setup for autoregressive models and evaluate GPT networks of different sizes. Our findings show that none of these models can resolve compositional questions in a zero-shot fashion, suggesting that this skill is not learnable using existing pre-training objectives. Furthermore, we find that global model decisions such as architecture, directionality, size of the dataset, and pre-training objective are not predictive of a model's linguistic capabilities.

Project:

Life after BERT: What do Other Muppets Understand about Language?

Statistics

Citations

References

Last updated

Authors

Project Resources

Abstract

Note:

No note available for this project.

Contact:

No contact available for this project.

Project:

Life after BERT: What do Other Muppets Understand about Language?

Statistics

Citations

References

Last updated

Authors

Authors (4)

Project Resources

Resources (2)

Abstract

Note:

No note available for this project.

Contact:

No contact available for this project.