NBFC Platform

Project:

Back Edit Delete

Effective Use of Transformer Networks for Entity Tracking

Created by MG96

External Public cs.CL

Statistics

Citations

References

Last updated

Authors

Aditya Gupta Greg Durrett

Project Resources

Filter by Resource Type:

Name	Type	Source	Actions
ArXiv Paper	Paper	arXiv	View Edit Delete
Semantic Scholar	Paper	Semantic Scholar	View Edit Delete
GitHub Repository	Code Repository	GitHub	View Edit Delete

Abstract

Tracking entities in procedural language requires understanding the transformations arising from actions on entities as well as those entities' interactions. While self-attention-based pre-trained language encoders like GPT and BERT have been successfully applied across a range of natural language understanding tasks, their ability to handle the nuances of procedural texts is still untested. In this paper, we explore the use of pre-trained transformer networks for entity tracking tasks in procedural text. First, we test standard lightweight approaches for prediction with pre-trained transformers, and find that these approaches underperform even simple baselines. We show that much stronger results can be attained by restructuring the input to guide the transformer model to focus on a particular entity. Second, we assess the degree to which transformer networks capture the process dynamics, investigating such factors as merged entities and oblique entity references. On two different tasks, ingredient detection in recipes and QA over scientific processes, we achieve state-of-the-art results, but our models still largely attend to shallow context clues and do not form complex representations of intermediate entity or process state.

Project:

Effective Use of Transformer Networks for Entity Tracking

Statistics

Citations

References

Last updated

Authors

Project Resources

Abstract

Note:

No note available for this project.

Contact:

No contact available for this project.

Project:

Effective Use of Transformer Networks for Entity Tracking

Statistics

Citations

References

Last updated

Authors

Authors (2)

Project Resources

Resources (3)

Abstract

Note:

No note available for this project.

Contact:

No contact available for this project.