Understanding and Evaluating Commonsense Reasoning in Transformer-based Architectures PAPER

Ascertaining the reason for and identifying the differences between sensical and nonsensical statements is a task that humans are naturally adept at. NLP methods, however, might struggle with such a task. This paper aims to evaluate various transformer-based architectures on common sense validation and explanation task, methods to improve their performance, as well as interpreting how fine-tuned language models perform such tasks, using the attention distribution of sentences at inference time. The tasks entail identifying the nonsensical statement from a given pair of similar statements (validation), followed by selecting the correct reason for its nonsensical nature(explanation). An accuracy of 83.90 and 88.42 is achieved on the respective tasks, using RoBERTa (large) language model fine-tuned on the datasets using the ULMFiT training approach.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
FastAI Task A.ipynb		FastAI Task A.ipynb
FastAI Task B.ipynb		FastAI Task B.ipynb
LICENSE		LICENSE
README.md		README.md
Transformer-Interpret.ipynb		Transformer-Interpret.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Understanding and Evaluating Commonsense Reasoning in Transformer-based Architectures PAPER

Inter-task transfer learning framework

Explaining results with attention distribution

Results

About

Uh oh!

Releases

Packages

Languages

License

manavgakhar/ComVE

Folders and files

Latest commit

History

Repository files navigation

Understanding and Evaluating Commonsense Reasoning in Transformer-based Architectures PAPER

Inter-task transfer learning framework

Explaining results with attention distribution

Results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages