The Generative Nature of Commonsense Knowledge: Insights from Machine Learning Pubblico
Ellison, Jacquelyn (Fall 2020)
Abstract
The study of commonsense has received little attention for lack of accounts of how it might be
represented. Recent advances in machine learning are resulting in rich knowledge bases that
(unwittingly) might offer some insight into this elusive phenomenon. This paper assesses one precomputed
model, RoBERTa, for suitability as a working model of human commonsense knowledge by
testing it against variation in human agreement. We examine the contribution of statistical and
structural properties of language to this performance, including frequency and cooccurrence-based
representations, as well as part of speech and syntactic structure. We conclude that RoBERTa is a
suitable model for language prediction: the model’s predictions closely reflected human agreement and
cannot be explained by simple linguistic features. In investigating the range of possible responses to a
particular context, we find that these responses illustrate the impact of categorical organization on
precise context sensitivity and conclude that this demonstrates the hallmarks of commonsense
knowledge. After exploring the contribution of the static component of RoBERTa’s knowledge, the main
finding of this paper is that the knowledge base that directly facilitates both human agreement and the
model’s measure of fit is by its very nature generative, and only truly exists in representation as it is
applied. This paper discusses the role of implicit learning and predictive processing as potential
frameworks within which to substantiate this meta-theoretic observation.
Table of Contents
TABLE OF CONTENTS
ABSTRACT 2
INTRODUCTION 4
THE CONTRIBUTIONS OF LINGUISTIC STRUCTURE AND DISTRIBUTION 9
METHODS 10
PARTICIPANTS 10
MATERIALS 10
PROCEDURE 12
RESULTS 13
DISCUSSION 15
CONTEXT-SENSITIVITY AND THE STRUCTURE OF THE KNOWLEDGE BASE 15
METHODS 17
PARTICIPANTS 17
MATERIALS 17
PROCEDURE 17
RESULTS 17
DISCUSSION 19
UNDERSTANDING ROBERTA’S KNOWLEDGE BASE 19
METHODS 23
PARTICIPANTS 23
MATERIALS 23
PROCEDURE 23
RESULTS 24
CLOZE TASK 24
SIMILARITY ANALYSES 26
DISCUSSION 27
GENERAL DISCUSSION 27
CONCLUSION 29
REFERENCES 30
About this Master's Thesis
| School | |
|---|---|
| Department | |
| Subfield / Discipline | |
| Degree | |
| Submission | |
| Language | 
 | 
| Research Field | |
| Parola chiave | |
| Committee Chair / Thesis Advisor | 
Primary PDF
| Thumbnail | Title | Date Uploaded | Actions | 
|---|---|---|---|
|  | The Generative Nature of Commonsense Knowledge: Insights from Machine Learning () | 2020-10-27 09:42:18 -0400 |  | 
Supplemental Files
| Thumbnail | Title | Date Uploaded | Actions | 
|---|