Automatic Personality Prediction with Attention-based Neural Networks Público

Hang Jiang (Spring 2018)

Permanent URL: https://etd.library.emory.edu/concern/etds/rv042t11v?locale=pt-BR
Published

Abstract

Previous works related to automatic personality prediction focus on using traditional classification models with linguistic features, but neural networks with pre-trained word embeddings, which have achieved huge success in text classification, have never been introduced for the task. This research aims to present a novel approach to automatic personality prediction using convolutional neural networks (CNN) and long short-term memory (LSTM) networks with attention mechanism. Our models are experimented on both monologue corpus, Essays dataset, and new multiparty dialogue corpus, called Friends dataset. We first create the corpus, Friends dataset, by annotating personalities from the popular Big Five theory on the multiparty dialogues from the TV show, Friends, through crowdsourcing and make a comprehensive analysis of the annotation. Our annotated corpus comprises 4 seasons with an average inter-annotator agreement below 0.1. We also propose novel attention-based CNN and LSTM models to overcome the limitations of the basic CNN and LSTM by encoding long-term contextual information and providing a global view of the document. Our analysis shows word embeddings and attention mechanism can effectively improve the performance of our model on the essays dataset by ignoring noise in the corpus. Besides, our results show the challenges for human beings to agree on the task if only text is provided from dialogues. This explains the reason why all the models cannot perform well on the Friends dataset.

Table of Contents

Chapter 1: Introduction ...1

Chapter 2: Background ...4

Chapter 3: Corpus ...11

Chapter 4: Approaches ...16

Chapter 5: Experiment ...20

Chapter 6: Conclusion ...24

Bibliography ...26

About this Honors Thesis

Rights statement
  • Permission granted by the author to include this thesis or dissertation in this repository. All rights reserved by the author. Please contact the author for information regarding the reproduction and use of this thesis or dissertation.
School
Department
Degree
Submission
Language
  • English
Research Field
Palavra-chave
Committee Chair / Thesis Advisor
Committee Members
Última modificação

Primary PDF

Supplemental Files