ClusT: Interactive Visualization Tool for Deep Constrained Clustering on Tweets Pubblico
Wu, Yunjie (Spring 2023)
Abstract
Healthcare indices are used to evaluate the overall accessibility of public health resources for a given region and have been found to be especially useful for predicting post-treatment outcomes for a variety of diseases. Heart failure (HF) patients’ outcome after receiving treatment, for instance, has been discovered to be strongly correlated with the patient’s residing neighborhood, a variable yet to be incorporated into the process of health indices’ generation. Based on preliminary research that shows census-level Twitter data can be utilized to capture neighborhood impact, we introduce a visual analytic system that enables the iterative refinement of this new data. Specifically, the system enables machine learning experts, healthcare professionals, and policymakers to (1) preprocess tweets retrieved by specified keywords, e.g., eliminating URLs, stemming words, etc., (2) extract keywords and their corresponding embeddings, and (3) customize, inspect, and refine a topic model through interactive clustering. Ultimately, data produced by this system, along with other data sources, allows for the refinement of a more environment-reflective healthcare index.
Table of Contents
Introduction Background and Related Work System Design User Study Discussion Conclusion
About this Honors Thesis
School | |
---|---|
Department | |
Degree | |
Submission | |
Language |
|
Research Field | |
Parola chiave | |
Committee Chair / Thesis Advisor | |
Committee Members |
Primary PDF
Thumbnail | Title | Date Uploaded | Actions |
---|---|---|---|
|
ClusT: Interactive Visualization Tool for Deep Constrained Clustering on Tweets () | 2023-04-06 14:11:19 -0400 |
|
Supplemental Files
Thumbnail | Title | Date Uploaded | Actions |
---|