Sparse regression of textual analysis
dc.contributor | Chen, Yuhui | |
dc.contributor | Davis, Cali M. | |
dc.contributor | Kwon, Hyun-Kyoung | |
dc.contributor | Zhu, Wei | |
dc.contributor.advisor | Ames, Brendan | |
dc.contributor.author | Carter, Phylisicia N. | |
dc.contributor.other | University of Alabama Tuscaloosa | |
dc.date.accessioned | 2018-12-14T18:12:31Z | |
dc.date.available | 2018-12-14T18:12:31Z | |
dc.date.issued | 2018 | |
dc.description | Electronic Thesis or Dissertation | en_US |
dc.description.abstract | We consider sparse regression techniques as tools for classification of sentiment within Twitter posts. Analysis of Twitter usage suffers from several unique challenges. For example, the 140-character limit severely limits the amount of information contained in each post; this causes most tweets to contain an extremely small subset of the dictionary, presenting challenges for learning schemes based on dictionary usage. To remedy this undersampling issue, we propose usage of penalized regression. Here, we employ logistic regularization to avoid any degeneracy caused by the sparse usage of the dictionary in each tweet, while simultaneously learning which terms are most associated with each sentiment. Accelerated sparse discriminant analysis is also used to combat the issues of degeneracy and overfitting of the training data while providing dimension reduction. As illustrative examples, we employ sparse logistic regression to classify tweets based on the users’ perception of a connection between vaccination and autism, and we examine the Twitter users' sentiment of the use of autonomous cars. | en_US |
dc.format.extent | 107 p. | |
dc.format.medium | electronic | |
dc.format.mimetype | application/pdf | |
dc.identifier.other | u0015_0000001_0003144 | |
dc.identifier.other | Carter_alatus_0004D_13541 | |
dc.identifier.uri | http://ir.ua.edu/handle/123456789/5276 | |
dc.language | English | |
dc.language.iso | en_US | |
dc.publisher | University of Alabama Libraries | |
dc.relation.hasversion | born digital | |
dc.relation.ispartof | The University of Alabama Electronic Theses and Dissertations | |
dc.relation.ispartof | The University of Alabama Libraries Digital Collections | |
dc.rights | All rights reserved by the author unless otherwise indicated. | en_US |
dc.subject | Applied mathematics | |
dc.title | Sparse regression of textual analysis | en_US |
dc.type | thesis | |
dc.type | text | |
etdms.degree.department | University of Alabama. Department of Mathematics | |
etdms.degree.discipline | Mathematics | |
etdms.degree.grantor | The University of Alabama | |
etdms.degree.level | doctoral | |
etdms.degree.name | Ph.D. |
Files
Original bundle
1 - 1 of 1