Structural information based term weighting in text retrieval for feature location

dc.contributorEtzkorn, Letha
dc.contributorGray, Jeff
dc.contributorSmith, Randy K.
dc.contributor.advisorKraft, Nicholas A.
dc.contributor.authorBassett, Richard Blake
dc.contributor.otherUniversity of Alabama Tuscaloosa
dc.date.accessioned2017-03-01T16:47:22Z
dc.date.available2017-03-01T16:47:22Z
dc.date.issued2013
dc.descriptionElectronic Thesis or Dissertationen_US
dc.description.abstractFeature location is a program comprehension activity in which a developer identifies source code entities that implement a feature of interest. Recent feature location techniques apply text retrieval techniques to corpora built from text embedded in source code. These techniques are highly configurable, but many of the available parameters remain unexplored in the software engineering context. For example, while the natural language processing community has developed several term weighting schemes meant to highlight the importance of certain terms in a particular document, the software engineering community has thus far not developed new term weighting schemes for use with source code. Thus, we propose a new term weighting scheme that is based on the structural information in source code. We then report the results of an empirical study in which we evaluated the performance effects of the proposed term weighting scheme on a latent Dirichlet allocation (LDA) based feature location technique (FLT). In all, we studied over 400 bugs and features from five open source Java systems. Our key finding is that the accuracy of the LDA-based FLT improves when a structural term weighting scheme is used rather than a uniform term weighting scheme.en_US
dc.format.extent44 p.
dc.format.mediumelectronic
dc.format.mimetypeapplication/pdf
dc.identifier.otheru0015_0000001_0001237
dc.identifier.otherBassett_alatus_0004M_11505
dc.identifier.urihttps://ir.ua.edu/handle/123456789/1708
dc.languageEnglish
dc.language.isoen_US
dc.publisherUniversity of Alabama Libraries
dc.relation.hasversionborn digital
dc.relation.ispartofThe University of Alabama Electronic Theses and Dissertations
dc.relation.ispartofThe University of Alabama Libraries Digital Collections
dc.rightsAll rights reserved by the author unless otherwise indicated.en_US
dc.subjectComputer science
dc.titleStructural information based term weighting in text retrieval for feature locationen_US
dc.typethesis
dc.typetext
etdms.degree.departmentUniversity of Alabama. Department of Computer Science
etdms.degree.disciplineComputer Science
etdms.degree.grantorThe University of Alabama
etdms.degree.levelmaster's
etdms.degree.nameM.S.
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
file_1.pdf
Size:
271.55 KB
Format:
Adobe Portable Document Format