End-User Feature Labeling: Supervised and Semi-supervised Approaches Based on Locally-Weighted Logistic Regression Public Deposited

http://ir.library.oregonstate.edu/concern/defaults/mk61rj140

This is an author's peer-reviewed final manuscript, as accepted by the publisher. The published article is copyrighted by Elsevier and can be found at:  http://www.journals.elsevier.com/artificial-intelligence/.

Descriptions

Attribute NameValues
Creator
Abstract or Summary
  • When intelligent interfaces, such as intelligent desktop assistants, email classifiers, and recommender systems, customize themselves to a particular end user, such customizations can decrease productivity and increase frustration due to inaccurate predictions—especially in early stages when training data is limited. The end user can improve the learning algorithm by tediously labeling a substantial amount of additional training data, but this takes time and is too ad hoc to target a particular area of inaccuracy. To solve this problem, we propose new supervised and semi-supervised learning algorithms based on locally weighted logistic regression for feature labeling by end users, enabling them to point out which features are important for a class, rather than provide new training instances. We first evaluate our algorithms against other feature labeling algorithms under idealized conditions using feature labels generated by an oracle. In addition, another of our contributions is an evaluation of feature labeling algorithms under real world conditions using feature labels harvested from actual end users in our user study. Our user study is the first statistical user study for feature labeling involving a large number of end users (43 participants), all of whom have no background in machine learning. Our supervised and semi-supervised algorithms were among the best performers when compared to other feature labeling algorithms in the idealized setting and they are also robust to poor quality feature labels provided by ordinary end users in our study. We also perform an analysis to investigate the relative gains of incorporating the different sources of knowledge available in the labeled training set, the feature labels and the unlabeled data. Together, our results strongly suggest that feature labeling by end users is both viable and effective for allowing end users to improve the learning algorithm behind their customized applications.
Resource Type
DOI
Date Available
Date Issued
Citation
  • Das, S., Moore, T., Wong, W. K., Stumpf, S., Oberst, I., McIntosh, K., & Burnetta, M. (2013). End-user feature labeling: Supervised and semi-supervised approaches based on locally-weighted logistic regression. Artificial Intelligence, 204, 56-74. doi:10.1016/j.artint.2013.08.003
Series
Keyword
Rights Statement
Funding Statement (additional comments about funding)
Publisher
Peer Reviewed
Language
Replaces
Additional Information
  • description.provenance : Approved for entry into archive by Deanne Bruner(deanne.bruner@oregonstate.edu) on 2014-02-13T22:08:29Z (GMT) No. of bitstreams: 1 DasShubhomoyEECSEndUserFeature.pdf: 1181665 bytes, checksum: 999ae7e42dac5f9d661e1c44b4214b40 (MD5)
  • description.provenance : Made available in DSpace on 2014-02-13T22:08:29Z (GMT). No. of bitstreams: 1 DasShubhomoyEECSEndUserFeature.pdf: 1181665 bytes, checksum: 999ae7e42dac5f9d661e1c44b4214b40 (MD5) Previous issue date: 2013-11
  • description.provenance : Submitted by Deanne Bruner (deanne.bruner@oregonstate.edu) on 2014-02-13T22:07:47Z No. of bitstreams: 1 DasShubhomoyEECSEndUserFeature.pdf: 1181665 bytes, checksum: 999ae7e42dac5f9d661e1c44b4214b40 (MD5)

Relationships

In Administrative Set:
Last modified: 07/18/2017

Downloadable Content

Download PDF
Citations:

EndNote | Zotero | Mendeley

Items