Confidence-based feature acquisition to minimize training and test costs

TitleConfidence-based feature acquisition to minimize training and test costs
Publication TypeJournal Articles
Year of Publication2010
AuthorsdesJardins M, MacGlashan J, Wagstaff KL
JournalProceedings of the SIAM Conference on Data Mining
Volume76
Issue373
Pagination514 - 524
Date Published2010///
Abstract

We present Confidence-based Feature Acquisition (CFA), a novel supervised learning method for acquiring missing feature values when there is missing data at both training and test time. Previous work has considered the cases of missing data at training time (e.g., Active Feature Acquisition, AFA 8), or at test time (e.g., Cost-Sensitive Naive Bayes, CSNB 2), but not both. At training time, CFA constructs a cascaded ensemble of classifiers, starting with the zero-cost features and adding a single feature for each successive model. For each model, CFA selects a subset of training instances for which the added feature should be acquired. At test time, the set of models is applied sequentially (as a cascade), stopping when a user-supplied confidence threshold is met. We compare CFA to AFA, CSNB, and several other baselines, and find that CFAs accuracy is at least as high as the other methods, while incurring significantly lower feature acquisition costs.

DOI10.2307/2287057