TY - CONF T1 - Improving text classification for oral history archives with temporal domain knowledge T2 - Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval Y1 - 2007 A1 - Olsson,J. Scott A1 - Oard, Douglas KW - automatic topic classification KW - classifying with domain knowledge KW - spoken document classification AB - This paper describes two new techniques for increasing the accuracy oftopic label assignment to conversational speech from oral history interviews using supervised machine learning in conjunction with automatic speech recognition. The first, time-shifted classification, leverages local sequence information from the order in which the story is told. The second, temporal label weighting, takes the complementary perspective by using the position within an interview to bias label assignment probabilities. These methods, when used in combination, yield between 6% and 15% relative improvements in classification accuracy using a clipped R-precision measure that models the utility of label sets as segment summaries in interactive speech retrieval applications. JA - Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval T3 - SIGIR '07 PB - ACM CY - New York, NY, USA SN - 978-1-59593-597-7 UR - http://doi.acm.org/10.1145/1277741.1277848 M3 - 10.1145/1277741.1277848 ER -