The acl anthology reference corpus: A reference dataset for bibliographic research in computational linguistics

TitleThe acl anthology reference corpus: A reference dataset for bibliographic research in computational linguistics
Publication TypeJournal Articles
Year of Publication2008
AuthorsBird S, Dale R, Dorr BJ, Gibson B, Joseph MT, Kan MY, Lee D, Powley B, Radev DR, Tan YF
JournalProc. of the 6th International Conference on Language Resources and Evaluation Conference (LREC’08)
Pagination1755 - 1759
Date Published2008///
Abstract

The ACL Anthology is a digital archive of conference and journal papers in natural language processing and computational linguistics.Its primary purpose is to serve as a reference repository of research results, but we believe that it can also be an object of study and
a platform for research in its own right. We describe an enriched and standardized reference corpus derived from the ACL Anthology
that can be used for research in scholarly document processing. This corpus, which we call the ACL Anthology Reference Corpus
(ACL ARC), brings together the recent activities of a number of research groups around the world. Our goal is to make the corpus
widely available, and to encourage other researchers to use it as a standard testbed for experiments in both bibliographic and bibliometric
research.