LBSC 878 - Information Storage and Retrieval
Spring 2005
Core Reading List
This list identifies core readings that each student is expected to
review before the assigned session. Readings are listed in the
recommended reading order. All readings will be available in the Paul
Wasserman library, and some are also available on the net in a variety
of formats. Viewers are available for postscript and PDF
files if you don't have them installed. Files ending in ".gz", ".Z",
or ".zip" will need to be uncompressed before those viewers
will handle them correctly, though. Optional
readings have also been identified for some weeks.
Typically four papers are assigned to all students in order to provide
us with a common ground for our discussions in class. In weeks 3-7, a
student will lead a discussion of one paper each week during the first
30 minutes of class. That paper, which is drawn from from the
previous week's reading assignment, is indicated with a "*" in
the reading list. Students are expected to read a total of at least 6
papers each week (which you should space out to one a day - don't try
to read all 6 on Sunday night!). In addition to the core readings
listed here, students are expected to complete additional readings in
their chosen survey topic and are encouraged
to also choose a paper from the optional
reading list each week.
Core readings for Week 1 (relevance):
- Blair, David C. 1990. Language and Representation in
Information Retrieval, Elsevier Science, Amsterdam. Read
Chapter 1, pages 1-10 only.
- Croft, W. Bruce. 1995. "What do People Want from Information
Retrieval?," D-Lib Magazine, November. Available at http://www.dlib.org/
- Mizzaro, S. 1997. "Relevance: The whole hi(story)," Journal of
the American Society for Information Science, 48(9): 810-832.
- * Wang, Peiling and Dagobert Soergel, 1998. "A Cognitive Model of
Document Use during a Research Project," Journal of the
American Society for Information Science, vol. 49, no. 2.
Core readings for Week 2 (knowledge representation):
- Berners-Lee, Tim, James Hendler, and Ora Lassila, "The
Semantic Web," Scientific American, May 2001.
- Torsun, I. S. 1995. Foundations
of Intelligent Knowledge-Based Systems, Academic Press,
London. Read chapter 5 only.
- Fikes, Richard and Tom Kehler. 1985. "The Role of Frame-Based
Representation in Reasoning," Communications of the ACM,
vol. 28, no. 9, pp. 904-920. Available
on campus at http://www.acm.org/dl
- * Srini Narayanan and and Sanda Harabagiu. 2004.
"Question Answering Based on Semantic Structures," Proceedings
of the 20th International Conference on Computational Linguistics
(COLING). Available at http://www.hlt.utdallas.edu/papers/837.pdf
Core readings for Week 3 (indexing):
- Harman, Donna, Ed Fox, Ricardo Baeza-Yates and W. Lee. 1992.
"Inverted Files," in Frakes, William
B. and Ricardo Baeza-Yates, Information Retrieval: Data
Structures and Algorithms. Prentice Hall, New York. Chapter 3.
- Resnik, Philip. 1999. "Semantic Similarity in a Taxonomy: An
Information-Based Measure and its Application to Problems of
Ambiguity in Natural Language," Journal of Artificial
Intelligence Research, volume 11, pp. 95-130.
- Lin, Dekang. 2001. "LaTaT: Language and Text Analysis Tools,"
in Proceedings of the First Human Language Technology
Conference (HLT)., San Diego. Available at http://www.cs.ualberta.ca/~lindek/papers.htm
- * Singhal, Amit and Fernando Pereira. 1999. "Document Expansion for
Speech Retrieval," in Proceedings of the 22nd Annual Interantional
ACM-SIGIR Conference on Research and Development in Information
Retrieval, Berkeley. Available on campus through the ACM
Digital Library.
Core readings for Week 4 (query formulation):
- Taylor, Robert S. 1962. "The Process of Asking Questions,"
American Documentation, vol. 13, no. 4, pp. 391-396.
- Belkin, N.J. (1980) Anomalous state of knowledge for information
retrieval. Canadian Journal of Information Science, 5, 133-143.
- Ruthven, Ian and Mounia Lalmas, "A Survey on the Use of
Relevance Feedback for Information Access Systems," Knowledge
Engineering Review. Available at
http://www.cis.strath.ac.uk/~ir/papers/
- * Baldonado, Michelle, Q. Wang, and Terry Winograd. 1997.
"SenseMaker: an Information-Exploration Interface Supporting
the Contextual Evolution of a User's Interests." in
Proceedings of CHI '97. Available at http://www-diglib.stanford.edu/cgi-bin/WP/get/SIDL-WP-1996-0048
Core readings for Week 5 (matching):
- Harman, Donna. 1992. "Ranking Algorithms," in Frakes, William
B. and Ricardo Baeza-Yates, in Information Retrieval: Data
Structures and Algorithms. Prentice Hall, New York. Chapter 14.
- Robertson, Stephen and Karen Sparck-Jones. 1997. "Simple, Proven
Approaches to Text Retrieval," Cambridge University Technical Report
TR356. Available at
http://www.ftp.cl.cam.ac.uk/ftp/papers/reports/TR356-ksj-approaches-to-text-retrieval.html.
- Miller, David, Tim Leek and Richard Schwartz, 1999. "A Hidden Markov
Model Information Retrieval System," in Proceedings of the
22nd Annual
International ACM SIGIR Conference on Research and Development in
Information Retrieval, Berkeley, August, 1999, pp. 214-221.
Available
on campus at http://www.acm.org/dl
- * Rodgers, Ian. 2002. "The Google Pagerank Algorithm and How It
Works," IPR Computing White Paper. Available at
http://www.iprcom.com/papers/pagerank/.
Core readings for Week 6 (browsing):
- Hearst, Marti A. 1999. "User Interfaces and Visualization," in
Ricardo Baeza-Yates and Berthier Ribeiro-Neto, ed., Modern
Information Retrieval, Chapter 10. Addison-Wesley, New York.
Available at http://www.sims.berkeley.edu/~hearst/irbook/chapters/chap10.html
- Xia Lin, Dagobert Soergel, and Gary Marchionini. 1991 "A
self-organizing semantic map for information retrieval," in
Proceedings of the 14th International ACM/SIGIR Conference
on Research and Development in Information Retrieval,
pp. 262-269. Available on campus at http://www.acm.org/dl
- * Campbell, I. and Van Rijsbergen, C.J. (1996) The ostensive model of
developing information needs. Proceedings of the 3rd
International Conference on Conceptions of Library and
Information Science (pp. 251-268) (Available online at:
http://www.dcs.gla.ac.uk/~iain/papers/colis2.thumbs.pdf
- * Kelly, Diane and Nicholas J. Belkin. 2004. "Display Time as
Implicit Feedback: Understanding Task Effects," Proceedings of
the 27th ACM SIGIR Conference on Research and Development in
Information Retrieval, Sheffield, pp. 377-384.
Core readings for Week 7 (evaluation):
- Kando, Noriko. 2004. "Overview of the Fourth NTCIR Workshop,"
in Working Notes of NTCIR-4, Tokyo, 8 pages. Available
at http://research.nii.ac.jp/ntcir-ws4/NTCIR4-WN/.
- Voorhees, Ellen M. 2000. "Variations in Relevance Judgments and
the Measurement of Retrieval Effectiveness," Information
Processing and Management, 36(5)697-716. Available
on campus through Research Port.
- Fiscus, Jonathan and Barbara Wheatley. 2004. "Overview of the
TDT 2004 Evaluation
and Results," in TDT2004 Workshop Presentations and System
Description Ppaers. Powerpoint slides only (no paper),
Available at http://www.nist.gov/speech/tests/tdt/tdt2004/papers/NIST-TDT2004.ppt.
- * Turpin, Andrew H. and William Hersh. 2001. "Why Batch and
User Evaluations Do Not Give the
Same Results" in Proceedings of the 24th Annual International
ACM SIGIR Conference on Research and Development in Information
Retrieval, pp. 225-231. Available on campus at http://www.acm.org/dl.
Core readings for Week 8 (surveys):
- John S. Garofolo, Cedric G.P. Auzanne, and Ellen M. Voorhees,
"The TREC Spoken Document Retrieval Track: A Success Story," Proceedings of
TREC-8.
Core readings for Week 9 (surveys):
- Christel, M.G., Winkler, D.B., & Taylor,
C.R. (1997). Multimedia Abstractions for a Digital Video Library.
Proceedings of the 2nd ACM International Conference on
Digital Libraries
(available on campus at http://www.acm.org/dl).
- TBA (Fleming)
Core readings for Week 10 (surveys):
- Cunningham, S. J., Reeves, N., & Britland, M. (2003). "An
ethnographic
study of music information seeking: Implications for the design of a
music digital library." Proceedings of the 2003 Joint Conference on
Digital Libraries (JCDL '03) (pp. 5-17). Washington DC: IEEE Computer
Society. (available from the ACM Digital Library)
- L. Hirschman and R. Gaizauskas, 2001, "Natural Language Question
Answering: The view from Here" Natural Language
Engineering, 7(4)275-300.
- Ford, G., & Gelderblom, H. (2003). "The effects of culture on
performance achieved through the use of human computer
interaction." in >i>Proceedings of the 2003 South African Institute of
Computer Scientists and Information Technologists (SAICSIT) on
Enablement Through Technology (pp. 218-230) (available from the ACM
Digital Library).
Core readings for Week 11 (academic life):
Core readings for Week 12 (research papers):
- TBA (Student A)
- TBA (Student B)
Core readings for Week 13 (research papers):
- TBA (Student C)
- TBA (Student D)
Core readings for Week 14 (research papers):
- TBA (Student E)
- Hirsh, S. G. (1999). Children's
relevance criteria and information seeking on electronic
resources. Journal of the American Society for Information Science,
50(14), 1265-1283. (available through Research Port)