As the amount of available data increases, the problem of information discovery, often referred to as finding the needle in the haystack problem, becomes more pressing. The most successful search applications today are the general purpose Web search engines and the well-structured database querying (e.g., SQL). Directly applying these two search models to specific domains is ineffective since they ignore the domain semantics - the meaning of object associations - and the needs of the domain users - a biologist wants to see different results from a physician for the same query on PubMed.
We present challenges and techniques to achieve effective information discovery on vertical domains by modeling the domain semantics and its users, and exploiting the knowledge of domain experts. Our focal domains are the products marketplace, biological data, clinical data, and bibliographic data. This project is funded by the NSF.
Vagelis Hristidis is an assistant professor at the School of Computing and Information Sciences at the Florida International University in Miami. He received his B.S. in Electrical and Computer Engineering from the National Technical University of Athens and his M.S. and Ph.D. in Computer Science from the University of California, San Diego in 2004. His main research work addresses the problem of bridging the gap between Databases and Information Retrieval.
This talk is part of the CLIP Colloquium Series. For the complete schedule, please visit http://www.umiacs.umd.edu/research/CLIP/colloq/.