| Research by Topic, and Data | ||
| (Click here to access invited talks, as well as, lists of conference, journal, and workshop publications.) | ||
|
||
| Emotion Analysis | ||
| Key words: Plutchik emotions, emotion lexicon, mechanical turk, crowd sourcing, semantic orientation lexicon, colors of emotions. |
||
| Lexicon: Please email saif.mohammad@nrc-cnrc.gc.ca to obtain a copy of the NRC word-emotion association lexicon. The lexicon has human annotations of emotion associations for more than 24,200 word senses (about 14,200 word types). The annotations include whether the target is positive or negative, and whether the target has associations with eight basic emotions (joy, sadness, anger, fear, surprise, anticipation, trust, disgust). The lexicon is described in the papers below. | ||
| Papers: | ||
| Crowdsourcing a Word-Emotion Association Lexicon, Saif Mohammad and Peter Turney, To Appear in Computational Intelligence, Wiley Blackwell Publishing Ltd. | ||
| Paper (pdf) BibTeX | ||
| Tracking Sentiment in Mail: How Genders Differ on Emotional Axes, Saif Mohammad and Tony Yang, In Proceedings of the ACL 2011 Workshop on ACL 2011 Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA), June 2011, Portland, OR. | ||
| Paper (pdf) BibTeX Presentation | ||
| Download: | 1. Collections of love letters, hate mail, and suicide notes. | |
| 2. A mapping of directory names in the Enron email corpus to email ids and to gender. | ||
| From Once Upon a Time to Happily Ever After: Tracking Emotions in Novels and Fairy Tales, Saif Mohammad, In Proceedings of the ACL 2011 Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH), June 2011, Portland, OR. | ||
| Paper (pdf) BibTeX Presentation | ||
| Associations of Words with Emotion, Polarity, and Colour: Crowdsoursing a Lexicon, Saif Mohammad and Peter Turney, Technical Report, National Research Council Canada, Ottawa, Canada. | ||
| Paper (pdf) BibTeX | ||
| "Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon", Saif Mohammad and Peter Turney, In Proceedings of the NAACL-HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, June 2010, LA, California. | ||
| Abstract Paper (pdf) Presentation | ||
| Invited Talk: From Once Upon a Time to Happily Ever After: Tracking Emotions in Books and Mail. | ||
|
||
| Automatically Generating a Semantic Orientation Lexicon | ||
| Key words: semantic orientation lexicon, polarity, subjectivity, marking theory, affix patterns, word senses, thesaurus. | ||
| Download: Access the Macquarie Semantic Orientation Lexicon (MSOL) here. It is described in the EMNLP-09 paper listed below. The paper describes a few different MSOL variants; the one available here for download is MSOL(ASL and GI). | ||
| "Generating High-Coverage Semantic Orientation Lexicons From Overtly Marked Words and a Thesaurus", Saif Mohammad, Bonnie Dorr, and Cody Dunne, In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2009), August 2009, Singapore. | ||
| Abstract Paper (pdf) Presentation | ||
| Computing Word-Colour Associations | ||
| Key words: words, colour, meaning, crowdsourcing, Mechanical Turk, Berlin and Kay. | ||
| Lexicon: Please email saif.mohammad@nrc-cnrc.gc.ca to obtain a copy of the NRC word-colour association lexicon. The lexicon has human annotations of colours associated with more than 24,200 word senses (about 14,200 word types). The lexicon is described in the papers below. | ||
| Colourful Language: Measuring Word-Colour Associations, Saif Mohammad, In Proceedings of the ACL 2011 Workshop on Cognitive Modeling and Computational Linguistics (CMCL), June 2011, Portland, OR. | ||
| Paper (pdf) BibTeX Presentation | ||
| Even the Abstract have Colour: Consensus in WordColour Associations, Saif Mohammad, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, June 2011, Portland, OR. | ||
| Paper (pdf) BibTeX Poster | ||
| Computing Semantic Distance | ||
| Key words: semantic distance (semantic relatedness and semantic similarity), distributional similarity, cross-lingual semantic distance, word senses, thesaurus. | ||
| "Measuring Semantic Distance using Distributional Profiles of Concepts", Saif Mohammad and Graeme Hirst. Submitted. | ||
| Abstract Paper (pdf) | ||
| "Estimating semantic distance using soft semantic constraints in knowledge-source–corpus hybrid models", Yuval Marton, Saif Mohammad, and Philip Resnik, In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2009), August 2009, Singapore. | ||
| Abstract Paper (pdf) Presentation | ||
| "Measuring Semantic Distance using Distributional Profiles of Concepts", Saif Mohammad, Ph.D. thesis, University of Toronto, January 2008, Toronto, Canada. | ||
| Abstract Paper (pdf) Presentation | ||
| "Cross-lingual distributional profiles of concepts for measuring semantic distance", Saif Mohammad, Iryna Gurevych, Graeme Hirst, and Torsten Zesch, In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP/CoNLL-2007), June 2007, Prague, Czech Republic. | ||
| Abstract Paper (ps) Paper (pdf) Presentation | ||
"Distributional Measures as Proxies
for Semantic Distance: A Survey", Saif Mohammad
and Graeme Hirst. |
||
Paper
- Dec 2007 version (pdf) (Note: This is an updated version of the
Jan 2006 paper below.) |
||
"Distributional Measures as Proxies
for Semantic Relatedness", Saif Mohammad and Graeme
Hirst. |
||
| "Distributional measures of concept-distance: A task-oriented evaluation", Saif Mohammad and Graeme Hirst, In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2006), July 2006, Sydney, Australia. | ||
| Abstract Paper (ps) Paper (pdf) Presentation | ||
| Computing Lexical Contrast | ||
| Key words: antonyms, opposites, contrast, distributional similarity, affix patterns, word senses, thesaurus. | ||
| Download: | 1. List of about 3.5 million antonym pairs identified from contrasting adjacent thesaurus categories. | |
| 2. List of about 3.2 million antonym pairs identified using affix patterns and the thesaurus structure. | ||
| 3. Total set of 6.3 million antonym pairs obtained by combining 1 and 2, and removing duplicates. | ||
| 4. Set of 1269 closest-to-opposite questions created for WordNet opposites: adjectives, adverbs, nouns, verbs | ||
| 5. Set of 162 closest-to-opposite questions from GRE preparatory website 1: development set. | ||
| 6. Set of 950 closest-to-opposite questions from GRE preparatory website 2: test set. | ||
| 7. Questionnaires for determining information about kinds of opposites: adjectives, adverbs, nouns, verbs | ||
| 8. Responses to crowdsourced questionnaires: adjectives, adverbs, nouns, verbs | ||
| 9. Set of 209 adjacent categories in the Macquarie Thesaurus that were manually determined to be contrasting. | ||
| 10. Set of 1358 WordNet opposites used to test the co-occurrence and the distributional hypotheses. | ||
| 11. Set of 1358 WordNet synonyms used to test the co-occurrence and the distributional hypotheses. | ||
| 12. Set of 1358 WordNet random word pairs used to test the co-occurrence and the distributional hypotheses. | ||
| 13. Set of 15 affix rules that tend to generate opposites. | ||
| "Computing Word-Pair Antonymy", Saif Mohammad, Bonnie Dorr , and Graeme Hirst, In Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-2008), October 2008, Waikiki, Hawaii. | ||
| Abstract Paper (pdf) Presentation | ||
| "Towards Antonymy-Aware Natural Language Applications", Saif Mohammad, Bonnie Dorr, and Graeme Hirst. Proceedings of the Symposium on Semantic Knowledge Discovery, Organization and Use (SKDOU-2008), November 2008, New York, NY. | ||
| Paper (pdf) Poster | ||
| Word Sense Disambiguation and Word Sense Dominance | ||
| "Distributional profiles of concepts
for Unsupervised Word Sense Disambigution", Saif Mohammad,
Graeme Hirst, and Philip
Resnik, In Proceedings of the Fourth International Workshop
on the Evaluation of Systems for the Semantic Analysis of Text (SemEval-07),
June 2007, Prague, Czech Republic. |
||
| "Determining Word Sense Dominance Using a Thesaurus", Saif Mohammad and Graeme Hirst, In Proceedings of the 11th conference of the European chapter of the Association for Computational Linguistics (EACL-2006), April 2006, Trento, Italy. | ||
| Abstract Paper (ps) Paper (pdf) Presentation | ||
"Combining Lexical and Syntactic Features
for Supervised Word Sense Disambiguation", Saif Mohammad
and Ted Pedersen, In Proceedings of the Conference on Computational
Natural Language Learning (CoNLL-2004), May, 2004, Boston, MA. |
||
| "Complementarity of Lexical and Simple
Syntactic Features: The SyntaLex Approach to Senseval-3", Saif
Mohammad and Ted Pedersen, In Proceedings of the Third International
Workshop on the Evaluation of Systems for the Semantic Analysis of Text
(SensEval-3), July 2004, Barcelona, Spain. |
||
| "Combining Lexical and Syntactic Features for Supervised Word Sense Disambiguation", Saif Mohammad, Master's thesis, University of Minnesota, August 2003, Minnesota. | ||
| Abstract Paper (ps) Paper (pdf) Presentation | ||
"Guaranteed Pre-Tagging for the Brill Tagger", Saif Mohammad and Ted Pedersen, In Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2003), February 2003, Mexico City. |
||
Abstract Paper (ps) Paper (pdf) Presentation |
||
| Text Summarization | ||
| "Generating Surveys of Scientific Paradigms", Saif Mohammad, Bonnie Dorr, Melissa Egan, Ahmed Hassan, Pradeep Muthukrishan, Vahed Qazvinian, Dragomir Radev, and David Zajic, In Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT-2009), May 2009, Boulder, Colorado. | ||
| Abstract Paper (pdf) Presentation | ||
| "Multiple alternative sentence compressions and word-pair
antonymy for automatic text summarization and recognizing textual entailment.",
Saif Mohammad, Bonnie Dorr, Melissa Egan, Jimmy Lin, and
David Zajic. Proceedings of the Text Analysis Conference (TAC-2008),
November 2008, Gaithersburg, MD. |
||
| Abstract Paper (pdf) Poster | ||
| Multi-Document Coreference Resolution | ||
| "Cross-Document Coreference Resolution: A Key Technology for Learning by Reading", James Mayfield, Bonnie Dorr, Jason Eisner, Tim Finin, Saif Mohammad, Douglas Oard, Ralph Weischedel, David Yarowsky, and others. March 2009. Proceedings of the AAAI Spring Symposium on Learning by Reading and Learning to Read (AAAI-09), Menlo Park, CA. | ||
| Abstract Paper (pdf) Presentation | ||
| "Multiple alternative sentence compressions and word-pair
antonymy for automatic text summarization and recognizing textual entailment.",
Saif Mohammad, Bonnie Dorr, Melissa Egan, Jimmy Lin, and
David Zajic. Proceedings of the Text Analysis Conference (TAC-2008),
November 2008, Gaithersburg, MD. |
||
| Abstract Paper (pdf) Poster | ||
Last Updated:
June 2011 |
||