Using lexical patterns in the Google Web 1T corpus to deduce semantic relations between nouns

Nulty, Paul; and Costello, Fintan J. (2009) Using lexical patterns in the Google Web 1T corpus to deduce semantic relations between nouns In: DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, 2009-06-01, Boulder CO,United States,USA.
Copy

This paper investigates methods for using lexical patterns in a corpus to deduce the semantic relation that holds between two nouns in a noun-noun compound phrase such as "flu virus" or "morning exercise". Much of the previous work in this area has used automated queries to commercial web search engines. In our experiments we use the Google Web 1T corpus. This corpus contains every 2, 3, 4 and 5 gram occurring more than 40 times in Google's index of the web, but has the advantage of being available to researchers directly rather than through a web interface. This paper evaluates the performance of the Web 1T corpus on the task compared to similar systems in the literature, and also investigates what kind of lexical patterns are most informative when trying to identify a semantic relation between two nouns.

Full text not available from this repository.

Atom BibTeX OpenURL ContextObject in Span OpenURL ContextObject Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation METS MODS RIOXX2 XML Reference Manager Refer ASCII Citation
Export

Downloads