Baldwin, Timothy, Bannard, Colin, Tanaka, Takaaki and Widdows, Dominic
(2003)
An empirical model of multiword expression decomposability
In: Proceedings of the ACL 2003 workshop on Multiword expressions analysis, acquisition and treatment -, 2003-7-12 - 2003-7-12, Sapporo, Japan.
Abstract
This paper presents a construction-inspecific model of multiword expression decomposability based on latent semantic analysis. We use latent semantic analysis to determine the similarity between a multiword expression and its constituent words, and claim that higher similarities indicate greater decomposability. We test the model over English noun-noun compounds and verb-particles, and evaluate its correlation with similarities and hyponymy values in WordNet. Based on mean hyponymy over partitions of data ranked on similarity, we furnish evidence for the calculated similarities being correlated with the semantic relational content of WordNet.
| Item Type: | Conference Item (Unspecified) |
|---|---|
| Uncontrolled Keywords: | 46 Information and Computing Sciences, 47 Language, Communication and Culture, 4704 Linguistics |
| Depositing User: | Symplectic Admin |
| Date Deposited: | 21 Jun 2016 10:12 |
| Last Modified: | 23 May 2026 00:06 |
| DOI: | 10.3115/1119282.1119294 |
| Open Access URL: | http://anthology.aclweb.org/W/W03/W03-1812.pdf |
| Related Websites: | |
| URI: | https://livrepository.liverpool.ac.uk/id/eprint/3001735 |
| Disclaimer: | The University of Liverpool is not responsible for content contained on other websites from links within repository metadata. Please contact us if you notice anything that appears incorrect or inappropriate. |
Altmetric
Altmetric