Abstract : The aim of this paper is to develop (i) a general framework for the analysis of verb-noun (VN) collocations in English and Romanian, and (ii) a system for the extraction of VN-collocations from large tagged and annotated corpora. We identify VN-collocations in two steps: (i) by calculation of the frequent lexical co-occurrences of each VN-pair, and (ii) the identification of the most typical lexico-grammatical constructions in which each VN-pair is involved in.
Amalia Todirascu, Christopher Gledhill, Dan Stefânescu. Extracting Collocations in Contexts. LNAI 5603, Springer-Verlag, 2009, Responding to Information Society Challenges: New Advances in Hum an Language Technologies. ⟨hal-01220046⟩