Show simple item record

dc.contributor.authorPekar, Viktor
dc.date.accessioned2011-03-31T18:02:32Z
dc.date.available2011-03-31T18:02:32Z
dc.date.issued2001
dc.identifier.issn0894-4539
dc.identifier.urihttp://hdl.handle.net/10150/126619
dc.descriptionPublished as Coyote Papers: Working Papers in Linguistics, Language in Cognitive Scienceen_US
dc.description.abstractThe paper presents a preliminary evaluation of a corpus-based representation of individual words and a method to generalize over these representations. The vector space is represented in a way that gives weight to the fact that words co-occur rather than to the frequency of their co-occurrence. This format is hypothesized to allow for reducing the vector space, minimizing negative effects of data sparseness and enhancing ability of the model to generalize words to novel contexts. The model is assessed by comparing computer-calculated probabilities of different verb-argument combinations with human subjects' judgements about appropriateness of these combinations. The results indicate that there is a correlation between the probabilities calculated by the model and the subjects' evaluations.
dc.language.isoen_USen_US
dc.publisherUniversity of Arizona Linguistics Circle (Tucson, Arizona)en_US
dc.relation.urlhttps://coyotepapers.sbs.arizona.edu/en_US
dc.rightsCopyright © is held by the author(s).en_US
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en_US
dc.titleModeling semantic coherence from corpus data: the fact and the frequency of a co-occurrenceen_US
dc.typetexten_US
dc.typeArticleen_US
dc.contributor.departmentBashkir State Universityen_US
dc.identifier.journalCoyote Papersen_US
dc.description.collectioninformationThe Coyote Papers are made available by the Arizona Linguistics Circle at the University of Arizona and the University of Arizona Libraries. Contact coyotepapers@email.arizona.edu with questions about these materials.en_US
dc.source.journaltitleCoyote Papers
refterms.dateFOA2018-06-12T10:51:13Z
html.description.abstractThe paper presents a preliminary evaluation of a corpus-based representation of individual words and a method to generalize over these representations. The vector space is represented in a way that gives weight to the fact that words co-occur rather than to the frequency of their co-occurrence. This format is hypothesized to allow for reducing the vector space, minimizing negative effects of data sparseness and enhancing ability of the model to generalize words to novel contexts. The model is assessed by comparing computer-calculated probabilities of different verb-argument combinations with human subjects' judgements about appropriateness of these combinations. The results indicate that there is a correlation between the probabilities calculated by the model and the subjects' evaluations.


Files in this item

Thumbnail
Name:
azu_alc_cp_n12_1_8_w.pdf
Size:
146.7Kb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record