Differences
This shows you the differences between two versions of the page.
data:start [2008/01/20 19:05] marco |
data:start [2010/11/01 14:07] |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Data sets for the evaluation of word space models ====== | ||
- | |||
- | This page contains a developing list of tasks, sub-tasks and corresponding (sub-)data-sets. | ||
- | |||
- | Other tasks or sub-tasks might be added in the near future. | ||
- | |||
- | |||
- | ===== Ordered by task categories ===== | ||
- | |||
- | |||
- | ==== Task 1: Free Association ==== | ||
- | |||
- | It is tempting to make a connection between the **statistical association** patterns of words -- first-order (// | ||
- | |||
- | |||
- | * [[Correlation with Free Association Norms]] | ||
- | |||
- | |||
- | |||
- | ==== Task 2: Categorization ==== | ||
- | |||
- | Categorization tasks play a prominent role in cognitive research on concepts. In this type of tasks, subjects | ||
- | are typically asked to assign experimental items - objects, images, words - | ||
- | to a given category or to group together items belonging to the same category. | ||
- | Since categorization presupposes an understanding of the relationship between the items in a category, it is regarded as a key source of evidence on the organization and structure of the human conceptual system. | ||
- | |||
- | In the present task, computational models will be tested on their ability to properly group | ||
- | words into semantic categories. The task is organized into three sub-tasks, focussing on different areas | ||
- | of the lexicon and/or semantic dimensions: | ||
- | |||
- | * [[Concrete Noun Categorization]] | ||
- | * [[Abstract/ | ||
- | * [[Verb Categorization]] | ||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | ==== Task 3: Property Generation ==== | ||
- | |||
- | The ability to describe a concept in terms of its salient properties is an important feature of human conceptual cognition. In this task, we compare human-generated //norms// collected by psychologists to the properties generated by computational models. | ||
- | |||
- | * [[Comparison with Speaker-Generated Features]] | ||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | ===== Source corpus ===== | ||
- | |||
- | You can train your word space on your favorite corpus. However, we also invite you, if this is suitable, to experiment with the [[http:// | ||