Data sets for the evaluation of word space models

This page contains a developing list of tasks, sub-tasks and corresponding (sub-)data-sets.

Other tasks or sub-tasks might be added in the near future.

Ordered by task categories

Task 1: Categorization

Categorization tasks play a prominent role in cognitive research on concepts. In this type of tasks, subjects are typically asked to assign experimental items - objects, images, words - to a given category or to group together items belonging to the same category. Since categorization presupposes an understanding of the relationship between the items in a category, it is regarded as a key source of evidence on the organization and structure of the human conceptual system.

In the present task, computational models will be tested on their ability to properly group words into semantic categories. The task is organized into three sub-tasks, focussing on different areas of the lexicon and/or semantic dimensions:

Task 2: Free Association

This task will be described in the near future.

Correlation with Free Association Norms

Task 3: Property Generation

The ability to describe a concept in terms of its salient properties is an important feature of human conceptual cognition. In this task, we compare human-generated norms collected by psychologists to the properties generated by computational models.

Comparison with Speaker-Generated Features

You are here: start » data

Table of Contents

Data sets for the evaluation of word space models

Ordered by task categories

Task 1: Categorization

Task 2: Free Association

Task 3: Property Generation

Navigation

Search

Toolbox