Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
data:verb_categorization [2008/01/20 15:18] alexlenci |
— (current) | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | ====== Task 1.c - Verb Categorization ====== | ||
| - | |||
| - | ==== Introduction ==== | ||
| - | |||
| - | The goal of the sub-task is to group verbs into semantic categories. | ||
| - | |||
| - | The {{verb.categorization.dataset.tar.gz |data set}} consists of 45 verbs, taken form the feature norms described D. P. Vinson & G. Vigliocco (2007), “Semantic Feature Production Norms for a Large Set of Objects and Events”, //Behavior Research Methods//. The verbs in the data set are classified into 9 semantic classes, closely following the classification proposed in Levin (1993) and partially adapted from those in Vinson & Vigliocco (2007). | ||
| - | |||
| - | |||
| - | ==== Task Operationalization ==== | ||
| - | |||
| - | We operationalize concrete nouns categorization as a clustering task. Since the data set is organized hierarchically, | ||
| - | we will run three clustering experiments, | ||
| - | |||
| - | * **6-way clustering** - models will be tested on their ability to categorize the nouns into the most fine-grained classes of the dataset: //bird// (" | ||
| - | * **3-way clustering** - models will be tested on their ability to categorize the nouns into 3 classes: //animal// (superordinate of //bird// and // | ||
| - | |||
| - | * **2-way clustering** - models will be tested on their ability to categorize the nouns into the two top classes: //natural// (superordinate of //animal// and // | ||
| - | |||
| - | To abstract away from differences stemming from any specific clustering method, you are asked to run your experiments with the //k-means// algorithm available in [[http:// | ||
| - | |||
| - | |||
| - | ==== Task Evaluation ==== | ||
| - | |||
| - | Evaluation will be carried in two stages: | ||
| - | |||
| - | 1. **quantitative evaluation** - results will be evaluated with respect to the two measures for cluster quality available in CLUTO: //purity// and //entropy// (cf. Zhao, Y. and G. Karypis (2002), " | ||
| - | |||
| - | 2. **qualitative evaluation** - participants will be asked to perform a fine-grained error analysis, focussing on critical nouns, hard classes, etc. Details about this type of evaluation will be provided later on. | ||
| - | |||
| - | Back to [[Start]] | ||
| - | |||