Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
data:verb_categorization [2008/02/24 21:25] alexlenci |
— (current) | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Task 1.c - Verb Categorization ====== | ||
- | |||
- | ==== Introduction ==== | ||
- | |||
- | The goal of the sub-task is to group verbs into semantic categories. | ||
- | |||
- | The {{verb.categorization.dataset.txt.gz |data set}} consists of 45 verbs, belonging to 9 semantic classes. The classification scheme is inspired by P. Vinson & G. Vigliocco (2007), “Semantic Feature Production Norms for a Large Set of Objects and Events”, //Behavior Research Methods//, which in turn closely follows the classification proposed in Levin (1993). | ||
- | |||
- | |||
- | |||
- | ==== Task Operationalization ==== | ||
- | |||
- | We operationalize verb categorization as a clustering task. Since the data set is organized hierarchically, | ||
- | we will run two clustering experiments, | ||
- | |||
- | * **9-way clustering** - models will be tested on their ability to categorize the verbs into the most fine-grained classes of the dataset: // | ||
- | * **5-way clustering** - models will be tested on their ability to categorize the verbs into 5 classes: // | ||
- | |||
- | To abstract away from differences stemming from any specific clustering method, you are asked to run your experiments with the //Repeated Bisections// | ||
- | |||
- | |||
- | ==== Task Evaluation ==== | ||
- | |||
- | Evaluation will be carried out in two stages: | ||
- | |||
- | 1. **coarse-grained evaluation** - results will be evaluated with respect to the standard measures for cluster quality available in CLUTO (again, see the page on the [[concrete_noun_categorization|concrete noun categorization task]]). | ||
- | |||
- | 2. **fine-grained evaluation** - verb semantic classification is notoriously hard. Any a priori classification scheme runs the risk of being defied by the highly polysemous and multidimensional character of verbs. In this second stage, evaluation will therefore focus on specific verbs selected as "hard cases", | ||
- | |||
- | Back to the [[Start# | ||
- | |||
- | Back to [[Start]] | ||
- | |||