Distributional Semantics – A Practical Introduction (ESSLLI 2016 & 2018)
Schedule & handouts
Day 1: Introduction
Presentation slides (PDF, 1.1 MB) – handout (PDF, 0.9 MB) – R code
motivation and brief history of distributional semantics
common DSM architectures & prototypical applications
first practical exercises with the wordspace
package
Day 2: The parameters of a DSM
Presentation slides (PDF, 1.3 MB) – handout (PDF, 1.0 MB) – R code – practice: input formats – exercise (DSM parameters)
taxonomy of DSM parameters: context representation, feature scaling, normalization and standardization, distance/similarity measures, dimensionality reduction
overview of common parameter settings & best-practice recommendations
practical exercises: building DSMs and exploring their parameters
Day 3: Applications and evaluation
Presentation slides (PDF, 2.0 MB) – handout (PDF, 1.8 MB) – R code – exercise (evaluation)
attributional and relational similarity, clustering and semantic categorization, multiple-choice tasks
insights from recent parameter evaluation studies
practical exercises: implementation and evaluation of selected tasks
Day 4: Elements of matrix algebra
Day 5: Making sense of DSMs
mathematical properties of and relations between different types of DSM
singular value decomposition (SVD) as a latent class model
comparison with neural vector embeddings