This is an old revision of the document!


Distributional Semantic Models (ESSLLI 2009)

Start pageScheduleDownloads & LinksBibliography

Online access (Web interfaces)

Off-the-shelf packages for DSM

Downloads

Under construction

Data sets

  • Verb + object noun co-occurrences (tokens) extracted from the British National Corpus: bnc_vobj_filtered.txt.gz (15 MB)
  • A 5-million word corpus of Harry Potter fan fiction in lemma_pos format (pre-cleaned): potter_tokens.txt.gz (8.9M)