Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
software:start [2010/10/31 12:18] eapontep |
software:start [2011/02/02 09:25] schtepf [Off-the-shelf packages for DSM] |
====== DSM Software and Data Sets ====== | ====== DSM Software and Data Sets ====== |
| |
{{:icon_warn.png?32 |Under Construction}} | {{:under_construction.png?48 |Under Construction}} |
| |
| \\ |
**This page is under construction.** | **This page is under construction.** |
\\ | \\ |
\\ | \\ |
| |
| ===== Useful corpora ===== |
| |
| * The Westbury Lab at Alberta has a [[http://www.psych.ualberta.ca/~westburylab/downloads/westburylab.wikicorp.download.html|preprocessed (cleaned) Wikipedia Corpus]] from an April 2010 dump. The WaCky initiative offers [[http://wacky.sslmit.unibo.it/doku.php?id=corpora|WaCkypedia, a dependency-parsed Wikipedia Corpus]] from a 2009 dump. Both corpora only cover the //English Wikipedia//. |
| |
===== Off-the-shelf packages for DSM ===== | ===== Off-the-shelf packages for DSM ===== |
| |
* [[http://infomap-nlp.sourceforge.net/|Infomap NLP]][[rewInfoMap|Review]] | * [[GenSim]]: incremental SVD & LSA in python, easily deployable to clusters. |
| * [[http://infomap-nlp.sourceforge.net/|Infomap NLP]] |
| * [[rewInfoMap|Review]] |
* [[http://www.psych.ualberta.ca/~westburylab/downloads/HiDEx.download.html|HiDEx]], the High-Dimensional Explorer | * [[http://www.psych.ualberta.ca/~westburylab/downloads/HiDEx.download.html|HiDEx]], the High-Dimensional Explorer |
* [[http://code.google.com/p/semanticvectors|Semantic Vectors]] | * [[hiDex|Review]] |
| * [[http://code.google.com/p/semanticvectors|SemanticVectors]] |
| * [[rewSemVector|Review]] |
* [[http://senseclusters.sourceforge.net/|SenseClusters]] | * [[http://senseclusters.sourceforge.net/|SenseClusters]] |
| * [[rewSenseClusters|Review]] |
* [[http://code.google.com/p/airhead-research/|S-Space Package]] (work in progress) | * [[http://code.google.com/p/airhead-research/|S-Space Package]] (work in progress) |
| *[[rewSSpacePackage|Review]] |
* [[http://code.google.com/p/wordspaces/|Wordspaces]] (interactive exploration) | * [[http://code.google.com/p/wordspaces/|Wordspaces]] (interactive exploration) |
* [[http://divisi.media.mit.edu/|Divisi]] (semantic networks, tensors & SVD in Python) | * [[rewWordSpaces|Review]] |
| * [[http://csc.media.mit.edu/docs/divisi2|Divisi]] (semantic networks, tensors & SVD in Python) |
| * [[rewDivisi2|Review]] |
| * [[miscellaneous|Miscellaneous]] |
| * [[http://scgroup20.ceid.upatras.gr:8000/tmg/|Text to Matrix Generator (TMG)]] (Matlab toolbox for text mining) |