Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
software:rewsspacepackage [2010/12/07 12:33] eapontep [Testing] |
software:rewsspacepackage [2010/12/07 12:53] (current) eapontep |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ==== STATE ==== | ||
- | It is working in my computer. I will write a script and run it on the whole American National Corpus and test results, memory and the type of vector produced! I talked with the Adms. and in the next few days the package should be running on the server. | ||
- | |||
- | |||
==== General ==== | ==== General ==== | ||
Line 29: | Line 25: | ||
* A new directory should have been created. Go to the directory and use the command< | * A new directory should have been created. Go to the directory and use the command< | ||
* If you want to make direct use of the .jar, you would also like to use the command | * If you want to make direct use of the .jar, you would also like to use the command | ||
- | |||
- | ==== Technical Issues ==== | ||
- | |||
==== Testing ==== | ==== Testing ==== | ||
Line 76: | Line 69: | ||
- | {{: | + | {{: |
+ | |||
+ | Visual inspection suggest that problems regarding the density of the vector space are solved by using MATLAB as the defauld algorithm. | ||
+ | |||
+ | {{: | ||
+ | |||
+ | Finally, I compared the scalability of Random Indexing and LSA (using SVDLIBC with 100 dimension): | ||
+ | |||
+ | {{: | ||
+ | It is clear that LSA can hardly handle large corpora. Although the results are different in the case of Random Indexing, they suggest a similar conclusion. | ||
+ | I wrote a simple script that automatically document the results of every experiment. It can be found under the key name " |