Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
software:rewsspacepackage [2010/12/07 12:33] eapontep [Testing] |
software:rewsspacepackage [2010/12/07 12:53] (current) eapontep |
||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | ==== STATE ==== | ||
| - | It is working in my computer. I will write a script and run it on the whole American National Corpus and test results, memory and the type of vector produced! I talked with the Adms. and in the next few days the package should be running on the server. | ||
| - | |||
| - | |||
| ==== General ==== | ==== General ==== | ||
| Line 29: | Line 25: | ||
| * A new directory should have been created. Go to the directory and use the command< | * A new directory should have been created. Go to the directory and use the command< | ||
| * If you want to make direct use of the .jar, you would also like to use the command | * If you want to make direct use of the .jar, you would also like to use the command | ||
| - | |||
| - | ==== Technical Issues ==== | ||
| - | |||
| ==== Testing ==== | ==== Testing ==== | ||
| Line 76: | Line 69: | ||
| - | {{: | + | {{: |
| + | |||
| + | Visual inspection suggest that problems regarding the density of the vector space are solved by using MATLAB as the defauld algorithm. | ||
| + | |||
| + | {{: | ||
| + | |||
| + | Finally, I compared the scalability of Random Indexing and LSA (using SVDLIBC with 100 dimension): | ||
| + | |||
| + | {{: | ||
| + | It is clear that LSA can hardly handle large corpora. Although the results are different in the case of Random Indexing, they suggest a similar conclusion. | ||
| + | I wrote a simple script that automatically document the results of every experiment. It can be found under the key name " | ||