Differences
This shows you the differences between two versions of the page.
software:rewsemvector [2010/10/31 12:46] eapontep created |
software:rewsemvector [2010/11/01 14:07] |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | === General === | ||
- | === Installation Instruction === | ||
- | |||
- | ** Please read point 2.6 Before starting with these instructions. | ||
- | |||
- | ** Before installing SemanticVectors you WILL need Lucene (a library for java to search text). | ||
- | |||
- | - Install Lucene: | ||
- | - You will need JDK and ANT: | ||
- | * JDK: Develpment environment for java | ||
- | * Ant: A build-tool for java, similar to make. | ||
- | - You can download both from a repository: | ||
- | |||
- | < | ||
- | sudo apt-get install openjdk-6-jdk | ||
- | sudo apt-get install ant | ||
- | sudo apt-get install ant-doc | ||
- | </ | ||
- | - If every thing worked out, you could test the following command on your terminal: | ||
- | < | ||
- | < | ||
- | |||
- | - [[http:// | ||
- | |||
- | - Untar the gz file in your desired location. | ||
- | |||
- | 1.4. Go to the directory | ||
- | |||
- | 1.5. If every thing is allright (if ant is working properly), then run ant in the current directory; it would directly associate build.xml. | ||
- | |||
- | 1.6. You will need to set the appropiate CLASSPATH. Classpath specify the location of Java libraries. To check up if the variable is declared in your system: | ||
- | |||
- | echo $CLASSPATH | ||
- | |||
- | 1.7. Althought there are (supposedly) several methods to set CLASSPATH the only one which worked for me was the following: You will have to edit the file bashrc | ||
- | |||
- | |||
- | in the end of the document add: | ||
- | |||
- | export {location of lucene}/ | ||
- | |||
- | in my case: | ||
- | |||
- | export CLASSPATH="/ | ||
- | |||
- | Reboot your computer. Now lucene should be working | ||
- | |||
- | 1.8. In order to check if everything is allrigh you should do the following. Download an example corpus from: " | ||
- | |||
- | java org.apache.lucene.demo.IndexFiles {comple location of lucene} | ||
- | |||
- | In my case: | ||
- | |||
- | java org.apache.lucene.demo.IndexFiles / | ||
- | |||
- | If everything runs without a problem you should be done. | ||
- | |||
- | 2. Install SemanticVector: | ||
- | |||
- | 2.1. Dowload SemanticVector from: http:// | ||
- | |||
- | 2.2. Move this file to the desire location | ||
- | |||
- | 2.3. Again open bashrc and to CLASSPATH the desire file. In my case: | ||
- | |||
- | export CLASSPATH="/ | ||
- | |||
- | 2.4. Reboot your Computer. | ||
- | |||
- | 2.5. By now every thing should be working. Go to the directory where you have runned lucene. Run the following command | ||
- | |||
- | java pitt.search.semanticvectors.BuildIndex / | ||
- | |||
- | 2.6. Now you are ready to use SemanticVectors. However in this point I reallized that there is a (probably) very serious bug. Since lucene is constantly actualized and SV depends on Lucene, there are several compatibility issues between both. In particular, a class form Lucene has been deprecated in the last version. I cheched in the official community and there are not answers to this issue, althought others have reported the same problem. Maybe a I get some feedback regarding this problem. Otherwise an older version of the software would be necessary. |