Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
|
software:rewsemvector [2010/10/31 12:46] eapontep created |
software:rewsemvector [2010/11/01 14:07] (current) |
||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | === General === | + | ==== General |
| - | === Installation Instruction === | + | **SemanticVectors: |
| - | ** Please read point 2.6 Before starting with these instructions. | ||
| - | ** | + | |
| + | | ||
| + | * Uses a Random Projection Algorithm | ||
| + | * Doesn' | ||
| + | * [[http:// | ||
| + | * [[http:// | ||
| + | * [[http:// | ||
| + | * You NEED [[http:// | ||
| - | - Install Lucene: | + | ==== Installation Instruction ==== |
| - | - You will need JDK and ANT: | + | |
| - | * JDK: Develpment environment for java | + | |
| - | * Ant: A build-tool for java, similar to make. | + | |
| - | - You can download both from a repository: | + | |
| - | <code> | + | **Please read the very last point before starting this tutorial.** |
| - | sudo apt-get install openjdk-6-jdk | + | |
| + | This is an installation tutorial for dolts like me, that means, I wrote it after fighting for hours with my lack of intelligence. I hope it helps you to install SemanticVectors quickly. You might need several things before installing SV, in particular a java developer kit, a building tool (ant) and Lucene, a library to search text. Everything is documented in this tutorial. | ||
| + | |||
| + | == Install Lucene: == | ||
| + | - You will need JDK and ANT: | ||
| + | - [[https:// | ||
| + | - [[http:// | ||
| + | - You can download both from a repository: | ||
| sudo apt-get install ant | sudo apt-get install ant | ||
| sudo apt-get install ant-doc | sudo apt-get install ant-doc | ||
| - | </code> | + | </file> |
| - | - | + | - |
| - | <code> ant -version </code> | + | |
| - | < | + | |
| - | + | ||
| - [[http:// | - [[http:// | ||
| + | - Untar the gz file in your desired location. | ||
| + | - Go to the target directory. | ||
| + | - If every thing is allright (if ant is working properly), then run ant in the current directory; it would automatically detect build.xml. <file bash>ant </ | ||
| + | - You will need to set the appropiate CLASSPATH. CLASSPATH specify the location of Java libraries. To check up if the variable is declared in your system:< | ||
| + | - Althought there are (supposedly) several methods to set CLASSPATH the only one which worked for me was the following: You will have to edit //bashrc// <file bash> | ||
| + | - At the end of the document add: <file bash> | ||
| + | - Reboot your computer. Now Lucene should be working. | ||
| + | - In order to check if everything is all right you should do the following. | ||
| + | - [[http:// | ||
| + | - Untar the corpus in a desired location. Normally the directory will we: bible_chapters. Go to that directory and run the following command: <file bash> | ||
| + | - In my case:< | ||
| + | - If everything runs ok, you should be done. An index directory will be created in bible_chapters. You should be able to perform some simple test using the demo library included in lucene. | ||
| + | == Install SemanticVector== | ||
| + | * This is the binary installation method. You could actually built SemanticVector by downloading the appropriate file and using again ant. I don't recommend it. | ||
| + | - [[http:// | ||
| + | - Move this file to the desire location | ||
| + | - Again open bashrc and add to CLASSPATH the desire file. In my case it looks like: <file bash> | ||
| + | - Reboot your Computer. | ||
| + | - By now every thing should be working. Go to the directory where you ran lucene. Run the following command: <file bash> | ||
| + | - Now you are ready to use SemanticVectors. In this point I realized that there is a (probably) very serious bug. Since lucene is constantly actualized and SV depends on Lucene, there are several compatibility issues between both. In particular, a class form Lucene has been deprecated in the last version. I checked in the official community and there are not answers to this issue, although others have reported the same problem. Maybe a I get some feedback regarding this problem. Otherwise an older version of the software will be necessary. | ||
| + | --- // | ||
| - | - Untar the gz file in your desired location. | ||
| - | |||
| - | 1.4. Go to the directory | ||
| - | |||
| - | 1.5. If every thing is allright (if ant is working properly), then run ant in the current directory; it would directly associate build.xml. | ||
| - | |||
| - | 1.6. You will need to set the appropiate CLASSPATH. Classpath specify the location of Java libraries. To check up if the variable is declared in your system: | ||
| - | |||
| - | echo $CLASSPATH | ||
| - | |||
| - | 1.7. Althought there are (supposedly) several methods to set CLASSPATH the only one which worked for me was the following: You will have to edit the file bashrc | ||
| - | |||
| - | |||
| - | in the end of the document add: | ||
| - | |||
| - | export {location of lucene}/ | ||
| - | |||
| - | in my case: | ||
| - | |||
| - | export CLASSPATH="/ | ||
| - | |||
| - | Reboot your computer. Now lucene should be working | ||
| - | |||
| - | 1.8. In order to check if everything is allrigh you should do the following. Download an example corpus from: " | ||
| - | |||
| - | java org.apache.lucene.demo.IndexFiles {comple location of lucene} | ||
| - | |||
| - | In my case: | ||
| - | |||
| - | java org.apache.lucene.demo.IndexFiles / | ||
| - | |||
| - | If everything runs without a problem you should be done. | ||
| - | |||
| - | 2. Install SemanticVector: | ||
| - | |||
| - | 2.1. Dowload SemanticVector from: http:// | ||
| - | |||
| - | 2.2. Move this file to the desire location | ||
| - | |||
| - | 2.3. Again open bashrc and to CLASSPATH the desire file. In my case: | ||
| - | |||
| - | export CLASSPATH="/ | ||
| - | |||
| - | 2.4. Reboot your Computer. | ||
| - | |||
| - | 2.5. By now every thing should be working. Go to the directory where you have runned lucene. Run the following command | ||
| - | |||
| - | java pitt.search.semanticvectors.BuildIndex / | ||
| - | |||
| - | 2.6. Now you are ready to use SemanticVectors. However in this point I reallized that there is a (probably) very serious bug. Since lucene is constantly actualized and SV depends on Lucene, there are several compatibility issues between both. In particular, a class form Lucene has been deprecated in the last version. I cheched in the official community and there are not answers to this issue, althought others have reported the same problem. Maybe a I get some feedback regarding this problem. Otherwise an older version of the software would be necessary. | ||