Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
software:rewsemvector [2010/10/31 12:46] eapontep created |
software:rewsemvector [2010/11/01 14:07] (current) |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | === General === | + | ==== General |
- | === Installation Instruction === | + | **SemanticVectors: |
- | ** Please read point 2.6 Before starting with these instructions. | ||
- | ** | + | |
+ | | ||
+ | * Uses a Random Projection Algorithm | ||
+ | * Doesn' | ||
+ | * [[http:// | ||
+ | * [[http:// | ||
+ | * [[http:// | ||
+ | * You NEED [[http:// | ||
- | - Install Lucene: | + | ==== Installation Instruction ==== |
- | - You will need JDK and ANT: | + | |
- | * JDK: Develpment environment for java | + | |
- | * Ant: A build-tool for java, similar to make. | + | |
- | - You can download both from a repository: | + | |
- | <code> | + | **Please read the very last point before starting this tutorial.** |
- | sudo apt-get install openjdk-6-jdk | + | |
+ | This is an installation tutorial for dolts like me, that means, I wrote it after fighting for hours with my lack of intelligence. I hope it helps you to install SemanticVectors quickly. You might need several things before installing SV, in particular a java developer kit, a building tool (ant) and Lucene, a library to search text. Everything is documented in this tutorial. | ||
+ | |||
+ | == Install Lucene: == | ||
+ | - You will need JDK and ANT: | ||
+ | - [[https:// | ||
+ | - [[http:// | ||
+ | - You can download both from a repository: | ||
sudo apt-get install ant | sudo apt-get install ant | ||
sudo apt-get install ant-doc | sudo apt-get install ant-doc | ||
- | </code> | + | </file> |
- | - | + | - |
- | <code> ant -version </code> | + | |
- | < | + | |
- | + | ||
- [[http:// | - [[http:// | ||
+ | - Untar the gz file in your desired location. | ||
+ | - Go to the target directory. | ||
+ | - If every thing is allright (if ant is working properly), then run ant in the current directory; it would automatically detect build.xml. <file bash>ant </ | ||
+ | - You will need to set the appropiate CLASSPATH. CLASSPATH specify the location of Java libraries. To check up if the variable is declared in your system:< | ||
+ | - Althought there are (supposedly) several methods to set CLASSPATH the only one which worked for me was the following: You will have to edit //bashrc// <file bash> | ||
+ | - At the end of the document add: <file bash> | ||
+ | - Reboot your computer. Now Lucene should be working. | ||
+ | - In order to check if everything is all right you should do the following. | ||
+ | - [[http:// | ||
+ | - Untar the corpus in a desired location. Normally the directory will we: bible_chapters. Go to that directory and run the following command: <file bash> | ||
+ | - In my case:< | ||
+ | - If everything runs ok, you should be done. An index directory will be created in bible_chapters. You should be able to perform some simple test using the demo library included in lucene. | ||
+ | == Install SemanticVector== | ||
+ | * This is the binary installation method. You could actually built SemanticVector by downloading the appropriate file and using again ant. I don't recommend it. | ||
+ | - [[http:// | ||
+ | - Move this file to the desire location | ||
+ | - Again open bashrc and add to CLASSPATH the desire file. In my case it looks like: <file bash> | ||
+ | - Reboot your Computer. | ||
+ | - By now every thing should be working. Go to the directory where you ran lucene. Run the following command: <file bash> | ||
+ | - Now you are ready to use SemanticVectors. In this point I realized that there is a (probably) very serious bug. Since lucene is constantly actualized and SV depends on Lucene, there are several compatibility issues between both. In particular, a class form Lucene has been deprecated in the last version. I checked in the official community and there are not answers to this issue, although others have reported the same problem. Maybe a I get some feedback regarding this problem. Otherwise an older version of the software will be necessary. | ||
+ | --- // | ||
- | - Untar the gz file in your desired location. | ||
- | |||
- | 1.4. Go to the directory | ||
- | |||
- | 1.5. If every thing is allright (if ant is working properly), then run ant in the current directory; it would directly associate build.xml. | ||
- | |||
- | 1.6. You will need to set the appropiate CLASSPATH. Classpath specify the location of Java libraries. To check up if the variable is declared in your system: | ||
- | |||
- | echo $CLASSPATH | ||
- | |||
- | 1.7. Althought there are (supposedly) several methods to set CLASSPATH the only one which worked for me was the following: You will have to edit the file bashrc | ||
- | |||
- | |||
- | in the end of the document add: | ||
- | |||
- | export {location of lucene}/ | ||
- | |||
- | in my case: | ||
- | |||
- | export CLASSPATH="/ | ||
- | |||
- | Reboot your computer. Now lucene should be working | ||
- | |||
- | 1.8. In order to check if everything is allrigh you should do the following. Download an example corpus from: " | ||
- | |||
- | java org.apache.lucene.demo.IndexFiles {comple location of lucene} | ||
- | |||
- | In my case: | ||
- | |||
- | java org.apache.lucene.demo.IndexFiles / | ||
- | |||
- | If everything runs without a problem you should be done. | ||
- | |||
- | 2. Install SemanticVector: | ||
- | |||
- | 2.1. Dowload SemanticVector from: http:// | ||
- | |||
- | 2.2. Move this file to the desire location | ||
- | |||
- | 2.3. Again open bashrc and to CLASSPATH the desire file. In my case: | ||
- | |||
- | export CLASSPATH="/ | ||
- | |||
- | 2.4. Reboot your Computer. | ||
- | |||
- | 2.5. By now every thing should be working. Go to the directory where you have runned lucene. Run the following command | ||
- | |||
- | java pitt.search.semanticvectors.BuildIndex / | ||
- | |||
- | 2.6. Now you are ready to use SemanticVectors. However in this point I reallized that there is a (probably) very serious bug. Since lucene is constantly actualized and SV depends on Lucene, there are several compatibility issues between both. In particular, a class form Lucene has been deprecated in the last version. I cheched in the official community and there are not answers to this issue, althought others have reported the same problem. Maybe a I get some feedback regarding this problem. Otherwise an older version of the software would be necessary. |