This is an old revision of the document!


General

Installation Instruction

Please read point 2.6 Before starting with these instructions. Before installing SemanticVectors you WILL need Lucene (a library for java to search text).

  1. Install Lucene:
    1. You will need JDK and ANT:
      • JDK: Develpment environment for java
      • Ant: A build-tool for java, similar to make.
    2. You can download both from a repository:
sudo apt-get install openjdk-6-jdk
sudo apt-get install ant 
sudo apt-get install ant-doc
  1. If every thing worked out, you could test the following command on your terminal:
 ant -version 
 Apache Ant version 1.7.1 compiled on September 8 2010 
	
-  [[http://ftp.fernuni-hagen.de/ftp-dir/pub/mirrors/www.apache.org//lucene/java/lucene-3.0.2.tar.gz|Dowload Lucene]]

- Untar the gz file in your desired location.

		1.4. Go to the directory
		1.5. If every thing is allright (if ant is working properly), then run ant in the current directory; it would directly associate build.xml. 
		1.6. You will need to set the appropiate CLASSPATH. Classpath specify the location of Java libraries. To check up if the variable is declared in your system:
				echo $CLASSPATH
		1.7. Althought there are (supposedly) several methods to set CLASSPATH the only one which worked for me was the following: You will have to edit the file bashrc
				
                          in the end of the document add:
				
				export {location of lucene}/lucene-3.0.2/lucene-demos-3.0.2.jar:{location of lucene}/lucene-3.0.2/lucene-core-3.0.2.jar"
		    in my case:
				export CLASSPATH="/home/eduardo/programas/lucene/lucene-3.0.2/lucene-demos-3.0.2.jar:/home/eduardo/programas/lucene/lucene-3.0.2/lucene-core-3.0.2.jar"
		    Reboot your computer. Now lucene should be working
		
		1.8. In order to check if everything is allrigh you should do the following. Download an example corpus from: "http://code.google.com/p/semanticvectors/downloads/detail?name=kjbible-corpus.tar.gz&can=2&q=". Untar the corpus in a desired location. Normally the directory will we: bib_chapters. Go to that directory and run the following command
			 	java org.apache.lucene.demo.IndexFiles {comple location of lucene}

                         In my case:
			 	java org.apache.lucene.demo.IndexFiles /home/eduardo/lucene-3.0.2
		   If everything runs without a problem you should be done.
2. Install SemanticVector:
		2.1. Dowload SemanticVector from: http://semanticvectors.googlecode.com/files/semanticvectors-1.8.jar
		2.2. Move this file to the desire location
		2.3. Again open bashrc and to CLASSPATH the desire file. In my case:
				export CLASSPATH="/home/eduardo/programas/lucene/lucene-3.0.2/lucene-demos-3.0.2.jar:/home/eduardo/programas/lucene/lucene-3.0.2/lucene-core-3.0.2.jar:/home/eduardo/programas/SemanticVectors/semanticvectors-1.8.jar"
		2.4. Reboot your Computer.
		2.5. By now every thing should be working. Go to the directory where you have runned lucene. Run the following command
				java pitt.search.semanticvectors.BuildIndex /home/eduardo/programas/SemanticVectors/bible_chapters/index/
		2.6. Now you are ready to use SemanticVectors. However in this point I reallized that there is a (probably) very serious bug. Since lucene is constantly actualized and SV depends on Lucene, there are several compatibility issues between both. In particular, a class form Lucene has been deprecated in the last version. I cheched in the official community and there are not answers to this issue, althought others have reported the same problem. Maybe a I get some feedback regarding this problem. Otherwise an older version of the software would be necessary.