Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision Next revision Both sides next revision | ||
software:rewinfomap [2010/10/31 12:28] eapontep created |
software:rewinfomap [2010/12/05 21:56] eapontep [Testing] |
||
---|---|---|---|
Line 1: | Line 1: | ||
This page is under construction! | This page is under construction! | ||
+ | |||
+ | |||
+ | ==== General ==== | ||
* **Infomap NLP Software: Not in development any more. The authors recommend to use SemanticVectors instead!!!** | * **Infomap NLP Software: Not in development any more. The authors recommend to use SemanticVectors instead!!!** | ||
Line 11: | Line 14: | ||
--- // | --- // | ||
+ | |||
+ | ==== Installation ==== | ||
+ | |||
+ | * Before installing Infomap you would have to install gdbm libraries in your computer. This could be quite challenging. In the following I document the installation process I followed. | ||
+ | - As a first step, you should download the last version of gdbm. | ||
+ | - Untar the .gz file and go into the created directory. | ||
+ | - Try: <file bash> | ||
+ | - The last overwrote all the libtool-related files in the directory. Now you can run <file bash> | ||
+ | - You might also have problems with the ANSI c headers. To solve this problem< | ||
+ | |||
+ | |||
+ | ==== Testing ===== | ||
+ | |||
+ | The first step in order to build a model is to choose a directory where the models will be created. This is done by setting an environment variable <file bash> | ||
+ | export INFOMAP_WORKING_DIR</ | ||
+ | |||
+ | Afterwards run build the model. Informap accepts two formats: a single file where documents are divided by xml markers or as set of files, where every file contains exactly one document. I decided to use this second option. As input, there should be a file specifying the name of file containing a document.< | ||
+ | |||
+ | Remember to add infomap to your PATH variable. | ||
+ | |||
+ | In corpora directory, you will find a simple py script for building a corpora from a file where every line is a document. Afterwards I used the following command:< | ||
+ | directory.txt is a file contaning the name of every file contaning a document. | ||
+ | {{: | ||
+ | {{: |