According to the Corpus Work Bench, to encode a corpus i need to use the cwb-encode perl script
"encode the corpus, i.e. convert the verticalized text to CWB binary format with the cwb-encode tool. Note that the command below has to be entered on a single line." http://cogsci.uni-osnabrueck.de/~korpora/ws/CWBdoc/CWB_Encoding_Tutorial/node3.html
$ cwb-encode -d /corpora/data/example -f example.vrt -R /usr/local/share/cwb/registry/example -P pos -S s
when i tried it, it says the file is missing but i'm sure the file is in $HOME/corpora/data/example, the error was
$ cwb-encode -d /corpora/data/example -f example.vrt -R /usr/local/share/cwb/registry/example -P pos -S s
example.vrt: No such file or directory
Can't open input file example.vrt!
can anyone figure out why?
How about giving full path to example.vrt
:
cwb-encode -d /corpora/data/example -f /corpora/data/example/example.vrt -R /usr/local/share/cwb/registry/example -P pos -S s
来源:https://stackoverflow.com/questions/5589684/cwb-encoding-corpus