This example goes over how to set up CoreNLP from the latest official release. This example will take you through downloading the package, and running a simple command-line invocation of CoreNLP.
Prerequisites:
java -version
should complete successfully with a line like: java version "1.8.0_92".Steps:
Download the CoreNLP zip file at: http://stanfordnlp.github.io/CoreNLP/index.html#download:
wget http://nlp.stanford.edu/software/stanford-corenlp-full-2015-12-09.zip
Unzip the release:
unzip stanford-corenlp-full-2015-12-09.zip
Enter the newly unzipped directory:
cd stanford-corenlp-full-2015-12-09
Set up your classpath. If you're using an IDE, you should set the classpath in your IDE.
for file in `find . -name "*.jar"`; do export CLASSPATH="$CLASSPATH:`realpath $file`"; done
If you'll be using CoreNLP frequently, this is a useful line to have in your ~/.bashrc
(or equivalent) file, replacing the directory /path/to/corenlp/
with the appropriate path to where you unzipped CoreNLP:
for file in `find /path/to/corenlp/ -name "*.jar"`; do export CLASSPATH="$CLASSPATH:`realpath $file`"; done
Try it out! For example, the following will make a simple text file to annotate, and run CoreNLP over this file. The output will be saved to input.txt.out
as a JSON file. Note that CoreNLP requires quite a bit of memory. You should give it at least 2GB (-mx2g
) in most cases.
echo "the quick brown fox jumped over the lazy dog" > input.txt
java -mx2g edu.stanford.nlp.pipeline.StanfordCoreNLP -outputFormat json -file input.txt