Import PorterStemmer
and initialize
from nltk.stem import PorterStemmer
from nltk.tokenize import word_tokenize
ps = PorterStemmer()
Stem a list of words
example_words = ["python","pythoner","pythoning","pythoned","pythonly"]
for w in example_words:
print(ps.stem(w))
Result:
python
python
python
python
pythonli
Stem a sentence after tokenizing it.
new_text = "It is important to by very pythonly while you are pythoning with python. All pythoners have pythoned poorly at least once."
word_tokens = word_tokenize(new_text)
for w in word_tokens:
print(ps.stem(w)) # Passing word tokens into stem method of Porter Stemmer
Result:
It
is
import
to
by
veri
pythonli
while
you
are
python
with
python
.
all
python
have
python
poorli
at
least
onc
.