Skip to content
Advertisement

Tag: nlp

Lemmatization with apache lucene

I’m developing a text analysis project using apache lucene. I need to lemmatize some text (transform the words to their canonical forms). I’ve already written the code that makes stemming. Using it, I am able to convert the following sentence The stem is the part of the word that never changes even when morphologically inflected; a lemma is the base

Advertisement