Lemmatization:
The purpose of lemmatization is to enable a query with one word form to match documents that contain a
different form of the word.
In English, lemmatization can occur for:
paradigms, depending on the grammatical features for the word forms.
Lemmatization allows a user to search for a term like car and get both documents that
contain the word car and documents that contain the word cars.
Lemmatization, stemming and wildcard search:
Lemmatization differs from stemming or wildcard search by being more precise. Different word forms are
mapped to each other by using a language specific dictionary, not by applying simple suffix chopping rules
(stemming) or partial string matches (wildcard search).
The purpose of lemmatization is to enable a query with one word form to match documents that contain a
different form of the word.
In English, lemmatization can occur for:
- singular or plural forms for nouns.
- positive, comparative, or superlative forms for adjectives.
- tense and person for verbs.
paradigms, depending on the grammatical features for the word forms.
Lemmatization allows a user to search for a term like car and get both documents that
contain the word car and documents that contain the word cars.
Lemmatization, stemming and wildcard search:
Lemmatization differs from stemming or wildcard search by being more precise. Different word forms are
mapped to each other by using a language specific dictionary, not by applying simple suffix chopping rules
(stemming) or partial string matches (wildcard search).
No comments:
Post a Comment