amara/lemmatizer

A non-part-of-speech lemmatizer tool

v1.0.0 2022-12-14 10:14 UTC

This package is not auto-updated.

Last update: 2024-05-02 14:50:08 UTC


README

Build

A PHP library for getting a lemma from a given word, and getting a list of words that map to a lemma. This does NOT support part of speech specification.

Source data

This lemma dictionary is largely based on WordNet, a project maintained at Princeton University.

Demo

See the lemmatizer in action at https://lemmatizer.markfullmer.com/

Usage in an application

The included index.php file contains an interactive demo.

Make your code aware of the Lemmatizer class via your favorite method (e.g., use writecrow\Lemmatizer\Lemmatizer;)

Then pass a word into one of the two methods

print Lemmatizer::getLemma('leaves');
// Will print 'leaf'

print Lemmatizer::getWordsFromLemma('leaf');
// Will print 'leaves,leafing,leafed,leafs'

Testing

Unit Tests can be run (after composer install) by executing vendor/bin/phpunit