writecrow/lemmatizer

A non-part-of-speech lemmatizer tool

dev-master 2019-11-08 03:50 UTC

This package is auto-updated.

Last update: 2024-05-08 14:06:15 UTC


README

Circle CI

A PHP library for getting a lemma from a given word, and getting a list of words that map to a lemma. This does NOT support part of speech specification.

Source data

This lemma dictionary is largely based on WordNet, a project maintained at Princeton University.

Demo

See the lemmatizer in action at https://lemmatizer.markfullmer.com/

Usage in an application

The included index.php file contains an interactive demo.

Make your code aware of the Lemmatizer class via your favorite method (e.g., use writecrow\Lemmatizer\Lemmatizer;)

Then pass a word into one of the two methods

print Lemmatizer::getLemma('leaves');
// Will print 'leaf'

print Lemmatizer::getWordsFromLemma('leaf');
// Will print 'leaves,leafing,leafed,leafs'

Testing

Unit Tests can be run (after composer install) by executing vendor/bin/phpunit