iannsp/ngram

A n-gram toolset.

1.0.0 2017-04-07 04:28 UTC

This package is not auto-updated.

Last update: 2024-11-01 04:40:25 UTC


README

Build Status

This is a naive ngram implementation with some tools together.

You can select build your ngram over letters or words, using:

  • Frequency
    • Letter
    • Word

And you can process

  • Text input.
  • Black list
  • Parse that HTML. Please, be kind with that naive regex.
  • Text sanitization.