foxorm / gibberish-detector
Determines if text contains gibberish.
Installs: 38 178
Dependents: 0
Suggesters: 1
Security: 0
Stars: 3
Watchers: 1
Forks: 1
Open Issues: 0
This package is not auto-updated.
Last update: 2020-01-19 15:06:08 UTC
README
Determines if text contains gibberish.
Example 1: You want phrases like "Hello, world" and never "dsfknifdkoindwsif32839hdd"
Example 2: You want usernames like "jamesjohn" and never "WWWWWWWWWWWWWWWWWWW"
How does it work?
Markov Chains. In this case, they describe which letters should or should not to be next to each other. By showing examples of good text and bad text, it teaches the software how to guess what letter combinations look like gibberish and which letter combinations look reasonably likely to be good text.
How do I use it?
- Train the Markov Chains
$trainingSuccessful = Gibberish::train(
$big_text_file,
$good_text_file,
$bad_text_file,
$matrix_file
);
- Test new text against the Markov Chains
$oddsItsGoodText = Gibberish::test($inputText, $matrix_file, true);
Who made it?
Originally written in Python by Rob Renaud. rrenaud@gmail.com https://github.com/rrenaud/Gibberish-Detector
Translated into PHP by Oliver Lillie. buggedcom@gmail.com https://github.com/buggedcom/Gibberish-Detector-PHP
Spruced up by Richard512
Implementation in composer by Jo Surikat jo@surikat.pro https://github.com/redcatphp/gibberish-detector