zz98500/php-dfa-sensitive

To achieve the filtering of sensitive words, based on the determination of finite automata (DFA) algorithm.

dev-master 2017-12-21 06:34 UTC

This package is not auto-updated.

Last update: 2024-04-22 05:01:55 UTC


README

php实现基于确定有穷自动机算法的铭感词过滤

安装&使用流程

Download and install Composer:

curl -sS https://getcomposer.org/installer | php

要检查 Composer 是否正常工作,只需要通过 php 来执行 PHAR

php composer.phar

安装扩展

composer require zz98500/php-dfa-sensitive
  • 注意:如果你在使用composer安装时,出现
    Could not find package lustre/php-dfa-sensitive at any version for your minimum-stability (stable). Check the package spelling or your minimum-stability 请在你的composer.json中加入"minimum-stability": "dev"

如果你需要手动引入

require './vendor/autoload.php';

use DfaFilter\SensitiveHelper;

获取影响因子

// 获取影响因子数组
$interferenceData = array(
    '#',
    ',',
    '~',
    '-',
    '/',
    ......
);

获取敏感词库

// 获取感词库索引数组
$wordData = array(
    '察象蚂',
    '拆迁灭',
    '车牌隐',
    '成人电',
    '成人卡通',
    ......
);

检测是否含有敏感词

$islegal = SensitiveHelper::init()->setInterference($interference)->setTree($wordData)->islegal($content);

敏感词过滤

// 敏感词替换为***为例
$filterContent = SensitiveHelper::init()->setInterference($interference)->setTree($wordData)->replace($content, '***');

获取文字中的敏感词

// 获取内容中所有的敏感词
$sensitiveWordGroup = SensitiveHelper::init()->setInterference($interference)->setTree($wordData)->getBadWord($content);
// 仅且获取一个敏感词
$sensitiveWordGroup = SensitiveHelper::init()->setInterference($interference)->setTree($wordData)->getBadWord($content, 1);

目前已知使用平台:广电云 节目直播页面在线聊天功能支持

如果大家有更好的建议,请大家多多指正,O(∩_∩)O谢谢