Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/FireLustre/php-dfa-sensitive
:see_no_evil: 实现过滤敏感词汇:underage:,基于确定有穷自动机(DFA)算法,支持composer安装扩展
https://github.com/FireLustre/php-dfa-sensitive
composer-package dfa php sensitive-word
Last synced: about 2 months ago
JSON representation
:see_no_evil: 实现过滤敏感词汇:underage:,基于确定有穷自动机(DFA)算法,支持composer安装扩展
- Host: GitHub
- URL: https://github.com/FireLustre/php-dfa-sensitive
- Owner: FireLustre
- Created: 2017-06-28T09:32:20.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2022-08-23T15:04:52.000Z (over 2 years ago)
- Last Synced: 2024-09-08T12:03:46.986Z (4 months ago)
- Topics: composer-package, dfa, php, sensitive-word
- Language: PHP
- Homepage: https://packagist.org/packages/lustre/php-dfa-sensitive
- Size: 57.6 KB
- Stars: 750
- Watchers: 16
- Forks: 141
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# php-DFA-filterWord
php实现基于确定有穷自动机算法的铭感词过滤 https://packagist.org/packages/lustre/php-dfa-sensitive## 安装&使用流程
### Download and install Composer:
curl -sS https://getcomposer.org/installer | php
> 要检查 Composer 是否正常工作,只需要通过 php 来执行 PHAR
php composer.phar### 安装扩展
composer require lustre/php-dfa-sensitive
* 注意:如果你在使用composer安装时,出现
Could not find package lustre/php-dfa-sensitive at any version for your minimum-stability (stable). Check the package spelling or your minimum-stability 请在你的composer.json中加入"minimum-stability": "dev"
#### 如果你需要手动引入require './vendor/autoload.php';
use DfaFilter\SensitiveHelper;### 构建敏感词库树
场景一: 可以拿到不同(用户)词库数组// 获取感词库索引数组
$wordData = array(
'察象蚂',
'拆迁灭',
'车牌隐',
'成人电',
'成人卡通',
......
);
// get one helper
$handle = SensitiveHelper::init()->setTree($wordData);场景二: 全站使用一套敏感词库
// 获取感词库文件路径
$wordFilePath = 'tests/data/words.txt';
// get one helper
$handle = SensitiveHelper::init()->setTreeByFile($wordFilePath);
### 检测是否含有敏感词$islegal = $handle->islegal($content);
### 敏感词过滤
// 敏感词替换为*为例(会替换为相同字符长度的*)
$filterContent = $handle->replace($content, '*', true);
// 或敏感词替换为***为例
$filterContent = $handle->replace($content, '***');
### 标记敏感词
$markedContent = $handle->mark($content, '', '');
### 获取文字中的敏感词// 获取内容中所有的敏感词
$sensitiveWordGroup = $handle->getBadWord($content);
// 仅且获取一个敏感词
$sensitiveWordGroup = $handle->getBadWord($content, 1);### 如何使用单元测试进行测试
#### 安装PHPUnit
```bash
$ wget https://phar.phpunit.de/phpunit.phar$ chmod +x phpunit.phar
$ mv phpunit.phar /usr/local/bin/phpunit
```
#### 使用composer自动加载php命名空间```bash
$ composer update
```
### 运行单元测试
```bash
$ phpunit tests/BaseTest.php
```*如果大家有更好的建议,请大家多多指正,O(∩_∩)O谢谢*
*你们的star是我的动力*