mehrdad-dadkhah / php-persian-natural-language-processor
Simple php and python wrapper on hazm persian text processor
Installs: 489
Dependents: 0
Suggesters: 0
Security: 0
Stars: 3
Watchers: 2
Forks: 2
Open Issues: 0
Language:Python
Requires
- php: >=7.2
- symfony/process: >=3.0
README
simple php and python wrapper on hazm persian text processor.
System requirements
install hazm:
if have not python:
sudo apt install python
then:
sudo apt install python-pip
and then:
pip install hazm
Installation
composer require mehrdad-dadkhah/php-persian-natural-language-processor
Usage
PHP:
use MehrdadDadkhah\Language\PersianLanguageProcessor; $parser = new PersianLanguageProcessor(); $parser->allNLP('سلام. این یک متن تست است. موفق باشید');
Python:
python /path/to/pr/processor.py allNLP json.dumps('سلام. این یک متن تست است. موفق باشید')
and the result:
array:7 [▼
"chunksGroup" => array:2 [▼
"main" => "[سلام NP] . [این یک متن تست NP] [است VP] . [موفق ADJP] [باشید VP]"
"normalized" => "[سلام NP] . [این یک متن تست NP] [است VP] . [موفق ADJP] [باشید VP]"
]
"postTags" => array:2 [▼
"main" => array:10 [▶]
"normalized" => array:10 [▼
0 => array:2 [▶]
1 => array:2 [▶]
2 => array:2 [▶]
3 => array:2 [▶]
4 => array:2 [▼
0 => "متن"
1 => "N"
]
5 => array:2 [▶]
6 => array:2 [▶]
7 => array:2 [▶]
8 => array:2 [▶]
9 => array:2 [▶]
]
]
"stem" => array:2 [▼
"main" => array:4 [▶]
"normalized" => array:4 [▼
"ADV" => []
"N" => array:2 [▶]
"Ne" => []
"V" => array:3 [▶]
]
]
"wordTokenize" => array:2 [▼
"main" => array:10 [▶]
"normalized" => array:10 [▼
0 => "سلام"
1 => "."
2 => "این"
3 => "یک"
4 => "متن"
5 => "تست"
6 => "است"
7 => "."
8 => "موفق"
9 => "باشید"
]
]
"lemmatized" => array:2 [▼
"main" => array:4 [▼
"ADV" => []
"N" => array:2 [▼
0 => "سلام"
1 => "متن"
]
"Ne" => []
"V" => array:3 [▼
0 => "تست"
1 => "است"
2 => "بود#باش"
]
]
"normalized" => array:4 [▼
"ADV" => []
"N" => array:2 [▼
0 => "سلام"
1 => "متن"
]
"Ne" => []
"V" => array:3 [▼
0 => "تست"
1 => "است"
2 => "بود#باش"
]
]
]
"normalized" => "سلام. این یک متن تست است. موفق باشید"
"sentTokenize" => array:2 [▼
"main" => array:3 [▶]
"normalized" => array:3 [▼
0 => "سلام."
1 => "این یک متن تست است."
2 => "موفق باشید"
]
]
]
functions
- normilizeText(string $text)
- sentTokenizeText(string $text)
- wordTokenizeText(string $text)
- postTagText(string $text)
- chunksText(string $text)
- getChunksGroup(string $text)
- stemText(string $text)
- lemmatizeText(string $text)
- allNLP(string $text)
allNLP function call all other functions and return all results.
Acknowledgments
Uses:
License
php-persian-natural-language-processor is licensed under the GPLv3 License.