muhfirdaus19/bahasata

Text Processing PHP bahasa indonesia

dev-main 2021-07-25 10:18 UTC

This package is auto-updated.

Last update: 2025-06-25 18:46:25 UTC


README

text processing bahasa indonesia PHP

About

Bahasata is Example Text Processing for bahasa indonesia written in PHP.
for now only can stemmer and tokenizer

Cara Install

Bahasata dapat diinstall melalui Composer.

composer require muhfirdaus19/bahasata:dev-main

Penggunaan

Text tokenization

memisahkan kata, kalimat

use Bahasata\Bahasata;

// include autoloader
require './vendor/autoload.php';

$bahasata = new Bahasata();
$write = $bahasata->write('tetap bersama, jaga kesehatan!');

$result = $write->get();
// tetap bersama, jaga kesehatan!

$result = $write->wordsTokenizer()->get();
// ['tetap' ,'bersama' ,'jaga' ,'kesehatan']

$result = $write->sentencesTokenizer()->get();
// ['tetap bersama' ,'jaga kesehatan']

print_r($result);

Stemmer

mencari kata dasar dari sebuat kalimat/kata. contoh : memakan -> makan

use Bahasata\Bahasata;

// include autoloader
require './vendor/autoload.php';

$bahasata = new Bahasata();
$result = $bahasata->stem('merekomendasikan');
// rekomendasi

$write = $bahasata->write('saya rekomendasikan untuk memakan sayur');
$result = $write->wordsTokenizer()->stem()->get();
// ['saya', 'rekomendasi', 'untuk', 'makan', 'sayur']

print_r($result);

Copyright and License

The muhfirdaus19/bahasata library is copyright © Muhammad Firdaus and licensed for use under the terms of the MIT License (MIT). Please see LICENSE for more information.