mehrab-wj/tiktoken-php

a clone of python tiktoken but for PHP! fast BPE tokeniser for use with OpenAI's models.

v1.0.0 2023-04-19 18:36 UTC

This package is auto-updated.

Last update: 2024-05-11 19:24:32 UTC


README

PHP Text Tokenizer for GPT models

About

A PHP toolkit to tokenize text like GPT family of models process it.

Forked from semji/gpt3-tokenizer-php to bug fixes and improvement.

Requirements

Usage

First install the package using composer:

composer require mehrab-wj/tiktoken-php
use TikToken\Encoder;
$prompt = "Ai is cool";
$encoder = new Encoder();

$tokens = $encoder->encode($prompt); // [32, 72, 318, 3608]

// Get tokens count:
echo count($tokens); // 4