curiolabs / name-parser
PHP library for parsing a string containing a full name into its parts
Requires
- php: >=7.1
Requires (Dev)
- php-coveralls/php-coveralls: ^2.1
- php-mock/php-mock-phpunit: ^2.1
- phpunit/phpunit: ^7.0
- dev-master
- v1.3.2
- v1.3.1
- v1.3.0
- v1.2.6
- v1.2.5
- v1.2.4
- v1.2.3
- v1.2.2
- v1.2.1
- v1.2.0
- v1.1.0
- v1.0.3
- v1.0.2
- v1.0.1
- v1.0.0
- 0.1.1
- 0.1
- dev-feature/lastname-remapping
- dev-feature/update-readme
- dev-feature/allow-salutation-at-end
- dev-feature/explicit-lastname-prefix-getter
- dev-feature/allow-nicknames-at-end
- dev-feature/languages
This package is auto-updated.
Last update: 2024-10-19 20:54:50 UTC
README
Purpose
This is a universal, language-independent name parser.
Its purpose is to split a single string containing a full name, possibly including salutation, initials, suffixes etc., into meaningful parts like firstname, lastname, initials, and so on.
It is mostly tailored towards english names but works pretty well with non-english names as long as they use latin spelling.
E.g. Mr Anthony R Von Fange III is parsed to
- salutation: Mr.
- firstname: Anthony
- initials: R
- lastname: von Fange
- suffix: III
Features
Supported patterns
This parser is able to handle name patterns with and without comma:
... [firstname] ... [lastname] ...
... [lastname] ..., ... [firstname] ...
... [lastname] ..., ... [firstname] ..., [suffix]
Supported parts
- salutations (e.g. Mr, Mrs, Dr, etc.)
- first name
- middle names
- initials (single letters, possibly followed by a dot)
- nicknames (parts within parenthesis, brackets etc.)
- last names (also supports prefixes like von, de etc.)
- suffixes (Jr, Senior, 3rd, PhD, etc.)
Other features
- multi-language support for salutations, suffixes and lastname prefixes
- customizable nickname delimiters
- customizable normalisation of all output strings (original values remain accessible)
- customizable whitespace
Examples
More than 80 different successfully parsed name patterns can be found in the parser unit test.
Setup
composer require CurioLabs/name-parser
Usage
Basic usage
<?php $parser = new CurioLabs\NameParser\Parser(); $name = $parser->parse($input); echo $name->getSalutation(); echo $name->getFirstname(); echo $name->getLastname(); echo $name->getMiddlename(); echo $name->getNickname(); echo $name->getInitials(); echo $name->getSuffix(); print_r($name->getAll()); // all parts as an associative array echo $name; // re-prints the full normalised name
An empty string is returned for missing parts.
Special part retrieval features
Explicit last name parts
You can retrieve last name prefixes and pure last names separately with
echo $name->getLastnamePrefix(); echo $name->getLastname(true); // true enables strict mode for pure lastnames, only
Nick names with normalized wrapping
By default, getNickname()
returns the pure string of nick names. However, you can
pass true
to have the same normalised parenthesis wrapping applied as in echo $name
:
echo $name->getNickname(); // The Giant echo $name->getNickname(true); // (The Giant)
Setting Languages
$parser = new CurioLabs\NameParser\Parser([ new CurioLabs\NameParser\Language\English(), //default new CurioLabs\NameParser\Language\German(), ])
Setting nickname delimiters
$parser = new CurioLabs\NameParser\Parser(); $parser->setNicknameDelimiters(['(' => ')']);
Setting whitespace characters
$parser = new CurioLabs\NameParser\Parser(); $parser->setWhitespace("\t _.");
Limiting the position of salutations
$parser = new CurioLabs\NameParser\Parser(); $parser->setMaxSalutationIndex(2);
This will require salutations to appear within the first two words of the given input string. This defaults to half the amount of words in the input string, meaning that effectively the salutation may occur within the first half of the name parts.
License
THE ICONIC Name Parser library for PHP is released under the MIT License.