leinonen/php-dataloader

Port of Facebook's dataloader to PHP

2.1.0 2021-09-23 20:21 UTC

README

Port of the Facebook's DataLoader to PHP. Async superpowers from ReactPHP.

DataLoader is a generic utility to be used as part of your application's data fetching layer to provide a simplified and consistent API over various remote data sources such as databases or web services via batching and caching.

Build Status Code Coverage Latest Stable Version Total Downloads Latest Unstable Version License Scrutinizer Code Quality SensioLabsInsight

Table of contents

Installation

Require this package, with Composer, in the root directory of your project.

composer require leinonen/php-dataloader

Usage

To create a loader you must provide a batching function, an internal memoization cache and the global event loop from ReactPHP. To have better understanding what the ReactPHP event loop is and how it is used refer to it's documentation.

use leinonen\DataLoader\Dataloader;
use React\EventLoop\Factory;

$eventLoop = Factory::create();

$bandLoader = new DataLoader(
    function ($keys) {
        // Batch load bands with given keys.
    },
    $eventLoop,
    new CacheMap()
);

Then load individual values from the loader. DataLoader will coalesce all individual loads which occur within a single tick of the event loop and then call your batch function with all requested keys.

$bandLoader->load(1)->then(function ($band) {
    echo "Band #${$band->getId()} loaded";
});

$bandLoader->load(2)->then(function ($band) {
    echo "Band #${$band->getId()} loaded";
});

$eventLoop->run(); // The batch function will be called with keys [1, 2] at this point

Calling the load function returns React\Promise\Promises. To have a better understanding how to use promises within PHP refer to the ReactPHP docs.

Batch Function

The batch loading function accepts an array of keys, and must return a Promise which resolves to an Array of values. There are a few other constraints:

  • The Array of values must be the same length as the Array of keys.
  • Each index in the Array of values must correspond to the same index in the Array of keys i.e. The order of the batch loaded results must be the same as the order of given keys.

For example, if your batch function was provided the Array of keys: [2, 9, 6, 1], and the batch loaded results were:

[
    ['id' => 1, 'name' => 'Mojo Waves'],
    ['id' => 2, 'name' => 'Pleasure Hazard'],
    ['id' => 9, 'name' => 'Leka'],
]

The loaded results are in a different order that we requested which is quite common with most of the relation dbs for example. Also result for key 6 is omitted which we can interpret as no value existing for that key.

To satisfy the constraints of the batch function we need to modify the results to be the same length as the Array of keys and re-order them to ensure each index aligns with the original keys:

[
    ['id' => 2, 'name' => 'Pleasure Hazard'],
    ['id' => 9, 'name' => 'Leka'],
    null,
    ['id' => 1, 'name' => 'Mojo Waves'],
]

Caching

DataLoader provides a memoization cache for all loads which occur in a single request to your application. After load() is called once with a given key, the resulting value is cached to eliminate redundant loads.

In addition to relieving load on your data storage, caching results per-request also creates fewer objects which may relieve memory pressure on your application:

$promise1 = $bandLoader->load(1);
$promise2 = $bandLoader->load(2);

($promise1 === $promise2) // true

DataLoader caching does not replace Redis, Memcache, or any other shared application-level cache. DataLoader is first and foremost a data loading mechanism, and its cache only serves the purpose of not repeatedly loading the same data in the context of a single request to your Application. To do this it utilizes the CacheMap given as a constructor argument.

This package provides a simple CacheMap (leinonen\DataLoader\CacheMap) implementation to be used with DataLoader. You can also use your custom CacheMap with various different cache algorithms by implementing the leinonen\DataLoader\CacheMapInterface.

Usage with common ORM's

Eloquent (Laravel)

$userByIdLoader = new DataLoader(function ($ids) {
  $users = User::findMany($ids);

  // Make sure that the users are on the same order as the given ids for the loader
  $orderedUsers = collect($ids)->map(function ($id) use ($users) {
    return $users->first(function ($user) use ($id) {
      return $user->id === $id;
    });
  });

   return \React\Promise\resolve($orderedUsers);
}, $eventLoopFromIoCContainer, $cacheMapFromIoCContainer);

ActiveRecord (Yii2)

$usersByIdLoader = new DataLoader(function ($ids) {
    $users = User::find()->where(['id' => $ids])->all();

    $orderedUsers = \array_map(function ($id) use ($users) {
        foreach ($users as $user) {
            if ($user->id === $id) {
                return $user;
            }
        }

        return null;
    }, $ids);

    return \React\Promise\resolve($orderedUsers);
}, $eventLoopFromDiContainer, $cacheMapImplementationFromDiContainer);

API

load($key)

Loads a key, returning a Promise for the value represented by that key.

  • @param mixed $key An key value to load.

loadMany($keys)

Loads multiple keys, promising an array of values.

This is equivalent to the more verbose:

$promises = \React\Promise\all([
  $myLoader->load('a'),
  $myLoader->load('b')
]);
  • @param array $keys: An array of key values to load.

clear($key)

Clears the value at $key from the cache, if it exists.

  • @param mixed key: An key value to clear.

clearAll()

Clears the entire cache.

prime($key, $value)

Primes the cache with the provided key and value. If the key already exists, no change is made. (To forcefully prime the cache, clear the key first with $loader->clear($key)->prime($key, $value).