10up / wp-scrubber
A WordPress plugin that scrubs sensitive data from the database.
Installs: 13 715
Dependents: 0
Suggesters: 0
Security: 0
Stars: 18
Watchers: 17
Forks: 4
Open Issues: 4
Type:wordpress-plugin
Requires
- php: >=7.3
Requires (Dev)
- 10up/phpcs-composer: dev-master
This package is auto-updated.
Last update: 2024-12-10 09:30:59 UTC
README
This WordPress plugin provides a command-line interface for scrubbing sensitive user and comment data from a WordPress installation.
Installation
- Clone or download the plugin files into your WordPress plugins directory.
- Note: If on WordPress VIP, the plugin should be added to the
/client-mu-plugins
directory.
- Note: If on WordPress VIP, the plugin should be added to the
- Activate the plugin through the WordPress admin interface or via WP-CLI.
- Set the
WP_ENVIRONMENT_TYPE
tolocal
orstaging
.
Usage
The plugin provides a WP-CLI command called wp scrub all
that will scrub all user and comment data from the WordPress database. This command can only be run on non-production environments, unless overridden with wp_scrubber_allow_on_production
. You can also run wp scrub users
or wp scrub comments
to only scrub users or comments.
To use the command, open up your terminal and navigate to your WordPress installation. Then run the following commands:
wp scrub all
wp cache flush
When creating an export for local development, best practice is to export a scrubbed database from a lower environment:
- Copy production into a lower environment (staging, develop, preprod, etc...)
- Scrub the data on the lower environment with the
wp scrub all
command. - Export the scrubbed database.
Note: On WordPress VIP, scrubbing commands will occur automatically when copying from production to a lower environment.
Scrubbed Data
Users
- All exisiting passwords are replaced with a random password.
- Email, user_login and display_name are replaced with dummy values.
To only scrub users run the following command:
wp scrub users
Comments
- Comment and Comment Meta tables are completely emptied. To only scrub comments run the following command:
wp scrub comments
On a multisite, to scrub comments across all the sites you can run the following commands:
wp site list --field=url | xargs -n1 -I % wp --url=% scrub comments
wp site list --field=url | xargs -n1 -I % wp --url=% cache flush
wp cache flush --network
CLI Arguments
--allowed-domains
- Comma separated list of email domains. Any WordPress user with this email domain will be ignored by the scrubbing scripts.10up.com
andget10up.com
are ignored by default.- ex:
wp scrub all --allowed-domains=example.com,example.net
- ex:
--allowed-emails
- Comma separated list of email addresses. Any WordPress user with this email will be ignored by the scrubbing scripts.- ex:
wp scrub all --allowed-emails=user1@example.com,user2@example.com
- ex:
--ignore-size-limit
- Ignore the database size limit.- ex: `wp scrub all --ignore-size-limit
Database Size Limit
WP Scrubber includes a database size limit of 2GB. This limit exists as a failsafe to prevent the scrubbing action from taking effect on large sites, unless the developer chosen to ignore the warning.
The core plugin is very quick and scales based on the number of users which need to be scrubbed. However, because there are additional hooks and filter which projects are encouraged to extend, cvustom queries which loop through post or postmeta can slow down the scrub operation considerably.
You can adjust the default size limit with wp_scrubber_db_size_limit
.
Extensibility
WP Scrubber includes several filters and actions which allows developers to hook into the scrubbing process and add their own rules.
How can I add my own scrubbing rules?
Additional scrubbing commands can be added before or after the default commands with wp_scrubber_before_scrub
and wp_scrubber_after_scrub
.
How do I allow this on production environments?
WP Scrubber uses wp_get_environment_type()
to ensure it is not accidentally run on a production environment. By default, unless WP_ENVIRONMENT_TYPE
is defined, WordPress will assume all environments are production environments.
It is encouraged that WP_ENVIRONMENT_TYPE
be updated to local
and staging
, however if you do indeed need to run on production you can add a filter to wp_scrubber_allow_on_production
, changing the value to true
.
How can I change which users are scrubbed?
In addition to CLI arguments, three filters are available to developers
wp_scrubber_allowed_email_domains
- Allows for ignoring users based on their email domain. By default,10up.com
andget10up.com
are ignored.wp_scrubber_allowed_emails
- Allows for ignoring specific users based on their full email address. Helpful if you want to save certain users from an organization, but not all of them.wp_scrubber_should_scrub_user
- If you conditions are more complex, you can use this filter to check each user individually.
PII
WP Scrubber scrubs PII based on where WordPress core stores data (users, comments, standard user/comment meta keys). It does not have knowledge of PII stored by third party plugins. Even when using this tool you should audit your database for third party PII.
JSON Configuration
WP Scrubber also includes the option to configure scrubbing rules using a JSON configuration file. This allows for more detailed and flexible scrubbing rules for post types, taxonomies, options, user data, custom tables, and truncating tables.
To use the JSON configuration, create a wp-scrubber.json
file in the root of your WordPress installation. The plugin will automatically detect and use this file for scrubbing rules.
JSON Configuration Structure
Post Types
Define which post types and their associated fields and meta fields to scrub.
{ "post_types": [ { "name": "post", "fields": [ { "name": "post_title", "action": "faker", "faker_type": "sentence" } // More fields... ], "meta_fields": [ { "key": "meta_key_name", "action": "replace", "value": "new value" } // More meta_fields... ] }, // More post_types... ] }
name
: The post type (e.g., post, page).fields
: Fields to scrub within the post type.name
: Field name.action
: Scrubbing action (faker
,replace
,remove
).faker_type
: Type of fake data from Faker (e.g.,sentence
).value
: Replacement value forreplace
action.
meta_fields
: Post meta fields to scrub.key
: Meta key.action
,faker_type
,value
: As described above.
Taxonomies
Define taxonomies and their terms and meta fields to scrub.
{ "taxonomies": [ { "name": "category", "fields": [ { "name": "name", "action": "faker", "faker_type": "sentence" } // More fields... ], "meta_fields": [ { "key": "meta_key_name", "action": "replace", "value": "new value" } // More meta_fields... ] }, // More taxonomies... ] }
name
: Taxonomy name.fields
: Fields to scrub within the terms.name
,action
,faker_type
,value
: As described above.
meta_fields
: Term meta fields to scrub.key
,action
,faker_type
,value
: As described above.
Options
Define WordPress options to scrub.
{ "options": [ { "name": "admin_email", "action": "faker", "faker_type": "email" } // More options... ] }
name
: Option name.action
,faker_type
,value
: As described above.
User Data
Define user data fields to scrub.
{ "user_data": [ "fields": [ { "name": "user_email", "action": "faker", "faker_type": "email" } // More user_data... ], "meta_fields": [ { "key": "meta_key_name", "action": "replace", "value": "new value" } // More meta_fields... ] ] }
fields
: Fields to scrub within the user.name
,action
,faker_type
,value
: As described above.
meta_fields
: User meta fields to scrub.key
,action
,faker_type
,value
: As described above.
Custom Tables
Define custom tables and columns to scrub.
{ "custom_tables": [ { "name": "custom_table_name", "primary_key": "id", "columns": [ { "name": "column_name", "action": "faker", "faker_type": "name" } // More columns... ] }, // More custom_tables... ] }
name
: Custom table name.primary_key
: Primary key column name.columns
: Columns within the custom table to scrub.name
,action
,faker_type
,value
: As described above.
Truncate Tables
List tables to be entirely truncated.
{ "truncate_tables": [ "table_to_truncate" // More tables to truncate... ] }
truncate_tables
: List of table names to truncate.
Scrubbing Actions
action
: Defines the scrubbing action (faker
,replace
,remove
).faker_type
: Specifies the type of fake data (e.g.,name
,email
) when using thefaker
action.value
: For thereplace
action, the specific value to replace the original data.
Full Example
See a full example JSON config at /config-example
.
Definition of Beta
10up considers this tool production ready for 10up projects. Publicly we define it as beta because we are wary of people relying on this tool solely when third party software can store PII in unknown locations.
Support Level
Beta: This project is quite new and we're not sure what our ongoing support level for this will be. Bug reports, feature requests, questions, and pull requests are welcome. If you like this project please let us know, but be cautious using this in a Production environment!