ashleyhindle / private-dump
CLI script to dump your MySQL database, and sanitise the output of sensitive data for development/staging use
Installs: 6
Dependents: 0
Suggesters: 0
Security: 0
Stars: 6
Watchers: 3
Forks: 1
Open Issues: 6
Type:project
Requires
- php: ^7.3|^8.0|^8.1|^8.2
- ext-json: *
- dflydev/dot-access-data: ^1.0
- fakerphp/faker: ^1.14.0
- ifsnop/mysqldump-php: ^2.9
- nategood/commando: ^0.3.0
- pelmered/fake-car: ^1.6
Requires (Dev)
- phpunit/phpunit: 9.5.4
- dev-master
- v0.2.2
- v0.2.1
- v0.2
- v0.1.1
- v0.1
- v0.0.7
- v0.0.6
- v0.0.5
- v0.0.4
- v0.0.3
- v0.0.2
- v0.0.1
- dev-feat/value-persistency
- dev-fixup/restore-private-dump
- dev-analysis-vQg7Pl
- dev-update-phar
- dev-pass-params-to-faker-commands
- dev-add-vehicle-options
- dev-add-gravitar-support
- dev-unit-test-action
- dev-php8-support-fakerphp
This package is auto-updated.
Last update: 2024-12-09 11:57:44 UTC
README
Private Dump is a CLI tool which can create an anonymised dump of your MySQL database, usually for development or staging use.
It accomplishes this by reading a JSON configuration file which maps out which table columns should be modified and how.
Private Dump requires PHP >= 7.3
Table of Contents
Installation
Install with Composer
composer require ashleyhindle/private-dump
Install with curl
curl -Lo private-dump https://github.com/cazana/private-dump/releases/download/v0.1.1/private-dump chmod a+x private-dump
Usage
When private-dump is ran with a valid configuration it will output the dump to stdout, allowing you to redirect it to a file, or pipe it to another program (for compression, transfer, encryption, etc..).
First, create a configuration file manually or from an example config, then:
Composer:
vendor/bin/private-dump.phar -c private-dump.json > /backups/mysql-backup-with-sensitive-data-overwritten.sql
Curl:
private-dump -c private-dump.json > /backups/mysql-backup-with-sensitive-data-overwritten.sql
You can also override the MySQL username, password and hostname from the command line:
Composer:
vendor/bin/private-dump.phar -c private-dump.json -u bigben -p bingbong -h rds-213121231-13gb.amazon.com > /backups/mysql-backup-with-sensitive-data-overwritten.sql
Curl:
private-dump -c private-dump.json -u bigben -p bingbong -h rds-213121231-13gb.amazon.com > /backups/mysql-backup-with-sensitive-data-overwritten.sql
Note: It's best not to pass the password on the command line as it can be seen in the process list, and will exist in the user's history.
Configuration File
The configuration file is a JSON dictionary containing:
- database connection details
- list of databases to dump
- list of tables with replacements or transformers
- Seeder - one column can be used as a row-seeder to get repeatability in consecutive dumps
- keepers - some rows can be kept as is, if a specified column matches a regular expression
- columns with replacements or transformers
- options to restrict output
where
- Added to the query when retrieving data to dump:... WHERE xxx...
limit
- Added to the query when retrieving data to dump:... LIMIT xxx...
An example configuration, and configurations for popular applications, exist in the configs directory.
Annotated Example Configuration - private-dump.json
{
"connection": { -- Database connection details
"username": "bigben",
"password": "bingbong",
"hostname": "192.168.56.81"
},
"databases": { -- Databases to dump - databases not present will not be dumped
"databaseName": { -- All tables in this database are dumped
"tableName": { -- Referenced tables allow replacements
"email_address": "@email", -- Column and its replacement
"full_name": "@fullName",
"is_active": 0, -- Column with hardcoded value
"$options": { -- Special options array for limit/where
"where": "last_login > NOW() - INTERVAL 1 WEEK",
"limit": 25
}
}
},
"databaseTwo": {}, -- Dump entire database and tables, with no replacements
"databaseThree": {
"users": {
"$options": { -- Only options to limit data, no replacements
"where": "is_active=1"
}
}
}
"databaseThree": {
"users": {
"@seed": "id", -- Seed each row using user id, to get repeatability
"@keepif": { -- Keep all rows with emails using domain @ourdomain.com intact
"column": "email",
"regex": "^.*@ourdomain.com$"
}
"first_name": "@user(current_user).firstName",
"last_name": "@user(current_user).lastName",
"email": "@user(current_user).email", - email will be based on the first and last names
}
}
}
}
Key Value Tables
Private Dump supports replacing values in a key-value store, by using an array in the configuration file to link the value
column with the key
column as below:
{ "connection": {...}, "databases": { "wordpress": { "wp_options": { "option_value": { "$link": "option_name", "$transformers": { "admin_email": "@email", "mailserver_pass": "@password", "autoload": "yes" } } } } } }
This is a bit more complicated than the standard replacements, but offers a lot of flexibility for anonymising all types of data.
Replacements
The vast majority of these are made possible by the amazing Faker library. Most formatters listed in Faker's documentation are supported in Private Dump's configuration file
All replacements below should be prefixed with an @
as in the example configuration files.
If you need to use a hardcoded value (active=0, completed=1) you can do this by omitting the @
: "active": 0
in the configuration file.
You can pass variables to commands as such @numberBetween|100,1000
Text
original
- The original value, useful to use with modifiersstring
- Random length string up to 255 charactersrealText
- Quotes from booksloremSentence
- 1 sentence of LoremloremParagraph
- 3 sentences of LoremloremParagraphs
- 3 paragraphs of Lorem
Dates
iso8601
- 2019-01-20iso8601Recent
- ISO 8601 date in the last 3 months
Internet
email
- bigben@example.comurl
- https://www.parliament.uk/bigbenipv4
ipv6
userAgent
domainName
- bigben.netslug
- big-ben-bing-bong
Random
randomDigit
- singular digitrandomNumber
- up to 8 digitsrandomLetter
randomString
- Random length string up to 255 characters
User
firstName
lastName
title
- Ms. Mr. Dr.fullName
- Brian MayfullAddress
- One line: Building number, street, city, state/county, postcode/zipbuildingNumber
- 368streetName
- BroadwaystreetAddress
- 368 Broadwaycity
- Londonpostcode
- SW1A 0AAcountry
- Englandstate
- Texascounty
- Londonlatitude
- 51.5008longitude
--.1246
phoneNumber
email
- bigben@example.comusername
- BigBenpassword
url
- https://www.parliament.uk/bigbenipv4
- IPv4 Addressipv6
- IPv6 Address
Payment
creditCardType
- MastercardcreditCardNumber
- 4444 1111 2222 3333creditCardExpirationDate
- 04/22creditCardExpirationDateString
- '04/13'iban
- BI6B3N8497112740YZ575DJ28BP4swiftBicNumber
- BIGBEN22263
Company
company
- Company-NamejobTitle
- Croupier
Miscellaneous
boolean
md5
sha1
sha256
countryCode
- UKcurrencyCode
- EUR
Barcodes
barcodeEan13
barcodeEan8
barcodeIsbn13
barcodeIsbn10
Transformers
uppercase
lowercase
These notes are mainly for my own development use, feel free to ignore.
Dev Steps
- Install Box
- Modify PHP configuration to set
phar.readonly = Off
box build
chmod a+x bin/private-dump.phar
Release Process
- Build the PHAR:
box build
- Rename the PHAR:
mv bin/private-dump.phar ./private-dump
- Update the version in
README.md
's installation instructions based on the next version fromgit tag --list
- Tag the next release:
git tag -a vx.x.x -m "Release x.x.x"
- Push:
git push origin --tags
- Edit release on GitHub attaching the newly created
bin/private-dump
file