Binary Log Formats
Contents
Supported Binary Log Formats
There are three supported formats for binary log events:
- Statement-Based Logging
- Row-Based Logging
- Mixed Logging
Regardless of the format, binary log events are always stored in a binary format, rather than in plain text. MariaDB includes the mysqlbinlog
utility that can be used to output binary log events in a human-readable format.
You may want to set the binary log format in the following cases:
- If you execute single statements that update many rows, then statement-based logging will be more efficient than row-based logging for the slave to download.
- If you execute many statements that don't affect any rows, then row-based logging will be more efficient than statement-based logging for the slave to download.
- If you execute statements that take a long time to complete, but they ultimately only insert, update, or delete a few rows in the table, then row-based logging will be more efficient than statement-based logging for the slave to apply.
The default, since MariaDB 10.2.3, is mixed logging which is replication safe and uses up less storage space than row logging.
The storage engine API also allows storage engines to set or limit the logging format, which helps reduce errors with replicating between masters and slaves with different storage engines.
Statement-Based Logging
MariaDB until 10.2.3
In MariaDB 10.2.3 and before, statement-based logging is the default. In future version mixed logging is the default.
When statement-based logging is enabled, statements are logged to the binary log exactly as they were executed. Temporary tables created on the master will also be created on the slave. This mode is only recommended in the case where one has a need to keep the binary log as small as possible and the master, slave has identical data (including using the same storage engines for all tables) and all functions you are using are deterministic (repeatable with the same arguments). Statements and tables using timestamps or auto_increment are safe to use with statement-based logging.
This mode can be enabled by setting the binlog_format
system variable to STATEMENT
.
In certain cases when it would be impossible to execute the statement on the slave, the server will switch to row-based logging for the statement. Some cases of this are:
- When replication has been changed from Row based to Statement based and a statement uses data from a temporary table created during Row based mode. In this case the temporary tables are not stored on the slave so row logging is the only alternative.
- ALTER TABLE of a table using a storage engine that stores data remotely, like the S3 storage engine, to another storage engine.
- One is using SEQUENCE's in the statement or the CREATE TABLE definition.
In certain cases, a statement may not be deterministic, and therefore not safe for replication. If MariaDB determines that an unsafe statement has been executed, then it will issue a warning. For example:
[Warning] Unsafe statement written to the binary log using statement format since BINLOG_FORMAT = STATEMENT. The statement is unsafe because it uses a LIMIT clause. This is unsafe because the set of rows included cannot be predicted.
See Unsafe Statements for Statement-based Replication for more information.
If you need to execute non-deterministic statements, then it is safer to use mixed logging or row-based.
Mixed Logging
MariaDB starting with 10.2.4
In MariaDB 10.2.4 and later, mixed logging is the default.
When mixed logging is enabled, the server uses a combination of statement-based logging and row-based logging. Statement-based logging is used by default, but when the server determines a statement may not be safe for statement-based logging, it will use row-based logging instead. See Unsafe Statements for Statement-based Replication: Unsafe Statements for a list of unsafe statements.
During one transaction, some statements may be logged with row logging while others are logged with statement-based logging.
This mode can be enabled by setting the binlog_format
system variable to MIXED
.
Row-Based Logging
When row-based logging is enabled, DML statements are not logged to the binary log. Instead, each insert, update, or delete performed by the statement for each row is logged to the binary log separately. DDL statements are still logged to the binary log.
Row-based logging uses more storage than the other log formats but is the safest to use. In practice mixed logging should be as safe.
If one wants to be able to see the original query that was logged, one can enable annotated rows events, that is shown with mysqlbinlog, with --binlog-annotate-row-events. This option is on by default.
This mode can be enabled by setting the binlog_format
system variable to ROW
.
Compression of the binary log
Compression of the binary log can be used with any of the binary log format, but the best results comes from using mixed or row logging. You can enable compression by using the --log_bin_compress startup option.
Configuring the Binary Log Format
The format for binary log events can be configured by setting the binlog_format
system variable. If you have the SUPER
privilege, then you can change it dynamically with SET GLOBAL
. For example:
SET GLOBAL binlog_format='ROW';
You can also change it dynamically for just a specific session with SET SESSION
. For example:
SET SESSION binlog_format='ROW';
It can also be set in a server option group in an option file prior to starting up the server. For example:
[mariadb] ... binlog_format=ROW
Be careful when changing the binary log format when using replication. When you change the binary log format on a server, it only changes the format for that server. Changing the binary log format on a master has no effect on the slave's binary log format. This can cause replication to give inconsistent results or to fail.
Be careful changing the binary log format dynamically when the server is a slave and parallel replication is enabled. If you change the global value dynamically, then that does not also affect the session values of any currently running threads. This can cause problems with parallel replication, because the worker threads will remain running even after STOP SLAVE
is executed. This can be worked around by resetting the slave_parallel_threads
system variable. For example:
STOP SLAVE; SET GLOBAL slave_parallel_threads=0; SET GLOBAL binlog_format='ROW'; SET GLOBAL slave_parallel_threads=4; START SLAVE
Effect of the Binary Log Format on Slaves
In MariaDB 10.0.22 and later, a slave will apply any events it gets from the master, regardless of the binary log format. The binlog_format
system variable only applies to normal (not replicated) updates.
If you are running MySQL or an older MariaDB than 10.0.22, you should be aware of that if you are running the slave in binlog_format=STATEMENT
mode, the slave will stop if the master is used with binlog_format
set to anything else than STATEMENT
.
The binary log format is upward compatible. This means replication should always work if slave is of the same or new version of MariaDB than the master.
The mysql Database
Statements that affect the mysql
database can be logged in a different way to that expected.
If the mysql database is edited directly, logging is performed as expected according to the binlog_format. Statements that directly edit the mysql database include INSERT, UPDATE, DELETE, REPLACE, DO, LOAD DATA INFILE, SELECT, and TRUNCATE TABLE.
If the mysql
database is edited indirectly, statement logging is used regardless of binlog_format setting. Statements editing the mysql
database indirectly include GRANT, REVOKE, SET PASSWORD, RENAME USER, ALTER, DROP and CREATE (except for the situation described below).
CREATE TABLE ... SELECT can use a combination of logging formats. The CREATE TABLE portion of the statement is logged using statement-based logging, while the SELECT portion is logged according to the value of binlog_format
.