1 of 100

Storage Engines

Understand MariaDB Server's storage engines. Explore the features and use cases of InnoDB, Aria, MyISAM, and other engines to choose the best option for your specific data needs.

Storage Engines Overview

An introduction to MariaDB's pluggable storage engine architecture, highlighting key engines like InnoDB, MyISAM, and Aria for different workloads.

Overview

MariaDB features pluggable storage engines to allow per-table workload optimization.

A storage engine is a type of plugin for MariaDB:

Different storage engines may be optimized for different workloads, such as transactional workloads, analytical workloads, or high throughput workloads.
Different storage engines may be designed for different use cases, such as federated table access, table sharding, and table archiving in the cloud.
Different tables on the same server may use different storage engines.

Engine

Target

Optimization

Availability

Examples

Identify the Default Storage Engine

Identify the server's global default storage engine by using to query the system variable:

Identify the session's default storage engine by using :

Set the Default Storage Engine

Global default storage engine:

Session default storage engine supersedes global default during this session:

Configure the Default Storage Engine

Identify Available Storage Engines

Choose Storage Engine for a New Table

Storage engine is specified at time of table creation using a ENGINE = parameter.

Resources

Engines for System Tables

Standard MariaDB storage engines are used for System Table storage:

FAQ

Can I use more than one storage engine on a server?

Yes, different tables can use different storage engines on the same server.
To create a table with a specific storage engine, specify the ENGINE table option to the statement.

Can I use more than one storage engine in a single query?

Yes, a single query can reference tables that use multiple storage engines.
In some cases, special configuration may be required. For example, ColumnStore requires cross engine joins to be configured.

What storage engine should I use for transactional or OLTP workloads?

is the recommended storage engine for transactional or OLTP workloads.

What storage engine should I use for analytical or OLAP workloads?

is the recommended storage engine for analytical or OLAP workloads.

What storage engine should I use if my application performs both transactional and analytical queries?

An application that performs both transactional and analytical queries is known as .

HTAP can be implemented with MariaDB by using for transactional queries and for analytical queries.

Reference

MariaDB Server Reference

.
, which shows available storage engines.
, which shows storage engine by table.

ARIA

Learn about the Aria storage engine in MariaDB Server. Understand its features, advantages, and use cases, particularly for crash-safe operations and transactional workloads.

Aria Group Commit

Learn about Aria's group commit functionality, which improves performance by batching commit operations to the transaction log.

The Aria storage engine includes a feature to group commits to speed up concurrent threads doing many inserts into the same or different Aria tables.

By default, group commit for Aria is turned off. It is controlled by the aria_group_commit and aria_group_commit_interval system variables.

Information on setting server variables can be found on the Server System Variables page.

Terminology

A commit is flush of logs followed by a sync.
sent to disk means written to disk but not sync()ed,
flushed mean sent to disk and synced().
LSN means log serial number. It's refers to the position in the transaction log.

Non Group commit logic (aria_group_commit="none")

The thread which first started the commit is performing the actual flush of logs. Other threads set the new goal (LSN) of the next pass (if it is maximum) and wait for the pass end or just wait for the pass end.

The effect of this is that a flush (write of logs + sync) will save all data for all threads/transactions that have been waiting since the last flush.

If hard group commit is enabled (aria_group_commit="hard")

If hard commit and aria_group_commit_interval=0

The first thread sends all changed buffers to disk. This is repeated as long as there are new LSNs added. The process can not loop forever because we have a limited number of threads and they will wait for the data to be synced.

Pseudo code:

If hard commit and aria_group_commit_interval > 0

If less than rate microseconds has passed since the last sync, then after buffers have been sent to disk, wait until rate microseconds has passed since last sync, do sync and return. This ensures that if we call sync infrequently we don't do any waits.

If soft group commit is enabled (aria_group_commit="soft")

Note that soft group commit should only be used if you can afford to lose a few rows if your machine shuts down hard (as in the case of a power failure).

Works like in non group commit' but the thread doesn't do any real sync(). If aria_group_commit_interval is not zero, the sync() calls are performed by a service thread with the given rate when needed (new LSN appears). If aria_group_commit_interval is zero, there are no sync() calls.

Code

The code for this can be found in storage/maria/ma_loghandler.c::translog_flush().

_{This page is licensed: CC BY-SA / Gnu FDL}

Aria Status Variables

A list of status variables specific to the Aria engine, providing metrics on page cache usage, transaction log syncs, and other internal operations.

This page documents status variables related to the Aria storage engine. See Server Status Variables for a complete list of status variables that can be viewed with SHOW STATUS.

`Aria_pagecache_blocks_not_flushed`

Description: The number of dirty blocks in the Aria page cache. The global value can be flushed by .
Scope: Global
Data Type: numeric

`Aria_pagecache_blocks_unused`

Description: Free blocks in the Aria page cache. The global value can be flushed by .
Scope: Global
Data Type: numeric

`Aria_pagecache_blocks_used`

Description: Blocks used in the Aria page cache. The global value can be flushed by .
Scope: Global
Data Type: numeric

`Aria_pagecache_read_requests`

Description: The number of requests to read something from the Aria page cache.
Scope: Global
Data Type: numeric

`Aria_pagecache_reads`

Description: The number of Aria page cache read requests that caused a block to be read from the disk.
Scope: Global
Data Type: numeric

`Aria_pagecache_write_requests`

Description: The number of requests to write a block to the Aria page cache.
Scope: Global
Data Type: numeric

`Aria_pagecache_writes`

Description: The number of blocks written to disk from the Aria page cache.
Scope: Global
Data Type: numeric

`Aria_transaction_log_syncs`

Description: The number of Aria log fsyncs.
Scope: Global
Data Type: numeric

_{This page is licensed: CC BY-SA / Gnu FDL}

Aria Storage Formats

Understand the different row formats supported by Aria, particularly the default PAGE format which enables crash safety and better concurrency.

The Aria storage engine supports three different table storage formats.

These are FIXED, DYNAMIC and PAGE, and they can be set with the ROW FORMAT option in the CREATE TABLE statement. PAGE is the default format, while FIXED and DYNAMIC are essentially the same as the MyISAM formats.

The SHOW TABLE STATUS statement can be used to see the storage format used by a table.

Fixed-length

Fixed-length (or static) tables contain records of a fixed-length. Each column is the same length for all records, regardless of the actual contents. It is the default format if a table has no BLOB, TEXT, VARCHAR or VARBINARY fields, and no ROW FORMAT is provided. You can also specify a fixed table with ROW_FORMAT=FIXED in the table definition.

Tables containing BLOB or TEXT fields cannot be FIXED, as by design these are both dynamic fields.

Fixed-length tables have a number of characteristics:

fast, since MariaDB will always know where a record begins
easy to cache
take up more space than dynamic tables, as the maximum amount of storage space are allocated to each record.
reconstructing after a crash is uncomplicated due to the fixed positions

Dynamic

Dynamic tables contain records of a variable length. It is the default format if a table has any BLOB, TEXT, VARCHAR or VARBINARY fields, and no ROW FORMAT is provided. You can also specify a DYNAMIC table with ROW_FORMAT=DYNAMIC in the table definition.

Dynamic tables have a number of characteristics

Each row contains a header indicating the length of the row.
Rows tend to become fragmented easily. UPDATING a record to be longer will likely ensure it is stored in different places on the disk.
All string columns with a length of four or more are dynamic.
They require much less space than fixed-length tables.

Transactional

See for the impact of the TRANSACTIONAL option on the row format.

_{This page is licensed: CC BY-SA / Gnu FDL}

Aria Two-step Deadlock Detection

Explains Aria's deadlock detection mechanism, which uses a two-step process with configurable search depths and timeouts to resolve conflicts.

Description

The Aria storage engine can automatically detect and deal with deadlocks (see the Wikipedia deadlocks article).

This feature is controlled by four configuration variables, two that control the search depth and two that control the timeout.

deadlock_search_depth_long

How it Works

If Aria is ever unable to obtain a lock, we might have a deadlock. There are two primary ways for detecting if a deadlock has actually occurred. First is to search a wait-for graph (see the ) and the second is to just wait and let the deadlock exhibit itself. Aria Two-step Deadlock Detection does a combination of both.

First, if the lock request cannot be granted immediately, we do a short search of the wait-for graph with a small search depth as configured by the deadlock_search_depth_short variable. We have a depth limit because the graph can (theoretically) be arbitrarily big and we don't want to recursively search the graph arbitrarily deep. This initial, short search is very fast and most deadlocks are detected right away. If no deadlock cycles are found with the short search the system waits for the amount of time configured in deadlock_timeout_short to see if the lock conflicts are removed and the lock can be granted. Assuming this did not happen and the lock request still waits, the system then moves on to step two, which is a repeat of the process but this time searching deeper using the deadlock_search_depth_long. If no deadlock has been detected, it waits deadlock_timeout_long and times out.

When a deadlock is detected the system uses a weighting algorithm to determine which thread in the deadlock should be killed and then kills it.

_{This page is licensed: CC BY-SA / Gnu FDL}

The Aria Name

A brief history of the naming of the Aria storage engine, explaining its origins as "Maria" and the reasons for the eventual name change.

The storage engine used to be called Maria. This page gives the history and background of how and why this name was changed to Aria.

Backstory

When starting what became the MariaDB project, Monty and the initial developers only planned to work on a next generation storage engine replacement. This storage engine would be crash safe and eventually support transactions. Monty named the storage engine, and the project, after his daughter, Maria.

Work began in earnest on the Maria storage engine but the plans quickly expanded and morphed and soon the developers were not just working on a storage engine, but on a complete branch of the MySQL database. Since the project was already called Maria, it made sense to call the whole database server MariaDB.

CONNECT

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Note: You can download a PDF version of the CONNECT documentation (1.7.0003):

Connect Version

Introduced

Maturity

Introduction to the CONNECT Engine

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

CONNECT is not just a new “YASE” (Yet another Storage Engine) that provides another way to store data with additional features. It brings a new dimension to MariaDB, already one of the best products to deal with traditional database transactional applications, further into the world of business intelligence and data analysis, including NoSQL facilities. Indeed, BI is the set of techniques and tools for the transformation of raw data into meaningful and useful information. And where is this data?

"It's amazing in an age where relational databases reign supreme when it comes to managing data that so much information still exists outside RDBMS engines in the form of flat files and other such constructs. In most enterprises, data is passed back and forth between disparate systems in a fashion and speed that would rival the busiest expressways in the world, with much of this data existing in common, delimited files. Target systems intercept these source files and then typically proceed to load them via ETL (extract, transform, load) processes into databases that then utilize the information for business intelligence, transactional functions, or other standard operations. ETL tasks and data movement jobs can consume quite a bit of time and resources, especially if large volumes of data are present that require loading into a database. This being the case, many DBAs welcome alternative means of accessing and managing data that exists in file format."

Using CONNECT

The CONNECT storage engine has been deprecated.

Using CONNECT - Condition Pushdown

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

The ODBC, JDBC, MYSQL, TBL and WMI table types use engine condition pushdown in order to restrict the number of rows returned by the RDBS source or the WMI component.

The CONDITION_PUSHDOWN argument used in old versions of CONNECT is no longer needed because CONNECT uses condition pushdown unconditionally.

_{This page is licensed: GPLv2}

Using CONNECT - Exporting Data From MariaDB

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Exporting data from MariaDB is obviously possible with CONNECT in particular for all formats not supported by the statement. Let us consider the query:

Supposing you want to get the result of this query into a file handlers.htm in XML/HTML format, allowing displaying it on an Internet browser, this is how you can do it:

Just create the CONNECT table that are used to make the file:

Here the column definition is not given and will come from the Select statement following the Create. The CONNECT options are the same we have seen previously. This will do both actions, creating the matching handlers CONNECT table and 'filling' it with the query result.

Using CONNECT - General Information

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

The main characteristic of CONNECT is to enable accessing data scattered on a machine as if it was a centralized database. This, and the fact that locking is not used by connect (data files are open and closed for each query) makes CONNECT very useful for importing or exporting data into or from a MariaDB database and also for all types of Business Intelligence applications. However, it is not suited for transactional applications.

For instance, the index type used by CONNECT is closer to bitmap indexing than to B-trees. It is very fast for retrieving result but not when updating is done. In fact, even if only one indexed value is modified in a big table, the index is entirely remade (yet this being four to five times faster than for a b-tree index). But normally in Business Intelligence applications, files are not modified so often.

If you are using CONNECT to analyze files that can be modified by an external process, the indexes are of course not modified by it and become outdated. Use the OPTIMIZE TABLE command to update them before using the tables based on them.

This means also that CONNECT is not designed to be used by centralized servers, which are mostly used for transactions and often must run a long time without human intervening.

Performance

Performances vary a great deal depending on the table type. For instance, ODBC tables are only retrieved as fast as the other DBMS can do. If you have a lot of queries to execute, the best way to optimize your work can be sometime to translate the data from one type to another. Fortunately this is very simple with CONNECT. Fixed formats like FIX, BIN or VEC tables can be created from slower ones by commands such as:

FIX and BIN are often the better choice because the I/O functions are done on blocks of BLOCK_SIZE rows. VEC tables can be very efficient for tables having many columns only a few being used in each query. Furthermore, for tables of reasonable size, the MAPPED option can very often speed up many queries.

Create Table statement

Be aware of the two broad kinds of CONNECT tables:

Drop Table statement

For outward tables, the statement just removes the table definition but does not erase the table data. However, dropping an inward tables also erase the table data as well.

Alter Table statement

Be careful using the statement. Currently the data compatibility is not tested and the modified definition can become incompatible with the data. In particular, Alter modifies the table definition only but does not modify the table data. Consequently, the table type should not be modified this way, except to correct an incorrect definition. Also adding, dropping or modifying columns may be wrong because the default offset values (when not explicitly given by the FLAG option) may be wrong when recompiled with missing columns.

Safe use of ALTER is for indexing, as we have seen earlier, and to change options such as MAPPED or HUGE those do not impact the data format but just the way the data file is accessed. Modifying the BLOCK_SIZE option is all right with FIX, BIN, DBF, split VEC tables; however it is unsafe for VEC tables that are not split (only one data file) because at their creation the estimate size has been made a multiple of the block size. This can cause errors if this estimate is not a multiple of the new value of the block size.

In all cases, it is safer to drop and re-create the table (outward tables) or to make another one from the table that must be modified.

Update and Delete for File Tables

CONNECT can execute these commands using two different algorithms:

It can do it in place, directly modifying rows (update) or moving rows (delete) within the table file. This is a fast way to do it in particular when indexing is used.
It can do it using a temporary file to make the changes. This is required when updating variable record length tables and is more secure in all cases.

The choice between these algorithms depends on the session variable .

_{This page is licensed: GPLv2}

Using CONNECT - Importing File Data Into MariaDB Tables

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Directly using external (file) data has many advantages, such as to work on “fresh” data produced for instance by cash registers, telephone switches, or scientific apparatus. However, you may want in some case to import external data into your MariaDB database. This is extremely simple and flexible using the CONNECT handler. For instance, let us suppose you want to import the data of the xsample.xml XML file previously given in example into a table called biblio belonging to the connect database. All you have to do is to create it by:

This last statement creates the table and inserts the original XML data, translated to tabular format by the xsampall2 CONNECT table, into the MariaDB biblio table. Note that further transformation on the data could have been achieved by using a more elaborate Select statement in the Create statement, for instance using filters, alias or applying functions to the data. However, because the Create Table process copies table data, later modifications of the

USING CONNECT - Offline Documentation

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Note: You can download a (1.7.0003).

_{This page is licensed: CC BY-SA / Gnu FDL}

CONNECT Table Types

The CONNECT storage engine has been deprecated.

CONNECT - NoSQL Table Types

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

They are based on files that do not match the relational format but often represent hierarchical data. CONNECT can handle , , , and some HTML files.

The way it is done is different from what MySQL or PostgreSQL does. In addition to including in a table some column values of a specific data format (JSON, XML) to be handled by specific functions, CONNECT can directly use JSON, XML or INI files that are produced by other applications, and this is the table definition that describes where and how the contained information must be retrieved.

This is also different from what MariaDB does with dynamic columns, which is close to what MySQL and PostgreSQL do with the JSON column type.

Note: The LEVEL option used with these tables should, from Connect 1.07.0002, be specified as DEPTH. Also, what was specified with the FIELD_FORMAT column option should now also be specified using JPATH or XPATH.

CONNECT Table Types - OEM: Implemented in an External LIB

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Although CONNECT provides a rich set of table types, specific applications may need to access data organized in a way that is not handled by its existing foreign data wrappers (FDW). To handle these cases, CONNECT features an interface that enables developers to implement in C++ the required table wrapper and use it as if it were part of the standard CONNECT table type list. CONNECT can use these additional handlers providing the corresponding external module (dll or shared lib) be available.

To create such a table on an existing handler, use a Create Table statement as shown below.

The option module gives the name of the DLL or shared library implementing the OEM wrapper for the table type. This library must be located in the plugin directory like all other plugins or UDF’s.

This library must export a function GetMYTYPE

CONNECT VEC Table Type

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Warning: Avoid using this table type in production applications. This file format is specific to CONNECT and may not be supported in future versions.

Tables of type VEC are binary files that in some cases can provide good performance on read-intensive query workloads. CONNECT organizes their data on disk as columns of values from the same attribute, as opposed to storing it as rows of tabular records. This organization means that when a query needs to access only a few columns of a particular table, only those columns need to be read from disk. Conversely, in a row-oriented table, all values in a table are typically read from disk, wasting I/O bandwidth.

CONNECT provides two integral VEC formats, in which each column's data is adjacent.

CONNECT Security

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

The use of the CONNECT engine requires the FILE privilege for "outward" tables. This should not be an important restriction. The use of CONNECT "outward" tables on a remote server seems of limited interest without knowing the files existing on it and must be protected anyway. On the other hand, using it on the local client machine is not an issue because it is always possible to create locally a user with the FILE privilege.

_{This page is licensed: GPLv2}

Current Status of the CONNECT Handler

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

The CONNECT handler is a GA (stable) release. It was written starting both from an aborted project written for MySQL in 2004 and from the “DBCONNECT” program. It was tested on all the examples described in this document, and is distributed with a set of 53 test cases. Here is a not limited list of future developments:

Adding more table types.

CSV

The CSV storage engine stores data in text files using comma-separated values format, allowing easy data exchange with other applications.

FederatedX

FederatedX is a storage engine that allows access to tables on remote MariaDB or MySQL servers as if they were local tables.

This storage engine has been deprecated.

About FederatedX Differences Between FederatedX and Federated

Differences Between FederatedX and Federated

This page outlines the key enhancements in FederatedX over the original Federated engine, including support for transactions and a refactored codebase. This storage engine has been deprecated.

This storage engine has been deprecated.

The main differences are:

New features in FederatedX

Transactions (beta feature)
Supports partitions (alpha feature)
New class structure which allows developers to write connection classes for other RDBMSs without having to modify base classes for FederatedX

Different behavior

FederatedX is statically compiled into MariaDB by default.
When you create a table with FederatedX, the connection are tested. The CREATE will fail if MariaDB can't connect to the remote host or if the remote table doesn't exist.

_{This page is licensed: CC BY-SA / Gnu FDL}

InnoDB

Discover InnoDB, the default storage engine for MariaDB Server. Learn about its transaction-safe capabilities, foreign key support, and high performance for demanding workloads.

InnoDB Doublewrite Buffer

The doublewrite buffer is a storage area where InnoDB writes pages before writing them to the data file, preventing data corruption from partial page writes.

The InnoDB doublewrite buffer was implemented to recover from half-written pages. This can happen when there's a power failure while InnoDB is writing a page to disk. On reading that page, InnoDB can discover the corruption from the mismatch of the page checksum. However, in order to recover, an intact copy of the page would be needed.

The double write buffer provides such a copy.

Whenever InnoDB flushes a page to disk, it is first written to the double write buffer. Only when the buffer is safely flushed to disk will InnoDB write the page to the final destination. When recovering, InnoDB scans the double write buffer and for each valid page in the buffer checks if the page in the data file is valid too.

Doublewrite Buffer Settings

To turn off the doublewrite buffer, set the system variable to 0. This is safe on filesystems that write pages atomically - that is, a page write fully succeeds or fails. But with other filesystems, it is not recommended for production systems. An alternative option is atomic writes. See for more details.

_{This page is licensed: CC BY-SA / Gnu FDL}

InnoDB Architecture for MariaDB Enterprise Server

Understand InnoDB's architecture for MariaDB Enterprise Server. This section details its components and their interactions, focusing on performance, scalability, and reliability for enterprise workloa

MariaDB Enterprise Server InnoDB Background Thread Pool

This page details the dedicated thread pool in MariaDB Enterprise Server that manages InnoDB background tasks, improving scalability and performance.

Overview

Starting with MariaDB Enterprise Server 10.5 and MariaDB Community Server 10.5, InnoDB uses the InnoDB Background Thread Pool to perform internal operations in the background. In previous versions, the internal operations were performed by dedicated threads. By using the InnoDB Background Thread Pool instead of many dedicated threads, InnoDB can reduce context switching and use system resources more effectively.

The InnoDB Background Thread Pool performs internal operations in multiple categories: tasks, timers, and asynchronous I/O.

Tasks are used to perform internal operations that are triggered by some event. In ES 10.5 and later and CS 10.5 and later, the following threads have been replaced by

InnoDB Online DDL

Perform online DDL operations with InnoDB in MariaDB Server. Learn how to alter tables without blocking read/write access, ensuring high availability for your applications.

Using CONNECT - Partitioning and Sharding

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

CONNECT supports the MySQL/MariaDB partition specification. It is done similar to the way MyISAM or InnoDB do by using the PARTITION engine that must be enabled for this to work. This type of partitioning is sometimes referred as “horizontal partitioning”.

Partitioning enables you to distribute portions of individual tables across a file system according to rules which you can set largely as needed. In effect, different portions of a table are stored as separate tables in different locations. The user-selected rule by which the division of data is accomplished is known as a partitioning function, which in MariaDB can be the modulus, simple matching against a set of ranges or value lists, an internal hashing function, or a linear hashing function.

CONNECT takes this notion a step further, by providing two types of partitioning:

File partitioning. Each partition is stored in a separate file like in multiple tables.
Table partitioning. Each partition is stored in a separate table like in TBL tables.

Partition engine issues

Using partitions sometimes requires creating the tables in an unnatural way to avoid some error due to several partition engine bugs:

Engine specific column and index options are not recognized and cause a syntax error when the table is created. The workaround is to create the table in two steps, a CREATE TABLE statement followed by an ALTER TABLE statement.
The connection string, when specified for the table, is lost by the partition engine. The workaround is to specify the connection string in the option_list.
. In case of list columns partitioning it sometimes causes a false “impossible where” clause to be raised. This makes a wrong void result returned when it should not be void. There is no workaround but this bug should be hopefully fixed.

The following examples are using the above workaround syntax to address these issues.

File Partitioning

File partitioning applies to file-based CONNECT table types. As with multiple tables, physical data is stored in several files instead of just one. The differences to multiple tables are:

Data is distributed amongst the different files following the partition rule.
Unlike multiple tables, partitioned tables are not read only.
Unlike multiple tables, partitioned tables can be indexable.
The file names are generated from the partition names.

The table file names are generated differently depending on whether the table is an inward or outward table. For inward tables, for which the file name is not specified, the partition file names are:

For instance for the table:

CONNECT will generate in the current data directory the files:

This is similar to what the partition engine does for other engines - CONNECT partitioned inward tables behave like other engines partition tables do. Just the data format is different.

Note: If sub-partitioning is used, inward table files and index files are named:

Outward Tables

The real problems occur with outward tables, in particular when they are created from already existing files. The first issue is to make the partition table use the correct existing file names. The second one, only for already existing not void tables, is to be sure the partitioning function match the distribution of the data already existing in the files.

The first issue is addressed by the way data file names are constructed. For instance let us suppose we want to make a table from the fixed formatted files:

This can be done by creating a table such as:

The rule is that for each partition the matching file name is internally generated by replacing in the given FILE _ NAME option value the “%s” part by the partition name.

If the table was initially void, further inserts will populate it according to the partition function. However, if the files did exist and contained data, this is your responsibility to determine what partition function actually matches the data distribution in them. This means in particular that partitioning by key or by hash cannot be used (except in exceptional cases) because you have almost no control over what the used algorithm does.

In the example above, there is no problem if the table is initially void, but if it is not, serious problems can be met if the initial distribution does not match the table distribution. Supposing a row in which “id” as the value 12 was initially contained in the part1.txt file, it are seen when selecting the whole table but if you ask:

The result will have 0 rows. This is because according to the partition function query pruning will only look inside the second partition and will miss the row that is in the wrong partition.

One way to check for wrong distribution if for instance to compare the results from queries such as:

And

If they match, the distribution can be correct although this does not prove it. However, if they do not match, the distribution is surely wrong.

Partitioning on a Special Column

There are some cases where the files of a multiple table do not contain columns that can be used for range or list partitioning. For instance, let’s suppose we have a multiple table based on the following files:

Each of them containing the same kind of data:

A multiple table can be created on them, for instance by:

The issue is that if we want to create a partitioned table on these files, there are no columns to use for defining a partition function. Each city file can have the same kind of column values and there is no way to distinguish them.

However, there is a solution. It is to add to the table a special column that are used by the partition function. For instance, the new table creation can be done by:

Note 1: we had to do it in two steps because of the column CONNECT options.

Note 2: the special column PARTID returns the name of the partition in which the row is located.

Note 3: here we could have used the FNAME special column instead because the file name is specified as being the partition name.

This may seem rather stupid because it means for instance that a row are in partition boston if it belongs to the partition boston! However, it works because the partition engine doesn’t know about special columns and behaves as if the city column was a real column.

What happens if we populate it by?

The value given for the city column (explicitly or by default) are used by the partition engine to decide in which partition to insert the rows. It are ignored by CONNECT (a special column cannot be given a value) but later will return the matching value. For instance:

This query returns:

city

first_name

job

Everything works as if the city column was a real column contained in the table data files.

Partitioning of zipped tables

Two cases are currently supported: If a table is based on several zipped files, portioning is done the standard way as above. This is the file_name option specifying the name of the zip files that shall contain the ‘%s’ part used to generate the file names. If a table is based on only one zip file containing several entries, this is indicated by placing the ‘%s’ part in the entry option value. Note: If a table is based on several zipped files each containing several entries, only the first case is possible. Using sub-partitioning to make partitions on each entries is not supported yet.

Table Partitioning

With table partitioning, each partition is physically represented by a sub-table. Compared to standard partitioning, this brings the following features:

The partitions can be tables driven by different engines. This relieves the current existing limitation of the partition engine.
The partitions can be tables driven by engines not currently supporting partitioning.
Partition tables can be located on remote servers, enabling table sharding.
Like for TBL tables, the columns of the partition table do not necessarily match the columns of the sub-tables.

The way it is done is to create the partition table with a table type referring to other tables, , or . Let us see how this is done on a simple example. Supposing we have created the following tables:

We can for instance create a partition table using these tables as physical partitions by:

Here the name of each partition sub-table are made by replacing the ‘%s’ part of the tabname option value by the partition name. Now if we do:

The rows are distributed in the different sub-tables according to the partition function. This can be seen by executing the query:

This query replies:

partition_name

table_rows

Query pruning is of course automatic, for instance:

This query replies:

select_type

table

partitions

type

possible_keys

key

key_len

ref

rows

Extra

When executing this select query, only sub-table xt3 are used.

Indexing with Table Partitioning

Using the table type seems natural. However, in this current version, the issue is that PROXY (and ) tables are not indexable. This is why, if you want the table to be indexed, you must use the table type. The CREATE TABLE statement are almost the same:

The column id is declared as a key, and the table type is now MYSQL. This makes Sub-tables accessed by calling a MariaDB server as MYSQL tables do. Note that this modifies only the way CONNECT sub-tables are accessed.

However, indexing just make the partitioned table use “remote indexing” the way FEDERATED tables do. This means that when sending the query to retrieve the table data, a where clause are added to the query. For instance, let’s suppose you ask:

The query sent to the server are:

On a query like this one, it does not change much because the where clause could have been added anyway by the cond_push function, but it does make a difference in case of joins. The main thing to understand is that real indexing is done by the called table and therefore that it should be indexed.

This also means that the xt1, xt2, and xt3 table indexes should be made separately because creating the t2 table as indexed does not make the indexes on the sub-tables.

Sharding with Table Partitioning

Using table partitioning can have one more advantage. Because the sub-tables can address a table located on another server, it is possible to shard a table on separate servers and hardware machines. This may be required to access as one table data already located on several remote machines, such as servers of a company branches. Or it can be just used to split a huge table for performance reason. For instance, supposing we have created the following tables:

Creating the partition table accessing all these are almost like what we did with the t4 table:

The only difference is the tabname option now referring to the rt1, rt2, and rt3 tables. However, even if it works, this is not the best way to do it. This is because accessing a table via the MySQL API is done twice per table. Once by CONNECT to access the FEDERATED table on the local server, then a second time by FEDERATED engine to access the remote table.

The CONNECT MYSQL table type being used anyway, you’d rather use it to directly access the remote tables. Indeed, the partition names can also be used to modify the connection URL’s. For instance, in the case shown above, the partition table can be created as:

Several things can be noted here:

As we have seen before, the partition engine currently loses the connection string. This is why it was specified as “connect” in the option list.
For each partition sub-tables, the “%s” part of the connection string has been replaced by the partition name.
It is not needed anymore to define the rt1, rt2, and rt3 tables (even it does not harm) and the FEDERATED engine is no more used to access the remote tables.

This is a simple case where the connection string is almost the same for all the sub-tables. But what if the sub-tables are accessed by very different connection strings? For instance:

There are two solutions. The first one is to use the parts of the connection string to differentiate as partition names:

The second one, allowing avoiding too complicated partition names, is to create federated servers to access the remote tables (if they do not already exist, else just use them). For instance the first one could be:

Similarly, “server_two” and “server_three” would be created and the final partition table would be created as:

It would be even simpler if all remote tables had the same name on the remote databases, for instance if they all were named xt1, the connection string could be set as “server_%s/xt1” and the partition names would be just “one”, “two”, and “three”.

Sharding on a Special Column

The technique we have seen above with file partitioning is also available with table partitioning. Companies willing to use as one table data sharded on the company branch servers can, as we have seen, add to the table create definition a special column. For instance:

This example assumes that federated servers had been created named “server_main”, “server_east” and “server_west” and that all remote tables are named “sales”. Note also that in this example, the column id is no more a key.

Current Partition Limitations

Because the partition engine was written before some other engines were added to MariaDB, the way it works is sometime incompatible with these engines, in particular with CONNECT.

Update statement

With the sample tables above, you can do update statements such as:

It works perfectly and is accepted by CONNECT. However, let us consider the statement:

This statement is not accepted by CONNECT. The reason is that the column id being part of the partition function, changing its value may require the modified row to be moved to another partition. The way it is done by the partition engine is to delete the old row and to re-insert the new modified one. However, this is done in a way that is not currently compatible with CONNECT (remember that CONNECT supports UPDATE in a specific way, in particular for the table type MYSQL) This limitation could be temporary. Meanwhile the workaround is to manually do what is done above,

Deleting the row to modify and inserting the modified row:

Alter Table statement

For all CONNECT outward tables, the ALTER TABLE statement does not make any change in the table data. This is why ALTER TABLE should not be used; in particular to modify the partition definition, except of course to correct a wrong definition. Note that using ALTER TABLE to create a partition table in two steps because column options would be lost is valid as it applies to a table that is not yet partitioned.

As we have seen, it is also safe to use it to create or drop indexes. Otherwise, a simple rule of thumb is to avoid altering a table definition and better drop and re-create a table whose definition must be modified. Just remember that for outward CONNECT tables, dropping a table does not erase the data and that creating it does not modify existing data.

Rowid special column

Each partition being handled separately as one table, the ROWID special column returns the rank of the row in its partition, not in the whole table. This means that for partition tables ROWID and ROWNUM are equivalent.

_{This page is licensed: CC BY-SA / Gnu FDL}

CONNECT JDBC Table Type: Accessing Tables from Another DBMS

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

The JDBC table type should be distributed with all recent versions of MariaDB. However, if the automatic compilation of it is possible after the java JDK was installed, the complete distribution of it is not fully implemented in older versions. The distributed JdbcInterface.jar file contains the JdbcInterface wrapper only. New versions distribute a JavaWrappers.jar that contains all currently existing wrappers.

This will require that:

The Java SDK is installed on your system.
The java wrapper class files are available on your system.
And of course, some JDBC drivers exist to be used with the matching DBMS.

Point 2 was made automatic in the newest versions of MariaDB.

Compiling From Source Distribution

Even when the Java JDK has been installed, CMake sometimes cannot find the location where it stands. For instance on Linux the Oracle Java JDK package might be installed in a path not known by the CMake lookup functions causing error message such as:

When this happen, provide a Java prefix as a hint on where the package was loaded. For instance on Ubuntu I was obliged to enter:

After that, the compilation of the CONNECT JDBC type was completed successfully.

Compiling the Java source files

They are the source of the java wrapper classes used to access JDBC drivers. In the source distribution, they are located in the CONNECT source directory.

The default wrapper, JdbcInterface, is the only one distributed with binary distribution. It uses the standard way to get a connection to the drivers via the DriverManager.getConnection method. Other wrappers, only available with source distribution, enable connection to a Data Source, eventually implementing pooling. However, they must be compiled and installed manually.

The available wrappers are:

Wrapper

Description

The wrapper used by default is specified by the session variable and is initially set to wrappers/JdbcInterface. The wrapper to use for a table can also be specified in the option list as a wrapper option of the “create table” statements.

Note: Conforming java naming usage, class names are preceded by the java package name with a slash separator. However, this is not mandatory for CONNECT which adds the package name if it is missing.

The JdbcInterface wrapper is always usable when Java is present on your machine. Binary distributions have this wrapper already compiled as a JdbcInterface.jar file installed in the plugin directory whose path is automatically included in the class path of the JVM. Recent versions also add a JavaWrappers.jar that contains all these wrappers. Therefore there is no need to worry about its path.

Compiling the ApacheInterface wrapper requires that the Apache common-DBCP2 package be installed. Other wrappers are to be used only with the matching JDBC drivers that must be available when compiling them.

Installing the jar file in the plugin directory is the best place because it is part of the class path. Depending on what is installed on your system, the source files can be reduced accordingly. To compile only the JdbcInterface.java file the CMAKE_JAVA_INCLUDE_PATH is not required. Here the paths are the ones existing on my Windows 7 machine and should be localized.

Setting the Required Information

Before any operation with a JDBC driver can be made, CONNECT must initialize the environment that will make working with Java possible. This will consist of:

Loading dynamically the JVM library module.
Creating the Java Virtual Machine.
Establishing contact with the java wrapper class.
Connecting to the used JDBC driver.

Indeed, the JVM library module is not statically linked to the CONNECT plugin. This is to make it possible to use a CONNECT plugin that has been compiled with the JDBC table type on a machine where the Java SDK is not installed. Otherwise, users not interested in the JDBC table type would be obliged to install the Java SDK on their machine to be able to load the CONNECT storage engine.

JVM Library Location

If the JVM library (jvm.dll on Windows, libjvm.so on Linux) was not placed in the standard library load path, CONNECT cannot find it and must be told where to search for it. This happens in particular on Linux when the Oracle Javapackage was installed in a private location.

If the JAVA_HOME variable was exported as explained above, CONNECT can sometimes find it using this information. Otherwise, its search path can be added to the LD_LIBRARY_PATH environment variable. But all this is complicated because making environment variables permanent on Linux is painful (many different methods must be used depending on the Linux version and the used shell).

This is why CONNECT introduced a new global variable connect_jvm_path to store this information. It can be set when starting the server as a command line option or even afterwards before the first use of the JDBC table type:

The client library is smaller and faster for connection. The server library is more optimized and can be used in case of heavy load usage.

Note that this may not be required on Windows because the path to the JVM library can sometimes be found in the registry.

Once this library is loaded, CONNECT can create the required Java Virtual Machine.

Java Class Path

This is the list of paths Java searches when loading classes. With CONNECT, the classes to load are the java wrapper classes used to communicate with the drivers , and the used JDBC driver classes that are grouped inside jar files. If the ApacheInterface wrapper must be used, the class path must also include all three jars used by the Apache package.

Caution: This class path is passed as a parameter to the Java Virtual Machine (JVM) when creating it and cannot be modified as it is a read only property. In addition, because MariaDB is a multi-threading application, this JVM cannot be destroyed and are used throughout the entire life of the MariaDB server. Therefore, be sure it is correctly set before you use the JDBC table type for the first time. Otherwise, there are practically no alternative than to shut down the server and restart it.

The path to the wrapper classes must point to the directory containing the wrappers sub-directory. If a JdbcInterface.jar file was made, its path is the directory where it is located followed by the jar file name. It is unclear where because this will depend on the installation process. If you start from a source distribution, it can be in the storage/connect directory where the CONNECT source files are or where you moved them or compiled the JdbcInterface.jar file.

For binary distributions, there is nothing to do because the jar file has been installed in the mysql share directory whose path is always automatically included in the class path available to the JVM.

Remaining are the paths of all the installed JDBC drivers that you intend to use. Remember that their path must include the jar file itself. Some applications use an environment variable CLASSPATH to contain them. Paths are separated by ‘:’ on Linux and by ‘;’ on Windows.

If the CLASSPATH variable actually exists and if it is available inside MariaDB, so far so good. You can check this using an UDF function provided by CONNECT that returns environment variable values:

Most of the time, this will return null or some required files are missing. This is why CONNECT introduced a global variable to store this information. The paths specified in this variable are added and have precedence to the ones, if any, of the CLASSPATH environment variable. As for the jvm path, this variable connect_class_path should be specified when starting the server but can also be set before using the JDBC table type for the first time.

The current directory (sql/data) is also placed by CONNECT at the beginning of the class path.

As an example, here is how I start MariaDB when doing tests on Linux:

CONNECT JDBC Tables

These tables are given the type JDBC. For instance, supposing you want to access the boys table located on and external local or remote database management system providing a JDBC connector:

To access this table via JDBC you can create a table such as:

The CONNECTION option is the URL used to establish the connection with the remote server. Its syntax depends on the external DBMS and in this example is the one used to connect as root to a MySQL or MariaDB local database using the MySQL JDBC connector.

As for ODBC, the columns definition can be omitted and are retrieved by the discovery process. The restrictions concerning column definitions are the same as for ODBC.

Note: The dbname indicated in the URL corresponds for many DBMS to the catalog information. For MySQL and MariaDB it is the schema (often called database) of the connection.

Using a Federated Server

Alternatively, a JDBC table can specify its connection options via a Federated server. For instance, supposing you have a table accessing an external Postgresql table defined as:

You can create a Federated server:

Now the JDBC table can be created by:

or by:

In any case, the location of the remote table can be changed in the Federated server without having to alter all the tables using this server.

JDBC needs a URL to establish a connection. CONNECT was able to construct that URL from the information contained in such Federated server definition when the URL syntax is similar to the one of MySQL, MariaDB or Postgresql. However, other DBMSs such as Oracle use a different URL syntax. In this case, simply replace the HOST information by the required URL in the Federated server definition. For instance:

Now you can create an Oracle table with something like this:

Note: Oracle, as Postgresql, does not seem to understand the DATABASE setting as the table schema that must be specified in the Create Table statement.

Connecting to a JDBC driver

When the connection to the driver is established by the JdbcInterface wrapper class, it uses the options that are provided when creating the CONNECT JDBC tables. Inside the default Java wrapper, the driver’s main class is loaded by the DriverManager.getConnection function that takes three arguments:

URL

User

Password

The URL varies depending on the connected DBMS. Refer to the documentation of the specific JDBC driver for a description of the syntax to use. User and password can also be specified in the option list.

Beware that the database name in the URL can be interpreted differently depending on the DBMS. For MySQL this is the schema in which the tables are found. However, for Postgresql, this is the catalog and the schema must be specified using the CONNECT dbname option.

For instance a table accessing a Postgresql table via JDBC can be created with a create statement such as:

Note: In previous versions of JDBC, to obtain a connection, java first had to initialize the JDBC driver by calling the method Class.forName. In this case, see the documentation of your DBMS driver to obtain the name of the class that implements the interface java.sql.Driver. This name can be specified as an option DRIVER to be put in the option list. However, most modern JDBC drivers since version 4 are self-loading and do not require this option to be specified.

The wrapper class also creates some required items and, in particular, a statement class. Some characteristics of this statement will depend on the options specified when creating the table:

Scrollable

Block_size

Fetch Size

The fetch size determines the number of rows that are internally retrieved by the driver on each interaction with the DBMS. Its default value depends on the JDBC driver. It is equal to 10 for some drivers but not for the MySQL or MariaDB connectors.

The MySQL/MariaDB connectors retrieve all the rows returned by one query and keep them in a memory cache. This is generally fine in most cases, but not when retrieving a large result set that can make the query fail with a memory exhausted exception.

To avoid this, when accessing a big table and expecting large result sets, you should specify the BLOCK_SIZE option to 1 (the only acceptable value). However a problem remains:

Suppose you execute a query such as:

Not knowing the limit clause, CONNECT sends to the remote DBMS the query:

In this query big can be a huge table having million rows. Having correctly specified the block size as 1 when creating the table, the wrapper just reads the 10 first rows and stops. However, when closing the statement, these MySQL/MariaDB drivers must still retrieve all the rows returned by the query. This is why, the wrapper class when closing the statement also cancels the query to stop that extra reading.

The bad news is that if it works all right for some previous versions of the MySQL driver, it does not work for new versions as well as for the MariaDB driver that apparently ignores the cancel command. The good news is that you can use an old MySQL driver to access MariaDB databases. It is also possible that this bug are fixed in future versions of the drivers.

Connection to a Data Source

This is the java preferred way to establish a connection because a data source can keep a pool of connections that can be re-used when necessary. This makes establishing connections much faster once it was done for the first time.

CONNECT provide additional wrappers whose files are located in the CONNECT source directory. The wrapper to use can be specified in the global variable connect_java_wrapper, which defaults to “JdbcInterface”.

It can also be specified for a table in the option list by setting the option wrapper to its name. For instance:

They can be used instead of the standard JdbcInterface and are using created data sources.

The Apache one uses data sources implemented by the Apache-commons-dbcp2 package and can be used with all drivers including those not implementing data sources. However, the Apache package must be installed and its three required jar files accessible via the class path.

commons-dbcp2-2.1.1.jar
commons-pool2-2.4.2.jar
commons-logging-1.2.jar

Note: the versions numbers can be different on your installation.

The other ones use data sources provided by the matching JDBC driver. There are currently four wrappers to be used with mysql-6.0.2, mariadb, oracle and postgresql.

Unlike the class path, the used wrapper can be changed even after the JVM machine was created.

Random Access to JDBC Tables

The same methods described for ODBC tables can be used with JDBC tables.

Note that in the case of the MySQL or MariaDB connectors, because they internally read the whole result set in memory, using the MEMORY option would be a waste of memory. It is much better to specify the use of a scrollable cursor when needed.

Other Operations with JDBC Tables

Except for the way the connection string is specified and the table type set to JDBC, all operations with ODBC tables are done for JDBC tables the same way. Refer to the ODBC chapter to know about:

Accessing specified views (SRCDEF)
Data modifying operations.
Sending commands to a data source.
JDBC catalog information.

Note: Some JDBC drivers fail when the global time_zone variable is ambiguous, which sometimes happens when it is set to SYSTEM. If so, reset it to a not ambiguous value, for instance:

JDBC Specific Restrictions

Connecting via data sources created externally (for instance using Tomcat) is not supported yet.

Other restrictions are the same as for the ODBC table type.

Handling the UUID Data Type

PostgreSQL has a native UUID data type, internally stored as BIN(16). This is neither an SQL nor a MariaDB data type. The best we can do is to handle it by its character representation.

UUID are translated to CHAR(36) when column definitions are set using discovery. Locally a PostgreSQL UUID column are handled like a CHAR or VARCHAR column. Example:

Using the PostgreSQL table testuuid in the text database:

Its column definitions can be queried by:

This query returns:

Table

Column

Type

Name

Size

Note: PostgreSQL, when a column size is undefined, returns 2147483647, which is not acceptable for MariaDB. CONNECT change it to the value of the connect_conv_size session variable. Also, for TEXT columns the data type returned is 12 (SQL_VARCHAR) instead of -1 the SQL_TEXT value.

Accessing this table via JDBC by:

it are created by discovery as:

Note: 8192 being here the connect_conv_size value.

Let's populate it:

Result:

msg

Here the id column values come from the DEFAULT of the PostgreSQL column that was specified as uuid_generate_v4().

It can be set from MariaDB. For instance:

Result:

msg

The first insert specifies a valid UUID character representation. The second one set it to NULL. The third one (a void string) generates a Java random UUID. UPDATE commands obey the same specification.

These commands both work:

However, this one fails:

Returning:

1296: Got error 174 'ExecuteQuery: org.postgresql.util.PSQLException: ERROR: operator does not exist: uuid ~ unknown hint: no operator corresponds to the data name and to the argument types.

because CONNECT cond_push feature added the WHERE clause to the query sent to PostgreSQL:

and the LIKE operator does not apply to UUID in PostgreSQL.

To handle this, a new session variable was added to CONNECT: connect_cond_push. It permits to specify if cond_push is enabled or not for CONNECT and defaults to 1 (enabled). In this case, you can execute:

Doing so, the where clause are executed by MariaDB only and the query will not fail anymore.

Executing the JDBC tests

Four tests exist but they are disabled because requiring some work to localized them according to the operating system and available java package and JDBC drivers and DBMS.

Two of them, jdbc.test and jdbc_new.test, are accessing MariaDB via JDBC drivers that are contained in a fat jar file that is part of the test. They should be executable without anything to do on Windows; simply adding the option –enable-disabled when running the tests.

However, on Linux these tests can fail to locate the JVM library. Before executing them, you should export the JAVA_HOME environment variable set to the prefix of the java installation or export the LD_LIBRARY_PATH containing the path to the JVM lib.

Fixing Problem With mariadb-dump

In some case or some platform, when CONNECT is set up for use with JDBC table types, this causes with the option --all-databases to fail.

This was reported by Robert Dyas who found the cause - see the discussion at .

_{This page is licensed: CC BY-SA / Gnu FDL}

CONNECT Data Types

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Many data types make no or little sense when applied to plain files. This why CONNECT supports only a restricted set of data types. However, ODBC, JDBC or MYSQL source tables may contain data types not supported by CONNECT. In this case, CONNECT makes an automatic conversion to a similar supported type when it is possible.

The data types currently supported by CONNECT are:

Type name

Description

Used for

TYPE_STRING

This type corresponds to what is generally known as or by database users, or as strings by programmers. Columns containing characters have a maximum length but the character string is of fixed or variable length depending on the file format.

The DATA_CHARSET option must be used to specify the character set used in the data source or file. Note that, unlike usually with MariaDB, when a multi-byte character set is used, the column size represents the number of bytes the column value can contain, not the number of characters.

TYPE_INT

The type contains signed integer numeric 4-byte values (the int of the C language) ranging from –2,147,483,648 to 2,147,483,647 for signed type and 0 to 4,294,967,295 for unsigned type.

TYPE_SHORT

The SHORT data type contains signed values (the short integer of the C language) ranging from –32,768 to 32,767 for signed type and 0 to 65,535 for unsigned type.

TYPE_TINY

The TINY data type contains values (the char of the C language) ranging from –128 to 127 for signed type and 0 to255 for unsigned type. For some table types, TYPE_TINY is used to represent Boolean values (0 is false, anything else is true).

TYPE_BIGINT

The data type contains signed integer 8-byte values (the long long of the C language) ranging from -9,223,372,036,854,775,808 to9,223,372,036,854,775,807 for signed type and from 0 to18,446,744,073,709,551,615 for unsigned type.

Inside tables, the coding of all integer values depends on the table type. In tables represented by text files, the number is written in characters, while in tables represented by binary files (BIN or VEC) the number is directly stored in the binary representation corresponding to the platform.

The length (or precision) specification corresponds to the length of the table field in which the value is stored for text files only. It is used to set the output field length for all table types.

TYPE_DOUBLE

The DOUBLE data type corresponds to the C language type, a floating-point double precision value coded with 8 bytes. Like for integers, the internal coding in tables depends on the table type, characters for text files, and platform binary representation for binary files.

The length specification corresponds to the length of the table field in which the value is stored for text files only. The scale (was_precision_) is the number of decimal digits written into text files. For binary table types (BIN and VEC) this does not apply. The length and_scale_ specifications are used to set the output field length and number of decimals for all types of tables.

TYPE_DECIM

The DECIMAL data type corresponds to what MariaDB or ODBC data sources call NUMBER, NUMERIC, or : a numeric value with a maximum number of digits (the precision) some of them eventually being decimal digits (the scale). The internal coding in CONNECT is a character representation of the number. For instance:

This defines a column colname as a number having a precision of 14 and a scale of 6. Supposing it is populated by:

The internal representation of it are the character string-2658.740000. The way it is stored in a file table depends on the table type. The length field specification corresponds to the length of the table field in which the value is stored and is calculated by CONNECT from the_precision_ and the scale values. This length is precision plus 1 if_scale_ is not 0 (for the decimal point) plus 1 if this column is not unsigned (for the eventual minus sign). In fix formatted tables the number is right justified in the field of width length, for variable formatted tables, such as CSV, the field is the representing character string.

Because this type is mainly used by CONNECT to handle numeric or decimal fields of ODBC, JDBC and MySQL table types, CONNECT does not provide decimal calculations or comparison by itself. This is why decimal columns of CONNECT tables cannot be indexed.

DATE Data type

Internally, date/time values are stored by CONNECT as a signed 4-byte integer. The value 0 corresponds to 01 January 1970 12:00:00 am coordinated universal time (). All other date/time values are represented by the number of seconds elapsed since or before midnight (00:00:00), 1 January 1970, to that date/time value. Date/time values before midnight 1 January 1970 are represented by a negative number of seconds.

CONNECT handles dates from 13 December 1901, 20:45:52 to18 January 2038, 19:14:07.

Although date and time information can be represented in both CHAR and INTEGER data types, the DATE data type has special associated properties. For each DATE value, CONNECT can store all or only some of the following information: century, year, month, day, hour, minute, and second.

Date Format in Text Tables

Internally, date/time values are handled as a signed 4-byte integer. But in text tables (type DOS, FIX, CSV, FMT, and DBF) dates are most of the time stored as a formatted character string (although they also can be stored as a numeric string representing their internal value). Because there are infinite ways to format a date, the format to use for decoding dates, as well as the field length in the file, must be associated to date columns (except when they are stored as the internal numeric value).

Note that this associated format is used only to describe the way the temporal value is stored internally. This format is used both for output to decode the date in a SELECT statement as well as for input to encode the date in INSERT or UPDATE statements. However, what is kept in this value depends on the data type used in the column definition (all the MariaDB temporal values can be specified). When creating a table, the format is associated to a date column using the DATE_FORMAT option in the column definition, for instance:

The SELECT query returns:

Name

Bday

Btime

The values of the INSERT statement must be specified using the standard MariaDB syntax and these values are displayed as MariaDB temporal values. Sure enough, the column formats apply only to the way these values are represented inside the CSV files. Here, the inserted record are:

Note: The field_length option exists because the MariaDB syntax does not allow specifying the field length between parentheses for temporal column types. If not specified, the field length is calculated from the date format (sometimes as a max value) or made equal to the default length value if there is no date format. In the above example it could have been removed as the calculated values are the ones specified. However, if the table type would have been DOS or FIX, these values could be adjusted to fit the actual field length within the file.

A CONNECT format string consists of a series of elements that represent a particular piece of information and define its format. The elements are recognized in the order they appear in the format string. Date and time format elements are replaced by the actual date and time as they appear in the source string. They are defined by the following groups of characters:

Element

Description

Usage Notes

To match the source string, you can add body text to the format string, enclosing it in single quotes or double quotes if it would be ambiguous. Punctuation marks do not need to be quoted.
The hour information is regarded as 12-hour format if a “t” or “tt” element follows the “hh” element in the format or as 24-hour format otherwise.
The "MM", "DD", "hh", "mm", "ss" elements can be specified with one or two letters (e.g. "MM" or "M") making no difference on input, but placing a leading zero to one-digit values on output [] for two-letter elements.
If the format contains elements DDD or DDDD, the day of week name is skipped on input and ignored to calculate the internal date value. On output, the correct day of week name is generated and displayed.

Handling dates that are out of the range of supported CONNECT dates

If you want to make a table containing, for instance, historical dates not being convertible into CONNECT dates, make your column CHAR or VARCHAR and store the dates in the MariaDB format. All date functions applied to these strings will convert them to MariaDB dates and will work as if they were real dates. Of course they must be inserted and are displayed using the MariaDB format.

NULL handling

CONNECT handles for data sources able to produce nulls. Currently this concerns mainly the , , MONGO, , , and table types. For INI, , MONGO or XML types, null values are returned when the key is missing in the section (INI) or when the corresponding node does not exist in a row (XML, JSON, MONGO).

For other file tables, the issue is to define what a null value is. In a numeric column, 0 can sometimes be a valid value but, in some other cases, it can make no sense. The same for character columns; is a blank field a valid value or not?

A special case is DATE columns with a DATE _FORMAT specified. Any value not matching the format can be regarded as NULL.

CONNECT leaves the decision to you. When declaring a column in the statement, if it is declared NOT NULL, blank or zero values are considered as valid values. Otherwise they are considered as NULL values. In all cases, nulls are replaced on insert or update by pseudo null values, a zero-length character string for text types or a zero value for numeric types. Once converted to pseudo null values, they are recognized as NULL only for columns declared as nullable.

For instance:

The select query replies:

Sure enough, the value 0 entered on the first row is regarded as NULL for a nullable column. However, if we execute the query:

This will return no line because a NULL is not equal to 0 in an SQL where clause.

Now let us see what happens with not null columns:

The insert statement will produce a warning saying:

Level

Code

Message

It is replaced by a pseudo null 0 on the fourth row. Let us see the result:

The first query returns no rows, 0 are valid values and not NULL. The second query replies:

It shows that the NULL inserted value was replaced by a valid 0 value.

Unsigned numeric types

They are supported by CONNECT since version 1.01.0010 for fixed numeric types (TINY, SHORT, INTEGER, and BITINT).

Data type conversion

CONNECT is able to convert data from one type to another in most cases. These conversions are done without warning even when this leads to truncation or loss of precision. This is true, in particular, for tables of type ODBC, JDBC, MYSQL and PROXY (via MySQL) because the source table may contain some data types not supported by CONNECT. They are converted when possible to CONNECT types.

When converted, MariaDB types are converted as:

MariaDB Types

CONNECT Type

Remark

For , the length of the column is the length of the longest value of the enumeration. For the length is enough to contain all the set values concatenated with comma separator.

In the case of columns, the handling depends on the values given to the and system variables.

Note: is currently not converted by default until a TYPE_BIN type is added to CONNECT. However, the FORCE option (from Connect 1.06.006) can be specified for blob columns containing text and the SKIP option also applies to ODBC BLOB columns.

ODBC SQL types are converted as:

SQL Types

Connect Type

Remark

JDBC SQL types are converted as:

JDBC Types

Connect Type

Remark

Note: The SKIP option also applies to ODBC and JDBC tables.

Here input and output are used to specify respectively decoding the date to get its numeric value from the data file and encoding a date to write it in the table file. Input is performed within queries; output is performed in or queries.

_{This page is licensed: GPLv2}

InnoDB Online DDL Operations with the INPLACE Alter Algorithm

Learn about operations supported by the INPLACE algorithm, which rebuilds the table but allows concurrent DML, offering a balance between performance and availability.

Supported Operations by Inheritance

When the ALGORITHM clause is set to INPLACE, the supported operations are a superset of the operations that are supported when the ALGORITHM clause is set to NOCOPY. Similarly, when the ALGORITHM clause is set to NOCOPY, the supported operations are a superset of the operations that are supported when the ALGORITHM clause is set to INSTANT.

Therefore, when the clause is set to INPLACE, some operations are supported by inheritance. See the following additional pages for more information about these supported operations:

Column Operations

`ALTER TABLE ... ADD COLUMN`

InnoDB supports adding columns to a table with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

With the exception of adding an column, this operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... DROP COLUMN`

InnoDB supports dropping columns from a table with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... MODIFY COLUMN`

This applies to for tables.

Reordering Columns

InnoDB supports reordering columns within a table with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

Changing the Data Type of a Column

InnoDB does not support modifying a column's data type with set to INPLACE in most cases. There are some exceptions:

In and later, InnoDB supports increasing the length of VARCHAR columns with set to INPLACE, unless it would require changing the number of bytes requires to represent the column's length. A VARCHAR column that is between 0 and 255 bytes in size requires 1 byte to represent its length, while a VARCHAR column that is 256 bytes or longer requires 2 bytes to represent its length. This means that the length of a column cannot be increased with set to INPLACE if the original length was less than 256 bytes, and the new length is 256 bytes or more.
In and later, InnoDB supports increasing the length of VARCHAR

See for more information.

For example, this fails:

But this succeeds in and later, because the original length of the column is less than 256 bytes, and the new length is still less than 256 bytes:

But this fails in and later, because the original length of the column is less than 256 bytes, and the new length is greater than 256 bytes:

Changing a Column to NULL

InnoDB supports modifying a column to allow values with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

Changing a Column to NOT NULL

InnoDB supports modifying a column to not allow values with set to INPLACE. It is required for to be enabled in . The operation will fail if the column contains any NULL values. Changes that would interfere with referential integrity are also not permitted.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

Adding a New `ENUM` Option

InnoDB supports adding a new option to a column with set to INPLACE. In order to add a new option with set to INPLACE, the following requirements must be met:

It must be added to the end of the list.
The storage requirements must not change.

This operation only changes the table's metadata, so the table does not have to be rebuilt..

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

But this fails:

Adding a New `SET` Option

InnoDB supports adding a new option to a column with set to INPLACE. In order to add a new option with set to INPLACE, the following requirements must be met:

It must be added to the end of the list.
The storage requirements must not change.

This operation only changes the table's metadata, so the table does not have to be rebuilt..

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

But this fails:

Removing System Versioning from a Column

In and later, InnoDB supports removing from a column with set to INPLACE. In order for this to work, the system variable must be set to KEEP. See for more information.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

`ALTER TABLE ... ALTER COLUMN`

This applies to for tables.

Setting a Column's Default Value

InnoDB supports modifying a column's value with set to INPLACE.

This operation only changes the table's metadata, so the table does not have to be rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted. For example:

Removing a Column's Default Value

InnoDB supports removing a column's value with set to INPLACE.

This operation only changes the table's metadata, so the table does not have to be rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

`ALTER TABLE ... CHANGE COLUMN`

InnoDB supports renaming a column with set to INPLACE, unless the column's data type or attributes changed in addition to the name.

This operation only changes the table's metadata, so the table does not have to be rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

But this fails:

This applies to for tables.

Index Operations

`ALTER TABLE ... ADD PRIMARY KEY`

InnoDB supports adding a primary key to a table with set to INPLACE.

If the new primary key column is not defined as , then it is highly recommended for to be enabled in . Otherwise, NULL values are silently converted to the default value for the given data type, which is probably not the desired behavior in this scenario.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

But this fails:

And this fails:

This applies to for tables.

`ALTER TABLE ... DROP PRIMARY KEY`

InnoDB does not support dropping a primary key with set to INPLACE in most cases.

If you try to do so, then you will see an error. InnoDB only supports this operation with set to COPY. Concurrent DML is not permitted.

However, there is an exception. If you are dropping a primary key, and adding a new one at the same time, then that operation can be performed with set to INPLACE. This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this fails:

But this succeeds:

This applies to for tables.

`ALTER TABLE ... ADD INDEX` and `CREATE INDEX`

This applies to and for tables.

Adding a Plain Index

InnoDB supports adding a plain index to a table with set to INPLACE. The table is not rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

And this succeeds:

Adding a Fulltext Index

InnoDB supports adding a index to a table with set to INPLACE. The table is not rebuilt in some cases.

However, there are some limitations, such as:

Adding a index to a table that does not have a user-defined FTS_DOC_ID column will require the table to be rebuilt once. When the table is rebuilt, the system adds a hidden FTS_DOC_ID column. From that point forward, adding additional indexes to the same table will not require the table to be rebuilt when is set to INPLACE.
Only one index may be added at a time when is set to INPLACE.

This operation supports a read-only locking strategy. This strategy can be explicitly chosen by setting the clause to SHARED. When this strategy is used, read-only concurrent DML is permitted.

For example, this succeeds, but requires the table to be rebuilt, so that the hidden FTS_DOC_ID column can be added:

And this succeeds in the same way as above:

And this succeeds, and the second command does not require the table to be rebuilt:

But this second command fails, because only one index can be added at a time:

And this third command fails, because a table cannot be rebuilt when it has more than one index:

Adding a Spatial Index

InnoDB supports adding a index to a table with set to INPLACE.

However, there are some limitations, such as:

If a table has a index, then it cannot be rebuilt by any operations when the clause is set to NONE.

This operation supports a read-only locking strategy. This strategy can be explicitly chosen by setting the clause to SHARED. When this strategy is used, read-only concurrent DML is permitted.

For example, this succeeds:

And this succeeds in the same way as above:

`ALTER TABLE ... DROP INDEX` and `DROP INDEX`

InnoDB supports dropping indexes from a table with set to INPLACE.

This operation only changes the table's metadata, so the table does not have to be rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

And this succeeds:

This applies to and for tables.

`ALTER TABLE ... ADD FOREIGN KEY`

InnoDB supports adding foreign key constraints to a table with set to INPLACE. In order to add a new foreign key constraint to a table with set to INPLACE, the system variable needs to be set to OFF. If it is set to ON, then ALGORITHM=COPY is required.

This operation only changes the table's metadata, so the table does not have to be rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this fails:

But this succeeds:

This applies to for tables.

`ALTER TABLE ... DROP FOREIGN KEY`

InnoDB supports dropping foreign key constraints from a table with set to INPLACE.

This operation only changes the table's metadata, so the table does not have to be rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

Table Operations

`ALTER TABLE ... AUTO_INCREMENT=...`

InnoDB supports changing a table's value with set to INPLACE. This operation should finish instantly. The table is not rebuilt.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... ROW_FORMAT=...`

InnoDB supports changing a table's with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... KEY_BLOCK_SIZE=...`

InnoDB supports changing a table's with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... PAGE_COMPRESSED=...` and `ALTER TABLE ... PAGE_COMPRESSION_LEVEL=...`

In and later, InnoDB supports setting a table's value to 1 with set to INPLACE. InnoDB also supports changing a table's value from 1 to 0 with set to INPLACE.

In these versions, InnoDB also supports changing a table's value with set to INPLACE.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

See for more information.

For example, this succeeds:

And this succeeds:

This applies to and for tables.

`ALTER TABLE ... DROP SYSTEM VERSIONING`

InnoDB supports dropping from a table with set to INPLACE.

This operation supports the read-only locking strategy. This strategy can be explicitly chosen by setting the clause to SHARED. When this strategy is used, read-only concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... DROP CONSTRAINT`

In and later, InnoDB supports dropping a constraint from a table with set to INPLACE. See for more information.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... FORCE`

InnoDB supports forcing a table rebuild with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... ENGINE=InnoDB`

InnoDB supports forcing a table rebuild with set to INPLACE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`OPTIMIZE TABLE ...`

InnoDB supports optimizing a table with set to INPLACE.

If the system variable is set to OFF, and if the system variable is also set to OFF, then OPTIMIZE TABLE are equivalent to ALTER TABLE … FORCE.

The table is rebuilt, which means that all of the data is reorganized substantially, and the indexes are rebuilt. As a result, the operation is quite expensive.

If either of the previously mentioned system variables is set to ON, then OPTIMIZE TABLE will optimize some data without rebuilding the table. However, the file size will not be reduced.

For example, this succeeds:

And this succeeds, but the table is not rebuilt:

This applies to for tables.

`ALTER TABLE ... RENAME TO` and `RENAME TABLE ...`

InnoDB supports renaming a table with set to INPLACE.

This operation only changes the table's metadata, so the table does not have to be rebuilt.

This operation supports the exclusive locking strategy. This strategy can be explicitly chosen by setting the clause to EXCLUSIVE. When this strategy is used, concurrent DML is not permitted.

For example, this succeeds:

And this succeeds:

This applies to and for tables.

Limitations

If a table has more than one index, then it cannot be rebuilt by any operations when is set to INPLACE.
If a table has a index, then it cannot be rebuilt by any operations when the clause is set to NONE.

If a table has a index, then it cannot be rebuilt by any operations when the clause is set to NONE.

do not currently support online DDL for all of the same operations that are supported for "real" columns.

See for more information on the limitations.

_{This page is licensed: CC BY-SA / Gnu FDL}

InnoDB Online DDL Operations with the INSTANT Alter Algorithm

Discover the INSTANT algorithm, which modifies table metadata without rebuilding the table, enabling extremely fast schema changes like adding columns.

Column Operations

`ALTER TABLE ... ADD COLUMN`

In and later, InnoDB supports adding columns to a table with set to INSTANT if the new column is the last column in the table. See for more information. If the table has a hidden FTS_DOC_ID column is present, then this is not supported.

In and later, InnoDB supports adding columns to a table with set to INSTANT, regardless of where in the column list the new column is added.

When this operation is performed with set to INSTANT, the tablespace file will have a non-canonical storage format. See for more information.

For example, this succeeds:

And this succeeds in and later:

This applies to for tables.

See for more information.

`ALTER TABLE ... DROP COLUMN`

In and later, InnoDB supports dropping columns from a table with set to INSTANT. See for more information.

When this operation is performed with set to INSTANT, the tablespace file will have a non-canonical storage format. See for more information.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... MODIFY COLUMN`

This applies to for tables.

Reordering Columns

In and later, InnoDB supports reordering columns within a table with set to INSTANT. See for more information.

When this operation is performed with set to INSTANT, the tablespace file will have a non-canonical storage format. See for more information.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

Changing the Data Type of a Column

InnoDB does not support modifying a column's data type with set to INSTANT in most cases. There are some exceptions:

InnoDB supports increasing the length of VARCHAR columns with set to INSTANT, unless it would require changing the number of bytes requires to represent the column's length. A VARCHAR column that is between 0 and 255 bytes in size requires 1 byte to represent its length, while a VARCHAR column that is 256 bytes or longer requires 2 bytes to represent its length. This means that the length of a column cannot be increased with set to INSTANT if the original length was less than 256 bytes, and the new length is 256 bytes or more.
In and later, InnoDB supports increasing the length of VARCHAR

The supported operations in this category support the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this fails:

But this succeeds because the original length of the column is less than 256 bytes, and the new length is still less than 256 bytes:

But this fails because the original length of the column is between 128 bytes and 255 bytes, and the new length is greater than 256 bytes:

But this succeeds in and later because the table has ROW_FORMAT=REDUNDANT:

And this succeeds in and later because the table has ROW_FORMAT=DYNAMIC and the column's original length is 127 bytes or less:

And this succeeds in and later because the table has ROW_FORMAT=COMPRESSED and the column's original length is 127 bytes or less:

But this fails even in and later because the table has ROW_FORMAT=DYNAMIC and the column's original length is between 128 bytes and 255 bytes:

Changing a Column to NULL

In and later, InnoDB supports modifying a column to allow values with set to INSTANT if the table option is set to . See for more information.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

Changing a Column to NOT NULL

InnoDB does not support modifying a column to not allow values with set to INSTANT.

For example:

Adding a New `ENUM` Option

InnoDB supports adding a new option to a column with set to INSTANT. In order to add a new option with set to INSTANT, the following requirements must be met:

It must be added to the end of the list.
The storage requirements must not change.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

But this fails:

Adding a New `SET` Option

InnoDB supports adding a new option to a column with set to INSTANT. In order to add a new option with set to INSTANT, the following requirements must be met:

It must be added to the end of the list.
The storage requirements must not change.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

But this fails:

Removing System Versioning from a Column

In and later, InnoDB supports removing from a column with set to INSTANT. In order for this to work, the system variable must be set to KEEP. See for more information.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

`ALTER TABLE ... ALTER COLUMN`

This applies to for tables.

Setting a Column's Default Value

InnoDB supports modifying a column's value with set to INSTANT.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

Removing a Column's Default Value

InnoDB supports removing a column's value with set to INSTANT.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

`ALTER TABLE ... CHANGE COLUMN`

InnoDB supports renaming a column with set to INSTANT, unless the column's data type or attributes changed in addition to the name.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example, this succeeds:

But this fails:

This applies to for tables.

Index Operations

`ALTER TABLE ... ADD PRIMARY KEY`

InnoDB does not support adding a primary key to a table with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... DROP PRIMARY KEY`

InnoDB does not support dropping a primary key with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... ADD INDEX` and `CREATE INDEX`

This applies to and for tables.

Adding a Plain Index

InnoDB does not support adding a plain index to a table with set to INSTANT.

For example, this fails:

And this fails:

Adding a Fulltext Index

InnoDB does not support adding a index to a table with set to INSTANT.

For example, this fails:

And this fails:

Adding a Spatial Index

InnoDB does not support adding a index to a table with set to INSTANT.

For example, this fails:

And this fails:

`ALTER TABLE ... ADD FOREIGN KEY`

InnoDB does not support adding foreign key constraints to a table with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... DROP FOREIGN KEY`

InnoDB supports dropping foreign key constraints from a table with set to INSTANT.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

Table Operations

`ALTER TABLE ... AUTO_INCREMENT=...`

InnoDB supports changing a table's value with set to INSTANT.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... ROW_FORMAT=...`

InnoDB does not support changing a table's with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... KEY_BLOCK_SIZE=...`

InnoDB does not support changing a table's with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... PAGE_COMPRESSED=1` and `ALTER TABLE ... PAGE_COMPRESSION_LEVEL=...`

In and later, InnoDB supports setting a table's value to 1 with set to INSTANT. InnoDB does not support changing a table's value from 1 to 0 with set to INSTANT.

In these versions, InnoDB also supports changing a table's value with set to INSTANT.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

See for more information.

For example, this succeeds:

And this succeeds:

But this fails:

This applies to and for tables.

`ALTER TABLE ... DROP SYSTEM VERSIONING`

InnoDB does not support dropping from a table with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... DROP CONSTRAINT`

In and later, InnoDB supports dropping a constraint from a table with set to INSTANT. See for more information.

This operation supports the non-locking strategy. This strategy can be explicitly chosen by setting the clause to NONE. When this strategy is used, all concurrent DML is permitted.

For example:

This applies to for tables.

`ALTER TABLE ... FORCE`

InnoDB does not support forcing a table rebuild with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... ENGINE=InnoDB`

InnoDB does not support forcing a table rebuild with set to INSTANT.

For example:

This applies to for tables.

`OPTIMIZE TABLE ...`

InnoDB does not support optimizing a table with set to INSTANT.

For example:

This applies to for tables.

`ALTER TABLE ... RENAME TO` and `RENAME TABLE ...`

InnoDB supports renaming a table with set to INSTANT.

This operation supports the exclusive locking strategy. This strategy can be explicitly chosen by setting the clause to EXCLUSIVE. When this strategy is used, concurrent DML is not permitted.

For example, this succeeds:

And this succeeds:

This applies to and for tables.

Limitations

do not currently support online DDL for all of the same operations that are supported for "real" columns.

See for more information on the limitations.

Non-canonical Storage Format Caused by Some Operations

Some operations cause a table's tablespace file to use a non-canonical storage format when the INSTANT algorithm is used. The affected operations include:

These operations require the following non-canonical changes to the storage format:

A hidden metadata record at the start of the clustered index is used to store each column's value. This makes it possible to add new columns that have default values without rebuilding the table.
A in the hidden metadata record is used to store column mappings. This makes it possible to drop or reorder columns without rebuilding the table. This also makes it possible to add columns to any position or drop columns from any position in the table without rebuilding the table.
If a column is dropped, old records will contain garbage in that column's former position, and new records are written with values, empty strings, or dummy values.

This non-canonical storage format has the potential to incur some performance or storage overhead for all subsequent DML operations. If you notice some issues like this and you want to normalize a table's storage format to avoid this problem, then you can do so by forcing a table rebuild by executing with set to INPLACE:

However, keep in mind that there are certain scenarios where you may not be able to rebuild the table with set to INPLACE. See for more information on those cases. If you hit one of those scenarios, but you still want to rebuild the table, then you would have to do so with set to COPY.

Known Bugs

There are some known bugs that could lead to issues when an InnoDB DDL operation is performed using the algorithm. This algorithm will usually be chosen by default if the operation supports the algorithm.

The effect of many of these bugs is that the table seems to forget that its tablespace file is in the .

If you are concerned that a table may be affected by one of these bugs, then your best option would be to normalize the table structure. This can be done by rebuilding the table:

If you are concerned about these bugs, and you want to perform an operation that supports the algorithm, but you want to avoid using that algorithm, then you can set the algorithm to and add the FORCE keyword to the statement:

Closed Bugs

: This bug could cause a table to become corrupt if a column was added instantly. It is fixed in and .
: This bug could cause a table to become corrupt if a column was dropped instantly. It is fixed in .
: This bug could cause a table to become corrupt during page reorganization if a column was added instantly. It is fixed in and .

_{This page is licensed: CC BY-SA / Gnu FDL}

CONNECT XML Table Type

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Overview

CONNECT supports tables represented by XML files. For these tables, the standard input/output functions of the operating system are not used but the parsing and processing of the file is delegated to a specialized library. Currently two such systems are supported: libxml2, a part of the GNOME framework, but which does not require GNOME and, on Windows, MS-DOM (DOMDOC), the Microsoft standard support of XML documents.

DOMDOC is the default for the Windows version of CONNECT and libxml2 is always used on other systems. On Windows the choice can be specified using the XMLSUP list option, for instance specifyingoption_list='xmlsup=libxml2'.

Creating XML tables

First of all, it must be understood that XML is a very general language used to encode data having any structure. In particular, the tag hierarchy in an XML file describes a tree structure of the data. For instance, consider the file:

It represents data having the structure:

This structure seems at first view far from being tabular. However, modern database management systems, including MariaDB, implement something close to the relational model and work on tables that are structurally not hierarchical but tabular with rows and columns.

Nevertheless, CONNECT can do it. Of course, it cannot guess what you want to extract from the XML structure, but gives you the possibility to specify it when you create the table[].

Let us take a first example. Suppose you want to make a table from the above document, displaying the node contents.

For this, you can define a table xsamptag as:

It are displayed as:

AUTHOR

TITLE

TRANSLATOR

PUBLISHER

DATEPUB

Let us try to understand what happened. By default the column names correspond to tag names. Because this file is rather simple, CONNECT was able to default the top tag of the table as the root node <BIBLIO> of the file, and the row tags as the <BOOK> children of the table tag. In a more complex file, this should have been specified, as we will see later. Note that we didn't have to worry about the sub-tags such as <FIRSTNAME> or <LASTNAME> because CONNECT automatically retrieves the entire text contained in a tag and its sub-tags[].

Only the first author of the first book appears. This is because only the first occurrence of a column tag has been retrieved so the result has a proper tabular structure. We will see later what we can do about that.

How can we retrieve the values specified by attributes? By using a Coltype table option to specify the default column type. The value ‘@’ means that column names match attribute names. Therefore, we can retrieve them by creating a table such as:

This table returns the following:

ISBN

LANG

SUBJECT

Now to define a table that will give us all the previous information, we must specify the column type for each column. Because in the next statement the column type defaults to Node, the field_format column parameter was used to indicate which columns are attributes:

From Connect 1.7.0002

Before Connect 1.7.0002

Once done, we can enter the query:

This will return the following result:

SUBJECT

LANG

TITLE

AUTHOR

Note that we have been lucky. Because unlike SQL, XML is case sensitive and the column names have matched the node names only because the column names were given in upper case. Note also that the order of the columns in the table could have been different from the order in which the nodes appear in the XML file.

Using Xpaths with XML tables

Xpath is used by XML to locate and retrieve nodes. The table's main node Xpath is specified by the tabname option. If just the node name is given, CONNECT constructs an Xpath such as ‘BIBLIO’ in the example above that should retrieve the BIBLIO node wherever it is within the XML file.

The row nodes are by default the children of the table node. However, for instance to eliminate some children nodes that are not real row nodes, the row node name can be specified using the rownode sub-option of the option_list option.

The field_format options we used above can be specified to locate more precisely where and what information to retrieve using an Xpath-like syntax. For instance:

From Connect 1.7.0002

Before Connect 1.7.0002

This very flexible column parameter serves several purposes:

To specify the tag name, or the attribute name if different from the column name.
To specify the type (tag or attribute) by a prefix of '@' for attributes.
To specify the path for sub-tags using the '/' character.

This path is always relative to the current context (the column top node) and cannot be specified as an absolute path from the document root, therefore a leading '/' cannot be used. The path cannot be variable in node names or depth, therefore using '//' is not allowed.

The query:

replies:

ISBN

TITLE

TRANSLATED

TRANFN

TRANLN

LOCATION

Libxml2 default name space issue

An issue with libxml2 is that some files can declare a default name space in their root node. Because Xpath only searches in that name space, the nodes will not be found if they are not prefixed. If this happens, specify the tabname option as an Xpath ignoring the current name space:

This must also be done for the default of specified Xpath of the not attribute columns. For instance:

Note: This raises an error (and is useless anyway) with DOMDOC.

Direct access on XML tables

Direct access is available on XML tables. This means that XML tables can be sorted and used in joins, even in the one-side of the join.

However, building a permanent index is not yet implemented. It is unclear whether this can be useful. Indeed, the DOM implementation that is used to access these tables firstly parses the whole file and constructs a node tree in memory. This may often be the longest part of the process, so the use of an index would not be of great value. Note also that this limits the XML files to a reasonable size. Anyway, when speed is important, this table type is not the best to use. Therefore, in these cases, it is probably better to convert the file to another type by inserting the XML table into another table of a more appropriate type for performance.

Accessing tags with namespaces

With the Windows DOMDOC support, this can be done using the prefix in the tabname column option and/or xpath column option. For instance, given the file gns.xml:

and the defined CONNECT table:

Displays:

lon

lat

ele

time

Only the prefixed ‘ele’ tag is recognized.

However, this does not work with the libxml2 support. The solution is then to use a function ignoring the name space:

Then :

Displays:

lon

lat

ele

time

This time, all ‘ele` tags are recognized. This solution does not work with DOMDOC.

Having Columns defined by Discovery

It is possible to let the MariaDB discovery process do the job of column specification. When columns are not defined in the statement, CONNECT endeavours to analyze the XML file and to provide the column specifications. This is possible only for true XML tables, but not for HTML tables.

For instance, the xsamp table could have been created specifying:

Let’s check how it was actually specified using the SHOW CREATE TABLE statement:

It is equivalent except for the column sizes that have been calculated from the file as the maximum length of the corresponding column when it was a normal value. Also, all columns are specified as type because XML does not provide information about the node content data type. Nullable is set to true if the column is missing in some rows.

If a more complex definition is desired, you can ask CONNECT to analyse the XPATH up to a given level using the level option in the option list. The level value is the number of nodes that are taken in the XPATH. For instance:

This will define the table as:

From Connect 1.7.0002

Then if we ask:

Everything seems correct when we get the result:

SUBJECT

AUTHOR

TITLE

TRANSLATOR

PUBLISHER

However if we enter the apparently equivalent query on the xsampall table, based on the same file:

this returns an apparently wrong answer:

SUBJECT

AUTHOR

TITLE

TRANSLATOR

PUBLISHER

What happened here? Simply, because we used the xsamp table to do the Insert, what has been inserted within the XML file had the structure described for xsamp:

CONNECT cannot "invent" sub-tags that are not part of the xsamp table. Because these sub-tags do not exist, the xsampall table cannot retrieve the information that should be attached to them. If we want to be able to query the XML file by all the defined tables, the correct way to insert a new book to the file is to use the xsampall table, the only one that addresses all the components of the original document:

Now the added book, in the XML file, will have the required structure:

Note: We used a column list in the Insert statements when creating the table to avoid generating a <TRANSLATOR> node with sub-nodes, all containing null values (this works on Windows only).

Multiple nodes in the XML document

Let us come back to the above example XML file. We have seen that the author node can be "multiple" meaning that there can be more than one author of a book. What can we do to get the complete information fitting the relational model? CONNECT provides you with two possibilities, but is restricted to only one such multiple node per table.

The first and most challenging one is to return as many rows than there are authors, the other columns being repeated as if we had make a join between the author column and the rest of the table. To achieve this, simply specify the “multiple” node name and the “expand” option when creating the table. For instance, we can create the xsamp2 table like this:

In this statement, the Limit option specifies the maximum number of values that are expanded. If not specified, it defaults to 10. Any values above the limit are ignored and a warning message issued[]. Now you can enter a query such as:

This will retrieve and display the following result:

ISBN

SUBJECT

AUTHOR

TITLE

In this case, this is as if the table had four rows. However if we enter the query:

this time the result are:

ISBN

SUBJECT

TITLE

PUBLISHER

Because the author column does not appear in the query, the corresponding row was not expanded. This is somewhat strange because this would have been different if we had been working on a table of a different type. However, it is closer to the relational model for which there should not be two identical rows (tuples) in a table. Nevertheless, you should be aware of this somewhat erratic behavior. For instance:

This last query replies:

ISBN

SUBJECT

TITLE

PUBLISHER

Even though the author column does not appear in the result, the corresponding row was expanded because the multiple column was used in the where clause.

Intermediate multiple node

The "multiple" node can be an intermediate node. If we want to do the same expanding with the xsampall table, there are nothing more to do. The_xsampall2_ table can be created with:

From Connect 1.7.0002

Before Connect 1.7.0002

The only difference is that the "multiple" node is an intermediate node in the path. The resulting table can be seen with a query such as:

This query displays:

SUBJECT

LANG

TITLE

FIRST

LAST

YEAR

These composite tables, half array half tree, reserve some surprises for us when updating, deleting from or inserting into them. Insert just cannot generate this structure; if two rows are inserted with just a different author, two book nodes are generated in the XML file. Delete always deletes one book node and all its children nodes even if specified against only one author. Update is more complicated:

After these three updates, the first two responding "Affected rows: 1" and the last one responding "Affected rows: 2", the last query answers:

subject

lang

title

first

last

year

What must be understood here is that the Update modifies node values in the XML file, not cell values in the relational table. The first update worked normally. The second update changed the year value of the book and this shows for the two expanded rows because there is only one DATEPUB node for that book. Because the third update applies to a row having a certain date value, both author names were updated.

Making a List of Multiple Values

Another way to see multiple values is to ask CONNECT to make a comma separated list of the multiple node values. This time, it can only be done if the "multiple" node is not intermediate. For example, we can modify the xsamp2 table definition by:

This time 'Expand' is not specified, and Limit gives the maximum number of items in the list. Now if we enter the query:

We will get the following result:

ISBN

SUBJECT

AUTHOR(S)

TITLE

Note that updating the "multiple" column is not possible because CONNECT does not know which of the nodes to update.

This could not have been done with the xsampall2 table because the author node is intermediate in the path, and making two lists, one of first names and another one of last names would not make sense anyway.

What if a table contains several multiple nodes

This can be handled by creating several tables on the same file, each containing only one multiple node and constructing the desired result using joins.

Support of HTML Tables

Most tables included in HTML documents cannot be processed by CONNECT because the HTML language is often not compatible with the syntax of XML. In particular, XML requires all open tags to be matched by a closing tag while it is sometimes optional in HTML. This is often the case concerning column tags.

However, you can meet tables that respect the XML syntax but have some of the features of HTML tables. For instance:

Here the different column tags are included in <td></td> tags as for HTML tables. You cannot just add this tag in the Xpath of the columns, because the search is done on the first occurrence of each tag, and this would cause this search to fail for all columns except the first one. This case is handled by specifying the Colnode table option that gives the name of these column tags, for example:

From Connect 1.7.0002

Before Connect 1.7.0002

The table are displayed as:

Name

Origin

Description

However, you can deal with tables even closer to the HTML model. For example the coffee.htm file:

Here column values are directly represented by the TD tag text. You cannot declare them as tags nor as attributes. In addition, they are not located using their name but by their position within the row. Here is how to declare such a table to CONNECT:

You specify the fact that columns are located by position by setting the_Coltype_ option to 'HTML'. Each column position (0 based) are the value of the flag column parameter that is set by default in sequence. Now we are able to display the table:

Name

Cups

Type

Sugar

Note 1: We specified 'header=n' in the create statement to indicate that the first n rows of the table are not data rows and should be skipped.

Note 2: In this last example, we did not specify the node names using the Rownode and Colnode options because when Coltype is set to 'HTML' they default to 'Rownode=TR' and 'Colnode=TD'.

Note 3: The Coltype option is a word only the first character of which is significant. Recognized values are:

New file setting

Some create options are used only when creating a table on a new file, i. e. when inserting into a file that does not exist yet. When specified, the 'Header' option will create a header row with the name of the table columns. This is chiefly useful for HTML tables to be displayed on a web browser.

Some new list-options are used in this context:

Let us see for instance, the following create statement:

Supposing the table file does not exist yet, the first insert into that table, for instance by the following statement:

will generate the following file:

This file can be used to display the table on a web browser (encoding should beISO-8859-x)

handler

version

author

description

maturity

Note: The XML document encoding is generally specified in the XML header node and can be different from the DATA_CHARSET, which is always UTF-8 for XML tables. Therefore the table DATA_CHARSET character set should be unspecified, or specified as UTF8. The Encoding specification is useful only for new XML files and ignored for existing files having their encoding already specified in the header node.

Notes

CONNECT does not claim to be able to deal with any XML document. Besides, those that can usefully be processed for data analysis are likely to have a structure that can easily be transformed into a table.
With libxml2, sub tags text can be separated by 0 or several blanks depending on the structure and indentation of the data file.
This may cause some rows to be lost because an eventual where clause on the “multiple” column is applied only on the limited number of retrieved rows.

_{This page is licensed: CC BY-SA / Gnu FDL}

CONNECT JSON Table Type

The CONNECT storage engine has been deprecated.

This storage engine has been deprecated.

Overview

JSON (JavaScript Object Notation) is a lightweight data-interchange format widely used on the Internet. Many applications, generally written in JavaScript or PHP use and produce JSON data, which are exchanged as files of different physical formats. JSON data is often returned from REST queries.

It is also possible to query, create or update such information in a database-like manner. MongoDB does it using a JavaScript-like language. PostgreSQL includes these facilities by using a specific data type and related functions like dynamic columns.

The CONNECT engine adds this facility to MariaDB by supporting tables based on JSON data files. This is done like for XML tables by creating tables describing what should be retrieved from the file and how it should be processed.

Starting with 1.07.0002, the internal way JSON was parsed and handled was changed. The main advantage of the new way is to reduce the memory required to parse JSON. It was from 6 to 10 times the size of the JSON source and is now only 2 to 4 times. However, this is in Beta mode and JSON tables are still handled using the old mode. To use the new mode, tables should be created with TABLE_TYPE=BSON. Another way is the set the session variable to 1 or ON. Then all JSON tables are handled as BSON. Of course, this is temporary and when successfully tested, the new way will replace the old way and all tables be created as JSON.

Let us start from the file “biblio3.json” that is the JSON equivalent of the XML Xsample file described in the XML table chapter:

This file contains the different items existing in JSON.

Arrays: They are enclosed in square brackets and contain a list of comma separated values.
Objects: They are enclosed in curly brackets. They contain a comma separated list of pairs, each pair composed of a key name between double quotes, followed by a ‘:’ character and followed by a value.
Values: Values can be an array or an object. They also can be a string between double quotes, an integer or float number, a Boolean value or a null value. The simplest way for CONNECT to locate a table in such a file is by an array containing a list of objects (this is what MongoDB calls a collection of documents). Each array value are a table row and each pair of the row objects will represent a column, the key being the column name and the value the column value.

A first try to create a table on this file are to take the outer array as the table:

If we execute the query:

We get the result:

isbn

author

title

publisher

Note that by default, column values that are objects have been set to the concatenation of all the string values of the object separated by a blank. When a column value is an array, only the first item of the array is retrieved (This will change in later versions of Connect).

However, things are generally more complicated. If JSON files do not contain attributes (although object pairs are similar to attributes) they contain a new item, arrays. We have seen that they can be used like XML multiple nodes, here to specify several authors, but they are more general because they can contain objects of different types, even it may not be advisable to do so.

This is why CONNECT enables the specification of a column field_format option “JPATH” (FIELD_FORMAT until Connect 1.6) that is used to describe exactly where the items to display are and how to handles arrays.

Here is an example of a new table that can be created on the same file, allowing choosing the column names, to get some sub-objects and to specify how to handle the author array.

Until Connect 1.5:

From Connect 1.6:

From Connect 1.07.0002

Given the query:

The result is:

title

author

publisher

location

Note: The JPATH was not specified for column ISBN because it defaults to the column name.

Here is another example showing that one can choose what to extract from the file and how to “expand” an array, meaning to generate one row for each array value:

Until Connect 1.5:

From Connect 1.6:

From Connect 1.06.006:

From Connect 1.07.0002

It is displayed as:

ISBN

Title

AuthorFN

AuthorLN

Year

Note: The example above shows that the ‘$.’, that means the beginning of the path, can be omitted.

The Jpath Specification

From Connect 1.6, the Jpath specification has changed to be the one of the native JSON functions and more compatible with what is generally used. It is close to the standard definition and compatible to what MongoDB and other products do. The ‘:’ separator is replaced by ‘.’. Position in array is accepted MongoDB style with no square brackets. Array specification specific to CONNECT are still accepted but [*] is used for expanding and [x] for multiply. However, tables created with the previous syntax can still be used by adding SEP_CHAR=’:’ (can be done with alter table). Also, it can be now specified as JPATH (was FIELD_FORMAT) but FIELD_FORMAT is still accepted.

Until Connect 1.5, it is the description of the path to follow to reach the required item. Each step is the key name (case sensitive) of the pair when crossing an object, and the number of the value between square brackets when crossing an array. Each specification is separated by a ‘:’ character.

From Connect 1.6, It is the description of the path to follow to reach the required item. Each step is the key name (case sensitive) of the pair when crossing an object, and the position number of the value when crossing an array. Key specifications are separated by a ‘.’ character.

For instance, in the above file, the last name of the second author of a book is reached by:

$.AUTHOR[1].LASTNAME standard style &#xNAN;$AUTHOR.1.LASTNAME MongoDB style AUTHOR:[1]:LASTNAME old style when SEP_CHAR=’:’ or until Connect 1.5

The ‘$’ or “$.” prefix specifies the root of the path and can be omitted with CONNECT.

The array specification can also indicate how it must be processed:

For instance, in the above file, the last name of the second author of a book is reached by:

The array specification can also indicate how it must be processed:

Specification

Array Type

Limit

Description

Note 1: When the LIMIT restriction is applicable, only the first m array items are used, m being the value of the LIMIT option (to be specified in option_list). The LIMIT default value is 10.

Note 2: An alternative way to indicate what is to be expanded is to use the expand option in the option list, for instance:

AUTHOR is here the key of the pair that has the array as a value (case sensitive). Expand is limited to only one branch (expanded arrays must be under the same object).

Let us take as an example the file expense.json (). The table jexpall expands all under and including the week array:

From Connect 1.07.0002

From Connect.1.6

Until Connect 1.5:

WHO

WEEK

WHAT

AMOUNT

The table jexpw shows what was bought and the sum and average of amounts for each person and week:

From Connect 1.07.0002

From Connect 1.6:

Until Connect 1.5:

WHO

WEEK

WHAT

SUM

AVERAGE

Let us see what the table jexpz does:

From Connect 1.6:

From Connect 1.07.0002

Until Connect 1.5:

WHO

WEEKS

SUMS

SUM

AVGS

SUMAVG

AVGSUM

AVERAGE

For all persons:

Column 1 show the person name.
Column 2 shows the weeks for which values are calculated.
Column 3 lists the sums of expenses for each week.
Column 4 calculates the sum of all expenses by person.

It would be very difficult, if even possible, to obtain this result from table jexpall using an SQL query.

Handling of NULL Values

Json has a null explicit value that can be met in arrays or object key values. When regarding json as a relational table, a column value can be null because the corresponding json item is explicitly null, or implicitly because the corresponding item is missing in an array or object. CONNECT does not make any distinction between explicit and implicit nulls.

However, it is possible to specify how nulls are handled and represented. This is done by setting the string session variable . The default value of connect_json_null is “”; it can be changed, for instance, by:

This changes its representation when a column displays the text of an object or the concatenation of the values of an array.

It is also possible to tell CONNECT to ignore nulls by:

When doing so, nulls do not appear in object text or array lists. However, this does not change the behavior of array calculation nor the result of array count.

Having Columns defined by Discovery

It is possible to let the MariaDB discovery process do the job of column specification. When columns are not defined in the create table statement, CONNECT endeavors to analyze the JSON file and to provide the column specifications. This is possible only for tables represented by an array of objects because CONNECT retrieves the column names from the object pair keys and their definition from the object pair values. For instance, the jsample table could be created saying:

Let’s check how it was actually specified using the show create table statement:

It is equivalent except for the column sizes that have been calculated from the file as the maximum length of the corresponding column when it was a normal value. For columns that are json arrays or objects, the column is specified as a varchar string of length 256, supposedly big enough to contain the sub-object's concatenated values. Nullable is set to true if the column is null or missing in some rows or if its JPATH contains arrays.

If a more complex definition is desired, you can ask CONNECT to analyse the JPATH up to a given depth using the DEPTH or LEVEL option in the option list. Its default value is 0 but can be changed setting the session variable (in future versions the default are 5). The depth value is the number of sub-objects that are taken in the JPATH2 (this is different from what is defined and returned by the native function).

For instance:

This will define the table as:

From Connect 1.07.0002

From Connect 1.6:

Until Connect 1.5:

For columns that are a simple value, the Json path is the column name. This is the default when the Jpath option is not specified, so it was not specified for such columns. However, you can force discovery to specify it by setting the connect_all_path variable to 1 or ON. This can be useful if you plan to change the name of such columns and relieves you of manually specifying the path (otherwise it would default to the new name and cause the column to not or wrongly be found).

Another problem is that CONNECT cannot guess what you want to do with arrays. Here the AUTHOR array is set to 0, which means that only its first value are retrieved unless you also had specified “Expand=AUTHOR” in the option list. But of course, you can replace it with anything else.

This method can be used as a quick way to make a “template” table definition that can later be edited to make the desired definition. In particular, column names are constructed from all the object keys of their path in order to have distinct column names. This can be manually edited to have the desired names, provided their JPATH key names are not modified.

DEPTH can also be given the value -1 to create only columns that are simple values (no array or object). It normally defaults to 0 but this can be modified setting the variable.

Note: Since version 1.6.4, CONNECT eliminates columns that are “void” or whose type cannot be determined. For instance given the file sresto.json:

Previously, when using discovery, creating the table by:

The table was previously created as:

The column “grades” was added because of the void array in line 2. Now this column is skipped and does not appear anymore (unless the option Accept=1 is added in the option list).

JSON Catalogue Tables

Another way to see JSON table column specifications is to use a catalogue table. For instance:

which returns:

From Connect 1.07.0002:

column_name

type

size

jpath

From Connect 1.6:

column_name

type

size

jpath

Until Connect 1.5:

column_name

type

size

jpath

All this is mostly useful when creating a table on a remote file that you cannot easily see.

Finding the table within a JSON file

Given the file “facebook.json”:

The table we want to analyze is represented by the array value of the “data” object. Here is how this is specified in the create table statement:

From Connect 1.07.0002:

From Connect 1.6:

Until Connect 1.5:

This is the object option that gives the Jpath of the table. Note also an alternate way to declare the array to be expanded by the expand option of the option_list.

Because some string values contain a date representation, the corresponding columns are declared as datetime and the date format is specified for them.

The Jpath of the object option has the same syntax as the column Jpath but of course all array steps must be specified using the [n] (until Connect 1.5) or n (from Connect 1.6) format.

Note: This applies to the whole document for tables having PRETTY = 2 (see below). Otherwise, it applies to the document objects of each file records.

JSON File Formats

The examples we have seen so far are files that, even they can be formatted in different ways (blanks, tabs, carriage return and line feed are ignored when parsing them), respect the JSON syntax and are made of only one item (Object or Array). Like for XML files, they are entirely parsed and a memory representation is made used to process them. This implies that they are of reasonable size to avoid an out of memory condition. Tables based on such files are recognized by the option Pretty=2 that we did not specify above because this is the default.

An alternate format, which is the format of exported MongoDB files, is a file where each row is physically stored in one file record. For instance:

The original file, “cities.json”, has 29352 records. To base a table on this file we must specify the option Pretty=0 in the option list. For instance:

From Connect 1.07.0002:

From Connect 1.6:

Until Connect 1.5:

Note the use of [n] (until Connect 1.5) or n (from Connect 1.6) array specifications for the longitude and latitude columns.

When using this format, the table is processed by CONNECT like a DOS, CSV or FMT table. Rows are retrieved and parsed by records and the table can be very large. Another advantage is that such a table can be indexed, which can be of great value for very large tables. The “distrib” option of the “state” column tells CONNECT to use block indexing when possible.

For such tables – as well as for pretty=1 ones – the record size must be specified using the LRECL option. Be sure you don’t specify it too small as it is used to allocate the read/write buffers and the memory used for parsing the rows. If in doubt, be generous as it does not cost much in memory allocation.

Another format exists, noted by Pretty=1, which is similar to this one but has some additions to represent a JSON array. A header and a trailer records are added containing the opening and closing square bracket, and all records but the last are followed by a comma. It has the same advantages for reading and updating, but inserting and deleting are executed in the pretty=2 way.

Alternate Table Arrangement

We have seen that the most natural way to represent a table in a JSON file is to make it on an array of objects. However, other possibilities exist. A table can be an array of arrays, a one column table can be an array of values, or a one row table can be just one object or one value. Single row tables are internally handled by adding a one value array around them.

Let us see how to handle, for instance, a table that is an array of arrays. The file:

A table can be created on this file as:

From Connect 1.07.0002:

From Connect 1.6:

Until Connect 1.5:

Columns are specified by their position in the row arrays. By default, this is zero-based but for this table the base was set to 1 by the Base option of the option list. Another new option in the option list is Jmode=1. It indicates what type of table this is. The Jmode values are:

An array of objects. This is the default.
An array of Array. Like this one.
An array of values.

When reading, this is not required as the type of the array items is specified for the columns; however, it is required when inserting new rows so CONNECT knows what to insert. For instance:

After this, it is displayed as:

Unspecified array values are represented by their first element.

Getting and Setting JSON Representation of a Column

We have seen that columns corresponding to a Json object or array are retrieved by default as the concatenation of all its values separated by a blank. It is also possible to retrieve and display such column contains as the full JSON string corresponding to it in the JSON file. This is specified in the JPATH by a “*” where the object or array would be specified.

Note: When having columns generated by discovery, this can be specified by adding the STRINGIFY option to ON or 1 in the option list.

For instance:

From Connect 1.07.0002:

From Connect 1.6:

Until Connect 1.5:

Now the query:

will return and display :

json_Author

Note: Prefixing the column name by json_ is optional but is useful when using the column as argument to Connect UDF functions, making it to be surely recognized as valid Json without aliasing.

This also works on input, a column specified so that it can be directly set to a valid JSON string.

This feature is of great value as we will see below.

Create, Read, Update and Delete Operations on JSON Tables

The SQL commands INSERT, UPDATE and DELETE are fully supported for JSON tables except those returned by REST queries. For INSERT and UPDATE, if the target values are simple values, there are no problems.

However, there are some issues when the added or modified values are objects or arrays.

Concerning objects, the same problems exist that we have already seen with the XML type. The added or modified object will have the format described in the table definition, which can be different from the one of the JSON file. Modifications should be done using a file specifying the full path of modified objects.

New problems are raised when trying to modify the values of an array. Only updates can be done on the original table. First of all, for the values of the array to be distinct values, all update operations concerning array values must be done using a table expanding this array.

For instance, to modify the authors of the biblio.json based table, the jsampex table must be used. Doing so, updating and deleting authors is possible using standard SQL commands. For example, to change the first name of Knab from François to John:

However It would be wrong to do:

Because this would change the first name of both authors as they share the same ISBN.

Where things become more difficult is when trying to delete or insert an author of a book. Indeed, a delete command will delete the whole book and an insert command will add a new complete row instead of adding a new author in the same array. Here we are penalized by the SQL language that cannot give us a way to specify this. Something like:

However this does not exist in SQL. Does this mean that it is impossible to do it? No, but it requires us to use a table specified on the same file but adapted to this task. One way to do it is to specify a table for which the authors are no more an expanded array. Supposing we want to add an author to the “XML en Action” book. We will do it on a table containing just the author(s) of that book, which is the second book of the table.

From Connect 1.6:

Until Connect 1.5

The command:

replies:

FIRSTNAME

LASTNAME

It is a standard JSON table that is an array of objects in which we can freely insert or delete rows.

We can check that this was done correctly by:

This will display:

ISBN

Title

AuthorFN

AuthorLN

Year

Note: If this table were a big table with many books, it would be difficult to know what the order of a specific book is in the table. This can be found by adding a special ROWID column in the table.

However, an alternate way to do it is by using direct JSON column representation as in the JSAMPLE2 table. This can be done by:

Here, we didn't have to find the index of the sub array to modify. However, this is not quite satisfying because we had to manually write the whole JSON value to set to the json_Author column.

Therefore we need specific functions to do so. They are introduced now.

JSON User Defined Functions

Although such functions written by other parties do exist,[] CONNECT provides its own UDFs that are specifically adapted to the JSON table type and easily available because, being inside the CONNECT library or DLL, they require no additional module to be loaded (see to make these functions in a separate library module).

Here is the list of the CONNECT functions; more can be added if required.

Name

Type

Return

Description

Added

String values are mapped to JSON strings. These strings are automatically escaped to conform to the JSON syntax. The automatic escaping is bypassed when the value has an alias beginning with ‘json_’. This is automatically the case when a JSON UDF argument is another JSON UDF whose name begins with “json_” (not case sensitive). This is why all functions that do not return a Json item are not prefixed by “json_”.

Argument string values, for some functions, can alternatively be json file names. When this is ambiguous, alias them as jfile_. Full path should be used because UDF functions has no means to know what the current database is. Apparently, when the file name path is not full, it is based on the MariaDB data directory but I am not sure it is always true.

Numeric values are (big) integers, double floating point values or decimal values. Decimal values are character strings containing a numeric representation and are treated as strings. Floating point values contain a decimal point and/or an exponent. Integers are written without decimal points.

To install these functions execute the following commands :[]

Note

Json function names are often written on this page with leading upper case letters for clarity. It is possible to do so in SQL queries because function names are case insensitive. However, when creating or dropping them, their names must match the case they are in the library module, which is in lower case.

On Unix systems (from Connect 1.7.02):

On Unix systems (from Connect 1.6):

On Unix systems (until Connect 1.5):

On WIndows (from Connect 1.7.02):

On WIndows (from Connect 1.6):

On WIndows (until Connect 1.5):

Jfile_Bjson

MariaDB starting with

JFile_Bjson was introduced in MariaDB.

Converts the first argument pretty=0 json file to Bjson file. B(inary)json is a pre-parsed json format. It is described below in the Performance chapter (available in next Connect versions).

Jfile_Convert

MariaDB starting with

JFile_Convert was introduced in MariaDB.

Converts the first argument json file to another pretty=0 json file. The third integer argument is the record length to use. This is often required to process huge json files that would be very slow if they were in pretty=2 format.

This is done without completely parsing the file, is very fast and requires no big memory.

Jfile_Make

Jfile_Make was added in CONNECT 1.4

The first argument must be a json item (if it is just a string, Jfile_Make will try its best to see if it is a json item or an input file name). The following arguments are a string file name and an integer pretty value (defaulting to 2) in any order. This function creates a json file containing the first argument item.

The returned string value is the created file name. If not specified as an argument, the file name can in some cases be retrieved from the first argument; in such cases the file itself is modified.

This function can be used to create or format a json file. For instance, supposing we want to format the file tb.json, this can be done with the query:

The tb.json file are changed to:

Json_Array_Add

Note: The following describes this function for CONNECT version 1.4 only. The first argument must be a JSON array. The second argument is added as member of this array:

Array

Note: The first array is not escaped, its (alias) name beginning with ‘json_’.

Now we can see how adding an author to the JSAMPLE2 table can alternatively be done:

Note: Calling a column returning JSON a name prefixed by json_ (like json_author here) is good practice and removes the need to give it an alias to prevent escaping when used as an argument.

Additional arguments: If a third integer argument is given, it specifies the position (zero based) of the added value:

Array

If a string argument is added, it specifies the Json path to the array to be modified. For instance:

Json_Array_Add('{"a":1,"b":2,"c":[3, 4]}' json_, 5, 1, 'c')

Json_Array_Add_Values

Json_Array_Add_Values added in CONNECT 1.4 replaces the function Json_Array_Add of CONNECT version 1.3.

The first argument must be a JSON array string. Then all other arguments are added as members of this array:

Array

Json_Array_Delete

The first argument should be a JSON array. The second argument is an integer indicating the rank (0 based conforming to general json usage) of the element to delete:

Array

Now we can see how to delete the second author from the JSAMPLE2 table:

A Json path can be specified as a third string argument

Json_Array_Grp

This is an aggregate function that makes an array filled from values coming from the rows retrieved by a query. Let us suppose we have the pet table:

name

race

number

The query:

will return:

name

One problem with the JSON aggregate functions is that they construct their result in memory and cannot know the needed amount of storage, not knowing the number of rows of the used table.

Therefore, the number of values for each group is limited. This limit is the value of JsonGrpSize whose default value is 10 but can be set using the JsonSet_Grp_Size function. Nevertheless, working on a larger table is possible, but only after setting JsonGrpSize to the ceiling of the number of rows per group for the table. Try not to set it to a very large value to avoid memory exhaustion.

JsonContains

This function can be used to check whether an item is contained in a document. Its arguments are the same than the ones of the JsonLocate function; only the return value changes. The integer returned value is 1 is the item is contained in the document or 0 otherwise.

JsonContains_Path

This function can be used to check whether a Json path is contained in the document. The integer returned value is 1 is the path is contained in the document or 0 otherwise.

Json_File

The first argument must be a file name. This function returns the text of the file that is supposed to be a json file. If only one argument is specified, the file text is returned without being parsed. Up to two additional arguments can be specified:

A string argument is the path to the sub-item to be returned. An integer argument specifies the pretty format value of the file.

This function is chiefly used to get the json item argument of other json functions from a json file. For instance, supposing the file tb.json is:

Extracting a value from it can be done with a query such as:

This query returns:

Type

However, we’ll see that, most of the time, it is better to use Jbin_File or to directly specify the file name in queries. In particular this function should not be used for queries that must modify the json item because, even if the modified json is returned, the file itself would be unchanged.

Json_Get_Item

Json_Get_Item was added in CONNECT 1.4.

This function returns a subset of the json document passed as first argument. The second argument is the json path of the item to be returned and should be one returning a json item (terminated by a ‘*’). If not, the function will try to make it right but this is not foolproof. For instance:

The correct path should have been ‘second.*’), but in this simple case the function was able to make it right. The returned item:

item

Note: The array is aliased “json_second” to indicate it is a json item and avoid escaping it. However, the “json_” prefix is skipped when making the object and must not be added to the path.

JsonGet_Grp_Size

This function returns the JsonGrpSize value.

JsonGet_String / JsonGet_Int / JsonGet_Real

JsonGet_String, JsonGet_Int and JsonGet_Real were added in CONNECT 1.4.

The first argument should be a JSON item. If it is a string with no alias, it are converted as a json item. The second argument is the path of the item to be located in the first argument and returned, eventually converted according to the used function:

This query returns:

String

Int

Real

The function JsonGet_Real can be given a third argument to specify the number of decimal digits of the returned value. For instance:

This query returns:

String

The given path can specify all operators for arrays except the “expand” [*] operator). For instance:

The result:

Rank

Number

Concat

Sum

Avg

Json_Item_Merge

This function merges two arrays or two objects. For arrays, this is done by adding to the first array all the values of the second array. For instance:

The function returns:

Result

For objects, the pairs of the second object are added to the first object if the key does not yet exist in it; otherwise the pair of the first object is set with the value of the matching pair of the second object. For instance:

The function returns:

Result

JsonLocate

The first argument must be a JSON tree. The second argument is the item to be located. The item to be located can be a constant or a json item. Constant values must be equal in type and value to be found. This is "shallow equality" – strings, integers and doubles won't match.

This function returns the json path to the located item or null if it is not found:

This query returns:

Path

The path syntax is the same used in JSON CONNECT tables.

By default, the path of the first occurrence of the item is returned. The third parameter can be used to specify the occurrence whose path is to be returned. For instance:

first

second

wrong type

json

For string items, the comparison is case sensitive by default. However, it is possible to specify a string to be compared case insensitively by giving it an alias beginning by “ci”:

Path

Json_Locate_All

The first argument must be a JSON item. The second argument is the item to be located. This function returns the paths to all locations of the item as an array of strings:

This query returns:

All paths

The returned array can be applied other functions. For instance, to get the number of occurrences of an item in a json tree, you can do:

The displayed result:

Nb of occurs

If specified, the third integer argument set the depth to search in the document. This means the maximum items in the paths. This value defaults to 10 but can be increased for complex documents or reduced to set the maximum wanted depth of the returned paths.

Json_Make_Array

Json_Make_Array returns a string denoting a JSON array with all its arguments as members:

Json_Make_Array(56, 3.1416, 'My name is "Foo"',N ULL)

Note: The argument list can be void. If so, a void array is returned.

Json_Make_Object

Json_Make_Object returns a string denoting a JSON object. For instance:

The object is filled with pairs corresponding to the given arguments. The key of each pair is made from the argument (default or specified) alias.

Json_Make_Object(56, 3.1416, 'machin', NULL)

When needed, it is possible to specify the keys by giving an alias to the arguments:

Json_Make_Object(56 qty,3.1416 price,'machin' truc, NULL garanty)

If the alias is prefixed by ‘json_’ (to prevent escaping) the key name is stripped from that prefix.

This function is chiefly useful when entering values retrieved from a table, the key being by default the column name:

Json_Make_Object(matricule, nom, titre, salaire)

Json_Object_Add

The first argument must be a JSON object. The second argument is added as a pair to this object:

newobj

Note: If the specified key already exists in the object, its value is replaced by the new one.

The third string argument is a Json path to the target object.

Json_Object_Delete

The first argument must be a JSON object. The second argument is the key of the pair to delete:

newobj

The third string argument is a Json path to the object to be the target of deletion.

Json_Object_Grp

This function works like Json_Array_Grp. It makes a JSON object filled with value pairs whose keys are passed from its first argument and values are passed from its second argument.

This can be seen with the query:

This query returns:

name

json_object_grp(number,race)

Json_Object_Key

Return a string denoting a JSON object. For instance:

The object is filled with pairs made from each key/value arguments.

Json_Object_Key('qty', 56, 'price', 3.1416, 'truc', 'machin', 'garanty', NULL)

Json_Object_List

The first argument must be a JSON object. This function returns an array containing the list of all keys existing in the object:

Key List

Json_Object_Nonull

This function works like but “null” arguments are ignored and not inserted in the object. Arguments are regarded as “null” if they are JSON null values, void arrays or objects, or arrays or objects containing only null members.

It is mainly used to avoid constructing useless null items when converting tables (see later).

Json_Object_Values

The first argument must be a JSON object. This function returns an array containing the list of all values existing in the object:

Value List

JsonSet_Grp_Size

This function is used to set the JsonGrpSize value. This value is used by the following aggregate functions as a ceiling value of the number of items in each group. It returns the JsonGrpSize value that can be its default value when passed 0 as argument.

Json_Set_Item / Json_Insert_Item / Json_Update_Item

These functions insert or update data in a JSON document and return the result. The value/path pairs are evaluated left to right. The document produced by evaluating one pair becomes the new value against which the next pair is evaluated.

Json_Set_Item replaces existing values and adds non-existing values.
Json_Insert_Item inserts values without replacing existing values.
Json_Update_Item replaces only existing values.

Example:

This query returns:

Set

Insert

Update

JsonValue

Returns a JSON value as a string, for instance:

JsonValue(3.1416)

The “JBIN” return type

Almost all functions returning a json string - whose name begins with Json_ - have a counterpart with a name beginning with Jbin_. This is both for performance (speed and memory) as well as for better control of what the functions should do.

This is due to the way CONNECT UDFs work internally. The Json functions, when receiving json strings as parameters, parse them and construct a binary tree in memory. They work on this tree and before returning; serialize this tree to return a new json string.

If the json document is large, this can take up a large amount of time and storage space. It is all right when one simple json function is called – it must be done anyway – but is a waste of time and memory when json functions are used as parameters to other json functions.

To avoid multiple serializing and parsing, the Jbin functions should be used as parameters to other functions. Indeed, they do not serialize the memory document tree, but return a structure allowing the receiving function to have direct access to the memory tree. This saves the serialize-parse steps otherwise needed to pass the argument and removes the need to reallocate the memory of the binary tree, which by the way is 6 to 7 times the size of the json string. For instance:

This query returns:

Result

Here the binary json tree allocated by Jbin_Array is completed by Jbin_Array_Add and Json_Object and serialized only once to make the final result string. It would be serialized and parsed two more times if using “Json” functions.

Note that Jbin results are recognized as such because they are aliased beginning with “Jbin_”. This is why in the Json_Object function the alias is specified as “Jbin_foo”.

What happens if it is not recognized as such? These functions are declared as returning a string and to take care of this, the returned structure begins with a zero-terminated string. For instance:

This query replies:

Jbin_Array('a','b','c')

Note: When testing, the tree returned by a “Jbin” function can be seen using the Json_Serialize function whose unique parameter must be a “Jbin” result. For instance:

This query returns:

Json_Serialize(Jbin_Array('a','b','c'))

Note: For this simple example, this is equivalent to using the Json_Array function.

Using a file as json UDF first argument

We have seen that many json UDFs can have an additional argument not yet described. This is in the case where the json item argument was referring to a file. Then the additional integer argument is the pretty value of the json file. It matters only when the first argument is just a file name (to make the UDF understand this argument is a file name, it should be aliased with a name beginning with jfile_) or if the function modifies the file, in which case it are rewritten with this pretty format.

The json item is created by extracting the required part from the file. This can be the whole file but more often only some of it. There are two ways to specify the sub-item of the file to be used:

Specifying it in the Json_File or Jbin_File arguments.
Specifying it in the receiving function (not possible for all functions).

It doesn’t make any difference when the Jbin_File is used but it does with Json_File. For instance:

The second query returns:

Json_Array_Add(Json_File('test.json', 'b'), 66)

It just returns the – modified -- subset returned by the Json_File function, while the query:

returns what was received from Json_File with the modification made on the subset.

Json_Array_Add(Json_File('test.json'), 66, 'b')

Note that in both case the test.json file is not modified. This is because the Json_File function returns a string representing all or part of the file text but no information about the file name. This is all right to check what would be the effect of the modification to the file.

However, to have the file modified, use the Jbin_File function or directly give the file name. Jbin_File returns a structure containing the file name, a pointer to the file parsed tree and eventually a pointer to the subset when a path is given as a second argument:

This query returns:

Json_Array_Add(Jbin_File('test.json', 'b'), 66)

This time the file is modified. This can be checked with:

Json_File('test.json', 3)

The reason why the first argument is returned by such a query is because of tables such as:

In this table, the jfile_cols column just contains a file name. If we update it by:

This is the test.json file that must be modified, not the jfile_cols column. This can be checked by:

JsonGet_String(jfile_cols, '[1]:*')

Note: It was an important facility to name the second column of the table beginning by “jfile_” so the json functions knew it was a file name without obliging to specify an alias in the queries.

Using “Jbin” to control what the query execution does

This is applying in particular when acting on json files. We have seen that a file was not modified when using the Json_File function as an argument to a modifying function because the modifying function just received a copy of the json file. This is not true when using the Jbin_File function that does not serialize the binary document and make it directly accessible. Also, as we have seen earlier, json functions that modify their first file parameter modify the file and return the file name. This is done by directly serializing the internal binary document as a file.

However, the “Jbin” counterpart of these functions does not serialize the binary document and thus does not modify the json file. For example let us compare these two queries:

/* First query */

/* Second query */

Both queries return:

Result

In the first query Jbin_Object_Add does not serialize the document (no “Jbin” functions do) and Json_Object just returns a serialized modified tree. Consequently, the file bt2.json is not modified. This query is all right to copy a modified version of the json file without modifying it.

However, in the second query Json_Object_Add does modify the json file and returns the file name. The Json_Object function receives this file name, reads and parses the file, makes an object from it and returns the serialized result. This modification can be done willingly but can be an unwanted side effect of the query.

Therefore, using “Jbin” argument functions, in addition to being faster and using less memory, are also safer when dealing with json files that should not be modified.

Using JSON as Dynamic Columns

The JSON nosql language has all the features to be used as an alternative to dynamic columns. For instance, take the following example of dynamic columns:

/* Remove a column: */

/* Add a column: */

/* You can also list all columns, or get them together with their values in JSON format: */

The same result can be obtained with json columns using the json UDF’s:

/* JSON equivalent */

/* Remove a column: */

/* Add a column */

/* You can also list all columns, or get them together with their values in JSON format: */

However, using JSON brings features not existing in dynamic columns:

Use of a language used by many implementation and developers.
Full support of arrays, currently missing from dynamic columns.
Access of subpart of json by JPATH that can include calculations on arrays.
Possible references to json files.

With more experience, additional UDFs can be easily written to support new needs.

New Set of BSON Functions

All these functions have been rewritten using the new JSON handling way and are temporarily available changing the J starting name to B. Then Json_Make_Array new style is called using Bson_Make_Array. Some, such as Bson_Item_Delete, are new and some fix bugs found in their Json counterpart.

Converting Tables to JSON

The JSON UDF’s and the direct Jpath “*” facility are powerful tools to convert table and files to the JSON format. For instance, the file biblio3.json we used previously can be obtained by converting the xsample.xml file. This can be done like this:

From Connect 1.07.0002

Before Connect 1.07.0002

And then :

The xj1 table rows will directly receive the Json object made by the select statement used in the insert statement and the table file are made as shown (xj1 is pretty=2 by default) Its mode is Jmode=2 because the values inserted are strings even if they denote json objects.

Another way to do this is to create a table describing the file format we want before the biblio3.json file existed:

From Connect 1.07.0002

Before Connect 1.07.0002

and to populate it by:

This is a simpler method. However, the issue is that this method cannot handle the multiple column values. This is why we inserted from xsampall not from xsampall2. How can we add the missing multiple authors in this table? Here again we must create a utility table able to handle JSON strings. From Connect 1.07.0002

Before Connect 1.07.0002

Voilà !

Converting json files

We have seen that json files can be formatted differently depending on the pretty option. In particular, big data files should be formatted with pretty equal to 0 when used by a CONNECT json table. The best and simplest way to convert a file from one format to another is to use the Jfile_Make function. Indeed this function makes a file of specified format using the syntax:

The file name is optional when the json document comes from a Jbin_File function because the returned structure makes it available. For instance, to convert back the json file tb.json to pretty= 0, this can be simply done by:

Performance Consideration

MySQL and PostgreSQL have a JSON data type that is not just text but an internal encoding of JSON data. This is to save parsing time when executing JSON functions. Of course, the parse must be done anyway when creating the data and serializing must be done to output the result.

CONNECT directly works on character strings impersonating JSON values with the need of parsing them all the time but with the advantage of working easily on external data. Generally, this is not too penalizing because JSON data are often of some or reasonable size. The only case where it can be a serious problem is when working on a big JSON file.

Then, the file should be formatted or converted to pretty=0.

From Connect 1.7.002, this easily done using the Jfile_Convert function, for instance:

Such a json file should not be used directly by JSON UDFs because they parse the whole file, even when only a subset is used. Instead, it should be used by a JSON table created on it. Indeed, JSON tables do not parse the whole document but just the item corresponding to the row they are working on. In addition, indexing can be used by the table as explained previously on this page.

Generally speaking, the maximum flexibility offered by CONNECT is by using JSON tables and JSON UDFs together. Some things are better handled by tables, other by UDFs. The tools are there but it is up to you to discover the best way to resolve your problems.

Bjson files

Starting with Connect 1.7.002, pretty=0 json files can be converted to a binary format that is a pre-parsed representation of json. This can be done with the Jfile_Bjson UDF function, for instance:

Here the third argument, the record length, must 6 to 10 times larger than the lrecl of the initial json file because the parsed representation is bigger than the original json text representation.

Tables using such Bjson files must specify ‘Pretty=-1’ in the option list.

It is probably similar to the BSON used by MongoDB and PostgreSQL and permits to process queries up to 10 times faster than working on text json files. Indexing is also available for tables using this format making even more performance improvement. For instance, some queries on a json table of half a million rows, that were previously done in more than 10 seconds, took only 0.1 second when converted and indexed.

Here again, this has been remade to use the new way Json is handled. The files made using the bfile_bjson function are only from two to four times the size of the source files. This new representation is not compatible with the old one. Therefore, these files must be used with BSON tables only.

Specifying a JSON table Encoding

An important feature of JSON is that strings should in UNICODE. As a matter of fact, all examples we have found on the Internet seemed to be just ASCII. This is because UNICODE is generally encoded in JSON files using UTF8 or UTF16 or UTF32.

To specify the required encoding, just use the data_charset CONNECT option or the native DEFAULT CHARSET option.

Retrieving JSON data from MongoDB

Classified as a NoSQL database program, MongoDB uses JSON-like documents (BSON) grouped in collections. The simplest way, and only method available before Connect 1.6, to access MongoDB data was to export a collection to a JSON file. This produces a file having the pretty=0 format. Viewed as SQL, a collection is a table and documents are table rows.

Since CONNECT version 1.6, it is now possible to directly access MongoDB collections via their MongoDB C Driver. This is the purpose of the MONGO table type described later. However, JSON tables can also do it in a somewhat different way (providing MONGO support is installed as described for MONGO tables).

It is achieved by specifying the MongoDB connection URI while creating the table. For instance:

From Connect 1.7.002

Before Connect 1.7.002

In this statement, the file_name option was replaced by the connection option. It is the URI enabling to retrieve data from a local or remote MongoDB server. The tabname option is the name of the MongoDB collection that are used and the dbname option could have been used to indicate the database containing the collection (it defaults to the current database).

The way it works is that the documents retrieved from MongoDB are serialized and CONNECT uses them as if they were read from a file. This implies serializing by MongoDB and parsing by CONNECT and is not the best performance wise. CONNECT tries its best to reduce the data transfer when a query contains a reduced column list and/or a where clause. This way makes all the possibilities of the JSON table type available, such as calculated arrays.

However, to work on large JSON collations, using the MONGO table type is generally the normal way.

Note: JSON tables using the MongoDB access accept the specific MONGO options , and . They are described in the MONGO table chapter.

Summary of Options and Variables Used with Json Tables

Options and variables that can be used when creating Json tables are listed here:

Table Option

Type

Description

(*) For Json tables connected to MongoDB, Mongo specific options can also be used.

Other options must be specified in the option list:

Table Option

Type

Description

Column options:

Column Option

Type

Description

Variables used with Json tables are:

Notes

The value n can be 0 based or 1 based depending on the base table option. The default is 0 to match what is the current usage in the Json world but it can be set to 1 for tables created in old versions.
See for instance: , and
This will not work when CONNECT is compiled embedded

_{This page is licensed: CC BY-SA / Gnu FDL}

InnoDB System Variables

This page lists the system variables available for configuring InnoDB's behavior, performance, buffers, and logs.

This page documents system variables related to the InnoDB storage engine. For options that are not system variables, see InnoDB Options.

See Server System Variables for a complete list of system variables and instructions on setting them.

Also see the Full list of MariaDB options, system and status variables.

`have_innodb`

Description: If the server supports , are set to YES, otherwise are set to NO. Removed in , use the table or instead.
Scope: Global
Dynamic: No
Removed:

`ignore_builtin_innodb`

Description: Setting this to 1 results in the built-in InnoDB storage engine being ignored. In some versions of MariaDB, XtraDB is the default and is always present, so this variable is ignored and setting it results in a warning. From to , when InnoDB was the default instead of XtraDB, this variable needed to be set. Usually used in conjunction with the option to use the InnoDB plugin.
Command line: --ignore-builtin-innodb
Scope: Global

`innodb_adaptive_checkpoint`

Description: Replaced with . Controls adaptive checkpointing. InnoDB's fuzzy checkpointing can cause stalls, as many dirty blocks are flushed at once as the checkpoint age nears the maximum. Adaptive checkpointing aims for more consistent flushing, approximately modified age / maximum checkpoint age. Can result in larger transaction log files
- reflex Similar to flushing but flushes blocks constantly and contiguously based on the oldest modified age. If the age exceeds 1/2 of the maximum age capacity, flushing are weak contiguous. If the age exceeds 3/4, flushing are strong. Strength can be adjusted by the variable .

`innodb_adaptive_flushing`

Description: If set to 1, the default, the server will dynamically adjust the flush rate of dirty pages in the . This assists to reduce brief bursts of I/O activity. If set to 0, adaptive flushing will only take place when the limit specified by is reached.
Command line: --innodb-adaptive-flushing={0|1}
Scope: Global

`innodb_adaptive_flushing_lwm`

Description: Adaptive flushing is enabled when this low water mark percentage of the capacity is reached. Takes effect even if is disabled.
Command line: --innodb-adaptive-flushing-lwm=#
Scope: Global
Dynamic: Yes

`innodb_adaptive_flushing_method`

Description: Determines the method of flushing dirty blocks from the InnoDB . If set to native or 0, the original InnoDB method is used. The maximum checkpoint age is determined by the total length of all transaction log files. When the checkpoint age reaches the maximum checkpoint age, blocks are flushed. This can cause lag if there are many updates per second and many blocks with an almost identical age need to be flushed. If set to estimate or 1, the default, the oldest modified age are compared with the maximum age capacity. If it's more than 1/4 of this age, blocks are flushed every second. The number of blocks flushed is determined by the number of modified blocks, the LSN progress speed and the average age of all modified blocks. It's therefore independent of the for the 1-second loop, but not entirely so for the 10-second loop. If set to keep_average or 2, designed specifically for SSD cards, a shorter loop cycle is used in an attempt to keep the I/O rate constant. Removed in /XtraDB 5.6 and replaced with InnoDB flushing method from MySQL 5.6.

`innodb_adaptive_hash_index`

Description: If set to 1, the default until , the hash index is enabled. Based on performance testing (), the InnoDB adaptive hash index helps performance in mostly read-only workloads, and could slow down performance in other environments, especially , , , or operations.
Command line: --innodb-adaptive-hash-index={0|1}
Scope: Global

`innodb_adaptive_hash_index_partitions`

Description: Specifies the number of partitions for use in adaptive searching. If set to 1, no extra partitions are created. XtraDB-only. From (which uses InnoDB as default instead of XtraDB), this is an alias for to allow for easier upgrades.
Command line: innodb-adaptive-hash-index-partitions=#
Scope: Global

`innodb_adaptive_hash_index_parts`

Description: Specifies the number of partitions for use in adaptive searching. If set to 1, no extra partitions are created.
Command line: innodb-adaptive-hash-index-parts=#
Scope: Global
Dynamic: No

`innodb_adaptive_max_sleep_delay`

Description: Maximum time in microseconds to automatically adjust the value to, based on the workload. Useful in extremely busy systems with hundreds of thousands of simultaneous connections. 0 disables any limit. Deprecated and ignored from .
Command line: --innodb-adaptive-max-sleep-delay=#
Scope: Global

`innodb_additional_mem_pool_size`

Description: Size in bytes of the memory pool used for storing information about internal data structures. Defaults to 8MB, if your application has many tables and a large structure, and this is exceeded, operating system memory are allocated and warning messages written to the error log, in which case you should increase this value. Deprecated in and removed in along with InnoDB's internal memory allocator.
Command line: --innodb-additional-mem-pool-size=#
Scope: Global

`innodb_alter_copy_bulk`

Description: Allow bulk insert operation for copy alter operation.
Scope: Global
Dynamic: Yes
Data Type: boolean

`innodb_api_bk_commit_interval`

Description: Time in seconds between auto-commits for idle connections using the InnoDB memcached interface (not implemented in MariaDB).
Command line: --innodb-api-bk-commit-interval=#
Scope: Global
Dynamic: Yes

`innodb_api_disable_rowlock`

Description: For use with MySQL's memcached (not implemented in MariaDB)
Command line: --innodb-api-disable-rowlock={0|1}
Scope: Global
Dynamic: No

`innodb_api_enable_binlog`

Description: For use with MySQL's memcached (not implemented in MariaDB)
Command line: --innodb-api-enable-binlog={0|1}
Scope: Global
Dynamic: No

`innodb_api_enable_mdl`

Description: For use with MySQL's memcached (not implemented in MariaDB)
Command line: --innodb-api-enable-mdl={0|1}
Scope: Global
Dynamic: No

`innodb_api_trx_level`

Description: For use with MySQL's memcached (not implemented in MariaDB)
Command line: --innodb-api-trx-level=#
Scope: Global
Dynamic: Yes

`innodb_auto_lru_dump`

Description: Renamed since XtraDB 5.5.10-20.1, which was in turn replaced by in .
Command line: --innodb-auto-lru-dump=#
Removed: XtraDB 5.5.10-20.1

`innodb_autoextend_increment`

Description: Size in MB to increment an auto-extending shared tablespace file when it becomes full. If was set to 1, this setting does not apply to the resulting per-table tablespace files, which are automatically extended in their own way.
Command line: --innodb-autoextend-increment=#
Scope: Global

`innodb_autoinc_lock_mode`

Description: The lock mode that is used when generating values for InnoDB tables.
- Valid values are:
  - 0 is the traditional lock mode.

`innodb_background_scrub_data_check_interval`

Description: Check if spaces needs scrubbing every seconds. See . Deprecated and ignored from .
Command line: --innodb-background-scrub-data-check-interval=#
Scope: Global
Dynamic: Yes

`innodb_background_scrub_data_compressed`

Description: Enable scrubbing of compressed data by background threads (same as encryption_threads). See . Deprecated and ignored from .
Command line: --innodb-background-scrub-data-compressed={0|1}
Scope: Global
Dynamic: Yes

`innodb_background_scrub_data_interval`

Description: Scrub spaces that were last scrubbed longer than this number of seconds ago. See . Deprecated and ignored from .
Command line: --innodb-background-scrub-data-interval=#
Scope: Global
Dynamic: Yes

`innodb_background_scrub_data_uncompressed`

Description: Enable scrubbing of uncompressed data by background threads (same as encryption_threads). See . Deprecated and ignored from .
Command line: --innodb-background-scrub-data-uncompressed={0|1}
Scope: Global
Dynamic: Yes

`innodb_blocking_buffer_pool_restore`

Description: If set to 1 (0 is default), XtraDB will wait until the least-recently used (LRU) dump is completely restored upon restart before reporting back to the server that it has successfully started up. Available with XtraDB only, not InnoDB.
Command line: innodb-blocking-buffer-pool-restore={0|1}
Scope: Global

`innodb_buf_dump_status_frequency`

Description: Determines how often (as a percent) the buffer pool dump status should be printed in the logs. For example, 10 means that the buffer pool dump status is printed when every 10% of the number of buffer pool pages are dumped. The default is 0 (only start and end status is printed).
Command line: --innodb-buf-dump-status-frequency=#
Scope: Global

`innodb_buffer_pool_chunk_size`

Description: Chunk size used for dynamically resizing the . Note that changing this setting can change the size of the buffer pool. When is used this value is effectively rounded up to the next multiple of . See . From , the variable is autosized based on the .
Command line: --innodb-buffer-pool-chunk-size=#
Scope: Global

`innodb_buffer_pool_dump_at_shutdown`

Description: Whether to record pages cached in the on server shutdown, which reduces the length of the warmup the next time the server starts. The related specifies whether the buffer pool is automatically warmed up at startup.
Command line: --innodb-buffer-pool-dump-at-shutdown={0|1}
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_dump_now`

Description: Immediately records pages stored in the . The related does the reverse, and will immediately warm up the buffer pool.
Command line: --innodb-buffer-pool-dump-now={0|1}
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_dump_pct`

Description: Dump only the hottest N% of each .
Command line: --innodb-buffer-pool-dump-pct={0|1}
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_evict`

Description: Evict pages from the buffer pool. If set to "uncompressed" then all uncompressed pages are evicted from the buffer pool. Variable to be used only for testing. Only exists in DEBUG builds.
Command line: --innodb-buffer-pool-evict=#
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_filename`

Description: The file that holds the list of page numbers set by and .
Command line: --innodb-buffer-pool-filename=file
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_instances`

Description: If is set to more than 1GB, innodb_buffer_pool_instances divides the buffer pool into the specified number of instances. The default was 1 in , but for large systems with buffer pools of many gigabytes, many instances could help reduce contention concurrency through . The default is 8 in MariaDB 10 (except on Windows 32-bit, where it varies according to , or from , where it is set to 1 if < 1GB). Each instance manages its own data structures and takes an equal portion of the total buffer pool size, so for example if innodb_buffer_pool_size is 4GB and innodb_buffer_pool_instances is set to 4, each instance are 1GB. Each instance should ideally be at least 1GB in size. Starting with , performance improvements intended to reduce the overhead of context-switching between buffer pools changed the recommended number of innodb_buffer_pool_instances to one for every 128GB of buffer pool size. Based on these changes, the variable is deprecated and ignored from , where the buffer pool runs in a single instance regardless of size.
Command line: --innodb-buffer-pool-instances=#

`innodb_buffer_pool_load_abort`

Description: Aborts the process of restoring contents started by or .
Command line: --innodb-buffer-pool-load-abort={0|1}
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_load_at_startup`

Description: Specifies whether the is automatically warmed up when the server starts by loading the pages held earlier. The related specifies whether pages are saved at shutdown. If the buffer pool is large and taking a long time to load, increasing at startup may help.
Command line: --innodb-buffer-pool-load-at-startup={0|1}
Scope: Global

`innodb_buffer_pool_load_now`

Description: Immediately warms up the by loading the stored data pages. The related does the reverse, and immediately records pages stored in the buffer pool.
Command line: --innodb-buffer-pool-load-now={0|1}
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_load_pages_abort`

Description: Number of pages during a buffer pool load to process before signaling . Debug builds only.
Command line: --innodb-buffer-pool-load-pages-abort=#
Scope: Global
Dynamic: Yes

`innodb_buffer_pool_populate`

Description: When set to 1 (0 is default), XtraDB will preallocate pages in the buffer pool on starting up so that NUMA allocation decisions are made while the buffer cache is still clean. XtraDB only. This option was made ineffective in . Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-buffer-pool-populate={0|1}
Scope: Global

`innodb_buffer_pool_restore_at_startup`

Description: Time in seconds between automatic buffer pool dumps. If set to a non-zero value, XtraDB will also perform an automatic restore of the at startup. If set to 0, automatic dumps are not performed, nor automatic restores on startup. Replaced by in .
Command line: innodb-buffer-pool-restore-at-startup
Scope: Global

`innodb_buffer_pool_shm_checksum`

Description: Used with Percona's SHM buffer pool patch in XtraDB 5.5. Was shortly deprecated and removed in XtraDB 5.6. XtraDB only.
Command line: innodb-buffer-pool-shm-checksum={0|1}
Scope: Global
Dynamic: No

`innodb_buffer_pool_shm_key`

Description: Used with Percona's SHM buffer pool patch in XtraDB 5.5. Later deprecated in XtraDB 5.5, and removed in XtraDB 5.6.
Command line: innodb-buffer-pool-shm-key={0|1}
Scope: Global
Dynamic: No

`innodb_buffer_pool_size`

Description: InnoDB buffer pool size in bytes. The primary value to adjust on a database server with entirely/primarily tables, can be set up to 80% of the total memory in these environments. See the for more on setting this variable, and also if doing so dynamically.
Command line: --innodb-buffer-pool-size=#
Scope: Global

`innodb_buffer_pool_size_auto_min`

Description: Minimum innodb_buffer_pool_size in bytes for dynamic shrinking on memory pressure. Only affects Linux. If a memory pressure event is reported by Linux, the innodb_buffer_pool_size may be automatically shrunk towards this value. By default, set to , that is, memory pressure events will be ignored. 0 sets no minimum value.
Command line: --innodb-buffer-pool-size-auto-min=#
Scope: Global

`innodb_buffer_pool_size_max`

Description: Maximum innodb_buffer_pool_size value.
Command line: --innodb-buffer-pool-size-max=#
Scope: Global
Dynamic: No

`innodb_change_buffer_dump`

Description: If set, causes the contents of the InnoDB change buffer to be dumped to the server error log at startup. Only available in debug builds.
Scope: Global
Dynamic: No
Data Type: boolean

`innodb_change_buffer_max_size`

Description: Maximum size of the as a percentage of the total buffer pool. The default is 25%, and this can be increased up to 50% for servers with high write activity, and lowered down to 0 for servers used exclusively for reporting.
Command line: --innodb-change-buffer-max-size=#
Scope: Global
Dynamic: Yes

`innodb_change_buffering`

Description: Sets how change buffering is performed. See for details on the settings. Deprecated and ignored from .
Command line: --innodb-change-buffering=#
Scope: Global
Dynamic: Yes

`innodb_change_buffering_debug`

Description: If set to 1, an debug flag is set. 1 forces all changes to the change buffer, while 2 causes a crash at merge. 0, the default, indicates no flag is set. Only available in debug builds.
Command line: --innodb-change-buffering-debug=#

`innodb_checkpoint_age_target`

Description: The maximum value of the checkpoint age. If set to 0, has no effect. Removed in /XtraDB 5.6 and replaced with InnoDB flushing method from MySQL 5.6.
Command line: innodb-checkpoint-age-target=#
Scope: Global

`innodb_checksum_algorithm`

Description: Specifies how the InnoDB tablespace checksum is generated and verified.
- innodb: Backwards compatible with earlier versions (<= ). Deprecated in , , and removed in . If really needed, data files can still be converted with .
- crc32: A newer, faster algorithm, but incompatible with earlier versions. Tablespace blocks are converted to the new format over time, meaning that a mix of checksums may be present.

`innodb_checksums`

Description: By default, performs checksum validation on all pages read from disk, which provides extra fault tolerance. You would usually want this set to 1 in production environments, although setting it to 0 can provide marginal performance improvements. Deprecated and functionality replaced by in , and should be removed to avoid conflicts. ON is equivalent to --innodb_checksum_algorithm=innodb and OFF to --innodb_checksum_algorithm=none.
Command line: --innodb-checksums

`innodb_cleaner_lsn_age_factor`

Description: XtraDB has enhanced page cleaner heuristics, and with these in place, the default InnoDB adaptive flushing may be too aggressive. As a result, a new LSN age factor formula has been introduced, controlled by this variable. The default setting, high_checkpoint, uses the new formula, while the alternative, legacy, uses the original algorithm. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: --innodb-cleaner-lsn-age-factor=value
Scope: Global

`innodb_cmp_per_index_enabled`

Description: If set to ON (OFF is default), per-index compression statistics are stored in the table. These are expensive to record, so this setting should only be changed with care, such as for performance tuning on development or replica servers.
Command line: --innodb-cmp-per-index-enabled={0|1}
Scope: Global

`innodb_commit_concurrency`

Description: Limit to the number of transaction threads that can commit simultaneously. 0, the default, imposes no limit. While you can change from one positive limit to another at runtime, you cannot set this variable to 0, or change it from 0, while the server is running. Deprecated and ignored from .
Command line: --innodb-commit-concurrency=#
Scope: Global
Dynamic: Yes

`innodb_compression_algorithm`

Description: Compression algorithm used for . The supported values are:
- none: Pages are not compressed.
- zlib: Pages are compressed using the bundled compression algorithm.

`innodb_compression_default`

Description: Whether or not is enabled by default for new tables.
- The default value is OFF, which means new tables are not compressed.
- See for more information.
Command line:

`innodb_compression_failure_threshold_pct`

Description: Specifies the percentage cutoff for expensive compression failures during updates to a table that uses , after which free space is added to each new compressed page, dynamically adjusted up to the level set by . Zero disables checking of compression efficiency and adjusting padding.
- See for more information.
Command line: --innodb-compression-failure-threshold-pct=#

`innodb_compression_level`

Description: Specifies the default level of compression for tables that use .
- Only a subset of InnoDB page compression algorithms support compression levels. If an InnoDB page compression algorithm does not support compression levels, then the compression level value is ignored.
- The compression level can be set to any value between 1 and 9. The default compression level is 6. The range goes from the fastest to the most compact, which means that

`innodb_compression_pad_pct_max`

Description: The maximum percentage of reserved free space within each compressed page for tables that use . Reserved free space is used when the page's data is reorganized and might be recompressed. Only used when is not zero, and the rate of compression failures exceeds its setting.
- See for more information.
Command line: --innodb-compression-pad-pct-max=#

`innodb_concurrency_tickets`

Description: Number of times a newly-entered thread can enter and leave until it is again subject to the limitations of and may possibly be queued. Deprecated and ignored from .
Command line: --innodb-concurrency-tickets=#
Scope: Global
Dynamic: Yes

`innodb_corrupt_table_action`

Description: What action to perform when a corrupt table is found. XtraDB only.
- When set to assert, the default, XtraDB will intentionally crash the server when it detects corrupted data in a single-table tablespace, with an assertion failure.
- When set to warn, it will pass corruption as corrupt table instead of crashing, and disable all further I/O (except for deletion) on the table file.

`innodb_data_file_buffering`

Description: Whether to enable the file system cache for data files. Set to OFF by default, are set to ON if is set to fsync, littlesync, nosync, or (Windows specific) normal.
Command line: --innodb-data-file-buffering={0|1}

`innodb_data_file_path`

Description: Individual data files, paths and sizes. The value of is joined to each path specified by innodb_data_file_path to get the full directory path. If innodb_data_home_dir is an empty string, absolute paths can be specified here. A file size is specified (with K for kilobytes, M for megabytes and G for gigabytes). Also whether or not to autoextend the data file, and whether or not to on startup may also be specified.
Command line: --innodb-data-file-path=name
Scope: Global

`innodb_data_file_write_through`

Description: Whether writes to InnoDB data files (including the temporary tablespace) are write through. Set to OFF by default, are set to ON if is set to O_DSYNC. On systems that support FUA it may make sense to enable write-through, to avoid extra system calls.
Command line: --innodb-data-file-write-through={0|1}
Scope: Global

`innodb_data_home_dir`

Description: Directory path for all data files in the shared tablespace (assuming is not enabled). File-specific information can be added in , as well as absolute paths if innodb_data_home_dir is set to an empty string.
Command line: --innodb-data-home-dir=path
Scope: Global
Dynamic: No

`innodb_deadlock_detect`

Description: By default, the InnoDB deadlock detector is enabled. If set to off, deadlock detection is disabled and MariaDB will rely on instead. This may be more efficient in systems with high concurrency as deadlock detection can cause a bottleneck when a number of threads have to wait for the same lock.
Command line: --innodb-deadlock-detect
Scope: Global
Dynamic: Yes

`innodb_deadlock_report`

Description: How to report deadlocks (if ).
- off: Do not report any details of deadlocks.
- basic: Report transactions and waiting locks.

`innodb_default_page_encryption_key`

Description: Encryption key used for page encryption.
- See and for more information.
Command line: --innodb-default-page-encryption-key=#
Scope: Global

`innodb_default_encryption_key_id`

Description: ID of encryption key used by default to encrypt InnoDB tablespaces.
- See and for more information.
Command line: --innodb-default-encryption-key-id=#
Scope: Global, Session

`innodb_default_row_format`

Description: Specifies the default to be used for InnoDB tables. The compressed row format cannot be set as the default.
- See for more information.
Command line: --innodb-default-row-format=value

`innodb_defragment`

Description: When set to 1 (the default is 0), InnoDB defragmentation is enabled. When set to FALSE, all existing defragmentation are paused and new defragmentation commands will fail. Paused defragmentation commands will resume when this variable is set to true again. See .
Command line: --innodb-defragment={0|1}
Scope: Global

`innodb_defragment_fill_factor`

Description:. Indicates how full defragmentation should fill a page. Together with ensures defragmentation won’t pack the page too full and cause page split on the next insert on every page. The variable indicating more defragmentation gain is the one effective. See .
Command line: --innodb-defragment-fill-factor=#
Scope: Global
Dynamic: Yes

`innodb_defragment_fill_factor_n_recs`

Description: Number of records of space that defragmentation should leave on the page. This variable, together with , is introduced so defragmentation won't pack the page too full and cause page split on the next insert on every page. The variable indicating more defragmentation gain is the one effective. See .
Command line: --innodb-defragment-fill-factor-n-recs=#
Scope: Global

`innodb_defragment_frequency`

Description: Maximum times per second for defragmenting a single index. This controls the number of times the defragmentation thread can request X_LOCK on an index. The defragmentation thread will check whether 1/defragment_frequency (s) has passed since it last worked on this index, and put the index back in the queue if not enough time has passed. The actual frequency can only be lower than this given number. See .
Command line: --innodb-defragment-frequency=#
Scope: Global

`innodb_defragment_n_pages`

Description: Number of pages considered at once when merging multiple pages to defragment. See .
Command line: --innodb-defragment-n-pages=#
Scope: Global
Dynamic: Yes

`innodb_defragment_stats_accuracy`

Description: Number of defragment stats changes there are before the stats are written to persistent storage. Defaults to zero, meaning disable defragment stats tracking. See .
Command line: --innodb-defragment-stats-accuracy=#
Scope: Global
Dynamic: Yes

`innodb_dict_size_limit`

Description: Size in bytes of a soft limit the memory used by tables in the data dictionary. Once this limit is reached, XtraDB will attempt to remove unused entries. If set to 0, the default and standard InnoDB behavior, there is no limit to memory usage. Removed in /XtraDB 5.6 and replaced by MySQL 5.6's new implementation.
Command line: innodb-dict-size-limit=#
Scope: Global

`innodb_disable_sort_file_cache`

Description: If set to 1 (0 is default), the operating system file system cache for merge-sort temporary files is disabled.
Command line: --innodb-disable-sort-file-cache={0|1}
Scope: Global

`innodb_disallow_writes`

Description: Tell InnoDB to stop any writes to disk.
Command line: None
Scope: Global
Dynamic: Yes

`innodb_doublewrite`

Description: If set to ON, the default, to improve fault tolerance first stores data to a before writing it to data file. Disabling will provide a marginal performance improvement, and assumes that writes of are atomic. fast is available from , and is like ON, but writes are not synchronized to data files. The deprecated start-up parameter will cause innodb_doublewrite=ON to be changed to innodb_doublewrite=fast, which will prevent InnoDB from making any durable writes to data files. This would normally be done right before the log checkpoint LSN is updated. Depending on the file systems being used and their configuration, this may or may not be safe.
The value innodb_doublewrite=fast differs from the previous combination of innodb_doublewrite=ON and innodb_flush_method=O_DIRECT_NO_FSYNC by always invoking os_file_flush() on the doublewrite buffer itself in buf_dblwr_t::flush_buffered_writes_completed(). This should be safer when there are multiple doublewrite batches between checkpoints.
Typically, once per second, buf_flush_page_cleaner() would write out up to innodb_io_capacity pages and advance the log checkpoint. Also typically, innodb_io_capacity>128, which is the size of the doublewrite buffer in pages. Should os_file_flush_func() not be invoked between doublewrite batches, writes could be reordered in an unsafe way.

`innodb_doublewrite_file`

Description: The absolute or relative path and filename to a dedicated tablespace for the . In heavy workloads, the doublewrite buffer can impact heavily on the server, and moving it to a different drive will reduce contention on random reads. Since the doublewrite buffer is mostly sequential writes, a traditional HDD is a better choice than SSD. This Percona XtraDB variable has not been ported to XtraDB 5.6.
Command line: innodb-doublewrite-file=filename
Scope: Global

`innodb_empty_free_list_algorithm`

Description: XtraDB 5.6.13-61 introduced an algorithm to assist with reducing mutex contention when the buffer pool free list is empty, controlled by this variable. If set to backoff, the default until , the new algorithm are used. If set to legacy, the original InnoDB algorithm are used. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades. See for the reasons this was changed back to legacy in XtraDB 5.6.36-82.0. When upgrading from 10.0 to 10.1 (>= 10.1.24), for large buffer pools the default will remain backoff, while for small ones it are changed to legacy.
Command line: innodb-empty-free-list-algorithm=value

`innodb_enable_unsafe_group_commit`

Description: Unneeded after XtraDB 1.0.5. If set to 0, the default, InnoDB will keep transactions between the transaction log and s in the same order. Safer, but slower. If set to 1, transactions can be group-committed, but there is no guarantee of the order being kept, and a small risk of the two logs getting out of sync. In write-intensive environments, can lead to a significant improvement in performance.
Command line: --innodb-enable-unsafe-group-commit
Scope: Global

`innodb_encrypt_log`

Description: Enables encryption of the . This also enables encryption of some temporary files created internally by InnoDB, such as those used for merge sorts and row logs.
- See and for more information.
Command line: --innodb-encrypt-log

`innodb_encrypt_tables`

Description: Enables automatic encryption of all InnoDB tablespaces.
- OFF - Disables table encryption for all new and existing tables that have the table option set to DEFAULT.
- ON - Enables table encryption for all new and existing tables that have the table option set to DEFAULT

`innodb_encrypt_temporary_tables`

Description: Enables automatic encryption of the InnoDB .
- See and for more information.
Command line: --innodb-encrypt-temporary-tables={0|1}

`innodb_encryption_rotate_key_age`

Description: Re-encrypt in background any page having a key older than this number of key versions. When setting up encryption, this variable must be set to a non-zero value. Otherwise, when you enable encryption through MariaDB won't be able to automatically encrypt any unencrypted tables.
- See and for more information.
Command line: --innodb-encryption-rotate-key-age=#

`innodb_encryption_rotation_iops`

Description: Use this many iops for background key rotation operations performed by the background encryption threads.
- See and for more information.
Command line: --innodb-encryption-rotation_iops=#

`innodb_encryption_threads`

Description: Number of background encryption threads performing background key rotation and . When setting up encryption, this variable must be set to a non-zero value. Otherwise, when you enable encryption through MariaDB won't be able to automatically encrypt any unencrypted tables. Recommended never be set higher than 255.
- See and for more information.
Command line: --innodb-encryption-threads=#

`innodb_extra_rsegments`

Description: Removed in XtraDB 5.5 and replaced by . Usually there is one rollback segment protected by single mutex, a source of contention in high write environments. This option specifies a number of extra user rollback segments. Changing the default will make the data readable by XtraDB only, and is incompatible with InnoDB. After modifying, the server must be slow-shutdown. If there is existing data, it must be dumped before changing, and re-imported after the change has taken effect.
Command line: --innodb-extra-rsegments=#
Scope: Global

`innodb_extra_undoslots`

Description: Usually, InnoDB has 1024 undo slots in its rollback segment, so 1024 transactions can run in parallel. New transactions will fail if all slots are used. Setting this variable to 1 expands the available undo slots to 4072. Not recommended unless you get the Warning: cannot find a free slot for an undo log error in the error log, as it makes data files unusable for ibbackup, or MariaDB servers not run with this option. See also .
Command line: --innodb-extra-undoslots={0|1}
Scope: Global

`innodb_fake_changes`

Description: From until , XtraDB-only option that enables the fake changes feature. In , setting up or restarting a replica can cause a replication reads to perform more slowly, as MariaDB is single-threaded and needs to read the data before it can execute the queries. This can be speeded up by prefetching threads to warm the server, replaying the statements and then rolling back at commit. This however has an overhead from locking rows only then to undo changes at rollback. Fake changes attempts to reduce this overhead by reading the rows for INSERT, UPDATE and DELETE statements but not updating them. The rollback is then very fast with little or nothing to do. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades. Not present in and beyond.
Command line: --innodb-fake-changes={0|1}

`innodb_fast_checksum`

Description: Implements a more CPU efficient XtraDB checksum algorithm, useful for write-heavy loads with high I/O. If set to 1 on a server with tables that have been created with it set to 0, reads are slower, so tables should be recreated (dumped and reloaded). XtraDB will fail to start if set to 0 and there are tables created while set to 1. Replaced with in /XtraDB 5.6.
Command line: --innodb-fast-checksum={0|1}

`innodb_fast_shutdown`

Description: The shutdown mode.
- 0 - InnoDB performs a slow shutdown, including full purge (before , not always, due to ) and change buffer merge. Can be very slow, even taking hours in extreme cases.
- 1 - the default, performs a fast shutdown, not performing a full purge or an insert buffer merge.

`innodb_fatal_semaphore_wait_threshold`

Description: In MariaDB, the fatal semaphore timeout is configurable. This variable sets the maximum number of seconds for semaphores to time out in InnoDB.
Command line: --innodb-fatal-semaphore-wait-threshold=#
Scope: Global
Dynamic: No

`innodb_file_format`

Description: File format for new tables. Can either be Antelope, the default and the original format, or Barracuda, which supports . Note that this value is also used when a table is re-created with an which requires a table copy. See for more on the file formats. Removed in 10.3.1 and restored as a deprecated and unused variable in 10.4.3 for compatibility purposes.
Command line: --innodb-file-format=value
Scope: Global

`innodb_file_format_check`

Description: If set to 1, the default, checks the shared tablespace file format tag. If this is higher than the current version supported by XtraDB/InnoDB (for example Barracuda when only Antelope is supported), XtraDB/InnoDB will not start. If it the value is not higher, XtraDB/InnoDB starts correctly and the value is set to this value. If innodb_file_format_check is set to 0, no checking is performed. See for more on the file formats.
Command line: --innodb-file-format-check={0|1}
Scope: Global

`innodb_file_format_max`

Description: The highest file format. This is set to the value of the file format tag in the shared tablespace on startup (see ). If the server later creates a higher table format, innodb_file_format_max is set to that value. See for more on the file formats.
Command line: --innodb-file-format-max=value
Scope: Global

`innodb_file_per_table`

Description: If set to ON, then new tables are created with their own . If set to OFF, then new tables are created in the instead. is only available with file-per-table tablespaces. Note that this value is also used when a table is re-created with an which requires a table copy. Deprecated in as there's no benefit to setting to OFF, the original InnoDB default.
Command line: --innodb-file-per-table

`innodb_fill_factor`

Description: Percentage of B-tree page filled during bulk insert (sorted index build). Used as a hint rather than an absolute value. Setting to 70, for example, reserves 30% of the space on each B-tree page for the index to grow in future.
Command line: --innodb-fill-factor=#
Scope: Global

`innodb_flush_log_at_timeout`

Description: Interval in seconds to write and flush the . Before MariaDB 10, this was fixed at one second, which is still the default, but this can now be changed. It's usually increased to reduce flushing and avoid impacting performance of binary log group commit.
Scope: Global
Dynamic: Yes
Data Type: numeric

`innodb_flush_log_at_trx_commit`

Description: Set to 1, along with for the greatest level of fault tolerance. The value of determines whether this variable can be reset with a SET statement or not.
- 1 The default, the log buffer is written to the file and a flush to disk performed after each transaction. This is required for full ACID compliance.
- 0 Nothing is done on commit; rather the log buffer is written and flushed to the

`innodb_flush_method`

Description: flushing method. Windows always uses async_unbuffered and this variable then has no effect. On Unix, before , by default fsync() is used to flush data and logs. Adjusting this variable can give performance improvements, but behavior differs widely on different filesystems, and changing from the default has caused problems in some situations, so test and benchmark carefully before adjusting. In MariaDB, Windows recognises and correctly handles the Unix methods, but if none are specified it uses own default - unbuffered write (analog of O_DIRECT) + syncs (e.g FileFlushBuffers()) for all files.
- O_DSYNC - O_DSYNC is used to open and flush logs, and fsync() to flush the data files.
- O_DIRECT

`innodb_flush_neighbor_pages`

Description: Determines whether, when dirty pages are flushed to the data file, neighboring pages in the data file are flushed at the same time. If set to none, the feature is disabled. If set to area, the default, the standard InnoDB behavior is used. For each page to be flushed, dirty neighboring pages are flushed too. If there's little head seek delay, such as SSD or large enough write buffer, one of the other two options may be more efficient. If set to cont, for each page to be flushed, neighboring contiguous blocks are flushed at the same time. Being contiguous, a sequential I/O is used, unlike the random I/O used in area. Replaced by in /XtraDB 5.6.
Command line: innodb-flush-neighbor-pages=value

`innodb_flush_neighbors`

Description: Determines whether flushing a page from the will flush other dirty pages in the same group of pages (extent). In high write environments, if flushing is not aggressive enough, it can fall behind resulting in higher memory usage, or if flushing is too aggressive, cause excess I/O activity. SSD devices, with low seek times, would be less likely to require dirty neighbor flushing to be set. Since an attempt is made under Windows and Linux to determine SSD status which was exposed in . This variable is ignored for table spaces that are detected as stored on SSD (and the 0 behavior applies).
- 1: The default, flushes contiguous dirty pages in the same extent from the buffer pool.

`innodb_flush_sync`

Description: If set to ON, the default, the setting is ignored for I/O bursts occurring at checkpoints.
Command line: --innodb-flush-sync={0|1}
Scope: Global
Dynamic: Yes

`innodb_flushing_avg_loops`

Description: Determines how quickly adaptive flushing will respond to changing workloads. The value is the number of iterations that a previously calculated flushing state snapshot is kept. Increasing the value smooths and slows the rate that the flushing operations change, while decreasing it causes flushing activity to spike quickly in response to workload changes.
Command line: --innodb-flushing-avg-loops=#
Scope: Global

`innodb_force_load_corrupted`

Description: Set to 0 by default, if set to 1, are permitted to load tables marked as corrupt. Only use this to recover data you can't recover any other way, or in troubleshooting. Always restore to 0 when the returning to regular use. Given that in aims to allow any metadata for a missing or corrupted table to be dropped, and given that and and related tasks made DDL operations crash-safe, the parameter no longer serves any purpose and was removed in .
Command line: --innodb-force-load-corrupted

`innodb_force_primary_key`

Description: If set to 1 (0 is default) CREATE TABLEs without a primary or unique key where all keyparts are NOT NULL will not be accepted, and will return an error.
Command line: --innodb-force-primary-key
Scope: Global

`innodb_force_recovery`

Description: crash recovery mode. 0 is the default. The other modes are for recovery purposes only, and no data can be changed while another mode is active. Some queries relying on indexes are also blocked. See for more on mode specifics.
Command line: --innodb-force-recovery=#
Scope: Global

`innodb_foreground_preflush`

Description: Before XtraDB 5.6.13-61.0, if the checkpoint age is in the sync preflush zone while a thread is writing to the , it will try to advance the checkpoint by issuing a flush list flush batch if this is not already being done. XtraDB has enhanced page cleaner tuning, and may already be performing furious flushing, resulting in the flush simply adding unneeded mutex pressure. Instead, the thread now waits for the flushes to finish, and then has two options, controlled by this variable. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
- exponential_backoff - thread sleeps while it waits for the flush list flush to occur. The sleep time randomly progressively increases, periodically reset to avoid runaway sleeps.

`innodb_ft_aux_table`

Description: Diagnostic variable intended only to be set at runtime. It specifies the qualified name (for example test/ft_innodb) of an InnoDB table that has a , and after being set the INFORMATION_SCHEMA tables , , INNODB_FT_CONFIG, , and will contain search index information for the specified table.
Command line: --innodb-ft-aux-table=value
Scope: Global

`innodb_ft_cache_size`

Description: Cache size available for a parsed document while creating an InnoDB .
Command line: --innodb-ft-cache-size=#
Scope: Global
Dynamic: No

`innodb_ft_enable_diag_print`

Description: If set to 1, additional search diagnostic output is enabled.
Command line: --innodb-ft-enable-diag-print={0|1}
Scope: Global
Dynamic: Yes

`innodb_ft_enable_stopword`

Description: If set to 1, the default, a set of is associated with an InnoDB when it is created. The stopword list comes from the table set by the session variable , if set, otherwise the global variable , if that is set, or the if neither variable is set.
Command line: --innodb-ft-enable-stopword={0|1}
Scope: Global

`innodb_ft_max_token_size`

Description: Maximum length of words stored in an InnoDB . A larger limit will increase the size of the index, slowing down queries, but permit longer words to be searched for. In most normal situations, longer words are unlikely search terms.
Command line: --innodb-ft-max-token-size=#
Scope: Global
Dynamic: No

`innodb_ft_min_token_size`

Description: Minimum length of words stored in an InnoDB . A smaller limit will increase the size of the index, slowing down queries, but permit shorter words to be searched for. For data stored in a Chinese, Japanese or Korean , a value of 1 should be specified to preserve functionality.
Command line: --innodb-ft-min-token-size=#
Scope: Global

`innodb_ft_num_word_optimize`

Description: Number of words processed during each on an InnoDB . To ensure all changes are incorporated, multiple OPTIMIZE TABLE statements could be run in case of a substantial change to the index.
Command line: --innodb-ft-num-word-optimize=#
Scope: Global
Dynamic: Yes

`innodb_ft_result_cache_limit`

Description: Limit in bytes of the InnoDB query result cache per fulltext query. The latter stages of the full-text search are handled in memory, and limiting this prevents excess memory usage. If the limit is exceeded, the query returns an error.
Command line: --innodb-ft-result-cache-limit=#
Scope: Global
Dynamic: Yes

`innodb_ft_server_stopword_table`

Description: Table name containing a list of stopwords to ignore when creating an InnoDB , in the format db_name/table_name. The specified table must exist before this option is set, and must be an InnoDB table with a single column, a named VALUE. See also .
Command line: --innodb-ft-server-stopword-table=db_name/table_name
Scope: Global

`innodb_ft_sort_pll_degree`

Description: Number of parallel threads used when building an InnoDB . See also .
Command line: --innodb-ft-sort-pll-degree=#
Scope: Global
Dynamic: No

`innodb_ft_total_cache_size`

Description:Total memory allocated for the cache for all InnoDB tables. A force sync is triggered if this limit is exceeded.
Command line: --innodb-ft-total-cache-size=#
Scope: Global
Dynamic: No

`innodb_ft_user_stopword_table`

Description: Table name containing a list of stopwords to ignore when creating an InnoDB , in the format db_name/table_name. The specified table must exist before this option is set, and must be an InnoDB table with a single column, a named VALUE. See also .
Command line: --innodb-ft-user-stopword-table=db_name/table_name
Scope: Session

`innodb_ibuf_accel_rate`

Description: Allows the insert buffer activity to be adjusted. The following formula is used: [real activity] = [default activity] * (innodb_io_capacity/100) * (innodb_ibuf_accel_rate/100). As innodb_ibuf_accel_rate is increased from its default value of 100, the lowest setting, insert buffer activity is increased. See also . This Percona XtraDB variable has not been ported to XtraDB 5.6.
Command line: innodb-ibuf-accel-rate=#
Scope: Global

`innodb_ibuf_active_contract`

Description: Specifies whether the insert buffer can be processed before it's full. If set to 0, the standard InnoDB method is used, and the buffer is not processed until it's full. If set to 1, the default, the insert buffer can be processed before it is full. This Percona XtraDB variable has not been ported to XtraDB 5.6.
Command line: innodb-ibuf-active-contract=#
Scope: Global

`innodb_ibuf_max_size`

Description: Maximum size in bytes of the insert buffer. Defaults to half the size of the so you may want to reduce if you have a very large buffer pool. If set to 0, the insert buffer is disabled, which will cause all secondary index updates to be performed synchronously, usually at a cost to performance. This Percona XtraDB variable has not been ported to XtraDB 5.6.
Command line: innodb-ibuf-max-size=#
Scope: Global

`innodb_idle_flush_pct`

Description: Up to what percentage of dirty pages should be flushed when innodb finds it has spare resources to do so. Has had no effect since merging InnoDB 5.7 from mysql-5.7.9 (). Deprecated in , , and removed in .
Command line: --innodb-idle-flush-pct=#
Scope: Global

`innodb_immediate_scrub_data_uncompressed`

Description: Enable scrubbing of data. See .
Command line: --innodb-immediate-scrub-data-uncompressed={0|1}
Scope: Global
Dynamic: Yes

`innodb_import_table_from_xtrabackup`

Description: If set to 1, permits importing of .ibd files exported with the --export option. Previously named innodb_expand_import. Removed in /XtraDB 5.6 and replaced with MySQL 5.6's transportable tablespaces.
Command line: innodb-import-table-from-xtrabackup=#
Scope: Global

`innodb_instant_alter_column_allowed`

Description:
- If a table is altered using ALGORITHM=INSTANT, it can force the table to use a non-canonical format: A hidden metadata record at the start of the clustered index is used to store each column's DEFAULT value. This makes it possible to add new columns that have default values without rebuilding the table. Starting with , a BLOB in the hidden metadata record is used to store column mappings. This makes it possible to drop or reorder columns without rebuilding the table. This also makes it possible to add columns to any position or drop columns from any position in the table without rebuilding the table. If a column is dropped without rebuilding the table, old records will contain garbage in that column's former position, and new records are written with NULL values, empty strings, or dummy values.
- This is generally not a problem. However, there may be cases where you want to avoid putting a table into this format. For example, to ensure that future UPDATE operations after an ADD COLUMN are performed in-place, to reduce write amplification. (Instantly added columns are essentially always variable-length.) Also avoid bugs similar to

`innodb_instrument_semaphores`

Description: Enable semaphore request instrumentation. This could have some effect on performance but allows better information on long semaphore wait problems.
Command line: --innodb-instrument-semaphores={0|1}
Scope: Global
Dynamic: Yes

`innodb_io_capacity`

Description: Limit on I/O activity for InnoDB background tasks, including merging data from the insert buffer and flushing pages. Should be set to around the number of I/O operations per second that system can handle, based on the type of drive/s being used. You can also set it higher when the server starts to help with the extra workload at that time, and then reduce for normal use. Ideally, opt for a lower setting, as at higher value data is removed from the buffers too quickly, reducing the effectiveness of caching. See also .
- See for more information.
Command line: --innodb-io-capacity=#

`innodb_io_capacity_max`

Description: Upper limit to which InnoDB can extend in case of emergency. See for more information.
Command line: --innodb-io-capacity-max=#
Scope: Global
Dynamic: Yes

`innodb_kill_idle_transaction`

Description: Time in seconds before killing an idle XtraDB transaction. If set to 0 (the default), the feature is disabled. Used to prevent accidental user locks. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Scope: Global
Dynamic: Yes
Data Type:

`innodb_large_prefix`

Description: If set to 1, tables that use specific are permitted to have index key prefixes up to 3072 bytes (for 16k pages, ). If not set, the limit is 767 bytes.
- This applies to the and row formats.
- Removed in 10.3.1 and restored as a deprecated and unused variable in 10.4.3 for compatibility purposes.

`innodb_lazy_drop_table`

Description: Deprecated and removed in XtraDB 5.6. processing can take a long time when is set to 1 and there's a large . If innodb_lazy_drop_table is set to 1 (0 is default), XtraDB attempts to optimize processing by deferring the dropping of related pages from the until there is time, only initially marking them.
Command line: innodb-lazy-drop-table={0|1}

`innodb_lock_schedule_algorithm`

Description: Removed in due to problems with the VATS implementation (). Specifies the algorithm that InnoDB uses to decide which of the waiting transactions should be granted the lock once it has been released. The possible values are: FCFS (First-Come-First-Served) where locks are granted in the order they appear in the lock queue and VATS (Variance-Aware-Transaction-Scheduling) where locks are granted based on the Eldest-Transaction-First heuristic. Note that VATS should not be used with , and InnoDB will refuse to start if VATS is used with Galera. It is also not recommended to set to VATS even in the general case (). From , the value was changed to FCFS and a warning produced when using Galera.

`innodb_lock_wait_timeout`

Description: Time in seconds that an InnoDB transaction waits for an InnoDB record lock (or table lock) before giving up with the error ERROR 1205 (HY000): Lock wait timeout exceeded; try restarting transaction. When this occurs, the statement (not transaction) is rolled back. The whole transaction can be rolled back if the option is used. Increase this for data warehousing applications or where other long-running operations are common, or decrease for OLTP and other highly interactive applications. This setting does not apply to deadlocks, which InnoDB detects immediately, rolling back a deadlocked transaction. 0 means no wait. See . Setting to 100000000 or more (from , 100000000 is the maximum) means the timeout is infinite.
Command line: --innodb-lock-wait-timeout=#

`innodb_locking_fake_changes`

Description: From to , XtraDB-only option that if set to OFF, fake transactions (see ) don't take row locks. This is an experimental feature to attempt to deal with drawbacks in fake changes blocking real locks. It is not safe for use in all environments. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: --innodb-locking-fake-changes
Scope: Global

`innodb_locks_unsafe_for_binlog`

Description: Set to 0 by default, in which case XtraDB/InnoDB uses . If set to 1, gap locking is disabled for searches and index scans. Deprecated in , and removed in , use instead.
Command line: --innodb-locks-unsafe-for-binlog
Scope: Global

`innodb_log_arch_dir`

Description: The directory for archiving. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: --innodb-log-arch-dir=name
Scope: Global
Dynamic: No

`innodb_log_arch_expire_sec`

Description: Time in seconds since the last change after which the archived should be deleted. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: --innodb-log-arch-expire-sec=#
Scope: Global
Dynamic: Yes

`innodb_log_archive`

Description: Whether or not archiving is enabled. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: --innodb-log-archive={0|1}
Scope: Global
Dynamic: Yes

`innodb_log_block_size`

Description: Size in bytes of the records. Generally 512, the default, or 4096, are the only two useful values. If the server is restarted and this value is changed, all old log files need to be removed. Should be set to 4096 for SSD cards or if is set to ALL_O_DIRECT on ext4 filesystems. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-log-block-size=#

`innodb_log_buffer_size`

Description: Size in bytes of the buffer for writing files to disk. Increasing this means larger transactions can run without needing to perform disk I/O before committing.
Command line: --innodb-log-buffer-size=#
Scope: Global
Dynamic: No

`innodb_log_checkpoint_now`

Description: Write back dirty pages from the and update the log checkpoint. Prior to , , was only available in debug builds. Introduced in order to force checkpoints before a backup, allowing mariadb-backup to create much smaller incremental backups. However, this comes at the cost of heavy I/O usage and it is now disabled by default.
Command line: --innodb-log-checkpoint{=1|0}
Scope: Global

`innodb_log_checksum_algorithm`

Description: Experimental feature (as of ), this variable specifies how to generate and verify checksums. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
- none - No checksum. A constant value is instead written to logs, and no checksum validation is performed.
- innodb - The default, and the original InnoDB algorithm. This is inefficient, but compatible with all MySQL, MariaDB and Percona versions that don't support other checksum algorithms.

`innodb_log_checksums`

Description: If set to 1, the CRC32C for Innodb or innodb_log_checksum_algorithm for XtraDB algorithm is used for pages. If disabled, the checksum field contents are ignored. From , the variable is deprecated, and checksums are always calculated, as previously, the InnoDB redo log used the slow innodb algorithm, but with hardware or SIMD assisted CRC-32C computation being available, there is no reason to allow checksums to be disabled on the redo log.
Command line: innodb-log-checksums={0|1}
Scope: Global

`innodb_log_compressed_pages`

Description: Whether or not images of recompressed pages are stored in the . Deprecated and ignored from .
Command line: --innodb-log-compressed-pages={0|1}
Scope: Global
Dynamic: Yes

`innodb_log_file_buffering`

Description: Whether the file system cache for ib_logfile0 is enabled. In , MariaDB disabled the file system cache on the InnoDB write-ahead log file (ib_logfile0) by default on Linux. With in particular, writing to the log via the file system cache typically improves throughput, especially on slow storage or at a small number of concurrent transactions. For other values of innodb_flush_log_at_trx_commit, direct writes were observed to be mostly but not always faster. Whether it pays off to disable the file system cache on the log may depend on the type of storage, the workload, and the operating system kernel version. If the server is started up with , the value are changed to ON. Will be set to OFF if is set to O_DSYNC. On Linux, when the physical block size cannot be determined to be a power of 2 between 64 and 4096 bytes, the file system cache cannot be disabled, and innodb_log_file_buffering=ON cannot be changed. Linux and Windows only.
Command line: --innodb-log-file-buffering={0|1}

`innodb_log_file_mmap`

Description: Whether ib_logfile0 resides in persistent memory or should initially be memory-mapped. When using the default innodb_log_buffer_size=2m, mariadb-backup --backup would spend a lot of time re-reading and re-parsing the log. For reading the log file during mariadb-backup --backup, it is beneficial to memory-map the entire ib_logfile0 to the address space (typically 48 bits or 256 TiB) and read it from there, both during --backup and --prepare. OFF by default on most platforms, to avoid aggressive read-ahead of the entire ib_logfile0 in when only a tiny portion would be accessed. On Linux and FreeBSD the default is innodb_log_file_mmap=ON, because those platforms define a specific mmap(2) option for enabling such read-ahead and therefore it can be assumed that the default wouldbe on-demand paging. This parameter will only have impact on the initial InnoDB startup and recovery. Any writes to the log will use regular I/O, except when the ib_logfile0 is stored in a specially configured file system that is backed by persistent memory (Linux "mount -o dax").
Command line: --innodb-log-file-mmap{=0|1}

`innodb_log_file_size`

Description: Size in bytes of each file in the log group. The combined size can be no more than 512GB. Larger values mean less disk I/O due to less flushing checkpoint activity, but also slower recovery from a crash. In , crash recovery has been improved and shouldn't run out of memory, so the default has been increased. It can safely be set higher to reduce checkpoint flushing, even larger than .From the variable is dynamic, and the server no longer needs to be restarted for the resizing to take place. Unless the log is located in a persistent memory file system (PMEM), an attempt to innodb_log_file_size to less than are refused. Log resizing can be aborted by killing the connection that is executing the SET GLOBAL statement.
Command line: --innodb-log-file-size=#
Scope: Global

`innodb_log_file_write_through`

Description: Whether each write to ib_logfile0 is write through (disabling any caching, as in O_SYNC or O_DSYNC). Set to OFF by default, are set to ON if is set to O_DSYNC. On systems that support FUA it may make sense to enable write-through, to avoid extra system calls.
Command line: --innodb-log-file-write-through={0|1}
Scope: Global

`innodb_log_files_in_group`

Description: Number of physical files in the . Deprecated and ignored from
Command line: --innodb-log-files-in-group=#
Scope: Global
Dynamic: No

`innodb_log_group_home_dir`

Description: Path to the files. If none is specified, files named ib_logfile0 and so on, with a size of are created in the data directory.
Command line: --innodb-log-group-home-dir=path
Scope: Global
Dynamic: No

`innodb_log_optimize_ddl`

Description: Whether activity should be reduced when natively creating indexes or rebuilding tables. Reduced logging requires additional page flushing and interferes with . Enabling this may slow down backup and cause delay due to page flushing. Deprecated and ignored from . Deprecated (but not ignored) from , and .
Command line: --innodb-log-optimize-ddl={0|1}
Scope: Global

`innodb_log_spin_wait_delay`

Description: Delay between log buffer spin lock polls (0 to use a blocking latch). Specifically, enables a spin lock that will execute that many MY_RELAX_CPU() operations (such as the x86 PAUSE instruction) between successive attempts of acquiring the spin lock. On some hardware with certain workloads (observed on write intensive workloads on NUMA systems), the default setting results in a significant amount of time being spent in native_queued_spin_lock_slowpath() in the Linux kernel, plus context switching between user and kernel address space, in which case changing from the default (for example, setting to 50), may result in a performance improvement.
Command line: --innodb-log-spin-wait-delay=#
Scope: Global

`innodb_log_write_ahead_size`

Description: write ahead unit size to avoid read-on-write. Should match the OS cache block IO size. Removed in , and instead on Linux and Windows, the physical block size of the underlying storage is detected and used. Reintroduced in and later versions. On Linux and Windows, the default or the specified innodb_log_write_ahead_size are automatically adjusted to not be less than the physical block size (if it can be determined).
Command line: --innodb-log-write-ahead-size=#
Scope: Global

`innodb_lru_flush_size`

Description: Number of pages to flush on LRU eviction. Changes in , , , , and made this setting superfluous, and it is no longer used.
Command line: --innodb-lru-flush-size=#
Scope: Global

`innodb_lru_scan_depth`

Description: Specifies how far down the buffer pool least-recently used (LRU) list the cleaning thread should look for dirty pages to flush. This process is performed once a second. In an I/O intensive-workload, can be increased if there is spare I/O capacity, or decreased if in a write-intensive workload with little spare I/O capacity.
- See for more information.
Command line: --innodb-lru-scan-depth=#

`innodb_max_bitmap_file_size`

Description: Limit in bytes of the changed page bitmap files. For faster incremental backup with , XtraDB tracks pages with changes written to them according to the and writes the information to special changed page bitmap files. These files are rotated when the server restarts or when this limit is reached. XtraDB only. See also and .
- Deprecated and ignored in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-max-bitmap-file-size=#

`innodb_max_changed_pages`

Description: Limit to the number of changed page bitmap files (stored in the ). Zero is unlimited. See and . Previously named innodb_changed_pages_limit. XtraDB only.
- Deprecated and ignored in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-max-changed-pages=#

`innodb_max_dirty_pages_pct`

Description: Maximum percentage of unwritten (dirty) pages in the buffer pool.
- See for more information.
Command line: --innodb-max-dirty-pages-pct=#
Scope: Global

`innodb_max_dirty_pages_pct_lwm`

Description: Low water mark percentage of dirty pages that will enable preflushing to lower the dirty page ratio. The value 0 (default) means 'refer to '. (Note that 0 meant 0 in 10.5.7 to 10.5.8, but was then reverted back to "same as innodb_max_dirty_pages_pct" again in 10.5.9)
- See for more information.
Command line: --innodb-max-dirty-pages-pct-lwm=#

`innodb_max_purge_lag`

Description: When purge operations are lagging on a busy server, setting innodb_max_purge_lag can help. By default set to 0, no lag, the figure is used to calculate a time lag for each INSERT, UPDATE, and DELETE when the system is lagging. InnoDB keeps a list of transactions with delete-marked index records due to UPDATE and DELETE statements. The length of this list is purge_lag, and the calculation, performed every ten seconds, is as follows: ((purge_lag/innodb_max_purge_lag)×10)–5 microseconds.
Command line: --innodb-max-purge-lag=#
Scope: Global

`innodb_max_purge_lag_delay`

Description: Maximum delay in milliseconds imposed by the setting. If set to 0, the default, there is no maximum.
Command line: --innodb-max-purge-lag-delay=#
Scope: Global
Dynamic: Yes

`innodb_max_purge_lag_wait`

Description: Wait until History list length is below the specified limit.
Command line: --innodb-max-purge-wait=#
Scope: Global
Dynamic: Yes

`innodb_max_undo_log_size`

Description: If an undo tablespace is larger than this, it is marked for truncation if is set.
Command line: --innodb-max-undo-log-size=#
Scope: Global
Dynamic: Yes

`innodb_merge_sort_block_size`

Description: Size in bytes of the block used for merge sorting in fast index creation. Replaced in /XtraDB 5.6 by .
Command line: innodb-merge-sort-block-size=#
Scope: Global
Dynamic: Yes

`innodb_mirrored_log_groups`

Description: Unused. Restored as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Deprecated:
Removed: -

`innodb_mtflush_threads`

Description: Sets the number of threads to use in Multi-Threaded Flush operations. For more information, see .
- InnoDB's multi-thread flush feature was deprecated in and removed from . In later versions of MariaDB, use system variable instead.
- See for more information.

`innodb_monitor_disable`

Description: Disables the specified counters in the table.
Command line: --innodb-monitor-disable=string
Scope: Global
Dynamic: Yes

`innodb_monitor_enable`

Description: Enables the specified counters in the table.
Command line: --innodb-monitor-enable=string
Scope: Global
Dynamic: Yes

`innodb_monitor_reset`

Description: Resets the count value of the specified counters in the table to zero.
Command line: --innodb-monitor-reset=string
Scope: Global
Dynamic: Yes

`innodb_monitor_reset_all`

Description: Resets all values for the specified counters in the table.
Command line: ---innodb-monitor-reset-all=string
Scope: Global
Dynamic: Yes

`innodb_numa_interleave`

Description: Whether or not to use the NUMA interleave memory policy to allocate the . Before , required that MariaDB be compiled on a NUMA-enabled Linux system.
Command line: innodb-numa-interleave={0|1}
Scope: Global
Dynamic: No

`innodb_old_blocks_pct`

Description: Percentage of the to use for the old block sublist.
Command line: --innodb-old-blocks-pct=#
Scope: Global
Dynamic: Yes

`innodb_old_blocks_time`

Description: Time in milliseconds an inserted block must stay in the old sublist after its first access before it can be moved to the new sublist. '0' means "no delay". Setting a non-zero value can help prevent full table scans clogging the . See also .
Command line: --innodb-old-blocks-time=#
Scope: Global
Dynamic: Yes

`innodb_online_alter_log_max_size`

Description: The maximum size for temporary log files during online DDL (data and index structure changes). The temporary log file is used for each table being altered, or index being created, to store data changes to the table while the process is underway. The table is extended by up to the limit set by this variable. If this limit is exceeded, the online DDL operation fails and all uncommitted changes are rolled back. A lower value reduces the time a table could lock at the end of the operation to apply all the log's changes, but also increases the chance of the online DDL changes failing.
Command line: --innodb-online-alter-log-max-size=#
Scope: Global

`innodb_open_files`

Description: Maximum .ibd files MariaDB can have open at the same time. Only applies to systems with multiple XtraDB/InnoDB tablespaces, and is separate to the table cache and . The default, if is disabled, is 300 or the value of , whichever is higher. It will also auto-size up to the default value if it is set to a value less than 10.
Command line: --innodb-open-files=#
Scope: Global

`innodb_optimize_fulltext_only`

Description: When set to 1 (0 is default), will only process InnoDB data. Only intended for use during fulltext index maintenance.
Command line: --innodb-optimize-fulltext-only={0|1}
Scope: Global

`innodb_page_cleaners`

Description: Number of page cleaner threads. The default is 4, but the value are set to the number of if this is lower. If set to 1, only a single cleaner thread is used, as was the case until . Cleaner threads flush dirty pages from the , performing flush list and least-recently used (LRU) flushing. Deprecated and ignored from , as the original reasons for splitting the buffer pool have mostly gone away.
- See for more information.
Command line:

`innodb_page_size`

Description: Specifies the page size in bytes for all InnoDB tablespaces. The default, 16k, is suitable for most uses.
- A smaller InnoDB page size might work more effectively in a situation with many small writes (OLTP), or with SSD storage, which usually has smaller block sizes.
- A larger InnoDB page size can provide a larger .

`innodb_pass_corrupt_table`

Removed: XtraDB 5.5 - renamed .

`innodb_prefix_index_cluster_optimization`

Description: Enable prefix optimization to sometimes avoid cluster index lookups. Deprecated and ignored from , as the optimization is now always enabled.
Command line: --innodb-prefix-index-cluster-optimization={0|1}
Scope: Global
Dynamic: Yes

`innodb_print_all_deadlocks`

Description: If set to 1 (0 is default), all InnoDB transaction deadlock information is written to the .
Command line: --innodb-print-all-deadlocks={0|1}
Scope: Global

`innodb_purge_batch_size`

Description: Number of pages to purge in one batch from the history list. Together with has a small effect on tuning.
Command line: --innodb-purge-batch-size=#
Scope: Global
Dynamic: No

`innodb_purge_rseg_truncate_frequency`

Description: Frequency with which undo records are purged. Set by default to every 128 times, reducing this increases the frequency at which rollback segments are freed. See also . The motivation for introducing this in MySQL seems to have been to avoid stalls due to freeing undo log pages or truncating undo log tablespaces. In MariaDB, should be a much lighter operation because it will not involve any log checkpoint, hence this is deprecated and ignored from , , , , and . ()
Command line: -- innodb-purge-rseg-truncate-frequency=#
Scope: Global

`innodb_purge_threads`

Description: Number of background threads dedicated to InnoDB purge operations. The range is 1 to 32. At least one background thread is always used. Setting to a value greater than 1 creates that many separate purge threads. This can improve efficiency in some cases, such as when performing DML operations on many tables. See also .
Command line: --innodb-purge-threads=#
Scope: Global

`innodb_random_read_ahead`

Description: Originally, random read-ahead was always set as an optimization technique, but was removed in . innodb_random_read_ahead permits it to be re-instated if set to 1 (0) is default.
Command line: --innodb-random-read-ahead={0|1}
Scope: Global

`innodb_read_ahead`

Description: If set to linear, the default, XtraDB/InnoDB will automatically fetch remaining pages if there are enough within the same extent that can be accessed sequentially. If set to none, read-ahead is disabled. random has been removed and is now ignored, while both sets to both linear and random. Also see for more control on read-aheads. Removed in /XtraDB 5.6 and replaced by MySQL 5.6's .
Command line: innodb-read-ahead=value

`innodb_read_ahead_threshold`

Description: Minimum number of pages InnoDB must read sequentially from an extent of 64 before initiating an asynchronous read for the following extent.
Command line: --innodb-read-ahead-threshold=#
Scope: Global
Dynamic: Yes

`innodb_read_io_threads`

Description: Prior to , this was simply the number of I/O threads for InnoDB reads. From , asynchronous I/O functionality in the InnoDB Background Thread Pool replaces the old InnoDB I/O Threads. This variable is now multiplied by 256 to determine the maximum number of concurrent asynchronous I/O read requests that can be completed by the Background Thread Pool. The default is therefore 4*256 = 1024 conccurrent asynchronous read requests. You may on rare occasions need to reduce this default on Linux systems running multiple MariaDB servers to avoid exceeding system limits, or increase if spending too much time waiting on I/O requests.
Command line: --innodb-read-io-threads=#
Scope: Global

`innodb_read_only`

Description: If set to 1 (0 is default), the server are read-only. For use in distributed applications, data warehouses or read-only media.
Command line: --innodb-read-only={0|1}
Scope: Global

`innodb_read_only_compressed`

Description: If set (the default before ), tables are read-only. This was intended to be the first step towards removing write support and deprecating the feature, but this plan has been abandoned.
Command line: --innodb-read-only-compressed, --skip-innodb-read-only-compressed
Scope: Global

`innodb_recovery_stats`

Description: If set to 1 (0 is default) and recovery is necessary on startup, the server will write detailed recovery statistics to the error log at the end of the recovery process. This Percona XtraDB variable has not been ported to XtraDB 5.6.
Command line: No
Scope: Global

`innodb_recovery_update_relay_log`

Description: If set to 1 (0 is default), the relay log info file are overwritten on crash recovery if the information differs from the InnoDB record. Should not be used if multiple storage engine types are being replicated. Previously named innodb_overwrite_relay_log_info. Removed in /XtraDB 5.6 and replaced by MySQL 5.6's relay-log-recovery
Command line: innodb-recovery-update-relay-log={0|1}

`innodb_replication_delay`

Description: Time in milliseconds for the replica server to delay the replication thread if is reached. Deprecated and ignored from .
Command line: --innodb-replication-delay=#
Scope: Global
Dynamic: Yes

`innodb_rollback_on_timeout`

Description: InnoDB usually rolls back the last statement of a transaction that's been timed out (see ). If innodb_rollback_on_timeout is set to 1 (0 is default), InnoDB will roll back the entire transaction. Before , rolling back the entire transaction was the default behavior.
Command line: --innodb-rollback-on-timeout
Scope: Global
Dynamic: No

`innodb_rollback_segments`

Description: Specifies the number of rollback segments that XtraDB/InnoDB will use within a transaction (see ). Deprecated and replaced by in . Removed in as part of an InnoDB cleanup, as it makes sense to always create and use the maximum number of rollback segments. |
Command line: --innodb-rollback-segments=#
Scope: Global

`innodb_safe_truncate`

Description: Use a backup-safe implementation and crash-safe rename operations inside InnoDB. This is not compatible with hot backup tools other than . Users who need to use such tools may set this to OFF.
Command line: --innodb-safe-truncate={0|1}
Scope: Global

`innodb_scrub_log`

Description: Enable scrubbing. See . Deprecated and ignored from , as never really worked ( and ). If old log contents should be kept secret, then enabling or setting a smaller could help.
Command line: --innodb-scrub-log
Scope: Global

`innodb_scrub_log_interval`

Description: Used with in 10.1.3 only - replaced in 10.1.4 by . scrubbing interval in milliseconds.
Command line: --innodb-scrub-log-interval=#
Scope: Global
Dynamic: Yes

`innodb_scrub_log_speed`

Description: scrubbing speed in bytes/sec. See . Deprecated and ignored from .
Command line: --innodb-scrub-log-speed=#
Scope: Global
Dynamic: Yes

`innodb_sched_priority_cleaner`

Description: Set a thread scheduling priority for cleaner and least-recently used (LRU) manager threads. The range from 0 to 39 corresponds in reverse order to Linux nice values of -20 to 19. So 0 is the lowest priority (Linux nice value 19) and 39 is the highest priority (Linux nice value -20). XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.

`innodb_show_locks_held`

Description: Specifies the number of locks held for each InnoDB transaction to be displayed in output. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-show-locks-held=#
Scope: Global
Dynamic: Yes

`innodb_show_verbose_locks`

Description: If set to 1, and is also ON, the traditional InnoDB behavior is followed and locked records are shown in output. If set to 0, the default, only high-level information about the lock is shown. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-show-verbose-locks=#
Scope: Global

`innodb_simulate_comp_failures`

Description: Simulate compression failures. Used for testing robustness against random compression failures. XtraDB only.
Command line: None
Scope: Global
Dynamic: Yes

`innodb_snapshot_isolation`

Description: Use snapshot isolation (write-write conflict detection). If set, if an attempt to acquire a lock on a record that does not exist in the current read view is made, an error DB_RECORD_CHANGED (HA_ERR_RECORD_CHANGED, ER_CHECKREAD) are raised. This error are treated in the same way as a deadlock and the transaction are rolled back. When set, the default isolation level, arecome Snapshot Isolation. Prior to , the default is OFF for backwards compatibility.
Command line: --innodb-snapshot-isolation={0|1}
Scope: Global, Session

`innodb_sort_buffer_size`

Description: Size of the sort buffers used for sorting data when an InnoDB index is created, as well as the amount by which the temporary log file is extended during online DDL operations to record concurrent writes. The larger the setting, the fewer merge phases are required between buffers while sorting. When a or creates a new index, three buffers of this size are allocated, as well as pointers for the rows in the buffer.
Command line: --innodb-sort-buffer-size=#
Scope: Global

`innodb_spin_wait_delay`

Description: Maximum delay (not strictly corresponding to a time unit) between spin lock polls. Default changed from 6 to 4 in , as this was verified to give the best throughput by OLTP update index and read-write benchmarks on Intel Broadwell (2/20/40) and ARM (1/46/46).
Command line: --innodb-log-spin-wait-delay=#
Scope: Global

`innodb_stats_auto_recalc`

Description: If set to 1 (the default), persistent statistics are automatically recalculated when the table changes significantly (more than 10% of the rows). Affects tables created or altered with STATS_PERSISTENT=1 (see ), or when is enabled. determines how much data to sample when recalculating. See .
Command line: --innodb-stats-auto-recalc={0|1}
Scope: Global

`innodb_stats_auto_update`

Description: If set to 0 (1 is default), index statistics will not be automatically calculated except when an is run, or the table is first opened. Replaced by in /XtraDB 5.6.
Scope: Global
Dynamic: Yes

`innodb_stats_include_delete_marked`

Description: Include delete marked records when calculating persistent statistics.
Scope: Global
Dynamic: Yes
Data Type: boolean

`innodb_stats_method`

Description: Determines how NULLs are treated for InnoDB index statistics purposes.
- nulls_equal: The default, all NULL index values are treated as a single group. This is usually fine, but if you have large numbers of NULLs the average group size is slanted higher, and the optimizer may miss using the index for ref accesses when it would be useful.
- nulls_unequal: The opposite approach to nulls_equal is taken, with each NULL forming its own group of one. Conversely, the average group size is slanted lower, and the optimizer may use the index for ref accesses when not suitable.

`innodb_stats_modified_counter`

Description: The number of rows modified before we calculate new statistics. If set to 0, the default, current limits are used.
Command line: --innodb-stats-modified-counter=#
Scope: Global
Dynamic: Yes

`innodb_stats_on_metadata`

Description: If set to 1, the default, XtraDB/InnoDB updates statistics when accessing the INFORMATION_SCHEMA.TABLES or INFORMATION_SCHEMA.STATISTICS tables, and when running metadata statements such as or . If set to 0, statistics are not updated at those times, which can reduce the access time for large schemas, as well as make execution plans more stable.
Command line: --innodb-stats-on-metadata
Scope: Global

`innodb_stats_persistent`

Description: produces index statistics, and this setting determines whether they are stored on disk, or be required to be recalculated more frequently, such as when the server restarts. This information is stored for each table, and can be set with the STATS_PERSISTENT clause when creating or altering tables (see ). See .
Command line: --innodb-stats-persistent={0|1}
Scope: Global

`innodb_stats_persistent_sample_pages`

Description: Number of index pages sampled when estimating cardinality and statistics for indexed columns. Increasing this value will increases index statistics accuracy, but use more I/O resources when running . See .
Command line: --innodb-stats-persistent-sample-pages=#
Scope: Global
Dynamic: Yes

`innodb_stats_sample_pages`

Description: Gives control over the index distribution statistics by determining the number of index pages to sample. Higher values produce more disk I/O, but, especially for large tables, produce more accurate statistics and therefore make more effective use of the query optimizer. Lower values than the default are not recommended, as the statistics can be quite inaccurate.
- If is enabled, then the exact number of pages configured by this system variable are sampled for statistics.
- If is disabled, then the number of pages to sample for statistics is calculated using a logarithmic algorithm, so the exact number can change depending on the size of the table. This means that more samples may be used for larger tables.

`innodb_stats_traditional`

Description: This system variable affects how the number of pages to sample for transient statistics is determined, in particular how is used.
- If is enabled, then the exact number of pages configured by the system variable are sampled for statistics.
- If is disabled, then the number of pages to sample for statistics is calculated using a logarithmic algorithm, so the exact number can change depending on the size of the table. This means that more samples may be used for larger tables.

`innodb_stats_transient_sample_pages`

Description: Gives control over the index distribution statistics by determining the number of index pages to sample. Higher values produce more disk I/O, but, especially for large tables, produce more accurate statistics and therefore make more effective use of the query optimizer. Lower values than the default are not recommended, as the statistics can be quite inaccurate.
- If is enabled, then the exact number of pages configured by this system variable are sampled for statistics.
- If is disabled, then the number of pages to sample for statistics is calculated using a logarithmic algorithm, so the exact number can change depending on the size of the table. This means that more samples may be used for larger tables.

`innodb_stats_update_need_lock`

Description: Setting to 0 (1 is default) may help reduce contention of the &dict_operation_lock, but also disables the Data_free option in . This Percona XtraDB variable has not been ported to XtraDB 5.6.
Scope: Global
Dynamic: Yes

`innodb_status_output`

Description: Enable output to the .
Command line: --innodb-status-output={0|1}
Scope: Global
Dynamic: Yes

`innodb_status_output_locks`

Description: Enable output to the and . Also requires to enable output to the error log.
Command line: --innodb-status-output-locks={0|1}
Scope: Global
Dynamic: Yes

`innodb_strict_mode`

Description: If set to 1 (the default), InnoDB will return errors instead of warnings in certain cases, similar to strict SQL mode. See for details.
Command line: --innodb-strict-mode={0|1}
Scope: Global, Session

`innodb_support_xa`

Description: If set to 1, the default, are supported. XA support ensures data is written to the in the same order to the actual database, which is critical for and disaster recovery, but comes at a small performance cost. If your database is set up to only permit one thread to change data (for example, on a replication replica with only the replication thread writing), it is safe to turn this option off. Removed in , XA transactions are always supported.
Command line: --innodb-support-xa
Scope: Global, Session

`innodb_sync_array_size`

Description: By default 1, can be increased to split internal thread co-ordinating, giving higher concurrency when there are many waiting threads.
Command line: --innodb-sync-array-size=#
Scope: Global
Dynamic: No

`innodb_sync_spin_loops`

Description: The number of times a thread waits for an InnoDB mutex to be freed before the thread is suspended.
Command line: --innodb-sync-spin-loops=#
Scope: Global
Dynamic: Yes

`innodb_table_locks`

Description: If is set to 0 (1 is default), setting innodb_table_locks to 1, the default, will cause InnoDB to lock a table internally upon a .
Command line: --innodb-table-locks
Scope: Global, Session

`innodb_thread_concurrency`

Description: Once this number of threads is reached (excluding threads waiting for locks), XtraDB/InnoDB will place new threads in a wait state in a first-in, first-out queue for execution, in order to limit the number of threads running concurrently. A setting of 0, the default, permits as many threads as necessary. A suggested setting is twice the number of CPU's plus the number of disks. Deprecated and ignored from .
Command line: --innodb-thread-concurrency=#
Scope: Global

`innodb_thread_concurrency_timer_based`

Description: If set to 1, thread concurrency are handled in a lock-free timer-based manner rather than the default mutex-based method. Depends on atomic op builtins being available. This Percona XtraDB variable has not been ported to XtraDB 5.6.
Command line: innodb-thread-concurrency-timer-based={0|1}
Scope: Global

`innodb_thread_sleep_delay`

Description: Time in microseconds that InnoDB threads sleep before joining the queue. Setting to 0 disables sleep. Deprecated and ignored from
Command line: --innodb-thread-sleep-delay=#
Scope: Global

`innodb_temp_data_file_path`

Description: Path where to store data for temporary tables. Argument is filename:size followed by options separated by ':' Multiple paths can be given separated by ';' A file size is specified (with K for kilobytes, M for megabytes and G for gigabytes). Also whether or not to autoextend the data file, max size and whether or not to on startup may also be specified.
Command line: --innodb-temp-data-file-path=path

`innodb_tmpdir`

Description: Allows an alternate location to be set for temporary non-tablespace files. If not set (the default), files are created in the usual location. Alternate location must be outside of datadir
Command line: --innodb-tmpdir=path
Scope: Global

`innodb_track_changed_pages`

Description: For faster incremental backup with , XtraDB tracks pages with changes written to them according to the and writes the information to special changed page bitmap files. This read-only variable is used for controlling this feature. See also and . XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-track-changed-pages={0|1}
Scope: Global

`innodb_track_redo_log_now`

Description: Available on debug builds only. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-track-redo-log-now={0|1}
Scope: Global
Dynamic: Yes

`innodb_truncate_temporary_tablespace_now`

Description: Set to ON to shrink the temporary tablespace.
Command line: innodb-truncate-temporary-tablespace-now={0|1}
Scope: Global
Dynamic: Yes

`innodb_undo_directory`

Description: Path to the directory (relative or absolute) that InnoDB uses to create separate tablespaces for the . . (the default value before 10.2.2) leaves the undo logs in the same directory as the other log files. From , the default value is NULL, and if no path is specified, undo tablespaces are created in the directory defined by . Use together with and . Undo logs are most usefully placed on a separate storage device.
Command line: --innodb-undo-directory=name
Scope: Global

`innodb_undo_log_truncate`

Description: When enabled, that are larger than are marked for truncation. See also . Enabling this setting may cause stalls during heavy write workloads.
Command line: --innodb-undo-log-truncate[={0|1}]
Scope: Global
Dynamic: Yes

`innodb_undo_logs`

Description: Specifies the number of rollback segments that XtraDB/InnoDB will use within a transaction (or the number of active ). By default set to the maximum, 128, it can be reduced to avoid allocating unneeded rollback segments. See the status variable for the number of undo logs available. See also and . Replaced in . The contains information about the XtraDB rollback segments. Deprecated and ignored in , as it always makes sense to use the maximum number of rollback segments.
Command line: --innodb-undo-logs=#

`innodb_undo_tablespaces`

Description: Number of tablespaces files used for dividing up the . Zero (the default before ) means that undo logs are all part of the system tablespace, which contains one undo tablespace more than the innodb_undo_tablespaces setting. A value of 1 is reset to 0 as 2 or more are needed for separate tablespaces. When the undo logs can grow large, splitting them over multiple tablespaces will reduce the size of any single tablespace. Until , must be set before InnoDB is initialized, or else MariaDB will fail to start, with an error saying that InnoDB did not find the expected number of undo tablespaces. The files are created in the directory specified by , and are named undoN, N being an integer. The default size of an undo tablespace is 10MB.From , multiple undo tablespaces are enabled by default, and the default is changed to 3 so that the space occupied by possible bursts of undo log records can be reclaimed after is set. Before , must have a non-zero setting for innodb_undo_tablespaces to take effect.

`innodb_use_atomic_writes`

Description: Implement atomic writes on supported SSD devices. See for other variables affected when this is set.
Command line: innodb-use-atomic-writes={0|1}
Scope: Global
Dynamic: No

`innodb_use_fallocate`

Description: Preallocate files fast, using operating system functionality. On POSIX systems, posix_fallocate system call is used.
- Automatically set to 1 when is set - see .
- See for more information.

`innodb_use_global_flush_log_at_trx_commit`

Description: Determines whether a user can set the variable . If set to 1, a user cannot reset the value with a SET command, while if set to 1, a user can reset the value of innodb_flush_log_at_trx_commit. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: innodb-use-global-flush-log-at-trx_commit={0|1}

`innodb_use_mtflush`

Description: Whether to enable Multi-Threaded Flush operations. For more information, see Fusion.
- InnoDB's multi-thread flush feature was deprecated in and removed from . In later versions of MariaDB, use system variable instead.
- See for more information.
Command line:

`innodb_use_native_aio`

Description: For Linux systems only, specified whether to use Linux's asynchronous I/O subsystem. Set to ON by default, it may be changed to 0 at startup if InnoDB detects a problem, or from /, if a 5.11 - 5.15 Linux kernel is detected, to avoid an io-uring bug/incompatibility (). MariaDB-10.6.6/MariaDB-10.7.2 and later also consider 5.15.3+ as a fixed kernel and default to ON. To really benefit from the setting, the files should be opened in O_DIRECT mode (, default from ), to bypass the file system cache. In this way, the reads and writes can be submitted with DMA, using the InnoDB buffer pool directly, and no processor cycles need to be used for copying data.
Command line: --innodb-use-native-aio={0|1}

`innodb_use_purge_thread`

Description: Usually with InnoDB, data changed by a transaction is written to an undo space to permit read consistency, and freed when the transaction is complete. Many, or large, transactions, can cause the main tablespace to grow dramatically, reducing performance. This option, introduced in XtraDB 5.1 and removed for 5.5, allows multiple threads to perform the purging, resulting in slower, but much more stable performance.
Command line: --innodb-use-purge-thread=#
Scope: Global

`innodb_use_stacktrace`

Description: If set to ON (OFF is default), a signal handler for SIGUSR2 is installed when the InnoDB server starts. When a long semaphore wait is detected at sync/sync0array.c, a SIGUSR2 signal is sent to the waiting thread and thread that has acquired the RW-latch. For both threads a full stacktrace is produced as well as if possible. XtraDB only. Added as a deprecated and ignored option in (which uses InnoDB as default instead of XtraDB) to allow for easier upgrades.
Command line: --innodb-use-stacktrace={0|1}
Scope: Global

`innodb_use_sys_malloc`

Description: If set the 1, the default, XtraDB/InnoDB will use the operating system's memory allocator. If set to 0 it will use its own. Deprecated in and removed in along with InnoDB's internal memory allocator.
Command line: --innodb-use-sys-malloc={0|1}
Scope: Global

`innodb_use_sys_stats_table`

Description: If set to 1 (0 is default), XtraDB will use the SYS_STATS system table for extra table index statistics. When a table is opened for the first time, statistics will then be loaded from SYS_STATS instead of sampling the index pages. Statistics are designed to be maintained only by running an . Replaced by MySQL 5.6's Persistent Optimizer Statistics.
Command line: innodb-use-sys-stats-table={0|1}
Scope: Global

`innodb_use_trim`

Description: Use trim to free up space of compressed blocks.
- See for more information.
Command line: --innodb-use-trim={0|1}
Scope: Global

`innodb_version`

Description: InnoDB version number. From , as the InnoDB implementation in MariaDB has diverged from MySQL, the MariaDB version is instead reported. For example, the InnoDB version reported in (which is based on MySQL 5.6) included encryption and variable-size page compression before MySQL 5.7 introduced them. (based on MySQL 5.7) introduced persistent AUTO_INCREMENT () in a GA release before MySQL 8.0. (based on MySQL 5.7) introduced instant ADD COLUMN () before MySQL.
Scope: Global
Dynamic: No

`innodb_write_io_threads`

Description: Prior to , this was simply the number of I/O threads for InnoDB writes. From , asynchronous I/O functionality in the InnoDB Background Thread Pool replaces the old InnoDB I/O Threads. This variable is now multiplied by 256 to determine the maximum number of concurrent asynchronous I/O write requests that can be completed by the Background Thread Pool. The default is therefore 4*256 = 1024 conccurrent asynchronous write requests. You may on rare occasions need to reduce this default on Linux systems running multiple MariaDB servers to avoid exceeding system limits, or increase if spending too much time waiting on I/O requests.
Command line: --innodb-write-io-threads=#
Scope: Global

_{This page is licensed: CC BY-SA / Gnu FDL}

Storage Engines

Storage Engines Overview

Overview

Examples

Identify the Default Storage Engine

Set the Default Storage Engine

Configure the Default Storage Engine

Identify Available Storage Engines

Choose Storage Engine for a New Table

Resources

Engines for System Tables

FAQ

Can I use more than one storage engine on a server?

Can I use more than one storage engine in a single query?

What storage engine should I use for transactional or OLTP workloads?

What storage engine should I use for analytical or OLAP workloads?

What storage engine should I use if my application performs both transactional and analytical queries?

Reference

MariaDB Server Reference

ARCHIVE

ARIA

Aria Group Commit

Terminology

Non Group commit logic (aria_group_commit="none")

If hard group commit is enabled (aria_group_commit="hard")

If hard commit and aria_group_commit_interval=0

If hard commit and aria_group_commit_interval > 0

If soft group commit is enabled (aria_group_commit="soft")

Code

Aria Status Variables

Aria_pagecache_blocks_not_flushed

Aria_pagecache_blocks_unused

Aria_pagecache_blocks_used

Aria_pagecache_read_requests

Aria_pagecache_reads

Aria_pagecache_write_requests

Aria_pagecache_writes

Aria_transaction_log_syncs

Aria Storage Formats

Fixed-length

Dynamic

Page

Transactional

Aria Two-step Deadlock Detection

Description

How it Works

The Aria Name

Backstory

CONNECT

Introduction to the CONNECT Engine

Using CONNECT

Using CONNECT - Condition Pushdown

Using CONNECT - Exporting Data From MariaDB

Using CONNECT - General Information

Performance

Create Table statement

Drop Table statement

Alter Table statement

Update and Delete for File Tables

Using CONNECT - Importing File Data Into MariaDB Tables

USING CONNECT - Offline Documentation

CONNECT Table Types

CONNECT - NoSQL Table Types

CONNECT Table Types - OEM: Implemented in an External LIB

CONNECT VEC Table Type

CONNECT Security

Current Status of the CONNECT Handler

CSV

FederatedX

Differences Between FederatedX and Federated

New features in FederatedX

Different behavior

InnoDB

InnoDB Doublewrite Buffer

Doublewrite Buffer Settings

InnoDB Architecture for MariaDB Enterprise Server

MariaDB Enterprise Server InnoDB Background Thread Pool

Overview

InnoDB Online DDL

Storage Engines

`Aria_pagecache_blocks_not_flushed`

`Aria_pagecache_blocks_unused`

`Aria_pagecache_blocks_used`

`Aria_pagecache_read_requests`

`Aria_pagecache_reads`

`Aria_pagecache_write_requests`

`Aria_pagecache_writes`

`Aria_transaction_log_syncs`

`Aria_pagecache_blocks_not_flushed`

`Aria_pagecache_blocks_unused`

`Aria_pagecache_blocks_used`

`Aria_pagecache_read_requests`

`Aria_pagecache_reads`

`Aria_pagecache_write_requests`

`Aria_pagecache_writes`

`Aria_transaction_log_syncs`