1 of 9

ANALYZE and EXPLAIN Statements

Learn commands for query analysis. This section covers ANALYZE TABLE and EXPLAIN, used to view execution plans and optimize query performance.

ANALYZE FORMAT=JSON

Gain deep insight into query execution with JSON-formatted analysis. This command combines optimizer estimates with actual runtime statistics for precise performance tuning.

ANALYZE FORMAT=JSON is a mix of the EXPLAIN FORMAT=JSON and ANALYZE statement features. The ANALYZE FORMAT=JSON $statement will execute $statement, and then print the output of EXPLAIN FORMAT=JSON, amended with data from the query execution.

Basic Execution Data

You can get the following also from tabular ANALYZE statement form:

r_rows is provided for any node that reads rows. It shows how many rows were read, on average.
r_filtered is provided whenever there is a condition that is checked. It shows the percentage of rows left after checking the condition.

Advanced Execution Data

The most important data not available in the regular tabular ANALYZE statement are:

r_loops field. This shows how many times the node was executed. Most query plan elements have this field.
r_total_time_ms field. It shows how much time in total, in milliseconds, was spent executing this node. If the node has subnodes, their execution time is included.
- For UPDATE and DELETE statements, top-level query_block.r_total_time_ms does include the time to make row deletions/updates but does NOT include the time to commit the changes.

InnoDB engine statistics

Starting from , , , , , , and (), the following statistics are reported for InnoDB tables:

Only non-zero members are printed.

pages_accessed is the total number of buffer pool pages accessed when reading this table.
pages_updated is the total number of buffer pool pages that were modified during the execution of the statement.
pages_read_count is the number of pages that InnoDB had to read from disk for this table. If the query touches "hot" data in the InnoDB buffer pool, this value will be 0 and not present.

SHOW ANALYZE FORMAT=JSON

MariaDB starting with

SHOW ANALYZE FORMAT=JSON for <connection_id> extends ANALYZE [FORMAT=JSON] <select> to allow one to analyze a query currently running in another connection.

Data About Individual Query Plan Nodes

filesort node reports whether sorting was done with LIMIT n parameter, and how many rows were in the sort result.
block-nl-join node has r_loops field, which allows to tell whether Using join buffer was efficient.
range-checked-for-each-record

Use Cases

See .

_{This page is licensed: CC BY-SA / Gnu FDL}

ANALYZE FORMAT=JSON Examples

Review practical examples of ANALYZE FORMAT=JSON output. Learn to identify performance bottlenecks by comparing estimated costs against actual execution metrics.

Example #1

Customers who have ordered more than 1M goods.

ANALYZE FORMAT=JSON
SELECT COUNT(*)
FROM customer
WHERE
  (SELECT SUM(o_totalprice) FROM orders WHERE o_custkey=c_custkey) > 1000*1000;

The query takes 40 seconds over cold cache.

ANALYZE shows that 39.208 seconds were spent in the subquery, which was executed 150K times (for every row of outer table).

_{This page is licensed: CC BY-SA / Gnu FDL}

ANALYZE: Interpreting rows and filtered members

Understand the r_rows and r_filtered fields in analysis output. Learn how these actual runtime counters compare to the optimizer's rows and filtered estimates.

This article describes how to interpret r_rows and r_filtered members in ANALYZE FORMAT=JSON when an index-based access method is used.

Index-based access method

Index-based access method may employ some or all of the following:

attached_condition checking

Consider a table access which does all three:

The access is performed as follows:

Access diagram

Statistics values in MariaDB before 11.5

In MariaDB versions before 11.5, the counters were counted as follows:

that is,

r_rows is counted after Index Condition Pushdown check and Rowid Filter check.
r_filtered only counts selectivity of the attached_condition.
selectivity of the Rowid Filter is in rowid_filter.r_selectivity_pct.

Statistics values in and later versions

Starting from (), the row counters are:

r_index_rows counts the number of enumerated index tuples, before any checks are made
r_rows is the same as before - number of rows after index checks.

The selectivity counters are:

r_icp_filtered is the percentage of records left after pushed index condition check.
rowid_filter.r_selectivity_pct shows selectivity of Rowid Filter, as before.
r_filtered is the selectivity of attached_condition check, as before.

ANALYZE output members

in ANALYZE FORMAT=JSON output these members are placed as follows:

Whenever applicable, r_index_rows is shown. It is comparable with rows - both are numbers of rows to enumerate before any filtering is done. If r_index_rows is not shown, r_rows shows the number of records enumerated.

Then, filtering members:

filtered is comparable with r_total_filtered: both show total amount of filtering.

ICP and its observed filtering. The optimizer doesn't compute an estimate for this currently.

attached_condition and its observed filtering.

_{This page is licensed: CC BY-SA / Gnu FDL}

ANALYZE Statement

Learn to use the ANALYZE statement to execute a query and produce a performance report. This command reveals how close the optimizer's plan was to the actual execution.

Description

The ANALYZE statement is similar to the EXPLAIN statement. ANALYZE statement will invoke the optimizer, execute the statement, and then produce EXPLAIN output instead of the result set. The EXPLAIN output will be annotated with statistics from statement execution.

This lets one check how close the optimizer's estimates about the query plan are to the reality. ANALYZE produces an overview, while the command provides a more detailed view of the query plan and the query execution.

The syntax is

where the statement is any statement for which one can run .

Command Output

Consider an example:

Compared to EXPLAIN, ANALYZE produces two extra columns:

r_rows is an observation-based counterpart of the rows column. It shows how many rows were actually read from the table.
r_filtered is an observation-based counterpart of the filtered column. It shows which fraction of rows was left after applying the WHERE condition.

Interpreting the Output

Joins

Let's consider a more complicated example.

Here, one can see that

For table customer, customer.rows=149095, customer.r_rows=150000. The estimate for number of rows we will read was fairly precise
customer.filtered=18.08, customer.r_filtered=9.13. The optimizer somewhat overestimated the number of records that will match selectivity of condition attached to customer table (in general, when you have a full scan and r_filtered is less than 15%, it's time to consider adding an appropriate index).
For table orders, orders.rows=7, orders.r_rows=10. This means that on average, there are 7 orders for a given c_custkey, but in our case there were 10, which is close to the expectation (when this number is consistently far from the expectation, it may be time to run ANALYZE TABLE, or even edit the table statistics manually to get better query plans).

Meaning of NULL in r_rows and r_filtered

Let's modify the previous example slightly

The output of orders.r_rows=NULL and orders.r_filtered=NULL shows that the table orders was never scanned. Indeed, we can also see customer.r_filtered=0.00. This shows that a part of WHERE attached to table customer was never satisfied (or, satisfied in less than 0.01% of cases).

ANALYZE FORMAT=JSON

produces JSON output. It produces much more information than tabular ANALYZE.

Notes

ANALYZE UPDATE or ANALYZE DELETE will actually make updates/deletes (ANALYZE SELECT will perform the select operation and then discard the resultset).
PostgreSQL has a similar command, EXPLAIN ANALYZE.
The feature allows MariaDB to have ANALYZE

EXPLAIN ANALYZE

Understand the historical context of EXPLAIN ANALYZE in MariaDB. Learn how this syntax maps to the modern ANALYZE statement for profiling query execution.

Outdated syntax

The syntax for the EXPLAIN ANALYZE feature was changed to ANALYZE statement, available since . See ANALYZE statement.

_{This page is licensed: CC BY-SA / Gnu FDL}

EXPLAIN FORMAT=JSON

Get comprehensive query plans in JSON format. This output provides detailed optimizer data, including costs and attached conditions, not found in the tabular view.

Synopsis

EXPLAIN FORMAT=JSON is a variant of EXPLAIN command that produces output in JSON form. The output always has one row which has only one column titled "JSON". The contents are a JSON representation of the query plan:

Output is different from MySQL

The output of MariaDB's EXPLAIN FORMAT=JSON is different from EXPLAIN FORMAT=JSON in MySQL. The reasons for that are:

MySQL's output has deficiencies.
The output of MySQL's EXPLAIN FORMAT=JSON is not defined. Even MySQL Workbench has trouble parsing it (see this ).
MariaDB has query optimizations that MySQL does not have. This means that MariaDB generates query plans that MySQL does not generate.

Output Format

TODO: MariaDB's output format description.

EXPLAIN

Syntax

Description

The EXPLAIN statement can be used either as a synonym for or as a way to obtain information about how MariaDB executes a SELECT, UPDATE or DELETE statement:

'EXPLAIN tbl_name' is synonymous with'[DESCRIBE](../describe.md) tbl_name' or'[SHOW COLUMNS](../show/show-columns.md) FROM tbl_name'.
When you precede a SELECT, UPDATE or a DELETE statement with the keywordEXPLAIN, MariaDB displays information from the optimizer about the query execution plan. That is, MariaDB explains how it would process the SELECT

shows the output of a running statement. In some cases, its output can be closer to reality than EXPLAIN.

The runs a statement and returns information about its execution plan. It also shows additional columns, to check how much the optimizer's estimation about filtering and found rows are close to reality.

There is an online that you can use to share EXPLAIN and EXPLAIN EXTENDED output with others.

EXPLAIN can acquire metadata locks in the same way that SELECT does, as it needs to know table metadata and, sometimes, data as well.

Columns in EXPLAIN ... SELECT

Column name

Description

Here are descriptions of the values for some of the more complex columns in EXPLAIN ... SELECT:

"Select_type" Column

The select_type column can have the following values:

Value

Description

Comment

"Type" Column

This column contains information on how the table is accessed.

Value

Description

"Extra" Column

This column consists of one or more of the following values, separated by ';'

Note that some of these values are detected after the optimization phase.

The optimization phase can do the following changes to the WHERE clause:

Add the expressions from the ON and USING clauses to the WHERE clause.
Constant propagation: If there is column=constant, replace all column instances with this constant.
Replace all columns from 'const' tables with their values.

Value

Description

EXPLAIN EXTENDED

The EXTENDED keyword adds another column, filtered, to the output. This is a percentage estimate of the table rows that will be filtered by the condition.

An EXPLAIN EXTENDED will always throw a warning, as it adds extra Message information to a subsequent statement. This includes what the SELECT query would look like after optimizing and rewriting rules are applied and how the optimizer qualifies columns and tables.

Examples

As synonym for DESCRIBE or SHOW COLUMNS FROM:

A simple set of examples to see how EXPLAIN can identify poor index usage:

SELECT on a primary key:

The type is const, which means that only one possible result could be returned. Now, returning the same record but searching by their phone number:

Here, the type is All, which means no index could be used. Looking at the rows count, a full table scan (all six rows) had to be performed in order to retrieve the record. If it's a requirement to search by phone number, an index will have to be created.

example:

Example of `ref_or_null` Optimization

ref_or_null is something that often happens when you use subqueries with NOT IN as then one has to do an extra check for NULL values if the first value didn't have a matching row.

Using Buffer UPDATE Algorithm

Understand the 'Using buffer' strategy for UPDATE operations. Learn how MariaDB prevents infinite update loops when modifying indexed columns during a range scan.

This article explains the UPDATE statement's Using Buffer algorithm.

Consider the following table and query:

Name

Salary

Babatunde

1000

Jolana

1050

Pankaja

1300

UPDATE employees SET salary = salary+100 WHERE salary < 2000;

Suppose the employees table has an index on the salary column, and the optimizer decides to use a range scan on that index.

The optimizer starts a range scan on the salary index. We find the first record Babatunde, 1000. If we do an on-the-fly update, we immediately instruct the storage engine to change this record to be Babatunde, 1000+100=1100.

Then we proceed to search for the next record, and find Jolana, 1050. We instruct the storage engine to update it to be Jolana, 1050+100=1150.

Then we proceed to search for the next record ... and what happens next depends on the storage engine. In some storage engines, data changes are visible immediately, so we will find the Babatunde, 1100 record that we wrote at the first step, modifying it again, giving Babatunde an undeserved raise. Then we will see Babatunde again and again, looping continually.

In order to prevent such situations, the optimizer checks whether the UPDATE statement is going to change key values for the keys it is using. In that case, it will use a different algorithm:

Scan everyone with "salary<2000", remembering the rowids of the rows in a buffer.
Read the buffer and apply the updates.

This way, each row will be updated only once.

The Using buffer output indicates that the buffer as described above will be used.

_{This page is licensed: CC BY-SA / Gnu FDL}

EXPLAIN

Syntax

EXPLAIN tbl_name [col_name | wild]

EXPLAIN [EXTENDED | PARTITIONS | FORMAT=JSON] 
  {SELECT select_options | UPDATE update_options | DELETE delete_options}

EXPLAIN [FORMAT=JSON] FOR CONNECTION <connection_id>

Description

The EXPLAIN statement can be used either as a synonym for or as a way to obtain information about how MariaDB executes a SELECT, UPDATE or DELETE statement:

'EXPLAIN tbl_name' is synonymous with'[DESCRIBE](../describe.md) tbl_name' or'[SHOW COLUMNS](../show/show-columns.md) FROM tbl_name'.
When you precede a SELECT, UPDATE or a DELETE statement with the keywordEXPLAIN, MariaDB displays information from the optimizer about the query execution plan. That is, MariaDB explains how it would process the SELECT

shows the output of a running statement. In some cases, its output can be closer to reality than EXPLAIN.

There is an online that you can use to share EXPLAIN and EXPLAIN EXTENDED output with others.

EXPLAIN can acquire metadata locks in the same way that SELECT does, as it needs to know table metadata and, sometimes, data as well.

Columns in EXPLAIN ... SELECT

Column name

Description

Here are descriptions of the values for some of the more complex columns in EXPLAIN ... SELECT:

"Select_type" Column

The select_type column can have the following values:

Value

Description

Comment

"Type" Column

This column contains information on how the table is accessed.

Value

Description

"Extra" Column

This column consists of one or more of the following values, separated by ';'

Note that some of these values are detected after the optimization phase.

The optimization phase can do the following changes to the WHERE clause:

Add the expressions from the ON and USING clauses to the WHERE clause.
Constant propagation: If there is column=constant, replace all column instances with this constant.
Replace all columns from 'const' tables with their values.

Value

Description

EXPLAIN EXTENDED

The EXTENDED keyword adds another column, filtered, to the output. This is a percentage estimate of the table rows that will be filtered by the condition.

Examples

As synonym for DESCRIBE or SHOW COLUMNS FROM:

A simple set of examples to see how EXPLAIN can identify poor index usage:

SELECT on a primary key:

The type is const, which means that only one possible result could be returned. Now, returning the same record but searching by their phone number:

example:

Example of `ref_or_null` Optimization

ref_or_null is something that often happens when you use subqueries with NOT IN as then one has to do an extra check for NULL values if the first value didn't have a matching row.

ANALYZE and EXPLAIN Statements

ANALYZE FORMAT=JSON

Basic Execution Data

Advanced Execution Data

InnoDB engine statistics

SHOW ANALYZE FORMAT=JSON

Data About Individual Query Plan Nodes

Use Cases

ANALYZE FORMAT=JSON Examples

Example #1

ANALYZE: Interpreting rows and filtered members

Index-based access method

Access diagram

Statistics values in MariaDB before 11.5

Statistics values in and later versions

ANALYZE output members

ANALYZE Statement

Description

Command Output

Interpreting the Output

Joins

Meaning of NULL in r_rows and r_filtered

ANALYZE FORMAT=JSON

Notes

See Also

EXPLAIN ANALYZE

EXPLAIN FORMAT=JSON

Synopsis

Output is different from MySQL

Output Format

See Also

EXPLAIN

Syntax

Description

Columns in EXPLAIN ... SELECT

"Select_type" Column

"Type" Column

"Extra" Column

EXPLAIN EXTENDED

Examples

Example of ref_or_null Optimization

See Also

Using Buffer UPDATE Algorithm

ANALYZE and EXPLAIN Statements

ANALYZE FORMAT=JSON Examples

Example #1

ANALYZE FORMAT=JSON

Basic Execution Data

Advanced Execution Data

InnoDB engine statistics

SHOW ANALYZE FORMAT=JSON

Data About Individual Query Plan Nodes

Use Cases

EXPLAIN FORMAT=JSON

Synopsis

Output is different from MySQL

Output Format

See Also

Using Buffer UPDATE Algorithm

EXPLAIN ANALYZE

ANALYZE Statement

Description

Command Output

Interpreting the Output

Joins

Meaning of NULL in r_rows and r_filtered

ANALYZE FORMAT=JSON

Notes

See Also

EXPLAIN

Syntax

Description

Columns in EXPLAIN ... SELECT

"Select_type" Column

"Type" Column

"Extra" Column

EXPLAIN EXTENDED

Examples

Example of ref_or_null Optimization

See Also

Example of `ref_or_null` Optimization

Example of `ref_or_null` Optimization