1 of 76

Special Functions

Explore special functions in MariaDB Server. This section details unique SQL functions that provide specialized capabilities, often related to server internals, diagnostics, or specific data handling.

Dynamic Columns Functions

Manage schema-less data within relational tables. These functions, such as COLUMN_CREATE and COLUMN_GET, allow you to store and retrieve variable sets of columns in a single BLOB field.

COLUMN_ADD

Update dynamic columns. This function adds or updates values within a dynamic column blob, returning the new blob content.

Syntax

Description

Adds or updates .

COLUMN_CHECK

Verify dynamic column integrity. This function checks if a blob containing dynamic columns is valid and returns 1 if it is, 0 otherwise.

Syntax

Description

Check if dyncol_blob

COLUMN_CREATE

Create a dynamic column blob. This function generates a binary string containing specified column names and values for storage in a BLOB.

Syntax

Description

Returns a blob that stores the specified columns with values.

COLUMN_EXISTS

Check for a dynamic column. This function returns 1 if a specified column exists within a dynamic column blob, and 0 otherwise.

Syntax

COLUMN_EXISTS(dyncol_blob, column_nr)
COLUMN_EXISTS(dyncol_blob, column_name)

Description

Checks if a column with name column_name exists in dyncol_blob. If yes, return 1, otherwise return 0. See for more information.

_{This page is licensed: CC BY-SA / Gnu FDL}

COLUMN_GET

Retrieve a dynamic column value. This function extracts a specific column's value from a dynamic column blob, casting it to a specified type.

Syntax

Description

Gets the value of a by its name. If no column with the given name exists, NULL

COLUMN_JSON

Convert dynamic columns to JSON. This function returns a JSON string representation of the data stored in a dynamic column blob.

Syntax

Description

Returns a JSON representation of data in dyncol_blob

COLUMN_LIST

List dynamic column names. This function returns a comma-separated list of all column names contained within a dynamic column blob.

Syntax

COLUMN_LIST(dyncol_blob);

Description

Returns a comma-separated list of column names. The names are quoted with backticks.

See for more information.

_{This page is licensed: CC BY-SA / Gnu FDL}

Galera Functions

Interact with the Galera Cluster plugin. These functions provide internal status information and control mechanisms specific to synchronous multi-master replication nodes.

WSREP_LAST_SEEN_GTID

Return the GTID of the most recent write transaction. This function helps determine the transaction ID to use for synchronization waiting.

Syntax

WSREP_LAST_SEEN_GTID()

Description

Returns the Global Transaction ID of the most recent write transaction observed by the client.

The result can be useful to determine the transaction to provide to for waiting and unblocking purposes.

_{This page is licensed: CC BY-SA / Gnu FDL}

WSREP_LAST_WRITTEN_GTID

Return the GTID of the client's last write. This function identifies the specific transaction ID generated by the current client's most recent write operation.

Syntax

WSREP_LAST_WRITTEN_GTID()

Description

Returns the Global Transaction ID of the most recent write transaction performed by the client.

_{This page is licensed: CC BY-SA / Gnu FDL}

WSREP_SYNC_WAIT_UPTO_GTID

Wait for a specific GTID to be applied. This function blocks the client until the node has committed the transaction with the specified Global Transaction ID.

Syntax

WSREP_SYNC_WAIT_UPTO_GTID(gtid[,timeout])

Description

Blocks the client until the transaction specified by the given Global Transaction ID is applied and committed by the node.

The optional timeout argument can be used to specify a block timeout in seconds. If not provided, the timeout will be indefinite.

Returns the node that applied and committed the Global Transaction ID, ER_LOCAL_WAIT_TIMEOUT if the function is timed out before this, or ER_WRONG_ARGUMENTS if the function is given an invalid GTID.

The result from can be useful to determine the transaction to provide to WSREP_SYNC_WAIT_UPTO_GTID for waiting and unblocking purposes.

_{This page is licensed: CC BY-SA / Gnu FDL}

Geographic Functions

Process geospatial data. This collection of functions allows you to create, analyze, and manipulate geometric shapes like points, lines, and polygons within your database.

Geographic and geometry functions. See Geographic Features for a full discussion of MariaDB's spatial extensions.

JSON Functions

Manipulate JSON documents natively. Use these functions to extract values, modify structures, and query data stored in JSON format directly within your SQL statements.

JSONPath Expressions

Understand JSONPath syntax. This guide explains how to use JSONPath to select and extract specific elements, objects, or arrays from JSON documents.

A number of accept JSON Path expressions. MariaDB defines this path as follows:

JSON Path Syntax

The path starts with an optional path mode. At the moment, MariaDB supports only the "lax" mode, which is also the mode that is used when it is not explicitly specified.

The $ symbol represents the context item. The search always starts from the context item; because of that, the path always starts with $.

Differences between JSON_QUERY and JSON_VALUE

Learn the difference between JSON_QUERY and JSON_VALUE in MariaDB. JSON_QUERY returns objects or arrays, while JSON_VALUE extracts scalar values.

The primary difference between the two functions is that JSON_QUERY returns an object or an array, while JSON_VALUE returns a scalar.

Take the following JSON document as an example:

Note that data member "x" is an array, and data members "y" and "z" are strings. The following examples demonstrate the differences between the two functions.

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_ARRAY

Create a JSON array. This function evaluates a list of values and returns a JSON array containing those values.

Syntax

Description

Returns a JSON array containing the listed values. The list can be empty.

JSON_ARRAYAGG

Aggregate values into a JSON array. This function aggregates a result set column into a single JSON array.

JSON_ARRAYAGG is available from MariaDB 10.5.

Syntax

JSON_ARRAYAGG(column_or_expression)

Description

JSON_ARRAYAGG returns a JSON array containing an element for each value in a given set of JSON or SQL values. It acts on a column or an expression that evaluates to a single value.

The maximum returned length in bytes is determined by the server system variable.

Returns NULL in the case of an error, or if the result contains no rows.

JSON_ARRAYAGG cannot currently be used as a .

The full syntax is as follows:

Examples

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_ARRAY_APPEND

Explore JSON_ARRAY_APPEND in MariaDB. This function appends values to the end of specified arrays within a JSON document, returning the modified result.

Syntax

JSON_ARRAY_APPEND(json_doc, path, value[, path, value] ...)

Description

Appends values to the end of the specified arrays within a JSON document, returning the result, or NULL if any of the arguments are NULL.

Evaluation is performed from left to right, with the resulting document from the previous pair becoming the new value against which the next pair is evaluated.

If the json_doc is not a valid JSON document, or if any of the paths are not valid, or contain a * or ** wildcard, an error is returned.

Examples

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_ARRAY_INSERT

This function inserts values into a JSON document at a specified path, returning the modified document, and supports evaluating multiple path-value pairs sequentially.

Syntax

Description

Inserts a value into a JSON document, returning the modified document, or NULL

JSON_ARRAY_INTERSECT

Available from version 11.2, this function compares two JSON arrays and returns a new array containing only the items present in both.

JSON_ARRAY_INTERSECT is available from MariaDB 11.2.

Syntax

JSON_ARRAY_INTERSECT(arr1, arr2)

Description

Finds intersection between two json arrays and returns an array of items found in both array.

Examples

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_COMPACT

Learn about the JSON_COMPACT function in MariaDB. This function removes all unnecessary whitespace from a JSON document, minimizing its size for efficient storage and transmission.

Syntax

Description

Removes all unnecessary spaces so the json document is as short as possible.

JSON_CONTAINS

Check for JSON containment. This function returns 1 if a candidate JSON document is contained within a target JSON document, or 0 otherwise.

Syntax

JSON_CONTAINS(json_doc, val[, path])

Description

Returns whether or not the specified value is found in the given JSON document or, optionally, at the specified path within the document. Returns 1 if it does, 0 if not and NULL if any of the arguments are null. An error occurs if the document or path is not valid, or contains the * or ** wildcards.

Examples

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_CONTAINS_PATH

Check for a JSON path. This function returns 1 if a specified path exists within a JSON document, or 0 if it does not.

Syntax

Description

Indicates whether the given JSON document contains data at the specified path or paths. Returns 1

JSON_DEPTH

Discover JSON_DEPTH in MariaDB. This function returns the maximum depth of a JSON document, assigning a depth of 1 to scalars and empty structures, and higher values for nested data.

Syntax

Description

Returns the maximum depth of the given JSON document, or NULL if the argument is null. An error occurs if the argument is an invalid JSON document.

Scalar values or empty arrays or objects have a depth of 1.
Arrays with only scalar values and objects with only scalar values for all keys have depth of 1.
In all other cases, the depth can be 2 or greater.

There is no maximum depth level — it's unlimited.

For more information, see .

The maximum depth is 32.

Examples

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_DETAILED

Explore JSON_DETAILED in MariaDB. Also known as JSON_PRETTY, this function formats JSON documents with indentation and newlines to emphasize nested structures for readability.

Syntax

Description

Represents JSON in the most understandable way emphasizing nested structures.

JSON_EXISTS

Explore JSON_EXISTS in MariaDB. This function checks whether a specified JSON document contains an element at a given path, returning 1 for existence and 0 otherwise.

Syntax

Description

Determines whether json_doc

JSON_EXTRACT

Extract data from a JSON document. This function returns data from a JSON document selected by a given path.

Syntax

Description

Extracts data from a JSON document. The extracted data is selected from the parts matching the path arguments. Returns all matched values; either as a single matched value, or, if the arguments could return multiple values, a result autowrapped as an array in the matching order.

JSON_KEY_VALUE

Explore JSON_KEY_VALUE in MariaDB. Available from version 11.2, this function extracts key/value pairs from a JSON object, enabling easier data transformation and usage with JSON_TABLE.

JSON_KEY_VALUE is available from MariaDB 11.2.

Syntax

JSON_MERGE_PATCH

Learn about JSON_MERGE_PATCH in MariaDB. This RFC 7396-compliant function merges JSON documents by overwriting duplicate keys, serving as a modern replacement for the deprecated JSON_MERGE.

Syntax

Description

Merges the given JSON documents, returning the merged result, or NULL

JSON_OBJECT

Create a JSON object. This function evaluates a list of key-value pairs and returns a JSON object containing those pairs.

Syntax

JSON_OBJECT([key, value[, key, value] ...])

Description

Returns a JSON object containing the given key/value pairs. The key/value list can be empty.

An error will occur if there are an odd number of arguments, or any key name is NULL.

Example

JSON_OBJECT_FILTER_KEYS

Discover JSON_OBJECT_FILTER_KEYS in MariaDB. Available from version 11.2, this function returns a new JSON object containing only the key-value pairs where the keys match those provided in a specified

JSON_OBJECT_FILTER_KEYS is available from MariaDB 11.2.

Syntax

JSON_OBJECT_FILTER_KEYS(obj, array_keys)

Description

JSON_OBJECT_FILTER_KEYS returns a JSON object with keys from the object that are also present in the array as string. It is used when one wants to get key-value pair such that the keys are common but the values may not be common.

Example

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_OBJECT_TO_ARRAY

Explore JSON_OBJECT_TO_ARRAY in MariaDB. Available from version 11.2, this function converts JSON objects into arrays of key-value pairs, facilitating operations like finding common values when used w

JSON_OBJECT_TO_ARRAY is available from MariaDB 11.2.

Syntax

JSON_OBJECTAGG

Aggregate key-value pairs into a JSON object. This function aggregates two columns or expressions into a single JSON object.

JSON_OBJECTAGG is available from MariaDB 10.5.

Syntax

JSON_OVERLAPS

Discover JSON_OVERLAPS in MariaDB. Available from version 10.9, this function compares two JSON documents, returning true if they share at least one common key-value pair, array element, or scalar val

JSON_OVERLAPS is available from MariaDB 10.9.

Syntax

JSON_PRETTY

Learn about JSON_PRETTY in MariaDB. This function is an alias for JSON_DETAILED, serving to format JSON documents in a human-readable way by adding indentation and newlines.

JSON_PRETTY is available from MariaDB 10.10.3, 10.9.5, 10.8.7, 10.7.8, 10.6.12, 10.5.19, and 10.4.28.

JSON_PRETTY is an alias for .

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_QUERY

Extract a JSON object or array. This function extracts data from a JSON document at a given path, returning a JSON object or array.

Syntax

JSON_QUERY(json_doc, path)

Description

Given a JSON document, returns an object or array specified by the path. Returns NULL if not given a valid JSON document, or if there is no match.

Examples

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_QUOTE

Quote a string as a JSON value. This function wraps a string with double quotes and escapes special characters to create a valid JSON string literal.

Syntax

JSON_QUOTE(json_value)

Description

Quotes a string as a JSON value, usually for producing valid JSON string literals for inclusion in JSON documents. Wraps the string with double quote characters and escapes interior quotes and other special characters, returning a utf8mb4 string.

Returns NULL if the argument is NULL.

Examples

_{This page is licensed: CC BY-SA / Gnu FDL}

JSON_REMOVE

Remove data from a JSON document. This function removes data from a JSON document at a specified path and returns the result.

Syntax

Description

Removes data from a JSON document returning the result, or NULL

JSON_REPLACE

Replace values in a JSON document. This function replaces existing values in a JSON document and returns the result.

Syntax

Description

Replaces existing values in a JSON document, returning the result, or NULL

JSON_TYPE

Return the type of a JSON value. This function returns a string indicating the type of a JSON value (e.g., OBJECT, ARRAY, INTEGER).

Syntax

Description

Returns the type of a JSON value (as a string), or NULL

Window Functions

Explore window functions in MariaDB Server. This section details SQL functions that perform calculations across a set of table rows related to the current row, enabling advanced analytical queries.

NTH_VALUE

Return the value from the N-th row. This function retrieves the value of an expression from the N-th row within the window frame.

Syntax

Description

The NTH_VALUE

CREATE TABLE obj_table(val_obj JSON CHECK(JSON_SCHEMA_VALID('{ "type":"object", "properties": { "number1":{ "type":"number", "maximum":5, "const":4 }, "string1":{ "type":"string", "maxLength":5, "minLength":3 }, "object1":{ "type":"object", "properties":{ "key1": {"type":"string"}, "key2":{"type":"array"}, "key3":{"type":"number", "minimum":3} }, "dependentRequired": { "key1":["key3"] } } }, "required":["number1","object1"] }', val_obj))); INSERT INTO obj_table VALUES( '{"number1":4, "string1":"abcd", "object1":{"key1":"val1", "key2":[1,2,3, "string1"], "key3":4}}' ); INSERT INTO obj_table VALUES( '{"number1":3, "string1":"abcd", "object1":{"key1":"val1", "key2":[1,2,3, "string1"], "key3":4}}' ); ERROR 4025 (23000): CONSTRAINT `obj_table.val_obj` failed for `test`.`obj_table` SELECT * FROM obj_table; +--------------------------------------------------------------------------------------------------+ | val_obj | +--------------------------------------------------------------------------------------------------+ | {"number1":4, "string1":"abcd", "object1":{"key1":"val1", "key2":[1,2,3, "string1"], "key3":4}} | +--------------------------------------------------------------------------------------------------+ SET @schema= '{ "properties" : { "number1":{ "maximum":10 }, "string1" : { "maxLength": 3} } }'; SELECT JSON_SCHEMA_VALID(@schema, '{ "number1":25, "string1":"ab" }'); +----------------------------------------------------------------+ | JSON_SCHEMA_VALID(@schema, '{ "number1":25, "string1":"ab" }') | +----------------------------------------------------------------+ | 0 | +----------------------------------------------------------------+ SELECT JSON_SCHEMA_VALID(@schema, '{ "number1":10, "string1":"ab" }'); +----------------------------------------------------------------+ | JSON_SCHEMA_VALID(@schema, '{ "number1":10, "string1":"ab" }') | +----------------------------------------------------------------+ | 1 | +----------------------------------------------------------------+

CREATE TABLE book_rating (name CHAR(30), star_rating TINYINT); INSERT INTO book_rating VALUES ('Lord of the Ladybirds', 5); INSERT INTO book_rating VALUES ('Lord of the Ladybirds', 3); INSERT INTO book_rating VALUES ('Lady of the Flies', 1); INSERT INTO book_rating VALUES ('Lady of the Flies', 2); INSERT INTO book_rating VALUES ('Lady of the Flies', 5); SELECT name, PERCENTILE_CONT(0.5) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+--------------+ | name | pc | +-----------------------+--------------+ | Lord of the Ladybirds | 4.0000000000 | | Lord of the Ladybirds | 4.0000000000 | | Lady of the Flies | 2.0000000000 | | Lady of the Flies | 2.0000000000 | | Lady of the Flies | 2.0000000000 | +-----------------------+--------------+ SELECT name, PERCENTILE_CONT(1) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+--------------+ | name | pc | +-----------------------+--------------+ | Lord of the Ladybirds | 5.0000000000 | | Lord of the Ladybirds | 5.0000000000 | | Lady of the Flies | 5.0000000000 | | Lady of the Flies | 5.0000000000 | | Lady of the Flies | 5.0000000000 | +-----------------------+--------------+ SELECT name, PERCENTILE_CONT(0) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+--------------+ | name | pc | +-----------------------+--------------+ | Lord of the Ladybirds | 3.0000000000 | | Lord of the Ladybirds | 3.0000000000 | | Lady of the Flies | 1.0000000000 | | Lady of the Flies | 1.0000000000 | | Lady of the Flies | 1.0000000000 | +-----------------------+--------------+ SELECT name, PERCENTILE_CONT(0.6) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+--------------+ | name | pc | +-----------------------+--------------+ | Lord of the Ladybirds | 4.2000000000 | | Lord of the Ladybirds | 4.2000000000 | | Lady of the Flies | 2.6000000000 | | Lady of the Flies | 2.6000000000 | | Lady of the Flies | 2.6000000000 | +-----------------------+--------------+

CREATE TABLE book_rating (name CHAR(30), star_rating TINYINT); INSERT INTO book_rating VALUES ('Lord of the Ladybirds', 5); INSERT INTO book_rating VALUES ('Lord of the Ladybirds', 3); INSERT INTO book_rating VALUES ('Lady of the Flies', 1); INSERT INTO book_rating VALUES ('Lady of the Flies', 2); INSERT INTO book_rating VALUES ('Lady of the Flies', 5); SELECT name, PERCENTILE_DISC(0.5) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+------+ | name | pc | +-----------------------+------+ | Lord of the Ladybirds | 3 | | Lord of the Ladybirds | 3 | | Lady of the Flies | 2 | | Lady of the Flies | 2 | | Lady of the Flies | 2 | +-----------------------+------+ 5 rows in set (0.000 sec) SELECT name, PERCENTILE_DISC(0) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+------+ | name | pc | +-----------------------+------+ | Lord of the Ladybirds | 3 | | Lord of the Ladybirds | 3 | | Lady of the Flies | 1 | | Lady of the Flies | 1 | | Lady of the Flies | 1 | +-----------------------+------+ 5 rows in set (0.000 sec) SELECT name, PERCENTILE_DISC(1) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+------+ | name | pc | +-----------------------+------+ | Lord of the Ladybirds | 5 | | Lord of the Ladybirds | 5 | | Lady of the Flies | 5 | | Lady of the Flies | 5 | | Lady of the Flies | 5 | +-----------------------+------+ 5 rows in set (0.000 sec) SELECT name, PERCENTILE_DISC(0.6) WITHIN GROUP (ORDER BY star_rating) OVER (PARTITION BY name) AS pc FROM book_rating; +-----------------------+------+ | name | pc | +-----------------------+------+ | Lord of the Ladybirds | 5 | | Lord of the Ladybirds | 5 | | Lady of the Flies | 2 | | Lady of the Flies | 2 | | Lady of the Flies | 2 | +-----------------------+------

create table t1 ( pk int primary key, a int, b int ); insert into t1 values ( 1 , 0, 10), ( 2 , 0, 10), ( 3 , 1, 10), ( 4 , 1, 10), ( 8 , 2, 10), ( 5 , 2, 20), ( 6 , 2, 20), ( 7 , 2, 20), ( 9 , 4, 20), (10 , 4, 20); select pk, a, b, rank() over (order by a) as rank, percent_rank() over (order by a) as pct_rank, cume_dist() over (order by a) as cume_dist from t1; +----+------+------+------+--------------+--------------+ | pk | a | b | rank | pct_rank | cume_dist | +----+------+------+------+--------------+--------------+ | 1 | 0 | 10 | 1 | 0.0000000000 | 0.2000000000 | | 2 | 0 | 10 | 1 | 0.0000000000 | 0.2000000000 | | 3 | 1 | 10 | 3 | 0.2222222222 | 0.4000000000 | | 4 | 1 | 10 | 3 | 0.2222222222 | 0.4000000000 | | 5 | 2 | 20 | 5 | 0.4444444444 | 0.8000000000 | | 6 | 2 | 20 | 5 | 0.4444444444 | 0.8000000000 | | 7 | 2 | 20 | 5 | 0.4444444444 | 0.8000000000 | | 8 | 2 | 10 | 5 | 0.4444444444 | 0.8000000000 | | 9 | 4 | 20 | 9 | 0.8888888889 | 1.0000000000 | | 10 | 4 | 20 | 9 | 0.8888888889 | 1.0000000000 | +----+------+------+------+--------------+--------------+ select pk, a, b, percent_rank() over (order by pk) as pct_rank, cume_dist() over (order by pk) as cume_dist from t1 order by pk; +----+------+------+--------------+--------------+ | pk | a | b | pct_rank | cume_dist | +----+------+------+--------------+--------------+ | 1 | 0 | 10 | 0.0000000000 | 0.1000000000 | | 2 | 0 | 10 | 0.1111111111 | 0.2000000000 | | 3 | 1 | 10 | 0.2222222222 | 0.3000000000 | | 4 | 1 | 10 | 0.3333333333 | 0.4000000000 | | 5 | 2 | 20 | 0.4444444444 | 0.5000000000 | | 6 | 2 | 20 | 0.5555555556 | 0.6000000000 | | 7 | 2 | 20 | 0.6666666667 | 0.7000000000 | | 8 | 2 | 10 | 0.7777777778 | 0.8000000000 | | 9 | 4 | 20 | 0.8888888889 | 0.9000000000 | | 10 | 4 | 20 | 1.0000000000 | 1.0000000000 | +----+------+------+--------------+--------------+ select pk, a, b, percent_rank() over (partition by a order by a) as pct_rank, cume_dist() over (partition by a order by a) as cume_dist from t1; +----+------+------+--------------+--------------+ | pk | a | b | pct_rank | cume_dist | +----+------+------+--------------+--------------+ | 1 | 0 | 10 | 0.0000000000 | 1.0000000000 | | 2 | 0 | 10 | 0.0000000000 | 1.0000000000 | | 3 | 1 | 10 | 0.0000000000 | 1.0000000000 | | 4 | 1 | 10 | 0.0000000000 | 1.0000000000 | | 5 | 2 | 20 | 0.0000000000 | 1.0000000000 | | 6 | 2 | 20 | 0.0000000000 | 1.0000000000 | | 7 | 2 | 20 | 0.0000000000 | 1.0000000000 | | 8 | 2 | 10 | 0.0000000000 | 1.0000000000 | | 9 | 4 | 20 | 0.0000000000 | 1.0000000000 | | 10 | 4 | 20 | 0.0000000000 | 1.0000000000 | +----+------+------+--------------+--------------+

Window Functions Overview

Understand window function concepts. These functions operate on a set of rows (window) defined by an OVER clause, returning a value for each row without collapsing results.

Introduction

Window functions allow calculations to be performed across a set of rows related to the current row.

Syntax

Description

Window functions perform calculations across a set of rows (in a defined window).

Dedicated window functions include

that can also be used as window functions include

Window function queries are characterised by the OVER keyword, following which the set of rows used for the calculation is specified. By default, the set of rows used for the calculation (the "window) is the entire dataset, which can be ordered with the ORDER BY clause. The PARTITION BY clause is used to reduce the window to a particular group within the dataset.

Consider the following data:

The following two queries return the average partitioned by test and by name respectively:

It is also possible to specify which rows to include for the window function (for example, the current row and all preceding rows). See for more details.

Scope

Window functions were introduced in SQL:2003, and their definition was expanded in subsequent versions of the standard. The last expansion was in the latest version of the standard, SQL:2011.

Most database products support a subset of the standard, they implement some functions defined as late as in SQL:2011, and at the same time leave some parts of SQL:2008 unimplemented.

MariaDB:

Supports ROWS and RANGE-type frames
- All kinds of frame bounds are supported, including RANGE PRECEDING|FOLLOWING n frame bounds (unlike PostgreSQL or MS SQL Server)
- Does not yet support DATE[TIME] datatype and arithmetic for RANGE-type frames ()

Examples

Given the following sample data:

First, let's order the records by email alphabetically, giving each an ascending rnum value starting with 1. This will make use of the window function:

We can generate separate sequences based on account type, using the PARTITION BY clause:

Given the following structure and data, we want to find the top 5 salaries from each department.

We could do this without using window functions, as follows:

This has a number of disadvantages:

If there is no index, the query could take a long time if the employee_salary_table is large.
Adding and maintaining indexes adds overhead, and even with indexes on dept and salary, each subquery execution adds overhead by performing a lookup through the index.

Let's try achieve the same with window functions. First, generate a rank for all employees, using the function.

Each department has a separate sequence of ranks due to the PARTITION BY clause. This particular sequence of values for rank() is given by the ORDER BY clause inside the window function’s OVER clause. Finally, to get our results in a readable format we order the data by dept and the newly generated ranking column.

Now, we need to reduce the results to find only the top 5 per department. Here is a common mistake:

Trying to filter only the first 5 values per department by putting a where clause in the statement does not work, due to the way window functions are computed. The computation of window functions happens after all WHERE, GROUP BY and HAVING clauses have been completed, right before ORDER BY, so the WHERE clause has no idea that the ranking column exists. It is only present after we have filtered and grouped all the rows.

To counteract this problem, we need to wrap our query into a derived table. We can then attach a where clause to it:

ColumnStore Window Functions

Review window function support in ColumnStore. This page details the specific window functions and limitations applicable when using the ColumnStore storage engine.

Introduction

MariaDB ColumnStore provides support for window functions broadly following the SQL 2003 specification. A window function allows for calculations relating to a window of data surrounding the current row in a result set. This capability provides for simplified queries in support of common business questions such as cumulative totals, rolling averages, and top 10 lists.

Aggregate functions are utilized for window functions however differ in behavior from a group by query because the rows remain ungrouped. This provides support for cumulative sums and rolling averages, for example.

Two key concepts for window functions are Partition and Frame:

A Partition is a group of rows, or window, that have the same value for a specific column, for example a Partition can be created over a time period such as a quarter or lookup values.
The Frame for each row is a subset of the row's Partition. The frame typically is dynamic allowing for a sliding frame of rows within the Partition. The Frame determines the range of rows for the windowing function. A Frame could be defined as the last X rows and next Y rows all the way up to the entire Partition.

Window functions are applied after joins, group by, and having clauses are calculated.

Syntax

A window function is applied in the select clause using the following syntax:

where window_definition is defined as:

PARTITION BY:

Divides the window result set into groups based on one or more expressions.
An expression may be a constant, column, and non window function expressions.
A query is not limited to a single partition by clause. Different partition clauses can be used across different window function applications.
The partition by columns do not need to be in the select list but do need to be available from the query result set.

ORDER BY:

Defines the ordering of values within the partition.
Can be ordered by multiple keys which may be a constant, column or non window function expression.
The order by columns do not need to be in the select list but need to be available from the query result set.
Use of a select column alias from the query is not supported.

and the optional frame_clause is defined as:

and the optional frame_start and frame_end are defined as (value being a numeric expression):

RANGE/ROWS:

Defines the windowing clause for calculating the set of rows that the function applies to for calculating a given rows window function result.
Requires an ORDER BY clause to define the row order for the window.
ROWS specify the window in physical units, i.e. result set rows and must be a constant or expression evaluating to a positive numeric value.

Supported Functions

Function

Description

Examples

Example Schema

The examples are all based on the following simplified sales opportunity table:

Some example values are (thanks to for sample data generation):

accountName

name

owner

amount

closeDate

stageName

The schema, sample data, and queries are available as an attachment to this article.

Cumulative Sum and Running Max Example

Window functions can be used to achieve cumulative / running calculations on a detail report. In this case a won opportunity report for a 7 day period adds columns to show the accumulated won amount as well as the current highest opportunity amount in preceding rows.

with example results:

owner

accountName

CloseDate

amount

cumeWon

runningMax

Partitioned Cumulative Sum and Running Max Example

The above example can be partitioned, so that the window functions are over a particular field grouping such as owner and accumulate within that grouping. This is achieved by adding the syntax "partition by" in the window function clause.

With example results:

owner

accountName

CloseDate

amount

cumeWon

runningMax

Ranking / Top Results

The rank window function allows for ranking or assigning a numeric order value based on the window function definition. Using the Rank() function will result in the same value for ties / equal values and the next rank value skipped. The Dense_Rank() function behaves similarly except the next consecutive number is used after a tie rather than skipped. The Row_Number() function will provide a unique ordering value. The example query shows the Rank() function being applied to rank sales reps by the number of opportunities for Q4 2016.

with example results (note the query is technically incorrect by using closeDate < '2016-12-31' however this creates a tie scenario for illustrative purposes):

owner

wonCount

rank

If the dense_rank function is used the rank values would be 1,2,3,3,4 and for the row_number function the values would be 1,2,3,4,5.

First and Last Values

The first_value and last_value functions allow determining the first and last values of a given range. Combined with a group by this allows summarizing opening and closing values. The example shows a more complex case where detailed information is presented for first and last opportunity by quarter.

with example results:

year

quarter

firstAccountName

firstOwner

firstAmount

lastAccountName

lastOwner

lastAmount

Prior and Next Example

Sometimes it useful to understand the previous and next values in the context of a given row. The lag and lead window functions provide this capability. By default the offset is one providing the prior or next value but can also be provided to get a larger offset. The example query is a report of opportunities by account name showing the opportunity amount, and the prior and next opportunity amount for that account by close date.

With example results:

accountName

closeDate

currentOppAmount

priorAmount

nextAmount

Quartiles Example

The NTile window function allows for breaking up a data set into portions assigned a numeric value to each portion of the range. NTile(4) breaks the data up into quartiles (4 sets). The example query produces a report of all opportunities summarizing the quartile boundaries of amount values.

With example results:

quartile

min

max

Percentile Example

The percentile functions have a slightly different syntax from other window functions as can be seen in the example below. These functions can be only applied against numeric values. The argument to the function is the percentile to evaluate. Following 'within group' is the sort expression which indicates the sort column and optionally order. Finally after 'over' is an optional partition by clause, for no partition clause use 'over ()'. The example below utilizes the value 0.5 to calculate the median opportunity amount in the rows. The values differ sometimes because percentile_cont will return the average of the 2 middle rows for an even data set while percentile_desc returns the first encountered in the sort.

With example results:

owner

accountName

CloseDate

amount

pct_cont

pct_disc

_{This page is licensed: CC BY-SA / Gnu FDL}