1 of 100

General Resources

About

About MariaDB Documentation

The documentation for MariaDB products is

written in standard American English,
using Markdown format,
stored in Git.

It is maintained by a team of technical writers from MariaDB plc and the MariaDB Foundation.

Until June 2025, the documentation used to be in the MariaDB KnowledgeBase (KB). With a few exceptions, mostly concerning outdated modules and functionality, all pages were migrated to a new platform, .

Contributing Documentation

Instructions on how to contribute to MariaDB documentation

The documentation for MariaDB products is

written in standard American English,
using Markdown format,
stored in Git.

While the documentation is mainly maintained by the documentation team at MariaDB plc, we are happy to get contributions. Being stored in Git, it allows anyone to contribute to the documentation. You need

Documentation Style Guide

Style Guide

We adhere to the . Here are some links to particularly important resources from that style guide:

– an alphabetically ordered list that allows you to quickly find words to use, or not to use, and recommendations of words and terms to use.

Reporting Documentation Issues

Bug Reports

If you find issues in the documentation, please report them:

Report only one issue per request. If you find multiple issues, report them one by one. Only report documentation issues, not software issues or software help requests.
Provide the URL of the page that has an issue, for example . ℹ️ When reporting via the rating system, the URL of the page you're on will be automatically be included in your response, so there's no need to include the URL.
Indicate the nature of the issue:
1. Typo, for example "known bucks should be known bugs".
2. Wrong information. Provide details of what's wrong. Ideally, point out what the right information should be.
3. Missing information. Provide details of what's missing.

Reporting Channels

Use one of the following channels to report documentation issues. Please don't report software issues via those channels — instructions for doing that are .

Rating system in the right sidebar

This is a super quick way to provide feedback or report issues in the documentation. However, it's one-way communication — we can't provide feedback to you, since we don't know who you are. 😇 ℹ️ Don't paste the URL of the page you're reporting from, since it will automatically be included.

MariaDB Community Slack

Join the #documentation channel in . This allows for more detailed feedback or reports, and naturally provides two-way communication.

_{This page is licensed: CC BY-SA / Gnu FDL}

About MariaDB

About MariaDB Server

MariaDB Server is a leading open source relational database, developed by the original creators of MySQL. Renowned for its speed, scalability, and robustness, it serves a wide range of applications, from banking to websites, and powers notable platforms like Wikipedia and WordPress.com. Originally designed as a drop-in replacement for MySQL, MariaDB Server offers an enhanced feature set, including a rich ecosystem of storage engines and plugins. Its versatility is further extended by features like GIS, Oracle and MySQL compatibility, JSON functionality, and vector search in recent versions. MariaDB Server utilizes a standard SQL interface for data access and benefits from continuous open-source development, ensuring its reliability and innovation.

Learn how to .

Contact Us

Explore these resources to optimize your use of MariaDB, including documentation, enterprise solutions and community engagement.

Documentation:
Enterprise products & support:
Chat & mailing lists:

Community

Joining the Community

For discussions, release announcements, and development updates, engage with the MariaDB community through various chat platforms and mailing lists. Participate in Zulip, Slack, IRC channels, or subscribe to mailing lists like announce, discuss, docs, developers, commits, and packagers for extensive community interactions.

Chat

MariaDB developers and community members can be found on these chat channels:

MariaDB Zulip Chat: https://mariadb.zulipchat.com/
MariaDB Slack Channel:
MariaDB IRC Chat:
More ways to connect:

Mailing lists

The list is a low-volume list for release announcements. The archive can be found .
The mailing list is for MariaDB users and general discussion. The archive can be found .
The mailing list is for those interested in documentation for MariaDB. The archive can be found .
The mailing list is for those who want to contribute code to the project or otherwise closely monitor MariaDB development. The archive can be found

GitHub

The MariaDB project is .

Feature requests

If you have a feature request, you can file it in . If you are prepared to pay for the feature you can contact the to get a quote. You can of course also fix it yourself and become a .

A list of MariaDB-related social media links and pages. Note that many of them are community groups not officially affiliated with MariaDB.

YouTube

X

MariaDB plc:
MariaDB Foundation:

Facebook

Instagram

Fosstodon

MariaDB Foundation:

MariaDB Community

MariaDB Server is an open-source project developed by MariaDB plc and stewarded by MariaDB Foundation. Anyone can participate in the development.

This section provides information to help you participate in making MariaDB Server and other MariaDB products a success.

If you're interested in contributing to the documentation, see this page.

Bug Tracking

How and where the community can report bugs and file feature requests, and how bug reports are processed for community users. MariaDB plc provides SLA for customer issues (see ).

To report documentation issues, please review .

Security Bug Fixing

See .

Building MariaDB Server for Debugging

Instructions on how to build a mysqld that contains all the information we need to fix problems you encounter. (A more detailed explanation can be found here.)

Add the option to your /.my.cnf or /etc/my.cnf file under the [mysqld] tag.
Get the latest MariaDB code from GitHub.
Compile MariaDB with the -g compiler flag (Unix).
Optionally: with more checking - will cause slowdown.
Shut down your old mysqld server.
Install the new compiled mysqld binary. Note that if you are compiling same version of MariaDB that you have already installed it's enough to just copy this one binary!
Restart mysqld.

Compiling with -g should not cause any notable slowdown of the server.

You can of course also do make install, but the above way allows you to go back to your old binary if needed.

If you get any errors about a wrong number of error messages, you can fix that by copying the corresponding language file from sql/share over your old ones (this should be reasonably safe to do).

What to Do When You Get a Crash After Installing a Debug Binary

Now when you get a crash do the following:

Create a README file that describes the problem. You can use the mysqlbug script to generate a template for this.
Create a tar file containing the core, the mysqld binary and README. If possible, also add any database files that could help us repeat the problem!

Send it to our secure ftp server:

To be able to follow the progress, create a bug report in about this. This should be easy to do based on the information you have in your README file.

Legal Documents

There are several legal aspects related to MariaDB related to licensing and contributing code

MariaDB Contributor Agreement

Questions? See the page.

MariaDB Contributor Agreement (MCA)

These terms apply to your contribution of materials to a product or project owned or managed by us ('project'), and set out the intellectual property rights you grant to us (The MariaDB Foundation) in the contributed material. If this contribution is on behalf of a company, the term 'you' will also mean the company you identify below. If you agree to be bound by these terms, fill in the information requested below and provide your signature. Read this agreement carefully before signing.

MariaDB Contributor Agreement FAQ

What is the Intention of the ?

The original copyright holder and the MariaDB Foundation both get (shared) copyright for the code.
Both parties can use their copy independently of each other for any purpose (as if each copy was completely separate and unique).

Frequently Asked Questions

There are many questions that are frequently asked about MariaDB, the organizations and the software. They have been grouped based on common topics. Click on a topic to find the question you may have. If you don't see it, please click on the button in the left margin labeled, "Ask a Question Here". We'll try to answer you question, quickly.

MariaDB Community Server

Q: What is MariaDB Community Server and its main purpose? A: MariaDB Community Server is a popular free and open-source relational database management system (RDBMS), originally forked from MySQL. It is actively developed and maintained by MariaDB plc and a global community of contributors. It's known for its strong performance, stability, and a rich feature set suitable for a wide range of applications.
Q: Can MariaDB Community Server be used as a direct MySQL replacement? A: Yes, in many scenarios and for numerous versions, MariaDB Community Server is designed to be a compatible, drop-in replacement for MySQL, facilitating easy migration for users. However, as both MariaDB and MySQL evolve independently, some differences may exist in newer or more advanced features.
Q: What are the key benefits and features of MariaDB Community Server? A: Key benefits of MariaDB Community Server include its true open-source nature (GPLv2 license), a broader selection of advanced storage engines (like Aria, MyRocks, and Spider, in addition to the standard InnoDB), continuous feature enhancements, performance improvements, and the support of a vibrant, active global community.

The mariadb-backup Tool

Q: What is mariadb-backup and its primary use? A: mariadb-backup is a command-line utility included with MariaDB Community Server. Its primary use is to perform physical "hot" backups of MariaDB database instances, especially those utilizing transactional storage engines like InnoDB and XtraDB. This tool is based on the well-regarded Percona XtraBackup technology.
Q: What are the main advantages of using mariadb-backup for database backups? A: The main advantages of mariadb-backup are its ability to perform online (hot) backups with minimal locking. This means your MariaDB database remains largely available for both read and write operations during the backup process. For large databases, mariadb-backup is generally much faster than logical backup methods (like mariadb-dump) and, when used in conjunction with MariaDB's binary logs, it enables precise point-in-time recovery (PITR).
Q: How does the mariadb-backup utility work to create backups? A: mariadb-backup works by copying the physical data files from the MariaDB data directory while the server is running. It continuously monitors for changes made to these files during the backup operation and records these modifications in a separate log file. In a subsequent "prepare" phase, these logged changes are applied to the copied data files to ensure they are brought to a transactionally consistent state, ready for restoration.
Q: Is mariadb-backup a suitable backup solution for all MariaDB storage engines? A: mariadb-backup is primarily designed for, and works most effectively with, transactional storage engines such as InnoDB and XtraDB, for which it can guarantee consistency and perform online backups. While it might be able to copy data files for other storage engines like Aria or MyISAM, it cannot ensure the same level of transactional consistency or perform true online backups for them. For non-transactional engines, mariadb-dump or filesystem-level snapshots might be more appropriate choices.
Q: Is the mariadb-backup tool free to use? A: Yes, mariadb-backup is an open-source tool that is included as a standard utility with MariaDB Community Server and is completely free to use.

Community Questions

How Can I Contribute to MariaDB?

There are many ways to contribute to MariaDB. If you want to contribute to, or participate in the development of MariaDB, there are many ways to do so. You don't have to be a developer (but we always welcome more of those), you just have to have the willingness to help make MariaDB better. For example, we are looking for writers or translators of KB articles and people helping setting up MariaDB discussions groups.

is the page that gathers together everything you need to get started.

The contains a lot of interesting links about how to participate.

You can also consider .

Welcome to the !

_{This page is licensed: CC BY-SA / Gnu FDL}

DBA Questions

Developer Questions

How can a VBA Application Connect to MariaDB?

How can I Learn about Developing MariaDB?

See for this.

_{This page is licensed: CC BY-SA / Gnu FDL}

How many JOIN Clauses are Allowed in a Query?

According to MySQL docs: joins-limits.html

The maximum number of tables that can be referenced in a single join is 61. This also applies to the number of tables that can be referenced in the definition of a view.

Does the same maximum (61) applies to MariaDB, too?

If yes, are there plans for raising this limit?

_{This page is licensed: CC BY-SA / Gnu FDL}

Remote connection work but often get error HY000/2002 connection timed out

Hi all,

I have create a remote connection between two linux server running mariadb. One run centos stream 9 with 10.5.16-MariaDB and the second run centos 7 with 5.5.68-MariaDB. Through php script automated with cron I run queries and it work.

The problem is that often i get the error HY000/2002 connection timed out. The error occurs at random times either using cron or launching it manually. For example, the script work for 10 times and on the 11th time I receive the error (HY000/2002 connection timed out) which can occur one or more times consecutively and then return to work on the umpteenth execution. The number of times the script works or not is always different.

I can't find the cause of the error. Any suggestions?

Thanks

_{This page is licensed: CC BY-SA / Gnu FDL}

Tools Available for Developing on the MariaDB Code

The code is hosted on . You can branch the latest code from there, and you can also push your own changes as a new branch that can be shared with others.

is done with standard Unix tools: CMake (or autotools for MariaDB versions below 5.5), Gnu Make, GCC (or other C/C++ compiler on some systems). On , CMake and Visual Studio are used.

The current state of the source with respect to build/test failures can be seen in .

For project management and bug tracking, we use .

The page has links to instructions on setting up a full development environment, if you are interested.

_{This page is licensed: CC BY-SA / Gnu FDL}

Where is the table_cache in MariaDB?

All versions of MariaDB are based on MySQL 5.1 and greater, thus the table_cache option is deprecated in favor of table_open_cache. This is also documented at: .

For further reading, please refer to the MySQL manual: .

Examples of use cases:

_{This page is licensed: CC BY-SA / Gnu FDL}

Why is ORDER BY in a FROM Subquery Ignored?

Query with ORDER BY in a FROM subquery produces unordered result. Is this a bug? Below is an example of this:

SELECT field1, field2 FROM ( SELECT field1, field2 FROM table1 ORDER BY field2 ) ALIAS

returns a result set that is not necessarily ordered by field2. This is not a bug.

A "table" (and subquery in the FROM clause too) is - according to the SQL standard - an unordered set of rows. Rows in a table (or in a subquery in the FROM clause) do not come in any specific order. That's why the optimizer can ignore the ORDER BY clause that you have specified. In fact, the SQL standard does not even allow the ORDER BY clause to appear in this subquery (we allow it, because ORDER BY ... LIMIT ... changes the result, the set of rows, not only their order).

You need to treat the subquery in the FROM clause, as a set of rows in some unspecified and undefined order, and put the ORDER BY on the top-level SELECT.

Source:

General Questions

Frequently Asked Questions for MariaDB that also apply to MySQL

Does the Password Prompt Appear Before or After Establishing a Connection?

Question: When I use the mysql client as in mysql -u root -p -h <host>, does the password-enter prompt popup before or after the client has established a connection with the server?

Answer: Before the client has established a connection to the server.

How do we derive the answer?

lovegood:~ byte$ mysql -uroot -p -hlocalhost
Enter password: 
Welcome to the MariaDB monitor.  Commands end with ; or \g.
Your MariaDB connection id is 1
Server version: 5.2.8-MariaDB Source distribution

This software comes with ABSOLUTELY NO WARRANTY. This is free software,
and you are welcome to modify and redistribute it under the GPL v2 license

Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.

MariaDB [(none)]> \q
Bye
lovegood:~ byte$ mysql.server stop
Shutting down MySQL
. SUCCESS! 
lovegood:~ byte$ mysql -uroot -p -hlocalhost
Enter password: 
ERROR 2002 (HY000): Can't connect to local MySQL server through socket '/tmp/mysql.sock' (2)

_{This page is licensed: CC BY-SA / Gnu FDL}

What is the Goal of MariaDB?

To provide a developed, stable, and DBMS that is, on the user level, broadly compatible with MySQL.

We strive for interoperability with both our own, and our upstream, communities.

Until , MariaDB was kept up to date with the latest MySQL release from the same branch. For example was up to date with MySQL 5.1.47, and so on.

We did a merge from the main MySQL branch for every new MySQL release or when there was some critical bugfix applied to the main branch.

Since , MariaDB includes backported features from MySQL as well as entirely new features not found anywhere else, but does not necessarily include all MySQL features.

We strive to keep our main trees as free from bugs as possible. It should be reasonably safe to pull from our trees at any time.

_{This page is licensed: CC BY-SA / Gnu FDL}

Why is the Software Called MariaDB?

The 'MySQL' name is trademarked by Oracle, and they have chosen to keep that trademark to themselves. The name MySQL (just like the MyISAM storage engine) comes from Monty's first daughter My. The first part of 'MySQL' is pronounced like the English adjective, even if this doesn't match the correct pronunciation of the Finnish name.

MariaDB continues this tradition by being named after his younger daughter, Maria.

The name Maria was initially given to a storage engine. After MariaDB was started, to avoid confusion, it was renamed to Aria. was decided as a result of a contest.

_{This page is licensed: CC BY-SA / Gnu FDL}

High Availability Questions

Licensing Questions

Here you can find answer to questions about licensing MariaDB/MySQL/plugins etc

MariaDB Licenses

MariaDB Software Questions

What is MariaDB?

MariaDB is a , drop-in replacement of the MySQL® Database Server. It includes all major open source storage engines.

You can see a list of all current and old releases as well as a .

The source code for MariaDB is on GitHub. Binaries and packages are also available.

_{This page is licensed: CC BY-SA / Gnu FDL}

What is MariaDB's Release Policy and Schedule?

The MariaDB Development Plans page links to plans for future versions of MariaDB.

The release schedule for upcoming MariaDB releases can be found on the MariaDB Jira release page.

What will be in MariaDB version X?

You can read about the features and improvements included in the current stable release of MariaDB on the page.

The page links to our plans for future versions of MariaDB.

_{This page is licensed: CC BY-SA / Gnu FDL}

When will MariaDB version X be released?

The page has the current stable and development releases.

See the page for plans for future versions of MariaDB.

_{This page is licensed: CC BY-SA / Gnu FDL}

Migrating to MariaDB Questions

How can I Import Only a Table's Structure?

How do I migrate a MySQL database to Amazon RDS?

Amazon Relational Database Service (Amazon RDS) is part of the Amazon Web Services, that allows one to setup and run a relational database in the cloud. It supports running MariaDB in addition to MySQL, PostgreSQL, and other common relational databases.

Gee-Hwan Chuang of Listia has written a guide titled Moving a Production MySQL Database to Amazon RDS with Minimal Downtime. He recommends using mysqldump.

_{This page is licensed: CC BY-SA / Gnu FDL}

Red Hat 6 Process Limit a Errors

When upgrading from MySQL 5.1 to you may encounter a process limit problem with your MariaDB installation. This is not MariaDB at fault. The issue is likely triggered with high connection counts to the database and possibly exacerbated by not using Thread pooling.

You might encounter the following error message if you are not out of available memory:

##SQLSTATE[HY000] [1135] Can't create a new thread (errno 11);##

You can consult the manual for a possible OS-dependent bug when trying to connect. There is a very good explanation and troubleshooting with a work around on the Percona MySQL Performance Blog

Review your connections and thread consumption usage prior to doing an upgrade to avoid this being a problem.

_{This page is licensed: CC BY-SA / Gnu FDL}

Support Questions

Getting Help With MariaDB

If you can't find help in the MariaDB documentation, you can also subscribe to the mailing lists or Zulip to communicate with MariaDB users and developers.

There's an active help community on StackOverflow for programming with SQL and indexing questions, and DBA StackExchange for non-programming questions for example tuning/backups/architecture/migration/replication etc. Use the mariadb tag so questions are searchable by those that can help you.

If you have a question about a feature that is not properly documented or something that is not working as expected, go to the corresponding documentation page and ask your question there.

You can report and check on bugs which apply to MariaDB in JIRA.

MariaDB plc offers commercial support.

_{This page is licensed: CC BY-SA / Gnu FDL}

Supported Software & Features

Here you can ask questions if MariaDB supports a certain feature that doesn't fit into any specific category.

Before asking a questions about a non existing feature, please first check the if there is already plans for that feature.

Contributing & Participating

There are many ways to contribute to MariaDB.

MariaDB Public FTP Server

MariaDB provides a secure FTP, SFTP and WebDAV server where you can upload files to be used by MariaDB developers, for example table structures and data for bug reports.

The folder tree consists of:

The public folder for files that the MariaDB developers want to give the public access to (patches, samples etc).
The private folder for uploads. Files uploaded there can only be accessed by MariaDB developers. You will not be able to see your upload and this folder does not allow downloads. This is done to protect any sensitive information which may be in test results, mysqld & core files. Upload those into this folder.
The secret folder is for private downloads. Files in this folder are not visible so you will need the complete filename to successfully download a file from this folder.

To share files with MariaDB developers, upload it into the private directory with either:

SFTP client (scp), enter 'anonymous' as the password:

You can ignore the 'fsetstat: Permission denied' error.

WebDAV client (curl):

FTP client (lftp); enter 'anonymous' as the password:

You can ignore the 'network error'.

Note for MariaDB developers: please request your access to the SFTP service if not already at ftp@mariadb.org (provide public SSH key and username). You will then be able to access the service with:

or with HTTPS at .

Sponsoring the MariaDB Project

Sponsorships are crucial for ongoing and future development of the MariaDB project! There are a number of easy ways you for you to help the project:

Fund the development of a specific feature. You can find a list of suggested features to sponsor or in . Feel free to sign in and add more projects to either place!
Contribute with time. If your organization has talented developers familiar with MariaDB or MySQL codebase they can become part of the MariaDB team and contribute to the development of the MariaDB project.
Hire a developer that you dedicate to work on the MariaDB project.

Google Summer of Code

MariaDB has participated in Google Summer of Code since 2013. This section contains pages providing information for each year.

We believe we are making a better database that remains application compatible with MySQL. We also work on making LGPL connectors (currently , , , Node.js) and on MariaDB Galera Cluster, which allows you to scale your reads & writes. Lately, we also have MariaDB ColumnsStore, which is a columnar storage engine, designed to process petabytes of data with real-time response to analytical queries.

Google Summer of Code 2013

We participated in Google Summer of Code 2013. MariaDB and the MariaDB Foundation believes we are making a better database that remains a drop-in replacement to MySQL. We also work on making LGPL connectors (currently in C, Java, C++ in development) and we also work on MariaDB Galera Cluster which allows you to scales your reads & writes.

Where to start

Please join us at irc.freenode.net at #maria to mingle with the community. Or subscribe to maria-developers@lists.launchpad.net. Or both.

Please keep in mind that in April we travel a lot (conferences, busy time), so if you have a question and nobody on IRC answers — do not feel disappointed, ask in an email to maria-developers@lists.launchpad.net.

LDAP authentication plugin

We would like the authentication system to be able to authenticate against a LDAP Directory Server.

See .

Skills: C, working knowledge of LDAP

Mentor: Sergei Golubchik

Kerberos authentication plugin

this project is taken

Kerberos is a security mechanism used in a lot of financial institutions. A MySQL plugin that allows authentication against Kerberos is the goal here.

See .

Skills: C/C++, working knowledge of Kerberos

Mentor: Sergei Golubchik

Active Directory authentication plugin

The Microsoft Windows world is all about Active Directory and upstream MySQL Enterprise already has this feature (though its a paid offering). It would be great to have an open source equivalent.

See .

Skills: C/C++, working knowledge of Active Directory/SAMBA, Windows-based development environment

Mentor: Sergei Golubchik, Vladislav Vaintroub

Keystone authentication plugin

Keystone is the OpenStack Identity Service. The idea would be to ensure that MariaDB can authenticate to Keystone directly.

Skills: Python, C/C++

Mentor: Mark Riddoch

Regex enhancements

this project is taken

MySQL and MariaDB use an old regex library, it works bytewise, and thus only supports one byte character set. It needs to be replaced by a modern multi-byte character set aware regex library.

Additionally a much requested REGEX_REPLACE function should be implemented. (See also for some UDF code that could be used as a starting point for this)

Detailed task description:

Skills: C/C++

Mentor: Alexander Barkov

Self-Tuning Optimizer

One of the reasons of bad query plans is inadequate cost estimation of individual operations. A cost of reading a row in one engine might be a lot higher than in some other, but optimizer cannot know it. Also, it uses hard-coded constants, assuming, for example, that evaluating a WHERE clause is 5 times cheaper than reading a row from a table.

Obviously, some kind of calibration procedure is needed to get these cost estimates to be relatively correct. It is not easy, because the estimates depend on the actual hardware where MariaDB is run (a cost of a row read is different on HD and SSD), and also — somewhat — on the application.

A simple and low-maintenance solution would be to use self-tuning cost coefficients. They measure the timing and adjust automatically to the configuration where MariaDB is run.

See .

Skills: C/C++

Mentor: Sergei Golubchik

Roles

this project is taken

Roles, close to SQL:2003 standard. See .

Skills: C/C++

Mentor: Sergei Golubchik

Potential list

_{This page is licensed: CC BY-SA / Gnu FDL}

Google Season of Docs

Remove from the navigation. Discuss those pages with Max.

MariaDB applied to participate in the first Google Season of Docs.

Google Season of Docs 2020

MariaDB applied to participate in the 2020 Google Season of Docs, but was unsuccessful.

Where to Start

Please join us on Zulip to mingle with the community. You can also subscribe to maria-docs@lists.launchpad.net, the documentation mailing list.

List of Tasks

Migration Documentation

You will choose a major relational DBMS, and, focusing on the most recent stable releases, document the process to migrate to MariaDB, including MariaDB equivalents to features in that system, and a detailed list of features that exist in one but not the other, as well as possible workarounds. For an example, see the work-in-progress as well as the documentation on (bearing that MariaDB is a MySQL fork, and is substantially more similar to MySQL than to other systems).

Stored Procedures Documentation

The capabilities of MariaDB Server are critical to producing large-scale applications. The current documentation lacks sufficient examples, and the examples warrant testing. Getting Started content would make this easier to adopt. MariaDB's implementation is based on ISO SQL/PSM.

Getting Started with Connector/C and Connector/J

and provide the ability to access a MariaDB Server from applications built in C/C++ and Java, respectively. The current documentation for these connectors lacks Getting Started guides.

Spider Documentation

The Storage Engine uses partitioning to provide data sharding through multiple servers. This task involves greatly expanding the existing documentation, including more detail about when to use Spider, basic usage tutorials, updating the , detailed examples of the effects of the and , as well as the . You will also ensure changes in the most recent Spider releases are properly documented.

Mroonga Documentation

is a full text search storage engine based on Groonga, which is an open-source CJK-ready fulltext search engine using column base. This project involves greatly expanding the existing MariaDB documentation on the use of this storage engine. A detailed tutorial and user guide, including examples of the various Mroonga , and the effects of changing their settings, as well as the parser and .

Translation

You will choose a major language and ensure that a substantial subsection of the documentation is translated into that language. See .

Loaded from the Loading Issues from [Jira](https://jira.mariadb.org/issues/?jql=labels=gsdocs19 and project=mdev order by key)

%fields.summary%

%renderedFields.description%

Details:

Mentor:

Suggest a Task

Do you have an idea of your own, not listed above? Do let us know!

_{This page is licensed: CC BY-SA / Gnu FDL}

Product Development

Articles of interest to MariaDB developers, including contributors from the MariaDB Community

MariaDB Quality Development Rules

Those are quality-improving rules that everyone with a write access to the MariaDB repository is expected to follow:

Respect previews
- A feature can be pushed into an RC release X.Y.1 only after it was in an earlier preview. Normally, in X.Y.0, but generally any earlier preview will do.
Tester sign-off for all new features
- A feature being in the preview is a necessary, but not a sufficient condition. It needs to be tested (by a dedicated tester, not a developer) and the tester has to say it's good enough
- Testing might discover bugs, that's normal, they have to be fixed before the feature is pushed (or — at the tester's discretion — they could be fixed after the push, if they're minor)
- For tester's sake, develop in a dedicated branch with the version and the issue number in the branch name, for example, bb-11.1-[MDEV-11111](https://jira.mariadb.org/browse/MDEV-11111), and let the release master know when it's ready, so that they could cherry-pick it into a preview.
Features must not be pushed directly into the GA release bypassing the above
- Keep an eye on the release schedule () to know when the next release is due
- Or simply remember that preview happen in mid-March/mid-June/mid-September/mid-December, innovation releases — in early February, early May, early August, early November, see .
Get your commit reviewed.
- If you don't know who would be the best reviewer for your PR, assign whoever github suggests — this person can reassign, if github was wrong
Don't push into the red (in buildbot) branch
- Fix failures first (or make sure they're fixed)
- Eventually buildbot will evolve to simply not let you to
Blocker issues block a release
- we don't release if there's a Blocker bug open, that's why they're called blockers
- so fix them asap, as your first priority, you don't want all the users to wait specifically for you
There's no penalty for breaking these rules, we hope that everyone wants quality bug-free releases anyway
- but if there will be serial violators, some kind of a penalty can be introduced later

_{This page is licensed: CC BY-SA / Gnu FDL}

Debugging MariaDB

This section is for articles on debugging MariaDB

Creating a Trace File

If mysqld is crashing, creating a trace file is a good way to find the issue.

A mysqld binary that has been compiled with debugging support can create trace files using the DBUG package created by Fred Fish. To find out if your mysqld binary has debugging support, run mysqld -V on the command line. If the version number ends in -debug then your mysqld binary was compiled with debugging support.

See Compiling MariaDB for debugging for instructions on how to create your own mysqld binary with debugging enabled.

To create the trace log, start mysqld like so:

Without options for --debug, the trace file will be named /tmp/mysqld.trace in MySQL and older versions of MariaDB before 10.5 and /tmp/mariadbd.trace starting from .

On Windows, the debug mysqld is called mysqld-debug and you should also use the --standalone option. So the command on Windows will look like:

Once the server is started, use the regular mysql command-line client (or another client) to connect and work with the server.

After you are finished debugging, stop the server with:

DBUG Options

Trace files can grow to a significant size. You can reduce their size by telling the server to only log certain items.

The --debug flag can take extra options in the form of a colon (:) delimited string of options. Individual options can have comma-separated sub-options.

For example:

The 'd' option limits the output to the named DBUG_ macros. In the above example, the /tmp/mariadbd.trace tracefile will contain output from the info, error, and query DBUG macros. A 'd' by itself (with no sub-options) will select all DBUG_ macros.

The 'o' option redirects the output to a file (/tmp/mariadbd.trace in the example above) and overwrites the file if it exists.

Debugging Memory Usage

Debugging memory usage on CentOS 7.

This page describes how to debug MariaDB's memory usage. It uses CentOS 7 but can be applied to other systems as well.

The idea is to employ Google PerfTools:

On CentOS :

This will open an editor.

Add this content and close the file:

Then run

Then, run the workload. When memory consumption becomes large enough, ruh

This should show several files.

Copy away the last one of them:

Then, run

(Note: this produces a lot of statements like

How to Use procmon to Trace mysqld.exe Filesystem Access

This article provides a walkthrough on using the Process Monitor on Windows, tracing file system access by mysqld.exe during the "install plugin" call.

Download

Process Monitor is an advanced monitoring tool for Windows that shows real-time file system, registry and process/thread activity. It is a part of sysinternals suite developed by Mark Russinovich and Bryce Cogswell. Process Monitor can be directly downloaded from . More description can be found at

Limitations/Differences with a MariaDB Server Compiled for Debugging

A MariaDB server configured with --with-debug=full has the following differences from a normal MariaDB server:

You can have maximum of 1000 tables locked at the same time in one statement. (Define MAX_LOCKS in mysys/thrlock.c). This is to detect loops in the used lists.
You can have maximum of 1000 threads locking the same table. (Define MAX_THREADS in mysys/thrlock.c). This is to detect loops in the used lists.

MariaDB Development

Quality Assurance

This section collects articles related to MariaDB quality assurance efforts

Code Coverage

We are working on getting more of the MariaDB source covered by our mysql-test-run (MTR) test suite. This is an ongoing (and slow) task as there is still a lot of old code with not very good coverage.

Goals for new code

For new code in MariaDB, we aim much higher:

The goals are:

All new lines of code should ideally be tested by MTR.

QA - Aria Recovery

General Principles

Recovery is tested via the , which provides a random workload against the server, and then uses kill -9 to kill the process. After that, recovery is attempted both by using maria_read_log and by restarting the mysqld process. Once the server has started up, the tables are verified in various ways, including ALTER|OPTIMIZE|ANALYZE|REPAIR TABLE as well SELECT queries that read the table back and forth using various access methods.

A combinations

QA Metrics

Code coverage

The gcov/lcov reports are run by BuildBot and can be viewed . The tests that are run to obtain the coverage percentage are listed under the f_gcov Factory in the BuildBot configuration file.

Bugs

Google Summer of Code 2024

In 2024, MariaDB again participated in the Google Summer of Code. We believe we are making a better database that remains application compatible with MySQL. We also work on making LGPL connectors (currently , C++, , , Node.js) and on MariaDB Galera Cluster, which allows you to scale your reads & writes. And we have MariaDB ColumnStore, which is a columnar storage engine, designed to process petabytes of data with real-time response to analytical queries.

Where to Start

Please join us on Zulip to mingle with the community. You should also subscribe to the developers mailing list (this is the main list where we discuss development - there are also other mailing lists).

To improve your chances of being accepted, it is a good idea to submit a pull request with a bug fix to the server.

Also see the from the MariaDB Issue Tracker.

List of Tasks

MariaDB Server

Implement IVFFlat indexing strategy for MariaDB Vector and evaluate performance

Part-time (175h) or full-time project (350h) - depending on scope MariaDB Vector is coming to MariaDB Server to serve AI Workloads. The current indexing strategy will use HNSW, but IVFFlat is a possible alternative that costs fewer resources to create. Having it as an option is desirable.

Spatial (GIS) functions in MariaDB

Part-time (175h) or full-time project (350h) - depending on scope

Our GIS functionality is limitted compared to other DBMSes. Given that MariaDB looks to facilitate migration from MySQL, we should be on par. We have a list of functions that are missing in MariaDB compared to MySQL, as described in . Our goal is to have as many of these functions available within MariaDB. Some of the functionality can be ported from MySQL, while others might require implementation from scratch.

Skills needed: Understanding of C++ development. Ability to navigate a large codebase (with help from mentor).Mentors: Anel Husakovic (primary) / Vicențiu Ciorbaru (secondary)

MariaDB Oracle mode misses Synonyms

Full-time project 350h

Synonyms are an important feature, particularly as it helps smooth migration from other databases. While the initial project scope seems straightforward, there are a number of aspects that must be considered:

Grammar extension
Where will the synonyms definitions be stored?
How do synonyms map to the underlying privilege system? Who can create a synonym? Who can access a synonym?
Do we enforce the underlying object to exists before creating a synonym? What if the underlying object gets dropped?

Skills needed: Understanding of C++ development. Able to write and discuss various tradeoffs such that we achieve a feature set that makes sense given the database's priorities.Mentors: Vicențiu Ciorbaru (primary) / Michael Widenius (secondary)

CREATE TRIGGER FOR { STARTUP | SHUTDOWN }

Full-time project 350h

Support generalized triggers like

the latter being a synonym for CREATE EVENT.

should STARTUP/SHUTDOWN triggers run exclusively? that is, STARTUP trigger is run before any connection is allowed or in parallel with them? Same for SHUTDOWN.

make my_vsnprintf to use gcc-compatible format extensions

Part-time project 175h

my_vsnprintf() is used internally in the server as a portable printf replacement. And it's also exported to plugins as a service.

It supports a subset of printf formats and three extensions:

%smeans that a string should be quoted like anidentifier`
%b means that it's a binary string, not zero-terminated; printing won't stop at \0, so one should always specify the field width (like %.100b)
%M is used in error messages and prints the integer (errno) and the corresponding strerror() for it

gcc knows printf formats and check whether actual arguments match the format string and issue a warning if they don't. Unfortunately there seems to be no easy way to teach gcc our extensions, so for now we have to disable printf format checks.

An better approach would be to use gcc compatible format extensions, like Linux kernel does. We should migrate to a different syntax for our extensions

%sI to mean "print as an identifier"
%sB to mean "print a binary string"
%uE to mean "print an errno"

old formats can still be supported or they can be removed and in the latter case the major version of the service should be increased to signal an incompatible change.

All error messages and all usages of my_vsnprintf should be changed to use the new syntax. One way to do it is to disable old syntax conditionally, only in debug builds. All gcc printf format checks should be enabled.

Skills needed: Understanding of C development.Mentors: Sergei Golubchik

Parallel CSV read leveraging Apache Arrow

Full-time project 350h

cpimport is a binary that ingests data into MCS in an efficient manner reducing ingest timings significantly whilst preserving transaction isolation levels.

cpimport is relatively complex facility that reads data from local file/S3 parses it, converts and put into MCS-specific files. cpimport is unable to read a big-sized single CSV file from disk in parallel. Apache Arrow has a CSV read faciilty that can do parallel CSV read. The goal of the project is to replace an existing homebrew CSV parser implemented in cpimport with the one from Apache Arrow.

Skills needed: modern C++.Mentors: Leonid Fedorov

Manual vacuum cleaning for on-disk data empty records

Full-time project 350h

Here extent is a unit of group of columnar values and partition is a group of extents that stores all column values for a specific portion of a table. MCS has a notion of an empty value for columnar segment/token files and dictionaries. Empty values are marked with a bit in the special 1 byte auxiliary column that is created for every table. When DELETE removes records from a table the records are marked with empty bit in the auxiliary column. The deleted records become a wasted disk space. The goal of the project is to reclaim the wasted disk space either re-creating the whole partition or moving partition values.

Skills needed: modern C++Mentors: Roman Nozdrin

Suggest a Task

Do you have an idea of your own, not listed above? Do let us know in the comments below (Click 'Login' on the top of the page first)!

_{This page is licensed: CC BY-SA / Gnu FDL}

Reporting Software Bugs

For reporting documentation bugs specifically, see Reporting Documentation Bugs.

MariaDB's bug and feature tracker is found at jira.mariadb.org.

This page contains general guidelines for the community for reporting bugs and feature requests in MariaDB products. If you want to discuss a problem or a new feature with other MariaDB developers, you can find the email lists and forums here.

Known Issues

First, check that the bug or feature request isn't already filed in the MariaDB bugs database.

For the MariaDB issue database, use JIRA search to check if a report you are going to submit already exists. You are not expected to be a JIRA search guru, but please at least make some effort.

Choose Issues => Search for issues;
If the form opens for you with a long blank line at top, press Basic on the right to switch to a simpler mode;
In the Project field, choose the related project, (MDEV for generic MariaDB server and clients);

If you see issue reports which are already closed, pay attention to the 'Fix version/s' field -- it is possible that they were addressed in the upcoming release. If they are said to be addressed in the release that you are currently using or earlier, you can ignore them and file a new one (although please mention in your report that you found them, it might be useful).

If you find an open issue report, please vote/add a comment that the issue also interests you along with any additional information you have that may help us to find and address the issue.

If the issue is not in the MariaDB issue database yet, then it's time to file a report. If you're filing a report about an issue that's already in the , please indicate so at the start of the report. Filing issue reports from MySQL in the MariaDB issue database makes sense, because:

It shows the MariaDB team that there is interest in having this issue addressed in MariaDB.
It allows work to start on addressing the issue in MariaDB - assigning versions, assigning MariaDB developers to the issue, etc.

Reporting an Issue

Bugs and feature requests are reported to the .

JIRA Privacy

Please note that our JIRA entries are public, and JIRA is very good at keeping a record of everything that has been done. What this means is that if you ever include confidential information in the description there will be a log containing it, even after you've deleted it. The only way to get rid of it will be removing the JIRA entry completely.

Attachments in JIRA are also public.

Access to a comment can be restricted to a certain group (e.g. Developers only), but the existing groups are rather wide, so you should not rely on it either.

If you have private information -- SQL fragments, logs, database dumps, etc. -- that you are willing to share with MariaDB team, but not with the entire world, put it into a file, compress if necessary, upload to the , and just mention it in the JIRA description. This way only the MariaDB team will have access to it.

Reporting Security Vulnerabilities

As explained above, all JIRA issues are public. If you believe you have found a security vulnerability, send an email to for, please, do not use JIRA for that. We will enter it in JIRA ourselves, following the practices.

Contents of a Good Bug Report

Below is the information we need to be able to fix bugs. The more information we get and the easier we can repeat the bug, the faster it will be fixed.

A good bug report consists of:

The environment (Operating system, hardware and MariaDB version) where the bug happened.
Any related errors or warnings from the server error log file. Normally it is hostname.err file in your database directory, but it can be different depending on the distribution and version; if you cannot find it, run SELECT @@log_error on the running server. If either the variable or the file it points at is empty, the error log most likely goes to your system log. If this is systemd you can get the last 50 lines of the MariaDB log with journalctl -n 50 -u mariadb.service. If possible, attach the full unabridged error log at least from the last server restart and till the end of the log.,

JIRA Fields

The section below describes which JIRA fields need to be populated while filing reports, and what should be put there. Apart from what's mentioned below, you don't have to fill or change any fields while creating a new bug report.

Project

If you are filing a report for MariaDB server, client programs, or MariaDB Galera cluster, the target project is MDEV. Connectors and MaxScale have separate projects with corresponding names. If you choose a wrong project, bug processing can be delayed, but there is no reason to panic -- we'll correct it. If you inform us about the mistake, we'll change it faster.

Some project names include:

CONC - MariaDB Connector/C
CONCPP - MariaDB Connector/C++
CONJ - MariaDB Connector/J
CONJS - MariaDB Connector/node.js

Type

Feature requests are not the same as bug reports. Specify a New feature type for feature requests in , and a Bug type for bug reports. Like with the project field, choosing a wrong type will put the request to the wrong queue and can delay its processing, but eventually it will be noticed and amended.

See also for things that we are considering to have in the next MariaDB release.

Summary

Please make sure the summary line is informative and distinctive. It should always be easy to recognize your report among other similar ones, otherwise a reasonable question arises -- why are they not duplicates?

Examples:

good summary: Server crash with insert statement containing DEFAULT into view
not a good summary: mariadbd crash

Generally, we try not to change the original summary without a good reason to do it, so that you can always recognize your own reports easily.

Priority

We do not have separate Severity/Priority fields in JIRA, so this Priority field serves a double purpose. For original reports, it indicates the importance of the problem from the reporter's point of view. The default is 'Major'; there are two lower and two higher values. Please set the value accurately. While we do take it into account during initial processing, increasing the value above reasonable won't do any good, the only effect will be the waste of time while somebody will be trying to understand why a trivial problem got such a high priority. After that, the value will be changed, and the report will be processed in its due time anyway.

Affected Versions

Put everything you know about which versions are affected. There are both major versions (10.6, 10.5 etc.) and minor versions (10.5.9, 10.4.12, etc.) available for choosing. Please always specify there the exact version(s) (X.Y.Z) which you are working with, and where you experience the problem.

Additionally, If you know the exact version where the problem appeared, please put it as well. If the problem has been present, as far as you know, in all previous releases, you can also put there the major version, e.g. 10.0. Alternatively, you can mention all of it in the description or comments.

Please also note in the description or comments which versions you know as not affected. This information will help to shorten further processing.

Environment

Put here environment-related information that might be important for reproducing or analyzing the problem: operating system, hardware, related 3rd-party applications, compilers, etc.

Description

The most important part of the description are steps to reproduce the problem. See more details about bug report contents above in the section .

If in the process of reproducing, you executed some SQL, don't describe it in words such as "I created a table with text columns and date columns and populated it with some rows" -- instead, whenever possible, put the exact SQL queries that you ran. The same goes for problems that you encountered: instead of saying "it did not work, the query failed, I got an error", always paste the exact output that you received.

Use {noformat}...{noformat} and {code}...{code} blocks for code and console output in the description.

Attachments

If you have SQL code, a database dump, a log etc. of a reasonable size, attach them to the report (archive them first if necessary). If they are too big, you can upload them to ftp.askmonty.org/private. It is always a good idea to attach your cnf file(s), unless it is absolutely clear from the nature of the report that configuration is irrelevant.

Links

If you found or filed a bug report either in MariaDB or MySQL or Percona bug base which you think is related to yours, you can put them in the Links section; same for any external links to 3rd-party resources which you find important to mention. Alternatively, you can just mention them in the description or comments.

Bugs that also Affect MySQL or Percona

Our normal practice is to report a bug upstream if it's applicable to their version. While we can do it on your behalf, it is always better if you do it yourself -- it will be easier for you to track it further.

If the bug affects MySQL, it should also be reported at . If the bug affects Percona server and not MySQL, it should go to .

Collecting Additional Information for a Bug Report

Getting a Stack Trace with Details

See the article .

Extracting a Portion of a Binary Log

See the article .

Getting Help with your Servers

If you require personalized assistance, want to ensure that the bug is fixed with high priority, or want someone to login to your server to find out what's wrong, you can always purchase a contract from MariaDB plc or use their consulting services.

_{This page is licensed: CC BY-SA / Gnu FDL}

Bug Processing

This page describes how community bug reports are processed among our products and explains what you need to notice while tracking bugs.

Commitments

MariaDB does not have any SLA or guaranteed reaction times on bugs in Jira. While we are taking bugs reported by the community very seriously, and aim to provide response and to handle issues as fast as possible, MariaDB does not have a dedicated bug verification team, this activity is performed on the best-effort basis.

To make sure your bug report will be confirmed and moved forward faster, please follow the guidelines about creating bug reports.

Bug Verification Routine

As of today, initial bug processing routine in MariaDB is not strictly formalized. This section describes the de-facto status rather than any policy.

The process is different for bug reports (Bug type in JIRA) vs feature requests (Task type). The process described below is related to bug reports.

Incoming Queue

All new bug reports go to the waiting list, to be reproduced and confirmed by a member of the team. The bug stays in the queue until one or more of the conditions below are met:

Bug report is assigned to a developer;
Bug report gets status 'Confirmed';
Bug report gets the label 'upstream';
Bug report is closed (on whatever reason).

With other things equal, bug reports in the queue are initially handled in the FIFO manner; however, there are various factors that make things not equal.

Bug Processing Order

First thing that is taken into account is . It does not mean that everything needs to be filed as Critical; on the contrary, it means that Priority should be chosen wisely. Although a report with higher Priority will be looked at sooner, as soon as it becomes clear that the Priority is set to a higher value than the problem deserves, it will be amended and put back to the queue. However, if the high priority is justified, we will try to process the report as fast as possible.

Another important factor is itself.

If the report is written well and , including a reproducible test case, it can be verified and moved forward quickly.
If the report is written clearly enough, but does not have enough information, it will get fast enough first response where we will request the missing details; but the further exchange can take a lot of time until we get everything we need to work on the issue.
Finally, reports which are written in a tangled and incomprehensible manner get the slowest response time, because even if eventually it turns out that they do have all required information, it is difficult and time-consuming to extract and process, so they can be put aside for some time.

First Response

Complete processing of a reported bug can be complicated and time-consuming, especially the reproducing part. We do not want our users to wait for long not knowing if their bug report has even been noticed, we try to provide first response quicker than that.

First response to the bug, which we are trying to provide as quickly as possible, is one of these:

If we can reproduce the problem based on the information that was provided in the initial description, the report gets the status Confirmed.
If it is obvious from the initial description that the bug report is a of an existing one, or the problem has already been fixed in later releases or in the upcoming release, or the described behavior is not a bug, or, in very rare cases, it is admitted to be a bug, but it is not going to be fixed, the report gets closed with the corresponding Resolution value and a comment with the explanation.
If the bug report at least appears to describe a real bug, but we do not have enough information to proceed, we will request the information from the reporter, and the report will go to the list.

We realize that "as quickly as possible" is a relative term. The dream scenario is that all reports are responded to in a matter of hours; however, more realistically, it can take a few days, and in some cases, when the team is overly busy with a big upcoming release or some other extraordinary events, it can even be weeks.

Need Feedback

When a report does not have all the information to reproduce the problem right away (which is quite often the case), we will ask the reporter to provide the missing information. Usually it takes more than one iteration to get everything right, so it is important that you respond to the questions as precisely as you can. Please make sure that you answered all questions (or, if you cannot answer some of them, please say so, otherwise we will have to ask again, and more time will be wasted on it).

There is no status "Need Feedback" in our JIRA; instead, we are using the label need_feedback. As long as the report has this label, it remains on the "Waiting for feedback" list. The label is set and removed manually by whoever asks for the feedback and receives it; so it can happen that the reporter has provided the response, but it remained unnoticed and the bug keeps waiting. It will be our fault, but human errors happen; it would help a lot if the reporter removed the label along with providing the feedback.

E-mail Notifications

This question arises fairly often, so it deserves mentioning.

As already said before, the need_feedback label is set and removed manually. JIRA e-mail updates about it can be confusing when you look at them quickly. For example, when someone removes the label, the email looks like this:

What it says that the Labels field has become empty, while before it had been need_feedback. People often misread it and ask "What else do you need from me? I've answered your questions". This update means that at the moment we don't need anything, your report is back to the incoming queue, and your feedback will be analyzed as soon as possible. Then, we will possibly ask more questions and set the label again, and the notification will look like this:

Successful Outcome

If the feedback exchange was fruitful and we received enough information to proceed, the bug report will go through the normal .

Incomplete Reports

Reports do not stay open on the "Need Feedback" list forever. After a month of waiting, if we do not get a response from the reporter, and still cannot proceed without it, we close the report as Incomplete with the corresponding comment. This state is not irreversible: you can still add comments and provide the information even when the report is closed as Incomplete, and it will be re-opened.

Worst Case Scenario

Sometimes it happens that after iterations of feedback requests we run out of ideas what else to ask from the reporter, and still could not verify the bug, or that the reporter is willing to collaborate with us, but cannot provide the necessary information on objective reasons (usually when the problem happens on a production instance). In some cases we might close the report as "Cannot reproduce", which we consider our loss; but more often we want to keep it open, in hope that more information arrives, maybe from a different source, and together with this report they will help us get to the bottom of the problem; if it happens so, the report gets assigned to somebody without being confirmed, just so it remains at least on somebody's radar, and it will stay open for a long time. It does not mean it is forgotten, it means that for the time being we hit the wall. You are very welcome to comment on such reports, whenever you think you might have something to add, because this is exactly what we are waiting for.

Bug Verification

Normally the bug report has to go through the following steps before it is moved forward to fixing:

the described problem needs to be reproduced;
it needs to be checked against all active post-Beta versions of MariaDB where it is theoretically applicable (as of the moment of writing this article, it is 5.5, 10.0, 10.1);
in case it is a relatively recent regression, the guilty change needs to be found;
the component or functional area should be determined, so that the bug gets assigned to the right person.

After that the bug is ready for fixing.

Bug Fixing Routine

Sometimes it seems hard to understand from the outside how MariaDB development team chooses which bugs to fix in a particular release, or why some bugs are fixed faster than others, or why critical bugs stay untouched for a long time.

Sprint Model for Bug Fixing

MariaDB currently uses 1- or 2-week sprint model for server development and bugfixing. It needs a separate article to describe it in more detail, but for bugfixing, in a nutshell it means the following.

one or two weeks before a scheduled release the team creates a new sprint and evaluates existing bugs which affect this release;
the selected bugs are added to the new sprint;
during the active sprint, the developer is supposed to work on the tasks which are part of the sprint, unless a true emergency arises.

There are two important consequences of this model which sometimes cause a confusion:

If the current sprint is for one version, e.g. 10.0, and you file a bug for another version, e.g. 10.1, then, even if the bug is really critical, it won't be jumped on right away: it makes no sense, because the 10.1 is not going to be released next week anyway, while 10.0 will be. When the 10.0 sprint finishes, and 10.1 sprint starts, your bug will be picked up for that sprint and fixed then.
If the current sprint for 10.1 is already in progress, newly created 10.1 reports normally won't be included into it, unless they are considered so extremely important that the developer is allowed to ignore the sprint plan.

Bugs Chosen for a Sprint

When a new sprint is created, bugs which affect the scheduled release are evaluated.

from all such bugs assigned to a developer, each developer chooses bugs he is able to work on during the given time interval;
bug priority plays the most significant role in this process, but this is not the only factor.
- Blocker bugs must be either fixed or degraded before the release goes out;

Tracking Progress

If a bug report has passed through verification stage, either being confirmed, or pushed forward to the development-level analysis as is, there can be various updates on it. It is important to understand what they mean.

JIRA Fields to Watch

All JIRA fields are public, but some of them are mainly used for internal development process, while others are more user-facing. describes which fields should be populated during the initial report submission. There is a different set of fields important for tracking purposes.

Resolution vs. Status

It might come as counter-intuitive, but in the existing JIRA structure, the Status field does not mean much for the user, it is mainly used for development and management purposes. On the contrary, the Resoluton field is entirely user-facing: it does not participate in planning or development. It remains the same 'Unresolved' for the whole life of the report, and is only changed when the bug gets closed, demonstrating the reason why it was closed.

Resolution

Unresolved - the bug report remains open, the work has not been finished.
Fixed - the bug has been fixed, see and possibly comments to the report for more information. This is almost always a terminal state, we do not re-open fixed bugs even if they later re-appear; please create a new one instead. The only case when it can be re-opened is when the 'Fix version/s' have not been released yet.
Duplicate - the bug report is identical to an already existing open (or recently fixed) report, which will be quoted in the comments and/or links. It is usually a terminal state, unless it is proven later that the report was not a duplicate after all.

Status

Open, Confirmed - this distinction is used in our internal queues, but from the user's perspective the difference is slim: setting the bug report to 'Confirmed' does mean that we are satisfied with the information provided in the report, but the user will also know about it from our comments and other updates. Otherwise, bugs in both statuses can be considered for fixing.
In Progress, Stalled - different intermediate states of bugs which help developers to filter their lists and management to gather a picture of the current activity. For the user, there is no important difference -- despite the negative semantics, 'Stalled' does not mean that something is wrong with the bug report, only that the developer is not working on it actively at the moment.

Fix Versions

This is an important field for progress tracking. After the bug is confirmed or otherwise acknowledged, this field is populated with a set of major versions where we intend to fix it. E.g. if the field is set to 10.0 10.1, it means that at the moment we consider it for fixing in some future 10.0 release (not necessarily the next one), and the bugfix will be merged into the next 10.1 release after that; but we do not consider it for fixing in 5.5, even if it is affected to.

To some extent, you can influence the initial plans: if you see that the fix is not targeted for versions where you think it should be, you can comment on the report, and if you provide convincing arguments and make your case, it can be reconsidered.

The value of the field is not a promise to fix the bug in the mentioned releases. It can be changed both ways: during further analysis, the developer can find out that it can be safely fixed in an earlier release, or, on the contrary, that it cannot be safely fixed in the GA release, and the fix can only go to the next versions which are currently under development.

After the bug is fixed, the value of the field is changed to the exact versions, e.g. 10.0.25 10.1.14. It means that the patch has been pushed into the 10.0 branch, and will be released with 10.0.25 release; it also means that the patch will be merged to 10.1 tree and released with 10.1.14 release, but it does not mean that it is already in the 10.1 branch.

Priority

As the other article says, the field serves two purposes. During the initial bug creation, it indicates the importance of the bug report from the user's perspective (in other bug tracking systems it is called 'Severity' or alike). After the bug has been confirmed, the same field is used for development purposes, to prioritize bug fixing (real 'Priority'). While we take into account the reporter's view on the matter, we can change the initial priority both ways, depending on the information we revealed during the problem analysis, versions affected, etc.

The value of the field normally means the following:

Blocker - we currently think that the bug must be fixed before the next release(s) set in the 'Fix version/s' field;
Critical - the bug should be picked up for fixing earlier than any other bugs apart from blockers;
Major - the bug will be present in the main queue for fixing in the upcoming 'Fix version/s', although only a part of such bugs will be fixed in every release;

Please note that the Priority field only demonstrates our intentions at the moment, it does not guarantee that things will happen according to these intentions.

Labels

Labels are mostly used for more convenient filtering and don't carry much importance otherwise. However, there are a few that affect the processing of a bug report:

need_feedback - its role during the initial bug processing was already described . However, after a bug is confirmed and queued for fixing, it should not appear anymore; and even if it's left by mistake, it won't affect the progress.
upstream - the label means that the bug also exists in the upstream version of the corresponding component - normally, in MySQL server or a client program, but can also be in Percona's XtraDB or TokuDB. Normally there should also be a link to the upstream bug report. Setting this label means that we might want to take for a while and see whether the bug is fixed in the upstream version before we fix it in MariaDB directly. It was usual for 5.5, less usual for 10.x where bugfixes, apart from InnoDB, are not merged automatically. The label is still set, but it is more for informational purposes than to affect the priority.

Bug Reports with Patches

MariaDB encourages contributors to provide bug fixes; so, bug reports which come with the fixes in general have a quicker turnaround. The bug fix can come in a form of Git pull request, or, in simple cases, as a diff pasted in or attached to the bug report itself.

Principles for External Bug Reports

There are some basic rules for bugs, particularly for setting the value, which we want to stick to and which might be different from procedures you came across in other projects. It mainly concerns external bugs (those that come from the community), for internal ones we can cut corners more freely.

This all is easier to understand if one remembers that the Resolution or its analogues in other bug-tracking systems is a user-facing field, as already mentioned above, and that it relates more to the report, than to the bug itself.

Duplicate

An older bug report cannot be a duplicate of a newer one, it is nonsensical. The only possible exception is when an older bug has no useful information whatsoever and the reporter does not provide any helpful feedback, while a newer report was not closed as a duplicate right away and got some useful updates. The common example of such exception is when the first report is just an optimized stack trace, no query, no data, nothing to work with, while the second report has a test case. But if the first reporter at least makes an effort to collaborate, the report deserves to be treated with respect.

Bug reports which have essentially different descriptions and/or test cases should not be duplicates. The common example is this: a developer creates a bug saying something like "this and that pieces of code are wrong, it should be so and so"; and then a user files a bug saying "this SQL produces a wrong result on this data set". Even if they are about the same error in the code at the end, they are not duplicate bug reports.

Obviously, a report can never be a duplicate of anything private (luckily it does not concern MariaDB server so far, as the bug reports are public).

In general, a bug report is a duplicate of another one if, and only if, the new reporter could find the existing report just by a reasonable .

Cannot Reproduce

A bug report should not be closed as "cannot reproduce" if it was once verified/confirmed, but disappeared in later versions. It's unfair to the reporter, and also dangerous to the product. We should know why a bug stopped being reproducible -- either we find when and how it was fixed (and close the report as "Fixed in version X by a patch for Y"), or we discover that it wasn't in fact fixed, but just masked. The simplest example is a change of execution plan in optimizer: server would crash on a particular query, then due to a change in optimizer it started using a different plan for the same query, so it wouldn't go through the crashing path anymore. The crash is still there, though.

In general, the "cannot reproduce" resolution is a last resort. Usually if we can't reproduce something, it means that either the reporter did not provide required information (and then the resolution should be "Incomplete"), or we don't know what to request from the reporter, and then we should keep thinking, rather than close it. Of course, it happens that the bug is genuinely not reproducible, but it shouldn't be decided lightly.

_{This page is licensed: CC BY-SA / Gnu FDL}

Google Summer of Code 2020

We participated in the Google Summer of Code 2020. The MariaDB Foundation believes we are making a better database that remains application compatible with MySQL. We also work on making LGPL connectors (currently C, ODBC, Java) and on MariaDB Galera Cluster, which allows you to scale your reads & writes. And we have MariaDB ColumnStore, which is a columnar storage engine, designed to process petabytes of data with real-time response to analytical queries.

Where to Start

Please join us on Zulip and on IRC to mingle with the community. Don't forget to subscribe to maria-developers@lists.launchpad.net (this is the main list where we discuss development).

A few handy tips for any interested students who are unsure which projects to choose:Blog post from former GSoC student & mentor

To improve your chances of being accepted, it is a good idea to submit a pull request with a bug fix to the server.

Also see the and from the MariaDB Issue Tracker.

List of Tasks

MariaDB Server: Optimizer

Evaluate subquery predicates earlier or later depending on their SELECTIVITY

(Based on conversation with Igor) There are a lot of subquery conditions out there that are inexpensive to evaluate and have good selectivity. If we just implement , we may get regressions. We need to take subquery condition's selectivity into account. It is difficult to get a meaningful estimate for an arbitrary, correlated subquery predicate. One possible solution is to measure selectivity during execution and reattach predicates on the fly. We don't want to change the query plan all the time, one way to dynamically move items between item trees is to wrap them inside Item_func_trig_cond so we can switch them on and off.

Details:

Mentor:

Histograms with equal-width bins in MariaDB

Histograms with equal-width bins are easy to construct using samples. For this it's enough to look through the given sample set and for each value from it to figure out what bin this value can be placed in. Each bin requires only one counter. Let f be a column of a table with N rows and n be the number of samples by which the equal-width histogram of k bins for this column is constructed. Let after looking through all sample rows the counters created for the histogram bins contain numbers c[1],..,c[k]. Then m[i]= c[i]/n * 100 is the percentage of the rows whose values of f are expected to be in the interval

It means that if the sample rows have been chosen randomly the expected number of rows with the values of f from this interval can be approximated by the number m[i]*/100 * N. To collect such statistics it is suggested to use the following variant of the ANALYZE TABLE command:

Here:

'WITH n ROWS' provides an estimate for the number of rows in the table in the case when this estimate cannot be obtained from statistical data.
'SAMPLING p PERCENTS' provides the percentage of sample rows to collect statistics. If this is omitted the number is taken from the system variable samples_ratio.
'IN RANGE r' sets the range of equal-width bins of the histogram built for the column col1. If this is omitted then and min and max values for the column can be read from statistical data then the histogram is built for the range [min(col1), max(col1)]. Otherwise the range [MIN_type(col1), MAX_type(col1) is considered]. The values beyond the given range, if any, are also is taken into account in two additional bins.

Details:

Mentor:

Add FULL OUTER JOIN to MariaDB

Add support for FULL OUTER JOIN One of the way how to implement is to re-write the query

into the following union all:

Here t1.a is some non-nullable column of t1 (e.g. the column of single column primary key).

Details:

Mentor:

Recursive CTE support for UPDATE (and DELETE) statements

supported in MySQL-8.0 and MSSQL

Details:

Mentor:

MariaDB Server: others

Support for GTID in mysqlbinlog

The mysqlbinlog client program needs to be updated to support GTID. Here is a suggested list of things to be done:

The --start-position and --stop-position options should be able to take GTID positions; or maybe there should be new --start-gtid and --stop-gtid options. Like --start-gtid=0-1-100,1-2-200,2-1-1000.
A GTID position means the point just after that GTID. So starting from GTID 0-1-100 and stopping at GTID 0-1-200, the first GTID output will probably be 0-1-101 and the last one 0-1-200. Note that if some domain is not specified in the position, it means to start from the begining, respectively stop immediately in that domain.

Details:

Mentor:

connection encryption plugin support

As a follow-on to we would like GSSAPI encryption (in addition to authentication) support in MariaDB. I am told that the current plan is to create a plugin interface and then we can build GSSAPI encryption on top of that, so here is a ticket for that. From having written GSSAPI for the internal interface, there were a couple things I would like to see in the plugin encryption interface. First, GSSAPI is weird in that it does authentication before encryption (TLS/SSL are the other way around, establishing an encrypted channel and then doing authentication over it). Of course support for this is needed, but more importantly, packets must be processed in a fully serialized fashion. This is because encrypted packets may be queued while one end of the connection is still finishing up processing the authentication handshake. One way to do this is registering "handle" callbacks with connection-specific state, but there are definitely others. Additionally, for whatever conception there ends up being of authentication and encryption, it needs to be possible to share more data than just a socket between them. The same context will be used for authentication and encryption, much as an SSL context is (except of course we go from authentication to encryption and not the other way around). This ties into an issue of dependency. If authentication plugins are separate entities from encryption plugins in the final architecture, it might make sense to do mix-and-match authentication with encryption. However, there are cases - and GSSAPI is one - where doing encryption requires a certain kind of authentication (or vice versa). You can't do GSSAPI encryption without first doing GSSAPI authentication. (Whether or not it's permitted to do GSSAPI auth->encryption all over a TLS channel, for instance, is not something I'm concerned about.) Finally, encrypted messages are larger than their non-encrypted counterparts. The transport layer should cope with this so that plugins don't have to think about reassembly, keeping in mind that there may not be a way to get the size of a message when encrypted without first encrypting it. It's unfortunately been a little while since I wrote that code, but I think those were the main things that we'll need for GSSAPI. Thanks!

Details:

Mentor:

Aggregate Window Functions

With a few exceptions, most native aggregate functions are supported as window functions. In , support for creating of custom aggregate functions was added. This task proposes to extend that feature and allow custom aggregate functions to be used as window functions An example of a creating a custom aggregate function is given below:

This functions can be used in the following query:

After this task is complete the following must also work:

Details:

Mentor:

Improve mysqltest language

mysqltest has a lot of historical problems:

ad hoc parser, weird limitations
commands added as needed with no view over the total language structure
historical code issues (e.g. casts that become unnecessary 10 years ago) etc

A lot can be done to improve it. Ideas

control structures, else in if, break and continue in while, for (or foreach) loop
proper expression support in let, if, etc
rich enough expressions to make resorting to sql unnecessary in most cases
remove unused and redundant commands (e.g. system vs exec, query_vertical vs vertical_results ONCE)

Details:

Mentor:

Implement multiple-table UPDATE/DELETE returning a result set

A multiple-table UPDATE first performs join operations, then it updates the matching rows. A multiple-table UPDATE returning a result set does the following:

first performs join operations
for each row of the result of the join it calculates some expressions over the columns of the join and forms from them a row of the returned result set
after this it updates the matching rows

A multiple-table DELETE first performs join operations, then it deletes the matching rows. A multiple-table DELETE returning a result set does the following:

first performs join operations
for each row of the result of the join it calculates some expressions over the columns of the join and forms from them a row of the returned result set
after this it deletes the matching rows

Details:

Mentor:

sort out the compression library chaos

As MariaDB is getting more storage engines and as they're getting more features, MariaDB can optionally use more and more compression libraries for various purposes. InnoDB, TokuDB, RocksDB — they all can use different sets of compression libraries. Compiling them all in would result in a lot of run-time/rpm/deb dependencies, most of which will be never used by most of the users. Not compiling them in, would result in requests to compile them in. While most users don't use all these libraries, many users use some of these libraries. A solution could be to load these libraries on request, without creating a packaging dependency. There are different ways to do it

hide all compression libraries behind a single unified compression API. Either develop our own or use something like Squash. This would require changing all engines to use this API
use the same approach as in server services — create a service per compression library, a service implementation will just return an error code for any function invocation if the corresponding library is not installed. this way — may be — we could avoid modifying all affected storage engines

Details:

Mentor:

Control over memory allocated for SP/PS

SP/PS (Stored Procedures / Prepared Statements) allocates memory till the PS cache of SP will be destroyed. There is no way to see how many memory allocated and if it grows with each execution (first 2 execution can lead to new memory allocation but not more)

Task minimum:

Status variables which count the memory used/allocated for SP/PS by thread and/or for the server.

Other ideas:

Automatic stop allocation in debvugging version after second execution and call exception on attempt.
Information schema by threads and SP/PS with information about allocated and used memory

Information can be collected in MEM_ROOTs of SP/PS. Storing info about status of mem_root before execution then checking after new allocated memory can be found. MEM_ROOT can be changed to have debug mode which make it read only which can be switched on after second execution.

Details:

Mentor:

MariaDB ColumnStore

Full DECIMAL support in ColumnStore

MariaDB ColumnStore supports DECIMAL with some limitations:

We do not support the full DECIMAL range that is in MariaDB
In several places in the code we convert the DECIMAL to DOUBLE during execution therefore losing precision Implementing this will likely require the following:

Implementation of methods to handle MariaDB's DECIMAL format
Support for a longer than 8-byte numeric column type (there is an InfiniDB tree with work for this already)
Modification of the primitives processor for the math
Modification of the function expression processor to handle the new type

Details

Mentor:

Replace glibc with google's re2 for regex processing

CS as of 1.4.2 relies on glibc for regex processing. We need to replace glibc with re2 for LIKE, REGEX and other facilities that affects performance.

Identify places with glibc regex functions invocations
Pick the invocations that significantly affects timings of the query
Replace glibc regex calls with appropriate re2

Details

Mentor:

Speed up RowGroup data access methods reducing level of inderection

RowGroup is the unit of the data sent around CS cluster. It is basically a set of records for fixed size data types + additional storage area for binary data, e.g. strings. Right now there are lots of RowGroup methods that access RowGroup fixed size data using extra level of inderection. Here is the example: return ((int64_t) &data[offsets[colIndex]]); This expression uses extra assembly instruction to calculate the effective address. We want to remove 'offsets[colIndex]' part of this and similar expressions in RowGroup code.

Details

Mentor:

Parallel sorting 2nd phase

As of 1.4.1 CS uses two-phase sorting. Here are the phases:

Presort partial runs of data.
Merge the presorted partial runs produced during the 1st phase. Here is more detailed explanation of how sorting works as of 1.4.1

CS gets a portion of records data from previous steps of the query execution(RGData instance from the ring buffer of RGData-s) and produces a sorting run out of it using existing sorting class LimitedOrderBy. If the query contains LIMIT then we apply it at this phase. This allows to significantly reduce the data set cardinality. If the query contains LIMIT + OFFSET then CS builds a sorted run of the records that is up to LIMIT+OFFSET size. CS does this step in parallel dividing the whole data set into k runs where k is governed by a session variable - infinidb/columnstore_orderby_threads. At this phase CS tries to preallocate memory in QUEUE_RESERVE_SIZE batches. CS merges and sorts k presorted partial runs produced by a previous phase in a single thread. If the query contains DISTINCT keyword CS rebuilds a hash map to preserve uniqueness. We want to make 2nd phase also parallel using range partitioning of the presorted runs produced by the 1st phase. After 1st phase finishes we know the distribution of the sorting key values thus can divide the thread key values run into regions - buckets. Every 2nd phase thread takes values from corresponding region buckets (contains the same values region) from every 1st phase sorted run. Then all 2nd phase threads sorts its runs in parallel. In the end we put the sorted regions in asscending order of the key values into output stream.

Details

Mentor:

Add a bitmap for NULL/empty data

We need a bitmap to store NULL/empty values instead of in-column values for this.

Details

Mentor:

Engine independent statistics for Columnstore

CS now has a very rudimentary query optimization capabilities and we want to improve the situtation. We consider to use Server's optimizer for the purpose but the Server needs statistics namely values distribution histograms and Number of Distinct Values distribution histograms. There are different levels of complexity for the task:

implement standalone segment files reader that in the end populates both mysql.column_stats and mysql.table_stats using out of band mariadb client connection
implement ANALYZE TABLE functionality for Columnstore engine
implement ANALYZE TABLE and Histograms with equal-width bins for values distribution histograms(similar to ) together with NDV histograms to decrease I/O We expect to have both unit and regression tests but this is optional.

Details

Mentor:

mcsapi needs a new read API Design

We need an ORM-style NoSQL read API to go along with the bulk write API of mcsapi. This will likely take the form of:

A reader in ExeMgr which will convert messages from mcsapi into jobs
Code in mcsapi to send/receive the messages Although ExeMgr can already receive messages with an execution plan the format is very complex and ABI breaks easily (we often send whole C++ objects). We should look at other ORM frameworks for inspiration as the API design. This task to do the design for this API.

Details:

Mentor:

Use JIT for aggregation functions

CS uses Volcano processing approach working on one value at a time. This is very inefficient way for analytics workload that usually uses lots of aggregation functions in projections, filtering or sorting. We are interested in using JIT for basic aggregation functions: sum, avg, count, min, max. The patch must compile and run a program that processes and returns the aggregation function result. We were written this description having LLVM in mind as it is widespread and has a lots of examples in the wild. I suggest to start looking at RowAggregation::addRowGroup() from ./utils/rowgroup/rowaggregation.cpp to get what it takmakees to get avg() function result. Here is the link on how to build fast a CS developer environment.

Details:

Mentor:

Engine independent statistics for Columnstore

implement standalone segment files reader that in the end populates both mysql.column_stats and mysql.table_stats using out of band mariadb client connection
implement ANALYZE TABLE functionality for Columnstore engine
implement ANALYZE TABLE and Histograms with equal-width bins for values distribution histograms(similar to ) together with NDV histograms to decrease I/O

We expect to have both unit and regression tests but this is optional.

Details:

Mentor:

Histograms: use JSON as on-disk format

Currently, histograms are stored as array of 1-byte bucket bounds (SINGLE_PREC_HB) or 2-byte bucket bounds (DOUBLE_PREC_HB). The table storing the histograms supports different histogram formats but limits them to 256 bytes (hist_size is tinyint).

This prevents us from supporting other kinds of histograms. The first low-hanging fruit would be to store the histogram bucket bounds precisely (like MySQL and PostgreSQL do, for example). The idea of this MDEV is to switch to JSON as storage format for histograms.

Details:

Mentor:

Suggest a Task

Do you have an idea of your own, not listed above? Do let us know!

_{This page is licensed: CC BY-SA / Gnu FDL}

Google Summer of Code 2019

We participated in the Google Summer of Code 2019. The MariaDB Foundation believes we are making a better database that remains application compatible with MySQL. We also work on making LGPL connectors (currently C, ODBC, Java) and on MariaDB Galera Cluster, which allows you to scale your reads & writes. And we have MariaDB ColumnStore, which is a columnar storage engine, designed to process petabytes of data with real-time response to analytical queries.

Where to Start

Please join us on Zulip and on IRC to mingle with the community. Don't forget to subscribe to maria-developers@lists.launchpad.net (this is the main list where we discuss development).

A few handy tips for any interested students who are unsure which projects to choose:Blog post from former GSoC student & mentor

To improve your chances of being accepted, it is a good idea to submit a pull request with a bug fix to the server.

List of Tasks

Loaded from the

MariaDB Server: Optimizer

Evaluate subquery predicates earlier or later depending on their SELECTIVITY

Details:

Mentor:

Add support for Indexes on Expressions

An index on expression means something like

in this case the optimizer should be able to use an index. This task naturally splits in two steps:

add expression matching into the optimizer, use it for generated columns. Like in CREATE TABLE t1 (a int, b int, v INT GENERATED ALWAYS AS (a/2+b), INDEX (v));
support the syntax to create an index on expression directly, this will automatically create a hidden generated column under the hood

original task description is visible in the history

Details:

Mentor:

Histograms with equal-width bins in MariaDB

Here:

'WITH n ROWS' provides an estimate for the number of rows in the table in the case when this estimate cannot be obtained from statistical data.
'SAMPLING p PERCENTS' provides the percentage of sample rows to collect statistics. If this is omitted the number is taken from the system variable samples_ratio.
'IN RANGE r' sets the range of equal-width bins of the histogram built for the column col1. If this is omitted then and min and max values for the column can be read from statistical data then the histogram is built for the range [min(col1), max(col1)]. Otherwise the range [MIN_type(col1), MAX_type(col1) is considered]. The values beyond the given range, if any, are also is taken into account in two additional bins.

Details:

Mentor:

Add FULL OUTER JOIN to MariaDB

Add support for FULL OUTER JOIN One of the way how to implement is to re-write the query

into the following union all:

Here t1.a is some non-nullable column of t1 (e.g. the column of single column primary key).

Details:

Mentor:

Recursive CTE support for UPDATE (and DELETE) statements

supported in MySQL-8.0 and MSSQL

Details:

Mentor:

Implement EXCEPT ALL and INTERSECT ALL operations

SQL Standard allows to use EXCEPT ALL and INTERSECT ALL as set operations. Currently MariaDB Server does not support them. The goal of this task is to support EXCEPT ALL and INTERSECT ALL

at syntax level - allow to use operators EXCEPT ALL and INTERSECT ALL in query expression body
at execution level - implement these operations employing temporary tables (the implementation could use the idea similar to that used for the existing implementation of the INTERSECT operation).

Details:

Mentor:

MariaDB Server: others

Implement UPDATE with result set

Add an UPDATE operation that returns a result set of the changed rows to the client.

I'm not exactly sure how the corresponding multiple-table syntax should look like, or if it is possible at all. But already having it for single-table updates would be a nice feature.

Details:

Mentor:

Automatic provisioning of slave

Idea

The purpose of this task is to create an easy-to-use facility for setting up a new MariaDB replication slave. Setting up a new slave currently involves: 1) installing MariaDB with initial database; 2) point the slave to the master with CHANGE MASTER TO; 3) copying initial data from the master to the slave; and 4) starting the slave with START SLAVE. The idea is to automate step (3), which currently needs to be done manually. The syntax could be something as simple as

This would then connect to the master that is currently configured. It will load a snapshot of all the data on the master, and leave the slave position at the point of the snapshot, ready for START SLAVE to continue replication from that point.

Implementation:

The idea is to do this non-blocking on the master, in a way that works for any storage engine. It will rely on row-based replication to be used between the master and the slave. At the start of LOAD DATA FROM MASTER, the slave will enter a special provisioning mode. It will start replicating events from the master at the master's current position. The master dump thread will send binlog events to the slave as normal. But in addition, it will interleave a dump of all the data on the master contained in tables, views, or stored functions. Whenever the dump thread would normally go to sleep waiting for more data to arrive in the binlog, the dump thread will instead send another chunk of data in the binlog stream for the slave to apply. A "chunk of data" can be:

A CREATE OR REPLACE TABLE / VIEW / PROCEDURE / FUNCTION
A range of N rows (N=100, for example). Each successive chunk will do a range scan on the primary key from the end position of the last chunk.

Sending data in small chunks avoids the need for long-lived table locks or transactions that could adversely affect master performance. The slave will connect in GTID mode. The master will send dumped chunks in a separate domain id, allowing the slave to process chunks in parallel with normal data. During the provisioning, all normal replication events from the master will arrive on the slave, and the slave will attempt to apply them locally. Some of these events will fail to apply, since the affected table or row may not yet have been loaded. In the provisioning mode, all such errors will be silently ignored. Proper locking (isolation mode, eg.) must be used on the master when fetching chunks, to ensure that updates for any row will always be applied correctly on the slave, either in a chunk, or in a later row event. In order to make the first version of this feature feasible to implement in a reasonable amount of time, it should set a number of reasonable restrictions (which could be relaxed in a later version of the feature):

Give up with an error if the slave is not configured for GTID mode (MASTER_USE_GTID != NO).
Give up with error if the slave receives any event in statement-based binlogging (so the master must be running in row-based replication mode, and no DDL must be done while the provisioning is running).
Give up with an error if the master has a table without primary key.
Secondary indexes will be enabled during the provisioning; this means that tables with large secondary indexes could be expensive to provision.

Details:

Mentor:

connection encryption plugin support

Details:

Mentor:

Add RETURNING to INSERT

Please add a RETURNING option to INSERT. Example from PostgreSQL

Inspired by:This could make it easier to write statements which work with both MariaDB and PostgreSQL. And this might improve compatibility with Oracle RDBMS.

Details:

Mentor:

Aggregate Window Functions

This functions can be used in the following query:

After this task is complete the following must also work:

Details:

Mentor:

True ALTER LOCK=NONE on slave

Currently no true LOCK=NONE exists on slave. Alter table is first committed on master then it is replicated on slaves. The purpose of this task is to create a true LOCK=NONE

Implementation Idea

Master will write BEGIN_DDL_EVENT in binlog after it hits ha_prepare_inplace_alter_table. Then master will write QUERY_EVENT on binlog with actual alter query . On commit/rollback master will write COMMIT_DDL_EVENT/ROLLBACK_DDL_EVENT. On slave there will be pool of threads(configurable global variable), which will apply these DDLs. On reciving BEGIN_DDL_EVENT slave thread will pass the QUERY_EVENT to one of the worker thread. Worker thread will execute untill ha_inplace_alter_table. Actual commit_inplace_alter will be called by sql thread. If sql thread recieve some kind of rollback event , then it will somehow signal worker thread to stop executing alter. If none of the worker threads are avaliable then event will be enqueued, then If we recieved rollback event the we will simply discard event from queue, If we recieved commit event then SQL thread will syncrolysly process DDL event.

Details:

Mentor:

Improve mysqltest language

mysqltest has a lot of historical problems:

ad hoc parser, weird limitations
commands added as needed with no view over the total language structure
historical code issues (e.g. casts that become unnecessary 10 years ago) etc

A lot can be done to improve it. Ideas

control structures, else in if, break and continue in while, for (or foreach) loop
proper expression support in let, if, etc
rich enough expressions to make resorting to sql unnecessary in most cases
remove unused and redundant commands (e.g. system vs exec, query_vertical vs vertical_results ONCE)

Details:

Mentor:

Implement multiple-table UPDATE/DELETE returning a result set

A multiple-table UPDATE first performs join operations, then it updates the matching rows. A multiple-table UPDATE returning a result set does the following:

first performs join operations
for each row of the result of the join it calculates some expressions over the columns of the join and forms from them a row of the returned result set
after this it updates the matching rows

A multiple-table DELETE first performs join operations, then it deletes the matching rows. A multiple-table DELETE returning a result set does the following:

first performs join operations
for each row of the result of the join it calculates some expressions over the columns of the join and forms from them a row of the returned result set
after this it deletes the matching rows

Details:

Mentor:

sort out the compression library chaos

hide all compression libraries behind a single unified compression API. Either develop our own or use something like Squash. This would require changing all engines to use this API
use the same approach as in server services — create a service per compression library, a service implementation will just return an error code for any function invocation if the corresponding library is not installed. this way — may be — we could avoid modifying all affected storage engines

Details:

Mentor:

Control over memory allocated for SP/PS

Task minimum:

Status variables which count the memory used/allocated for SP/PS by thread and/or for the server.

Other ideas:

Automatic stop allocation in debvugging version after second execution and call exception on attempt.
Information schema by threads and SP/PS with information about allocated and used memory

Details:

Mentor:

MariaDB ColumnStore

Full DECIMAL support in ColumnStore

MariaDB ColumnStore supports DECIMAL with some limitations:

We do not support the full DECIMAL range that is in MariaDB
In several places in the code we convert the DECIMAL to DOUBLE during execution therefore losing precision Implementing this will likely require the following:

Implementation of methods to handle MariaDB's DECIMAL format
Support for a longer than 8-byte numeric column type (there is an InfiniDB tree with work for this already)
Modification of the primitives processor for the math
Modification of the function expression processor to handle the new type

Details

Mentor:

mcsapi needs a new read API Design

We need an ORM-style NoSQL read API to go along with the bulk write API of mcsapi. This will likely take the form of:

A reader in ExeMgr which will convert messages from mcsapi into jobs
Code in mcsapi to send/receive the messages Although ExeMgr can already receive messages with an execution plan the format is very complex and ABI breaks easily (we often send whole C++ objects). We should look at other ORM frameworks for inspiration as the API design. This task to do the design for this API.

Details:

Mentor:

Use JIT for aggregation functions

Details:

Mentor:

Engine independent statistics for Columnstore

implement standalone segment files reader that in the end populates both mysql.column_stats and mysql.table_stats using out of band mariadb client connection
implement ANALYZE TABLE functionality for Columnstore engine
implement ANALYZE TABLE and Histograms with equal-width bins for values distribution histograms(similar to ) together with NDV histograms to decrease I/O

We expect to have both unit and regression tests but this is optional.

Details:

Mentor:

Suggest a Task

Do you have an idea of your own, not listed above? Do let us know!

_{This page is licensed: CC BY-SA / Gnu FDL}

Log of MariaDB Contributions

How to Document Contributions

Add the following in the global comment for each contribution:

For those cases this is not done, please add to this page a short line for each push into MariaDB that includes code from contributors not employed by the MariaDB Foundation or the MariaDB Corporation. The purpose of this is to properly track that all such patches are submitted either under MCA or BSD-new and to ensure that the developer gets credit for his work.

Example:

(Please enhance the example with anything that makes sense.)

Log of Contributions

. Log of Contributions

Log of Contributions

()

Tencent Game DBA Team, developed by vinchen.

()

Jerome Brauge.

Per-engine mysql.gtid_slave_pos tables ()

Kristian Nielsen funded by Booking.com.

The MariaDB Foundation website provides a more detailed list of contributors by release, starting from

Log of Contributions

New variable permits restricting the speed at which the slave reads the binlog from the master ()

Tencent Game DBA Team, developed by chouryzhou.

()

Tencent Game DBA Team, developed by vinchen.

()

Daniil Medvedev

Lixun Peng, Alibaba

Implement non-recursive common table expressions () Implement recursive common table expressions () Pushdown conditions into non-mergeable views/derived tables ()

Galina Shalygina

Backporting Delayed replication () from MySQL 5.6

Kristian Nielsen funded by Booking.com

The MariaDB Foundation website provides a more detailed list of contributors by release, starting from

Log of Contributions

, optimizer, security, speed enhancements, bug fixing, etc

Power8 optimization

Stewart Smith
In cooperation with IBM

enhancements and speedups

Reviews for , , compression, , storage engine, storage engine, , etc.

, scrubbing, enhanced semisync, dump thread enhancements, thd_specifics plugin service

Google

Table level ,

, online alter progress monitoring

Antony Curtis

Sriram Patil

New

Daniel Black

Daniël van Eeden

Atomic writes, page compression, trim, multi-threaded flush for XtraDB/InnoDB

In cooperation with FusionIO

The MariaDB Foundation website provides a more detailed list of contributors by release, starting from

Also Used Code Snippets by

Facebook

Defragmentation, prefix index queries optimization, lazy flushing, buffer pool list scan optimization, configurable long semaphore wait timeout

Percona

Oracle

optimization,

Log of Contributions

Per thread memory counting and usage

Base code and idea by Lixun Peng, Taobao
License: BSD

Base code by Lixun Peng, Taobao
License: BSD

Code by Konstantin "Kostja" Osipov, mail.ru
License: BSD

Code by Olivier Bertrand
License: GPL

Code by Kentoku Shiba, Spiral Arms
License: GPL

Code by Vicentiu Ciorbaru, Google Summer of Code 2013
License: BSD

Code by Sudheera Palihakkara, Google Summer of Code 2013
License: BSD

Some patches by Pavel Ivanov, Google

The MariaDB Foundation website provides a more detailed list of contributors by release, starting from

Log of Contributions

Function last_value() which returns the last value but evaluates all arguments as a side effect.

Original patch by Eric Herman, Booking.com
License: BSD

nowatch option for mysqld_safe (allow systemd)

Based on code from Maarten Vanraes
License: BSD

Security fixes, patches

Work by Honza Horak, Red Hat

Coverity scans

Work by Christian Convey

The MariaDB Foundation website provides a more detailed list of contributors by release, starting from

Log of Contributions

Virtual Columns

Andrey Zhakov (modified by Sanja and Igor)
Author has

Declaring many CHARSET objects as const.

Antony T Curtis (LinkedIn)
License: BSD

Authors: People at Google, Facebook and Percona. This code owes a special thanks to Mark Callaghan!
License: BSD

Fredrik Nylander from Stardoll.com
License: MCA

The storage engine

Created by Arjen Lenz, Open Query
License GPL

The storage engine

Created by Andrew Aksyonoff.
License: GPL

Pluggable Authentication

RJ Silk License: MCA

Various bug fixes

Stewart Smith, Percona

Log of Contributions (Outside of Monty Program Ab)

Microsecond precision in process list

Percona Inc
Patch was .

Slow Query Log Extened Statistics

Percona Inc
Patch was .

The

Created by Paul McCullagh
License: GPL

The FederatedX storage engine

All changes are made by Patrik Galbraith and Antony Curtis and are given to us under BSD-new.
In addition we are allowed to promote FederatedX.

Windows enhancements and various bug fixes

Alex Budovski, under MCA

Creating of MariaDB packages

Arjen Lenz, Open Query

Various bug fixes

Stewart Smith, Percona

General Resources

About

About MariaDB Documentation

Contributing Documentation

Documentation Style Guide

Style Guide

Reporting Documentation Issues

Bug Reports

Reporting Channels

Rating system in the right sidebar

MariaDB Community Slack

About MariaDB

About MariaDB Server

Contact Us

Community

Joining the Community

Chat

Mailing lists

GitHub

Feature requests

Social media

YouTube

LinkedIn

X

Reddit

Facebook

Instagram

Fosstodon

MariaDB Community

Bug Tracking

Security Bug Fixing

See Also

Building MariaDB Server for Debugging

What to Do When You Get a Crash After Installing a Debug Binary

See Also

Legal Documents

MariaDB Contributor Agreement

MariaDB Contributor Agreement (MCA)

MariaDB Contributor Agreement FAQ

What is the Intention of the ?

Frequently Asked Questions

MariaDB Community Server

The mariadb-backup Tool

Community Questions

How Can I Contribute to MariaDB?

DBA Questions

Developer Questions

How can a VBA Application Connect to MariaDB?

How can I Learn about Developing MariaDB?

How many JOIN Clauses are Allowed in a Query?

Remote connection work but often get error HY000/2002 connection timed out

Tools Available for Developing on the MariaDB Code

Where is the table_cache in MariaDB?

Why is ORDER BY in a FROM Subquery Ignored?

See also

General Questions

Does the Password Prompt Appear Before or After Establishing a Connection?

What is the Goal of MariaDB?

Why is the Software Called MariaDB?

High Availability Questions

Licensing Questions

MariaDB Licenses

MariaDB Software Questions

What is MariaDB?

What is MariaDB's Release Policy and Schedule?

What will be in MariaDB version X?

When will MariaDB version X be released?

Migrating to MariaDB Questions

How can I Import Only a Table's Structure?

How do I migrate a MySQL database to Amazon RDS?

Red Hat 6 Process Limit a Errors

Support Questions

Getting Help With MariaDB

Supported Software & Features

Contributing & Participating

MariaDB Public FTP Server

Sponsoring the MariaDB Project

Google Summer of Code

Google Summer of Code 2013

Where to start