MyRocks and Bloom Filters

Bloom filters are used to reduce read amplification. Bloom filters can be set on a per-column family basis (see myrocks-column-families).

Bloom Filter Parameters

  • How many bits to use
  • whole_key_filtering=true/false (TODO what is that?)
  • Whether the bloom filter is for the entire key or for the prefix. In case of a prefix, you need to look at the index definition and compute the desired prefix length.

Computing Prefix Length

  • It's 4 bytes for index_nr
  • Then it is key_length as shown by EXPLAIN.

Configuring Bloom Filter

To enable 10-bit bloom filter for 4-byte prefix length for column family "cf1", put this into my.cnf:


and restart the server.

Check if the column family actually uses the bloom filter:

select * 
from information_schema.rocksdb_cf_options 
  cf_name='cf1' and
| CF_NAME | OPTION_TYPE                  | VALUE                      |
| cf1     | PREFIX_EXTRACTOR             | rocksdb.CappedPrefix.4     |
| cf1     | TABLE_FACTORY::FILTER_POLICY | rocksdb.BuiltinBloomFilter |

Checking if Bloom Filter is Useful

Watch these status variables:

show status like '%bloom%';
| Variable_name                       | Value |
| Rocksdb_bloom_filter_prefix_checked | 1     |
| Rocksdb_bloom_filter_prefix_useful  | 0     |
| Rocksdb_bloom_filter_useful         | 0     |

Other useful variables are:

  • rocksdb_force_flush_memtable_now - bloom filter is only used when reading data from disk. If you are doing testing, flush the data to disk first.
  • rocksdb_skip_bloom_filter_on_read - skip using the bloom filter (default is FALSE).


Comments loading...