Often, the first step in evaluating and deploying a database is
to load an existing dataset into the database. In the latest
version, TokuDB makes use of multi-core parallelism to speed up
loading (and new index creation). Using the loader, MySQL tables
using TokuDB load 5x-8x faster than with previous versions of
TokuDB.
Measuring Load Performance
We generated several different datasets to measure the
performance of TokuDB when doing a LOAD DATA INFILE … command. To
characterize performance, we vary
- rows to load
- keys per row
- row length (including keys)
All generated keys, including the primary, are random, 8-byte
values. The remaining data, needed to pad out the row length to
specified length, is text.
Two files files are produced as part of data generation.
- data file, containing ‘|’ separated fields
- sql file, containing the …
[Read more]