One of the hot mutexes in InnoDB is the buffer pool mutex.
Among other things this mutex protects the page hash where
pages reside when they are in the cache.
There is already a number of variants of how to split out
this mutex. Here follows a short description of the various
approaches.
1) Google v3 approach
Ben Hardy at Google took the approach of using an array of
mutexes (64 mutexes) and this mutex only protects the
actual read, insert and delete from the page hash table.
This has the consequence of a very simple patch, it means
also that when the block has been locked one has to check
that the owner of the block hasn't changed since we didn't
protect the block between the read of the hash and the
locking of the block, thus someone is capable of coming in
between and grabbing the block for another page before we
get to lock the …
When I worked with Dimitri on the analysis of the
Split Rollback Segment Mutex he came up with numbers
on InnoDB Thread Concurrency set to 16 and 32 and I was
curious
to see if 24 was the optimal setting. So he made some new runs
and
some new graphs that I found interesting.
The first graph analyses behaviour of MySQL 5.4.0 on a
SPARC
Server using InnoDB Thread Concurrency set to 0, 16, 24 and
32.
Interestingly for both readonly and readwrite benchmarks
the
optimal setting for concurrency is 16 whereas the top
numbers
(at 32 threads) is achieved with concurrency set to 24 or
32.
…
When I read the blog about Split Rollback Segment Mutex,
I was interested to verify those results in the context of MySQL
5.4.0.
The patch can be found here.
We've analysed this patch both on a large SPARC system and on
my
benchmark x86/Linux machine. Our results tend to be positive
for
readwrite benchmarks but sometimes negative for readonly
benchmarks. Also the gain is much smaller than found in the
blog.
Also this patch has two negative effects, the first is that
it
provides an upgrade problem, this can probably be handled in
the
InnoDB code, but requires quite some digging. The other is
that
instead of writing UNDO results to one UNDO log, we write …
One of the goals we had originally with the MySQL 5.4
development was to improve scaling from 4 cores to
8 cores. So in my early testing I ran comparisons of
the Google SMP + IO + tcmalloc patches on 4, 8 and 12
cores to see how it behaved compared with a stock
MySQL 5.1.28 version (Note the comparison here was
done on a very early version of 5.4, 5.4.0 have a
set of additional patches applied to it).
What we can see here is that the Google SMP patch and use
of tcmalloc makes a difference already on a 4-core server
using 4 threads. On 1 and 2 threads the difference is only
on the order of 1-2% so not really of smaller significance.
An interesting note in the graph is that 8-core numbers
using
the Google …
Louis I | |
Louis II | |
Louis III | |
Louis IV | |
Louis V | |
Louis VI | |
Louis VII | |
Louis VIII | |
Louis IX | |
Louis X (dit le Hutin) | (the Quarrelsome) |
Louis XI | |
Louis XII | |
Louis XIII | |
Louis XIV | |
Louis XV | …
If you like to sift through tons of benchmark data about
various
MySQL versions, Dimitri at the Sun Benchmark Labs have
published
a serious amount of benchmark data in a report published here.
The report shows that the new MySQL 5.4.0 release
have a very good performance. The report also shows how the
day
of a developer of performance improvements and the massive
amount
of benchmark data that needs to be analysed and sifted
through
to understand the impact of new performance improvements.
I personally met Dimitri the first time in 2002 when I was
working
together with him for a couple of weeks on a benchmark on NDB
Cluster
(the storage engine of MySQL Cluster). Our goal then was to
perform
1 million reads per second on a 72-cpu SPARC box with …
In InnoDB there is an implementation of both mutexes
and RW-locks. The RW-locks implementation have been
improved by the Google SMP patches. Both of these
implementation relies on spin-loops as part of their
implementation. The defaults in InnoDB is to check
the condition, if it's not ok to enter to spin for
about 5 microseconds and then come back to check the
condition again.
If one reads the Intel manual how to do spin-loops
they propose to use a PAUSE instruction and then
check the condition again, so a much more active
checking of the condition. When we tried this out
using the sysbench benchmark we found that using
the Intel approach worsened performance. So instead
we tried an approach of putting the PAUSE instruction
into the InnoDB spinloop instead.
This approach turned out to be a success. Even on
machines with only one …
Currently I'm working hard to find and remove scalability
bottlenecks in the MySQL Server. MySQL was acquired by Sun
10 months ago by now. Many people have in blogs wondered
what
the impact has been from this acquisition. My personal
experience is that I now have a chance to work with Sun
experts in DBMS performance. As usual it takes time when
working on new challenges before the flow of inspiration
starts flowing. However I've seen this flow of inspiration
starting to come now, so the fruit of our joint work is
starting to bear fruit. I now have a much better
understanding
of MySQL Server performance than I used to have. I know
fairly
well where the bottlenecks are and I've started looking
into how they can be resolved.
Another interesting thing with Sun is the innovations they
have
done in a number of areas. One such area is DTrace. This is
a …