ChainDB: batch garbage collections #1932

mrBliss · 2020-04-10T06:57:28Z

Previously, we scheduled a garbage collection for each block for 10 seconds in
the future. This meant that our scheduled GCs queue was blocks/s * 10 long.
When tracing the queue length on my machine, it hovered between 5000 and 6000
entries. Moreover, a VolatileDB garbage collection is triggered at blocks/s,
which should result in a lot of contention for the VolatileDB state.

Even worse is that a 10 second delay is too short to reliably ensure the block
will have been flushed to disk (in the ImmutableDB) before it is garbage
collected. However, increasing this delay would make the queue significantly
longer.

To fix these issues, we introduce a GC interval (in seconds). We batch all GCs
in the same interval together. This means that the queue length is now at most
⌈delay / interval⌉ + 1, e.g., 60s / 10s = 7, which is much shorter than
5000-6000. Moreover, there will be at most one GC every interval seconds,
e.g., 10s.

The cost of switching to a longer GC delay is that the in-memory index of the
VolatileDB will be larger, making operations on it, such as lookups, more
expensive (most operations are O(n*log(n))). See the docstring of
'defaultSpecificArgs' for what the new default values of gcDelay and
gcInterval mean in practice.

Previously, we scheduled a garbage collection for each block for 10 seconds in the future. This meant that our scheduled GCs queue was blocks/s * 10 long. When tracing the queue length on my machine, it hovered between 5000 and 6000 entries. Moreover, a VolatileDB garbage collection is triggered at blocks/s, which should result in a lot of contention for the VolatileDB state. Even worse is that a 10 second delay is too short to reliably ensure the block will have been flushed to disk (in the ImmutableDB) before it is garbage collected. However, increasing this delay would make the queue significantly longer. To fix these issues, we introduce a GC interval (in seconds). We batch all GCs in the same interval together. This means that the queue length is now at most ⌈delay / interval⌉ + 1, e.g., 60s / 10s = 7, which is much shorter than 5000-6000. Moreover, there will be at most one GC every `interval` seconds, e.g., 10s. The cost of switching to a longer GC delay is that the in-memory index of the VolatileDB will be larger, making operations on it, such as lookups, more expensive (most operations are `O(n*log(n))`). See the docstring of 'defaultSpecificArgs' for what the new default values of `gcDelay` and `gcInterval` mean in practice.

mrBliss · 2020-04-10T07:01:24Z

bors merge

iohk-bors · 2020-04-10T08:03:29Z

Build succeeded

1932: ChainDB: batch garbage collections r=mrBliss a=mrBliss Previously, we scheduled a garbage collection for each block for 10 seconds in the future. This meant that our scheduled GCs queue was blocks/s * 10 long. When tracing the queue length on my machine, it hovered between 5000 and 6000 entries. Moreover, a VolatileDB garbage collection is triggered at blocks/s, which should result in a lot of contention for the VolatileDB state. Even worse is that a 10 second delay is too short to reliably ensure the block will have been flushed to disk (in the ImmutableDB) before it is garbage collected. However, increasing this delay would make the queue significantly longer. To fix these issues, we introduce a GC interval (in seconds). We batch all GCs in the same interval together. This means that the queue length is now at most ⌈delay / interval⌉ + 1, e.g., 60s / 10s = 7, which is much shorter than 5000-6000. Moreover, there will be at most one GC every `interval` seconds, e.g., 10s. The cost of switching to a longer GC delay is that the in-memory index of the VolatileDB will be larger, making operations on it, such as lookups, more expensive (most operations are `O(n*log(n))`). See the docstring of 'defaultSpecificArgs' for what the new default values of `gcDelay` and `gcInterval` mean in practice. Co-authored-by: Thomas Winant <[email protected]>

mrBliss added 3 commits April 9, 2020 10:03

Test.Util.QuickCheck: add le and gt

c661d71

Add Condense Time instance

f9cc04f

mrBliss added the consensus issues related to ouroboros-consensus label Apr 10, 2020

mrBliss requested a review from edsko April 10, 2020 06:57

edsko approved these changes Apr 10, 2020

View reviewed changes

iohk-bors bot merged commit 5fc0470 into master Apr 10, 2020

iohk-bors bot deleted the mrBliss/batch-gc branch April 10, 2020 08:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChainDB: batch garbage collections #1932

ChainDB: batch garbage collections #1932

mrBliss commented Apr 10, 2020

mrBliss commented Apr 10, 2020

iohk-bors bot commented Apr 10, 2020

ChainDB: batch garbage collections #1932

ChainDB: batch garbage collections #1932

Conversation

mrBliss commented Apr 10, 2020

mrBliss commented Apr 10, 2020

iohk-bors bot commented Apr 10, 2020

Build succeeded