Allocate fewer objects #4447

nnethercote · 2014-03-13T04:38:54Z

I have a hacked version of Firefox that identifies where object allocations in JS code occurs. With this, I've created a series of patches that reduces the number of allocations that occur when loading/viewing a PDF by up to 50%. All these eliminated allocations are of small, short-lived objects, which might not seem important, but actually are, because it reduces pressure on the garbage collector.

Of the eight documents I've tested, five of them had their peak RSS reduced by 15--40 MiB. And all the changes are pretty simple.

brendandahl · 2014-03-13T18:20:53Z

Needs a rebase.

timvandermeij · 2014-03-13T19:55:17Z

@nnethercote Is #3769 related to this PR (seeing nnethercote@42bb602)?

nnethercote · 2014-03-13T21:54:46Z

Tim: the first rule of microbenchmarks: you shouldn't trust microbenchmarks. It's hard to write good ones, and they rarely measure what they're meant to measure.

Here's an uninteresting question: is a.length=0 slower than a=[] in a microbenchmark?

Here's an interesting questions: does the use of a.length=0 rather than a=[] make pdf.js noticeably slower?

I'm confident the answer is "no", though pdf.js lacks good performance benchmarks (AIUI) so it's hard to say for certain. But from my understanding of JS engines, zeroing the length should be no slower -- you're possibly deallocating a slots arrays, but that would happen later during GC anyway. And avoiding the allocation saves memory and reduces the amount of garbage, thus delaying the time until the next GC occurs.

timvandermeij · 2014-03-13T22:01:56Z

That sounds good. Thank you for the explanation; that adds context to #3769 which I found interesting.

nnethercote · 2014-03-13T22:11:39Z

I've used the length=0 trick in some of my other patches. In principle I don't think there's anything wrong with #3769's patch, but I haven't seen that code show up as hot in any of my profiling.

nnethercote · 2014-03-13T22:28:44Z

I rebased and repushed.

timvandermeij · 2014-03-13T22:47:42Z

@nnethercote Travis is complaining a bit:

src/core/evaluator.js: line 1441, col 39, Expected an assignment or function call and instead saw an expression.
src/core/evaluator.js: line 1441, col 41, Missing semicolon.

yurydelendik · 2014-03-13T23:49:22Z

As in #4368, I will not recommend touching stuff near calcRenderParams. We will use all arguments in the matrix. Temporary commenting will not solve the problem in the future. Not sure about preprocessor args optimization as well for the same reason, also readability of the evaluators loops makes it unmaintainable imho. I like getBytes2/getBytes4 stuff, but let's rename them into getUint16 and getInt32 though. @brendandahl ?

nnethercote · 2014-03-14T00:17:26Z

The patches are ordered by effectiveness. So the calcRenderParams one isn't so important, but the Preprocessor.read() one is -- even after applying the patch, which reduces the object allocations from those two loops by 75%, the remaining 25% still represents the biggest single source of allocations in the entire codebase. In some documents its accounts for over 100,000 object allocations, typically 10--15% of all object allocations. Remove that optimization and the number quadruples.

This is achieved by adding getBytes2() and getBytes4() to streams, and by changing int16() and int32() to take multiple scalar args instead of an array arg.

nnethercote · 2014-03-14T05:25:34Z

I renamed getBytes{2,4}() as getUint{16,32}, and removed the patches modifying
calcRenderParams() and EvaluatePreprocessor_read().

W.r.t. the latter, I have a plan to restructure the way data is passed from the
worker to the main thread via fnArray/argsArray. If it works out, it will avoid
all the allocations in and around EvaluatePreprocessor_read(), plus all those
for the small arrays within argsArray.

brendandahl · 2014-03-17T18:57:56Z

/botio test

pdfjsbot · 2014-03-17T18:57:58Z

From: Bot.io (Linux)

Received

Command cmd_test from @brendandahl received. Current queue size: 0

Live output at: http://107.21.233.14:8877/320c74ce660ee00/output.txt

pdfjsbot · 2014-03-17T18:57:58Z

From: Bot.io (Windows)

Received

Command cmd_test from @brendandahl received. Current queue size: 1

Live output at: http://107.22.172.223:8877/141c7086fb7f371/output.txt

pdfjsbot · 2014-03-17T19:23:44Z

From: Bot.io (Linux)

Success

Full output at http://107.21.233.14:8877/320c74ce660ee00/output.txt

Total script time: 25.76 mins

Font tests: Passed
Unit tests: Passed
Regression tests: Passed

pdfjsbot · 2014-03-17T19:44:38Z

From: Bot.io (Windows)

Success

Full output at http://107.22.172.223:8877/141c7086fb7f371/output.txt

Total script time: 37.02 mins

Font tests: Passed
Unit tests: Passed
Regression tests: Passed

Allocate fewer objects

nnethercote added 4 commits March 13, 2014 22:15

Allocate fewer objects when parsing 2 and 4 byte chunks.

6a75e45

This is achieved by adding getBytes2() and getBytes4() to streams, and by changing int16() and int32() to take multiple scalar args instead of an array arg.

Avoid allocations in bidi().

2e93a0c

Avoid more allocations in bidi().

3759c11

Avoid allocations in executeCommand().

6c69851

brendandahl added a commit that referenced this pull request Mar 17, 2014

Merge pull request #4447 from nnethercote/object-reduction

1802fff

Allocate fewer objects

brendandahl merged commit 1802fff into mozilla:master Mar 17, 2014

nnethercote deleted the object-reduction branch March 21, 2014 03:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allocate fewer objects #4447

Allocate fewer objects #4447

nnethercote commented Mar 13, 2014

brendandahl commented Mar 13, 2014

timvandermeij commented Mar 13, 2014

nnethercote commented Mar 13, 2014

timvandermeij commented Mar 13, 2014

nnethercote commented Mar 13, 2014

nnethercote commented Mar 13, 2014

timvandermeij commented Mar 13, 2014

yurydelendik commented Mar 13, 2014

nnethercote commented Mar 14, 2014

nnethercote commented Mar 14, 2014

brendandahl commented Mar 17, 2014

pdfjsbot commented Mar 17, 2014

pdfjsbot commented Mar 17, 2014

pdfjsbot commented Mar 17, 2014

pdfjsbot commented Mar 17, 2014

Allocate fewer objects #4447

Allocate fewer objects #4447

Conversation

nnethercote commented Mar 13, 2014

brendandahl commented Mar 13, 2014

timvandermeij commented Mar 13, 2014

nnethercote commented Mar 13, 2014

timvandermeij commented Mar 13, 2014

nnethercote commented Mar 13, 2014

nnethercote commented Mar 13, 2014

timvandermeij commented Mar 13, 2014

yurydelendik commented Mar 13, 2014

nnethercote commented Mar 14, 2014

nnethercote commented Mar 14, 2014

brendandahl commented Mar 17, 2014

pdfjsbot commented Mar 17, 2014

From: Bot.io (Linux)

Received

pdfjsbot commented Mar 17, 2014

From: Bot.io (Windows)

Received

pdfjsbot commented Mar 17, 2014

From: Bot.io (Linux)

Success

pdfjsbot commented Mar 17, 2014

From: Bot.io (Windows)

Success