Improve how DecodeStream handles empty buffers. #5025

nnethercote · 2014-07-03T01:59:50Z

DecodeStream currently initializes its |buffer| field to |null|, which
is reasonable, because lots of DecodeStreams never need to instantiate a
buffer. But this requires various special cases in the code.

This patch change it so DecodeStreamClosure has a single empty
Uint8Array which gets shared between all buffers upon initialization.
This avoids the special cases.

DecodeStream.prototype.ensureBuffer() is really hot, and this removes a
test from the fast path. For one 226 page scanned document this sped up
rendering by about 2%.

DecodeStream currently initializes its |buffer| field to |null|, which is reasonable, because lots of DecodeStreams never need to instantiate a buffer. But this requires various special cases in the code. This patch change it so DecodeStreamClosure has a single empty Uint8Array which gets shared between all buffers upon initialization. This avoids the special cases. DecodeStream.prototype.ensureBuffer() is really hot, and this removes a test from the fast path. For one 226 page scanned document this sped up rendering by about 2%.

CodingFabian · 2014-07-07T15:07:45Z

what is the risk of accidentally writing to the empty buffer?
How does one do immutability in JavaScript? Could you unset the modifying methods of the instance?

nnethercote · 2014-07-07T22:48:17Z

In a typed array, if you read from out-of-bounds elements, you get undefined. If you write to out-of-bounds elements, nothing happens. Those operations shouldn't be happening, but even if they do, it's not a problem (well, not a problem beyond the fact that there's an underlying defect anyway).

Firefox's nsTArray has a similar optimization -- there's a single shared header used for empty arrays. That's where I got the idea.

CodingFabian · 2014-07-07T23:09:41Z

ok, got it, you are saying because you inited it with zero length it will never contain data. thats good.
So the PR does eliminate some checkings which are no longer needed because at least the empty array exists. and the other places which deal with existing content would put a new appropriately sized array into the variable.
looks good to me. 2% sounds quite a lot for this change. What kind of document would benefit from this? so i can try to verify the improvement.

nnethercote · 2014-07-07T23:42:10Z

That's right; if a bigger buffer is needed it will be allocated on demand.

Any document that uses CCITTFaxStreams heavily would be a good test, because ensureBuffer() is called for every input byte in such streams. I tested with http://njn.valgrind.org/Decontamination.pdf.

CodingFabian · 2014-07-09T19:04:11Z

I like that change.
I can confirm the improvement (used your document pages 1-200 in 10 rounds)

browser	stat	Count	Baseline(ms)	Current(ms)	+/-	%	Result(P<.05)
chrome35	Overall	2000	141	137	-4	-2.71	faster
chrome35	Page Request	2000	3	3	0	4.83
chrome35	Rendering	2000	138	134	-4	-2.88	faster

Snuffleupagus · 2014-07-13T08:20:09Z

/botio test

pdfjsbot · 2014-07-13T08:20:11Z

From: Bot.io (Linux)

Received

Command cmd_test from @Snuffleupagus received. Current queue size: 0

Live output at: http://107.21.233.14:8877/984acca3150d725/output.txt

pdfjsbot · 2014-07-13T08:20:11Z

From: Bot.io (Windows)

Received

Command cmd_test from @Snuffleupagus received. Current queue size: 0

Live output at: http://107.22.172.223:8877/6c5380298dfb162/output.txt

pdfjsbot · 2014-07-13T08:56:59Z

From: Bot.io (Linux)

Success

Full output at http://107.21.233.14:8877/984acca3150d725/output.txt

Total script time: 36.79 mins

Font tests: Passed
Unit tests: Passed
Regression tests: Passed

Improve how DecodeStream handles empty buffers.

Snuffleupagus · 2014-07-13T10:13:14Z

Thanks for the patch!

Snuffleupagus added a commit that referenced this pull request Jul 13, 2014

Merge pull request #5025 from nnethercote/share-zero-length-buffers

0237d50

Improve how DecodeStream handles empty buffers.

Snuffleupagus merged commit 0237d50 into mozilla:master Jul 13, 2014

nnethercote deleted the share-zero-length-buffers branch August 6, 2014 00:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve how DecodeStream handles empty buffers. #5025

Improve how DecodeStream handles empty buffers. #5025

nnethercote commented Jul 3, 2014

CodingFabian commented Jul 7, 2014

nnethercote commented Jul 7, 2014

CodingFabian commented Jul 7, 2014

nnethercote commented Jul 7, 2014

CodingFabian commented Jul 9, 2014

Snuffleupagus commented Jul 13, 2014

pdfjsbot commented Jul 13, 2014

pdfjsbot commented Jul 13, 2014

pdfjsbot commented Jul 13, 2014

Snuffleupagus commented Jul 13, 2014

Improve how DecodeStream handles empty buffers. #5025

Improve how DecodeStream handles empty buffers. #5025

Conversation

nnethercote commented Jul 3, 2014

CodingFabian commented Jul 7, 2014

nnethercote commented Jul 7, 2014

CodingFabian commented Jul 7, 2014

nnethercote commented Jul 7, 2014

CodingFabian commented Jul 9, 2014

Snuffleupagus commented Jul 13, 2014

pdfjsbot commented Jul 13, 2014

From: Bot.io (Linux)

Received

pdfjsbot commented Jul 13, 2014

From: Bot.io (Windows)

Received

pdfjsbot commented Jul 13, 2014

From: Bot.io (Linux)

Success

Snuffleupagus commented Jul 13, 2014