Change `getPDFFileNameFromURL` to ignore `data:` URLs for performance reasons (issue 8263) #8321

Snuffleupagus · 2017-04-20T08:24:59Z

Please refer to the individual commit messages.

Tentatively fixes #8263.

…nce the `URL` polyfill have made them redundant Also, this changes `createBlob` to throw when `Blob` isn't supported.

Snuffleupagus · 2017-04-20T08:26:56Z

/botio unittest

pdfjsbot · 2017-04-20T08:26:56Z

From: Bot.io (Linux)

Received

Command cmd_unittest from @Snuffleupagus received. Current queue size: 0

Live output at: http://107.21.233.14:8877/796b50f18362b4d/output.txt

pdfjsbot · 2017-04-20T08:26:57Z

From: Bot.io (Windows)

Received

Command cmd_unittest from @Snuffleupagus received. Current queue size: 0

Live output at: http://54.215.176.217:8877/b4df5fd0700655b/output.txt

pdfjsbot · 2017-04-20T08:29:58Z

From: Bot.io (Linux)

Success

Full output at http://107.21.233.14:8877/796b50f18362b4d/output.txt

Total script time: 3.04 mins

Unit Tests: Passed

pdfjsbot · 2017-04-20T08:33:30Z

From: Bot.io (Windows)

Success

Full output at http://54.215.176.217:8877/b4df5fd0700655b/output.txt

Total script time: 6.55 mins

Unit Tests: Passed

yurydelendik

Looks good with the change below

yurydelendik · 2017-04-20T15:41:22Z

web/ui_utils.js

-  if (typeof defaultFilename === 'undefined') {
-    defaultFilename = 'document.pdf';
+function getPDFFileNameFromURL(url, defaultFilename = 'document.pdf') {
+  if (url.indexOf('data:') === 0) {


Per https://tools.ietf.org/html/rfc3986#section-3.1 , schemes are case-insensitive. I recommend to create the isDataSchema(url) function above and also skip allowed whitespaces, something like:

var i = 0; while (i < url.length && url[i].trim() == '') i++; return url.substr(i, 5).toLowerCase() === 'data:';

Thank you for the review!
I've made the requested changes, and also added a few unit-tests for the whitespace case.

… reasons (issue 8263) The patch also changes the `defaultFilename` to use the ES6 default parameter notation, and fixes the formatting of the JSDoc comment. Finally, since `getPDFFileNameFromURL` currently has no unit-tests, a few basic ones are added to avoid regressions.

…er, instead of downloading them, when `PDFJS.disableCreateObjectURL = false` This prevents issues with the filename detection being skipped, when trying to download the opened PDF attachment, since `getPDFFileNameFromURL` ignores `data:` URLs for performance reasons.

Snuffleupagus · 2017-04-20T16:28:54Z

/botio unittest

pdfjsbot · 2017-04-20T16:28:54Z

From: Bot.io (Linux)

Received

Command cmd_unittest from @Snuffleupagus received. Current queue size: 0

Live output at: http://107.21.233.14:8877/458d3baa21701a5/output.txt

pdfjsbot · 2017-04-20T16:28:55Z

From: Bot.io (Windows)

Received

Command cmd_unittest from @Snuffleupagus received. Current queue size: 0

Live output at: http://54.215.176.217:8877/f9ccb988b4e6a21/output.txt

pdfjsbot · 2017-04-20T16:31:53Z

From: Bot.io (Linux)

Success

Full output at http://107.21.233.14:8877/458d3baa21701a5/output.txt

Total script time: 2.98 mins

Unit Tests: Passed

pdfjsbot · 2017-04-20T16:35:32Z

From: Bot.io (Windows)

Success

Full output at http://54.215.176.217:8877/f9ccb988b4e6a21/output.txt

Total script time: 6.62 mins

Unit Tests: Passed

Change `getPDFFileNameFromURL` to ignore `data:` URLs for performance reasons (issue 8263)

Remove the URL checks in the createObjectURL utility function, si…

3888a99

…nce the `URL` polyfill have made them redundant Also, this changes `createBlob` to throw when `Blob` isn't supported.

Snuffleupagus added the viewer label Apr 20, 2017

Snuffleupagus changed the title ~~Change getPDFFileNameFromURL to ignore data: URLs for performance… … reasons (issue 8263)~~ Change getPDFFileNameFromURL to ignore data: URLs for performance reasons (issue 8263) Apr 20, 2017

yurydelendik approved these changes Apr 20, 2017

View reviewed changes

Snuffleupagus added 2 commits April 20, 2017 18:21

Snuffleupagus merged commit c44fd3d into mozilla:master Apr 20, 2017

Snuffleupagus deleted the issue-8263 branch April 20, 2017 17:46

movsb pushed a commit to movsb/pdf.js that referenced this pull request Jul 14, 2018

Merge pull request mozilla#8321 from Snuffleupagus/issue-8263

80e1c47

Change `getPDFFileNameFromURL` to ignore `data:` URLs for performance reasons (issue 8263)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change `getPDFFileNameFromURL` to ignore `data:` URLs for performance reasons (issue 8263) #8321

Change `getPDFFileNameFromURL` to ignore `data:` URLs for performance reasons (issue 8263) #8321

Snuffleupagus commented Apr 20, 2017

Snuffleupagus commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

yurydelendik left a comment

yurydelendik Apr 20, 2017

Snuffleupagus Apr 20, 2017 •

edited

Loading

Snuffleupagus commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

Change getPDFFileNameFromURL to ignore data: URLs for performance reasons (issue 8263) #8321

Change getPDFFileNameFromURL to ignore data: URLs for performance reasons (issue 8263) #8321

Conversation

Snuffleupagus commented Apr 20, 2017

Snuffleupagus commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

From: Bot.io (Linux)

Received

pdfjsbot commented Apr 20, 2017

From: Bot.io (Windows)

Received

pdfjsbot commented Apr 20, 2017

From: Bot.io (Linux)

Success

pdfjsbot commented Apr 20, 2017

From: Bot.io (Windows)

Success

yurydelendik left a comment

Choose a reason for hiding this comment

yurydelendik Apr 20, 2017

Choose a reason for hiding this comment

Snuffleupagus Apr 20, 2017 • edited Loading

Choose a reason for hiding this comment

Snuffleupagus commented Apr 20, 2017

pdfjsbot commented Apr 20, 2017

From: Bot.io (Linux)

Received

pdfjsbot commented Apr 20, 2017

From: Bot.io (Windows)

Received

pdfjsbot commented Apr 20, 2017

From: Bot.io (Linux)

Success

pdfjsbot commented Apr 20, 2017

From: Bot.io (Windows)

Success

Change `getPDFFileNameFromURL` to ignore `data:` URLs for performance reasons (issue 8263) #8321

Change `getPDFFileNameFromURL` to ignore `data:` URLs for performance reasons (issue 8263) #8321

Snuffleupagus Apr 20, 2017 •

edited

Loading