Process subresource link headers #1409

noamr · 2022-03-09T06:49:09Z

In conjunction with whatwg/html#7691
(see there for implementers/tests etc)

At least two implementers are interested (and none opposed):
- …
- …
Tests are written and can be reviewed and commented upon at:
- …
Implementation bugs are filed:
- Chrome: …
- Firefox: …
- Safari: …
- Deno (not for CORS changes): …

(See WHATWG Working Mode: Changes for more details.)

💥 Error: 500 Internal Server Error 💥

PR Preview failed to build. (Last tried on Mar 9, 2022, 3:32 PM UTC).

More

PR Preview relies on a number of web services to run. There seems to be an issue with the following one:

🚨 CSS Spec Preprocessor - CSS Spec Preprocessor is the web service used to build Bikeshed specs.

🔗 Related URL

If you don't have enough information above to solve the error by yourself (or to understand to which web service the error is related to, if any), please file an issue.

annevk · 2022-03-10T13:01:51Z

Do we only test this for font resources currently? I saw https://github.com/web-platform-tests/wpt/blob/master/preload/link-header-on-subresource.html linked from the HTML PR.

This would also mean that pretty much any request you make can result in further requests, including a simple fetch(). Is that really what is implemented?

Also, it seems this should be processed as part of the same task that will run "process response". I don't think we want to introduce another moment in time for this.

noamr · 2022-03-10T13:15:16Z

Do we only test this for font resources currently? I saw https://github.com/web-platform-tests/wpt/blob/master/preload/link-header-on-subresource.html linked from the HTML PR.

I will add more, you are right.

This would also mean that pretty much any request you make can result in further requests, including a simple fetch(). Is that really what is implemented?

I need to double check. I believe so.

Also, it seems this should be processed as part of the same task that will run "process response". I don't think we want to introduce another moment in time for this.

OK

annevk · 2022-03-11T11:00:30Z

I see, I raised this before in w3c/preload#148 but that got closed by whatwg/html#7622 which doesn't quite address this. (E.g., I think as defined a Link header on a style sheet creates different kind of fetches (with respect to Referer for instance) than subresources the style sheet might link.)

I'm rather concerned about a subresource fetch resulting in further fetches. That's simply not a side effect you'd expect when pulling a resource from elsewhere. @yoavweiss do you know where this feature got discussed before including the security implications, how this should relate to CSP, Referer headers, etc?

cc @domenic

noamr · 2022-03-11T12:45:31Z

I see, I raised this before in w3c/preload#148 but that got closed by whatwg/html#7622 which doesn't quite address this. (E.g., I think as defined a Link header on a style sheet creates different kind of fetches (with respect to Referer for instance) than subresources the style sheet might link.)

I'm rather concerned about a subresource fetch resulting in further fetches. That's simply not a side effect you'd expect when pulling a resource from elsewhere. @yoavweiss do you know where this feature got discussed before including the security implications, how this should relate to CSP, Referer headers, etc?

I would expect a stylesheet I fetch to import other stylesheets and continue to do so recursively, same with scripts... I don't see what makes link headers different in that regard.

annevk · 2022-03-11T12:50:57Z

Right, for style sheets and scripts the main concern I have is around fetching logic (do all the bits and in particular Referer end up being set correctly). For fetch() (and also <img>) the concern is more about application security. In that there are now side effects where previously there were none.

noamr · 2022-03-11T13:07:14Z

Right, for style sheets and scripts the main concern I have is around fetching logic (do all the bits and in particular Referer end up being set correctly). For fetch() (and also <img>) the concern is more about application security. In that there are now side effects where previously there were none.

Though that's less specific to the recursive aspect, isn't it? An image with a link header would have a side-effect regardless of it being recursive.
Perhaps resources should only be allowed to trigger preloads to the kinds of resources it might end up fetching? (maybe this means that only styles & scripts can have preload headers)

noamr · 2022-03-13T08:22:17Z

I've added tests.

The current situation:

Both Safari & Chrome support recursive subresource link headers
Safari doesn't support subresource link headers on fonts
Firefox doesn't support subresource link headers (the current relevant test fails)

I'd love to be able to reach some consensus on how to proceed with this.

yoavweiss · 2022-03-14T07:30:46Z

@yoavweiss do you know where this feature got discussed before including the security implications, how this should relate to CSP, Referer headers, etc?

@annevk - this was discussed at the time (~2015, IIRC), but no particular concerns were raised. (some concerns were raised later)
If there are security/privacy issues with this, we can rediscuss. May be interesting to see how often this is used in Chromium, but in any case, breaking this is unlikely to result in compat issues, as Link headers can't define load/error event handlers.

With regards to why this is supported, I can see a clear use case for active content preloading depedent subresources (e.g. a script loading a dependent script it knows it'll need, or a CSS preloading a dependent BG image of font). I see less of a use case for passive content (e.g. images), so would be more open to disabling preloads there.

noamr · 2022-03-14T12:01:01Z

@yoavweiss do you know where this feature got discussed before including the security implications, how this should relate to CSP, Referer headers, etc?

@annevk - this was discussed at the time (~2015, IIRC), but no particular concerns were raised. (some concerns were raised later) If there are security/privacy issues with this, we can rediscuss. May be interesting to see how often this is used in Chromium, but in any case, breaking this is unlikely to result in compat issues, as Link headers can't define load/error event handlers.

With regards to why this is supported, I can see a clear use case for active content preloading depedent subresources (e.g. a script loading a dependent script it knows it'll need, or a CSS preloading a dependent BG image of font). I see less of a use case for passive content (e.g. images), so would be more open to disabling preloads there.

I can see how this would be a security/privacy concern. Thinking of a document with no-cors images only and no CSP, images might generate link headers to CORS resources, causing unexpected fetches and perhaps exfiltration.

yoavweiss · 2022-04-08T07:33:16Z

@annevk - do you agree with my argument that this is fine for active resources (e.g. scripts, styles)? If so, that could be a path forward and I can see if there's implementation interest on the Chromium side to remove that support for passive resources.

noamr · 2022-04-09T07:34:17Z

@annevk - do you agree with my argument that this is fine for active resources (e.g. scripts, styles)? If so, that could be a path forward and I can see if there's implementation interest on the Chromium side to remove that support for passive resources.

I can see how we would specify it as a map of which request destination can load which other request destinations.
Something like:
script / '' (fetch/XHR etc)-> any
style -> style / font / image

annevk · 2022-04-20T11:20:52Z

@yoavweiss I think it might be possible to make that split, yeah. See #1409 (comment) for the remaining issues.

Perhaps that also argues for making the processing happen on the endpoints and not in Fetch directly?

noamr · 2022-04-20T11:22:56Z

@yoavweiss I think it might be possible to make that split, yeah. See #1409 (comment) for the remaining issues.

Perhaps that also argues for making the processing happen on the endpoints and not in Fetch directly?

Yes, maybe fetch doesn't need to be involved at all. Will play around with doing this at the caller sites

yoavweiss · 2022-04-20T11:25:33Z

@annevk - good point in the comment above about setting the CSS resource as the Referer in case of resources preloaded as Link headers on style resources. I don't think that's how this is currently implemented, but it probably should.

noamr · 2022-04-20T11:36:57Z

So it's mainly the call sites for script/style. But I wonder about the call site at fetch / xhr - it's plausible that a fetch() call would have a side effect of fetching other resources - but do we consider it a subresource or should we keep it simple with only scripts / styles? Maybe this can be done iteratively

annevk · 2022-04-20T11:50:07Z

I don't think fetch() (or XMLHttpRequest) should end up fetching other resources besides the one requested (except perhaps as part of a protocol feature, such as H/2 Push). That seems rather unexpected.

yoavweiss · 2022-04-20T11:50:44Z

Yeah, fetch() is a theoretical use case, but I'm not sure I have clear examples of it that can't be resolved by having the script that triggers the first fetch simply trigger other fetches in parallel.

noamr · 2022-04-20T14:13:25Z

Yeah, fetch() is a theoretical use case, but I'm not sure I have clear examples of it that can't be resolved by having the script that triggers the first fetch simply trigger other fetches in parallel.

I was thinking of scenarios where you fetch() the script before executing it, fetch a JSON file that's going to eventually have side-effects that would have you load other resources, have a binary response for WASM that's going to end up downloading more resources in the future etc.

But I agree that starting with scripts & style is good (and having module scripts serve subresource modulepreload)

annevk · 2022-04-20T15:50:48Z

To be clear, I think fetch() acting on Link headers in such a way would be a rather egregious layering violation. Doing it for a select set of endpoints that already can end up resulting in multiple fetches is quite reasonable however.

noamr · 2022-04-20T16:18:30Z

To be clear, I think fetch() acting on Link headers in such a way would be a rather egregious layering violation. Doing it for a select set of endpoints that already can end up resulting in multiple fetches is quite reasonable however.

I'm not sure it's a layering violation but I also don't think it's a strong enough use case, so the end result is a consensus AFAIC :)

domenic · 2022-04-21T18:41:59Z

fetch.bs

@@ -4272,6 +4273,11 @@ steps:
  </ol>
 </li>

+ <li><p>If <var>request</var> is a <a>subresource request</a> and <var>request</var>'s
+ <a for=request>window</a> is an <a>environment settings object</a>, then
+ <a>queue a fetch task</a> to run <span>process subresource link headers</span> given


~~"process subresource link headers" is not defined anywhere.~~ I see, it is in the HTML PR. Relatedly, span does not cause cross-linking in Bikeshed specs.

I am going to do an overhaul of this PR, ignore for now.

Any idea when this will be overhauled?

I'm planning to do a follow-up change: I'd like to pass to the "process subresource link headers" algorithm whether the initiator subresource is render-blocking or not, so that we can create a render-blocking chain. It will depend on how this PR proceeds, though.

I will get to it once whatwg/html#7866 is in.
Also, I'm not sure how well tested the current support for blocking is in link headers, and I'm sure it's not spec'ed for early hints. I'm not sure if you wanted your follow up to include early hints, but if you do, first early hints have to support blocking. I would suggest to have the WPTs ready for those use-cases before we get to subresources.

@xiaochengh do/can you have an open issue for the render-blocking chain?
I have some questions/doubts about it, it would save us time later if we could discuss them now :)

I'll file an issue for render-blocking chain, and also see how it should work with early hints. Thanks for the note!

I filed whatwg/html#7899

Regarding early hints: I think blocking should be ignored on early hints, because there's no document to block when early hints are processed. It suffices as long as the actual response makes the document block on the early-hint-preloaded resources.

noamr · 2022-04-24T16:56:21Z

Thinking about this again, I believe this should be done inside fetch and not at the call sites, as this should also work with preloads - <link rel=preload as=style> where the response has link headers should work the same as <link rel=stylesheet> and @import in CSS, and putting this in each of the call sites would require something like "fetch stylesheet" and above fetch that preload knows about.

Also, we need to be careful not to allow preload semantics that styles are not capable of creating themselves - e.g. the blocking attribute or loading fonts with a crossorigin attribute that's not anonymous.

annevk · 2022-04-26T14:48:04Z

Sigh.

So that means preload is not a generic fetch, but would be a highly-specific fetch based on as. That also argues for reverting whatwg/html#7799 I think.

I think it would actually be better if we had "fetch a style sheet" and "fetch a script" wrappers if we decided to go down that road. Offloading all the complexity into Fetch isn't great long term.

cc @domenic @hiroshige-g

domenic · 2022-04-26T14:56:54Z

Yeah, I am really surprised at the claims like

should work the same as <link rel=stylesheet> and @import in CSS,

and

we need to be careful not to allow preload semantics that styles are not capable of creating themselves

Why can't we make Link headers a generic fetch primitive? Why do they need these constraints?

annevk · 2022-04-26T15:06:40Z

@domenic see above, all fetches acting on Link headers would be a layering violation. That's not what you want for a low-level primitive. Link headers are an application concern.

domenic · 2022-04-26T15:32:22Z

I don't think I really bought that argument, but even if we go that direction, I don't understand why it has to be so type-specific. Basically I would not expect the details of CSS to leak into rel=preload; I would expect them to be in something like rel=stylesheetpreload, but not rel=preload.

noamr · 2022-04-26T15:50:01Z

I don't think I really bought that argument, but even if we go that direction, I don't understand why it has to be so type-specific.

I agree that you wouldn't expect that <img src=""> would have the side effect of sending additional fetches.

Basically I would not expect the details of CSS to leak into rel=preload; I would expect them to be in something like > rel=stylesheetpreload, but not rel=preload.

Yes I can see that. Perhaps we could start by allowing this in modulepreload and defer the stylesheet issue. @yoavweiss?

domenic · 2022-04-26T15:54:31Z

To be clear, my proposal (not sure if it's helpful) is that rel=preload not have any restrictions. blocking="" always works, crossorigin="" can be any value, etc. If this allows you to create fetches you could not create with @import or @font-face, that's fine. Maybe that'll cause some cache misses, oh well.

noamr · 2022-04-26T16:00:19Z

To be clear, my proposal (not sure if it's helpful) is that rel=preload not have any restrictions. blocking="" always works, crossorigin="" can be any value, etc. If this allows you to create fetches you could not create with @import or @font-face, that's fine. Maybe that'll cause some cache misses, oh well.

I see, I'm Ok with this. But are you proposing to also allow that for things that are not styles/scripts?

domenic · 2022-04-26T16:15:09Z

But are you proposing to also allow that for things that are not styles/scripts?

My preference would be to allow it for all fetches, but I understand @annevk is against it, and I don't feel strongly. (Especially since it's always easier to start conservative.)

So the question is which fetches allow it. I can think of a few options:

Everything but fetch()
Navigations + <link> + <script>
Navigations + <link rel=preload as=stylesheet> + <link rel=stylesheet>
... various other permutations ...

I'm not sure which option the various participants in this discussion want to go for. Or what is most useful. Or how that plays out in terms of spec layering. But maybe nailing that down is the next step? I guess you made an initial proposal at #1409 (comment) but I'm not sure if everyone got on board with that... apologies if they did and I'm just confusing matters.

noamr · 2022-04-26T16:57:17Z

But are you proposing to also allow that for things that are not styles/scripts?

My preference would be to allow it for all fetches, but I understand @annevk is against it, and I don't feel strongly. (Especially since it's always easier to start conservative.)

So the question is which fetches allow it. I can think of a few options:

Everything but fetch()

Navigations + <link> + <script>

Navigations + <link rel=preload as=stylesheet> + <link rel=stylesheet>

... various other permutations ...

I'm not sure which option the various participants in this discussion want to go for. Or what is most useful. Or how that plays out in terms of spec layering. But maybe nailing that down is the next step? I guess you made an initial proposal at #1409 (comment) but I'm not sure if everyone got on board with that... apologies if they did and I'm just confusing matters.

I can go with an option where if the destination of the request is script it can process any link header, and if it's style it can process any as=font/as=img/as=style link header, allowing all the link semantics. It's not more layer-violating than CSP as it only deals with request destinations.

xiaochengh · 2022-04-26T21:52:44Z

Just a side note that this will be very useful for blocking=render: If a stylesheet can render-blockingly preload its font faces, then we can fully eliminate font-caused FOUT and layout shifts; And this works particularly well for 3rd party font providers, in which case developers only have urls of the font stylesheets but not the exact font files.

So I would love to see this getting landed soon.

domenic · 2022-04-28T19:05:35Z

I can go with an option where if the destination of the request is script it can process any link header, and if it's style it can process any as=font/as=img/as=style link header, allowing all the link semantics. It's not more layer-violating than CSP as it only deals with request destinations.

OK, so concretely, Fetch would contain this logic, which dispatches to HTML's "process link headers for subresources" which just assumes that if it's called it's allowed to do full Link processing. (Maybe it doesn't even need to be subresource-specific.)

That sounds pretty elegant to me. @annevk are you on board?

noamr · 2022-04-29T07:44:33Z

I can go with an option where if the destination of the request is script it can process any link header, and if it's style it can process any as=font/as=img/as=style link header, allowing all the link semantics. It's not more layer-violating than CSP as it only deals with request destinations.

OK, so concretely, Fetch would contain this logic, which dispatches to HTML's "process link headers for subresources" which just assumes that if it's called it's allowed to do full Link processing. (Maybe it doesn't even need to be subresource-specific.)

process link headers for subresources would need a list of allowed destinations, but otherwise that's the idea.
Perhaps this can still be totally inside HTML, make this check on style/script/preload-as-script/style response (with a suitable as). It does create some exception to the rule that preload is network only, but maybe that's OK.

annevk · 2022-04-29T07:47:52Z

Can the processing of these Link elements have any other side effects that need to be dealt with by the original caller of fetch? E.g., for documents Link: rel=stylesheet affects document.styleSheets. Based on that, it seems better if the caller of fetch is responsible for processing these, as they are not really a pure networking feature.

noamr · 2022-04-29T07:52:36Z

Can the processing of these Link elements have any other side effects that need to be dealt with by the original caller of fetch? E.g., for documents Link: rel=stylesheet affects document.styleSheets. Based on that, it seems better if the caller of fetch is responsible for processing these, as they are not really a pure networking feature.

Right, they feel more like an HTML feature. I'll prepare an HTML patch to handle these at a few of the call sites.

noamr · 2022-04-29T09:22:47Z

I want whatwg/html#7866 to go in first, and to start relying on the "pending document" concept where all link header processing has a "document promise" of sorts which it awaits at the moment where it really needs a document.

Otherwise I need to keep special-casing early hints and it's getting a bit out of hand

annevk · 2023-02-08T12:55:32Z

@noamr am I correct in assuming that this can now be closed?

noamr · 2023-02-08T12:57:01Z

@noamr am I correct in assuming that this can now be closed?

No. The spec still doesn't deal with subresource link headers while implementations do something with them.
I am not currently pursuing this though, not sure how I feel about this feature and whether it's actually helpful.

noamr · 2023-02-08T13:35:49Z

Closing for now, if someone wants to pursue spec'ing this feature it would probably require a new PR anyway.

Process subresource link headers

c642815

noamr mentioned this pull request Mar 9, 2022

Process subresource link headers whatwg/html#7691

Closed

3 tasks

noamr closed this Mar 9, 2022

noamr reopened this Mar 9, 2022

noamr mentioned this pull request Mar 13, 2022

Multi-spec non-feature TODOs for web performance WG w3c/web-performance#38

Closed

20 tasks

annevk added security/privacy There are security or privacy implications addition/proposal New features or enhancements labels Mar 15, 2022

yoavweiss mentioned this pull request Apr 20, 2022

Test different scenarios with subresource Link headers web-platform-tests/wpt#33167

Open

noamr mentioned this pull request Apr 21, 2022

Putting an upper limit on preload/prefetch chains w3c/preload#130

Closed

domenic reviewed Apr 21, 2022

View reviewed changes

noamr closed this Feb 8, 2023

Process subresource link headers #1409

Process subresource link headers #1409

Conversation

noamr commented Mar 9, 2022 • edited by pr-preview bot Loading

💥 Error: 500 Internal Server Error 💥

annevk commented Mar 10, 2022

noamr commented Mar 10, 2022

annevk commented Mar 11, 2022

noamr commented Mar 11, 2022

annevk commented Mar 11, 2022

noamr commented Mar 11, 2022

noamr commented Mar 13, 2022

yoavweiss commented Mar 14, 2022

noamr commented Mar 14, 2022

yoavweiss commented Apr 8, 2022

noamr commented Apr 9, 2022 • edited Loading

annevk commented Apr 20, 2022

noamr commented Apr 20, 2022

yoavweiss commented Apr 20, 2022

noamr commented Apr 20, 2022

annevk commented Apr 20, 2022

yoavweiss commented Apr 20, 2022

noamr commented Apr 20, 2022

annevk commented Apr 20, 2022

noamr commented Apr 20, 2022

domenic Apr 21, 2022 • edited Loading

Choose a reason for hiding this comment

noamr Apr 21, 2022

Choose a reason for hiding this comment

xiaochengh May 3, 2022

Choose a reason for hiding this comment

noamr May 4, 2022

Choose a reason for hiding this comment

noamr May 6, 2022 • edited Loading

Choose a reason for hiding this comment

xiaochengh May 6, 2022

Choose a reason for hiding this comment

xiaochengh May 6, 2022

Choose a reason for hiding this comment

noamr commented Apr 24, 2022

annevk commented Apr 26, 2022 • edited Loading

domenic commented Apr 26, 2022

annevk commented Apr 26, 2022

domenic commented Apr 26, 2022

noamr commented Apr 26, 2022

domenic commented Apr 26, 2022

noamr commented Apr 26, 2022

domenic commented Apr 26, 2022

noamr commented Apr 26, 2022

xiaochengh commented Apr 26, 2022

domenic commented Apr 28, 2022

noamr commented Apr 29, 2022

annevk commented Apr 29, 2022

noamr commented Apr 29, 2022

noamr commented Apr 29, 2022

annevk commented Feb 8, 2023

noamr commented Feb 8, 2023

noamr commented Feb 8, 2023

noamr commented Mar 9, 2022 •

edited by pr-preview bot

Loading

noamr commented Apr 9, 2022 •

edited

Loading

domenic Apr 21, 2022 •

edited

Loading

noamr May 6, 2022 •

edited

Loading

annevk commented Apr 26, 2022 •

edited

Loading