Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crates.io #67

Open
nuxeh opened this issue Oct 21, 2018 · 4 comments
Open

Crates.io #67

nuxeh opened this issue Oct 21, 2018 · 4 comments
Labels
bug Something isn't working

Comments

@nuxeh
Copy link
Owner

nuxeh commented Oct 21, 2018

https://crates.io/crates/webpage
https://crates.io/crates/openssl
and probably others...

Produce 404s:

ERROR Inner { kind: ClientError(404), url: Some("https://crates.io/crates/openssl") }

Curling the page produces what looks like json:

$ curl -s https://crates.io/crates/openssl
{"errors":[{"detail":"Not Found"}]}
@nuxeh nuxeh added the bug Something isn't working label Oct 21, 2018
@benjamb
Copy link
Contributor

benjamb commented Oct 22, 2018

Upstream bug: rust-lang/crates.io#788

@benjamb
Copy link
Contributor

benjamb commented Oct 22, 2018

tl;dr specify the content we want as (text/)html, if <noscript> tag is present, it's due to not supporting javascript. Could maybe make use of https://github.com/jonhoo/fantoccini?

@nuxeh
Copy link
Owner Author

nuxeh commented Oct 22, 2018

Interesting stuff, but seems like it relies on a running (possibly headless) instance of a web browser? Could be heavy.

@benjamb
Copy link
Contributor

benjamb commented Oct 22, 2018

@nuxeh Hrm, didn't escape the tag I was referring to. Aye, could be heavy, though it would only be in the case of noscript. I think just specifying the context we want as html should at least cause the bot to output something.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants