-
Notifications
You must be signed in to change notification settings - Fork 10.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Text search API #819
Comments
I'm wondering how this will look from an infrastructure point of view. We can reuse the IR extracted by the PartialEvaluator and look for Calling such a |
That sounds good. It'd be nice if we could offer a more friendly wrapper in How about |
We have getTextContent (see https://github.com/mozilla/pdf.js/blob/master/src/api.js#L394). Closing as resolved |
@notmasteryet As I understand it you'll be tackling text search soon. @hubgit and I have been talking about extracting text from PDFs, and it'd be nice if we have an API for that.
Does it make sense to build the search feature on top of a
PDFDoc()
API that extracts the text from each individual page?@hubgit Feel free to chime in on your needs here.
The text was updated successfully, but these errors were encountered: