Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jobs export download #30

Open
wants to merge 8 commits into
base: flask-cors-fix
Choose a base branch
from
Open

Conversation

Miclin1024
Copy link
Contributor

This PR integrates and extends the previous download feature to use a more structured output format. Including a .csv file with metadata of all the URLs scraped, and files/ and images/ subdirectories for larger media files. The PR also includes several bug fixes within the Scrapy pipeline and GridFS.

@Miclin1024 Miclin1024 changed the base branch from main to flask-cors-fix April 13, 2022 08:20
@Miclin1024 Miclin1024 requested review from jhaber-zz and hwarnuh April 13, 2022 08:20
@Miclin1024 Miclin1024 requested a review from ThaiHipster April 13, 2022 20:36
Configurations for Conda Environment
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants