Better error handling for scraper #222

ericyoondotcom · 2024-02-05T08:38:42Z

When the scraper encounters any sort of error (e.g. a blank HTTP response), the whole script hangs. Add some exception handling so the script can continue and just skip over that one page.

Also, from #223

because the different parts of the scraper mostly run asynchronously, errors are not being properly thrown from the appropriate thread. This leads to errors that can be difficult to debug, because the apparent cause of a crash will be the fact that one thread doesn't return a students list for example, when in fact the actual root cause is something more specific that happened in that thread several thousand lines of logs ago.

ErikBoesen · 2024-02-05T19:47:52Z

To clarify, I believe this is in reference to the Departmental portion of the scraper specifically.

ericyoondotcom added good first issue Good for newcomers scraper labels Feb 5, 2024

ErikBoesen added the scraper-departmental label Feb 5, 2024

ericyoondotcom added the will be fixed during rewrite label Oct 28, 2024

ericyoondotcom mentioned this issue Oct 28, 2024

Throw errors more gracefully from scraper #223

Closed

ericyoondotcom removed the scraper-departmental label Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better error handling for scraper #222

Better error handling for scraper #222

ericyoondotcom commented Feb 5, 2024 •

edited

Loading

ErikBoesen commented Feb 5, 2024

Better error handling for scraper #222

Better error handling for scraper #222

Comments

ericyoondotcom commented Feb 5, 2024 • edited Loading

ErikBoesen commented Feb 5, 2024

ericyoondotcom commented Feb 5, 2024 •

edited

Loading