Produce per-scraper-version, per-website parsing success statistics #21

jayaddison · 2020-07-23T09:32:02Z

Is your feature request related to a problem? Please describe.
The crawler and underlying recipe-scrapers codebases will evolve over time, and as with any software project, bugs may be introduced or fixed over time.

In addition, the content of recipe websites may change over time too, as websites decide to reformat their contents or rebrand their page look-and-feel.

It would be useful to continuously track the performance of scraper versions against real recipe website content.

Describe the solution you'd like
It should be possible to record historical statistics regarding the success/failure rate of recipe content crawling.

It should be possible to break this down by crawler version, by recipe-scrapers version, by website and also by time interval.

This data should be exposed via the diagnostics service and made available via the corresponding diagnostics component of the frontend application.

Describe alternatives you've considered
Real-time alerting on crawler failures (per-domain and overall) could also be beneficial, but is a slightly different use case and can be considered separately.

The text was updated successfully, but these errors were encountered:

jayaddison · 2022-11-05T17:12:37Z

Resolving in favour of openculinary/tardir#1.

jayaddison added the enhancement New feature or request label Jul 23, 2020

jayaddison mentioned this issue Jul 23, 2020

Handle additional ISO8601 date parsing case hhursev/recipe-scrapers#192

Merged

jayaddison closed this as completed Nov 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Produce per-scraper-version, per-website parsing success statistics #21

Produce per-scraper-version, per-website parsing success statistics #21

jayaddison commented Jul 23, 2020

jayaddison commented Nov 5, 2022

Produce per-scraper-version, per-website parsing success statistics #21

Produce per-scraper-version, per-website parsing success statistics #21

Comments

jayaddison commented Jul 23, 2020

jayaddison commented Nov 5, 2022