Replies: 1 comment 6 replies
-
Hi @anjackson Things have changed a bit since my comment in December. Discovered URLs are not expected to have a nextFetchDate - in fact the term is now refetchable_from_date; they are expected to be fetched as soon as possible FIFO-style. It's only updated URLs ('known') that can have a _refetchable_from_date, and they don't have to have one. If not they are considered finished. You could however manage that with an external cron job and push an update for the URL with a fetch date to resuscitate it. Changing the priority of URLs at query time is indeed more flexible and could be done via metadata. Would that just be a matter of adding an optional param to GetParams to be able to specify one or more metadata keys to be used to sort URLs within a queue? It doesn't solve the problem of whether backends support the functionality but at least for those which will, it provides a mechanism to do so. |
Beta Was this translation helpful? Give feedback.
-
Copied from @anjackson Some feedback on the URL Frontier API #11
and from #2
Beta Was this translation helpful? Give feedback.
All reactions