-
Hi, Is there a way to prevent crawling domains if a URL re-directs to a different URL? For me right now, if it hits a URL that redirects to a different domain it proceeds to crawl that domain as well, even using a strategy of enqueueing links. For example, www.somelink.com/github redirects to their github profile, which then leads to crawling every URL on page, which leads to endless crawling of GitHub. My code:
Thanks in advance |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
I made a workaround via putting enqueue links in a conditional block so that it will only start crawling if the domain is the same. |
Beta Was this translation helpful? Give feedback.
I made a workaround via putting enqueue links in a conditional block so that it will only start crawling if the domain is the same.
Still redirects to URLs outside of original domain, but doesn't start crawling any URLs under different domains which is sufficient.