Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NGA部分版面的帖子无法获得 #17303

Open
1 task done
Bianca2000 opened this issue Oct 25, 2024 · 3 comments
Open
1 task done

NGA部分版面的帖子无法获得 #17303

Bianca2000 opened this issue Oct 25, 2024 · 3 comments
Labels
anti-crawler The site have strict anti-crawler policies

Comments

@Bianca2000
Copy link

Routes

/nga/post/:tid/:authorId?

Full routes

/nga/post/38508595

Related documentation

https://docs.rsshub.app/zh/routes/bbs#nga

What is expected?

獲取帖子內容

What is actually happening?

FetchError: [GET] "https://nga.178.com/read.php?tid=38508595&page=3656&rand=440.52058106010605#": 403 Forbidden

Deployment information

Self-hosted

Deployment information (for self-hosted)

https://rss.peachyjoy.top/

Additional info

Service Unavailable

This is not a duplicated issue

  • I have searched existing issues to ensure this bug has not already been reported
@Bianca2000 Bianca2000 added the RSS bug Something isn't working label Oct 25, 2024
Copy link
Contributor

Searching for maintainers:

To maintainers: if you are not willing to be disturbed, list your username in scripts/workflow/test-issue/call-maintainer.js. In this way, your username will be wrapped in an inline code block when tagged so you will not be notified.

If all routes can not be found, the issue will be closed automatically. Please use NOROUTE for a route-irrelevant issue or leave a comment if it is a mistake.
如果所有路由都无法匹配,issue 将会被自动关闭。如果 issue 和路由无关,请使用 NOROUTE 关键词,或者留下评论。我们会重新审核。

@pseudoyu pseudoyu added the anti-crawler The site have strict anti-crawler policies label Oct 29, 2024
@pseudoyu
Copy link
Collaborator

The route works well and the 403 error is caused by the nga anti-crawler/restrictions. May be you can decrease the fetching interval and try again.

@pseudoyu pseudoyu removed the RSS bug Something isn't working label Oct 29, 2024
@RikaCelery
Copy link

RikaCelery commented Nov 12, 2024

虽然但是你给的那个帖子可以正常访问

我在测试/nga/post/41312589的时候遇到了这个问题

只有rsshub没法正常获取内容
curl/burp suite/chrome啥的都没问题
burp suite拦截请求再发送的时候也没问题
但是如果不拦截请求直接放行就403
image
感觉是HTTP/2的神奇问题,重试一次就能正常访问了
你可以修改这里lib/utils/ofetch.ts让ofetch自动重试403页面,我这里重试到第二次就可以正常访问了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
anti-crawler The site have strict anti-crawler policies
Projects
None yet
Development

No branches or pull requests

3 participants