Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【UK Parliament】只能抓到3条内容,网页上单页15条 #17106

Closed
KwToPA opened this issue Oct 13, 2024 · 7 comments · Fixed by #17129
Closed

【UK Parliament】只能抓到3条内容,网页上单页15条 #17106

KwToPA opened this issue Oct 13, 2024 · 7 comments · Fixed by #17129
Labels
RSS bug Something isn't working

Comments

@KwToPA
Copy link

KwToPA commented Oct 13, 2024

路由地址

/parliament.uk/commonslibrary/type/:topic?

完整路由地址

/parliament.uk/commonslibrary/type/:topic?

相关文档

https://docs.rsshub.app/routes/government#uk-parliament

预期是什么?

能够抓取更多的内容,按照时间倒序排列

实际发生了什么?

1、只能抓3条,Commonlibrary 和 House of Lords Library 都是这样
2、路由对应页面错误,两院全部都是 https://commonslibrary.parliament.uk/type ,指向type 这个错误页面

路由的内容摘录如下

<link>https://commonslibrary.parliament.uk/type</link>
<atom:link href="http://rsshub.app/parliament.uk/commonslibrary/type/data-dashboard" rel="self" type="application/rss+xml"/>

应该是https://commonslibrary.parliament.uk/type/data-dashboard/https://commonslibrary.parliament.uk/type/research-briefing/https://lordslibrary.parliament.uk/type/research-briefing/

部署

RSSHub 演示 (https://rsshub.app)

部署相关信息

No response

额外信息

麻烦 @AntiKnot

在网页中,我通过查看源代码,发现对应每个超链接的结构如下,搜 <div class="l-box l-box--no-border card__text"> 能搜到15个结果

<div class="l-box l-box--no-border card__text">

                                <p class="card__type">Data dashboards</p>

                    <h2 class="card__heading font-size-epsilon">
                <a class="card__link" href="https://commonslibrary.parliament.uk/constituency-data-lgbt-people-2021-census/">Constituency data: LGBT+ people</a>
            </h2>
        

        <p class="card__date">
            <time datetime="2024-09-20">Friday, 20 September, 2024</time>
        </p>

还有

    <div class="l-box l-box--no-border card__text">
        <h2 class="card__heading font-size-gamma">
            <a class="card__link" href="https://commonslibrary.parliament.uk/research-briefings/sn07158/">City Deals</a></h2>
        <p class="card__date"><time datetime="2024-10-11" >Friday, 11 October, 2024</time></p>


### 这不是重复的 issue

- [X] 我已经搜索了 [现有 issue](https://github.com/DIYgod/RSSHub/issues),以确保该错误尚未被报告。
@KwToPA KwToPA added the RSS bug Something isn't working label Oct 13, 2024

This comment has been minimized.

4 similar comments

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Copy link
Contributor

Searching for maintainers:
  • /parliament.uk/commonslibrary/type/:topic?: @AntiKnot

To maintainers: if you are not willing to be disturbed, list your username in scripts/workflow/test-issue/call-maintainer.js. In this way, your username will be wrapped in an inline code block when tagged so you will not be notified.

If all routes can not be found, the issue will be closed automatically. Please use NOROUTE for a route-irrelevant issue or leave a comment if it is a mistake.
如果所有路由都无法匹配,issue 将会被自动关闭。如果 issue 和路由无关,请使用 NOROUTE 关键词,或者留下评论。我们会重新审核。

@AntiKnot
Copy link
Contributor

@KwToPA

  1. 头版消息和历史消息,我没有进行解析拼接处理,只处理了最新的头版3条信息。
  2. 现有路由通过了测试,请给出相关的path,需要更详细的信息。
  3. 如果有type相关的错误,请给出相关信息。

本地正常 http://localhost:1200/parliament.uk/commonslibrary/type/data-dashboard
follow正常 rsshub://parliament.uk/commonslibrary/type/data-dashboard

@KwToPA
Copy link
Author

KwToPA commented Oct 14, 2024

@KwToPA

  1. 头版消息和历史消息,我没有进行解析拼接处理,只处理了最新的头版3条信息。
  2. 现有路由通过了测试,请给出相关的path,需要更详细的信息。
  3. 如果有type相关的错误,请给出相关信息。

本地正常 http://localhost:1200/parliament.uk/commonslibrary/type/data-dashboard follow正常 rsshub://parliament.uk/commonslibrary/type/data-dashboard

这三个路由

https://rsshub.app/parliament.uk/commonslibrary/type/research-briefing

https://rsshub.app/parliament.uk/commonslibrary/type/data-dashboard

https://rsshub.app/parliament.uk/lordslibrary/type/research-briefing

对应两个链接 <link>https://lordslibrary.parliament.uk/type</link><link>https://commonslibrary.parliament.uk/type</link>

其中一个例子见截图
image

实际的对应网页应该是这三个

https://commonslibrary.parliament.uk/type/data-dashboard/

https://commonslibrary.parliament.uk/type/research-briefing/

https://lordslibrary.parliament.uk/type/research-briefing/

这个type网站打不开
image

我用ttrss订阅,有个右键打开路由对应的网页,本来应该是打开上面三个,结果是后缀为type的这个网页。

另外 单页15个内容,只抓了三条,网页上应该只有样式的分别,不是历史内容和头版内容的分别,见图

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
RSS bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants