Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(route): add query keyword parse of cool paper #17894

Merged
merged 17 commits into from
Dec 25, 2024
Merged
6 changes: 3 additions & 3 deletions lib/routes/papers/index.ts
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ export const handler = async (ctx) => {

const rootUrl = 'https://papers.cool';
const currentUrl = new URL(category, rootUrl).href;
const feedUrl = new URL(`${category}/feed`, rootUrl).href;
const feedUrl = new URL(`arxiv/${category}/feed`, rootUrl).href;

const site = category.split(/\//)[0];
const apiKimiUrl = new URL(`${site}/kimi?paper=`, rootUrl).href;
Expand Down Expand Up @@ -76,15 +76,15 @@ export const handler = async (ctx) => {
};

export const route: Route = {
path: '/:category{.+}?',
path: '/arxiv/:category{.+}?',
name: 'Topic',
url: 'papers.cool',
maintainers: ['nczitzk', 'Muyun99'],
handler,
example: '/papers/arxiv/cs.AI',
parameters: { category: 'Category, arXiv Artificial Intelligence (cs.AI) by default' },
description: `:::tip
If you subscribe to [arXiv Artificial Intelligence (cs.AI)](https://papers.cool/arxiv/cs.AI)where the URL is \`https://papers.cool/arxiv/cs.AI\`, extract the part \`https://papers.cool/\` to the end, and use it as the parameter to fill in. Therefore, the route will be [\`/papers/arxiv/cs.AI\`](https://rsshub.app/papers/arxiv/cs.AI).
If you subscribe to [arXiv Artificial Intelligence (cs.AI)](https://papers.cool/arxiv/cs.AI), where the URL is \`https://papers.cool/arxiv/cs.AI\`, extract the part \`https://papers.cool/\` to the end, and use it as the parameter to fill in. Therefore, the route will be [\`/papers/arxiv/cs.AI\`](https://rsshub.app/papers/arxiv/cs.AI).
:::

| Category | id |
Expand Down
118 changes: 118 additions & 0 deletions lib/routes/papers/query.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,118 @@
import { Route } from '@/types';
import { getCurrentPath } from '@/utils/helpers';
const __dirname = getCurrentPath(import.meta.url);

import { parseDate } from '@/utils/parse-date';
import { art } from '@/utils/render';
import path from 'node:path';
import parser from '@/utils/rss-parser';

const pdfUrlGenerators = {
arxiv: (id: string) => `https://arxiv.org/pdf/${id}.pdf`,
};

export const handler = async (ctx) => {
const { keyword = 'query/Detection' } = ctx.req.param();
const limit = ctx.req.query('limit') ? Number.parseInt(ctx.req.query('limit'), 10) : 150;

const rootUrl = 'https://papers.cool';
const query = keyword.split(/\//)[1];
Muyun99 marked this conversation as resolved.
Show resolved Hide resolved
const currentUrl = new URL(`arxiv/search?highlight=1&query=${query}&sort=0`, rootUrl).href;
const feedUrl = new URL(`arxiv/search/feed?query=${query}`, rootUrl).href;

const site = keyword.split(/\//)[0];
const apiKimiUrl = new URL(`${site}/kimi?paper=`, rootUrl).href;
const feed = await parser.parseURL(feedUrl);

const language = 'en';

const items = feed.items.slice(0, limit).map((item) => {
const title = item.title;
const guid = item.guid;

const id = item.link?.split(/\//).pop() ?? '';
const kimiUrl = new URL(id, apiKimiUrl).href;
const pdfUrl = Object.hasOwn(pdfUrlGenerators, site) ? pdfUrlGenerators[site](id) : undefined;

const authorString = item.author;
const description = art(path.join(__dirname, 'templates/description.art'), {
pdfUrl,
siteUrl: item.link,
kimiUrl,
authorString,
summary: item.summary,
});

return {
title,
description,
pubDate: parseDate(item.pubDate ?? ''),
link: item.link,
category: item.categories,
author: authorString,
doi: `${site}${id}`,
guid,
id: guid,
content: {
html: description,
text: item.content,
},
language,
enclosure_url: pdfUrl,
enclosure_type: 'application/pdf',
enclosure_title: title,
};
});

return {
title: feed.title,
Muyun99 marked this conversation as resolved.
Show resolved Hide resolved
description: feed.description,
link: currentUrl,
item: items,
allowEmpty: true,
image: feed.image?.url,
language: feed.language,
};
};

export const route: Route = {
path: '/query/:keyword{.+}?',
name: 'Topic',
url: 'papers.cool',
maintainers: ['Muyun99'],
handler,
example: '/papers/query/Detection',
parameters: { keyword: 'Keyword to search for papers, e.g., Detection, Segmentation, etc.' },
description: `:::tip
If you subscibe to [arXiv Paper queryed by Detection](https://papers.cool/arxiv/search?highlight=1&query=Detection), where the URL is \`https://papers.cool/arxiv/search?highlight=1&query=Detection\`, extract the part \`https://papers.cool/\` to the end, and use it as the parameter to fill in. Therefore, the route will be [\`/papers/query/Detection\`](https://rsshub.app/papers/query/Detection).
:::

| Category | id |
| ----------------------------------------------------- | ------------------- |
| arXiv Paper queryed by Detection | query/Detection |
| arXiv Paper queryed by Segmentation | query/Segmentation |
`,
categories: ['journal'],

features: {
requireConfig: false,
requirePuppeteer: false,
antiCrawler: false,
supportRadar: true,
supportBT: false,
supportPodcast: false,
supportScihub: true,
},
radar: [
{
title: 'arXiv Paper queryed by Detection',
source: ['papers.cool/arxiv/search?highlight=1&query=Detection&sort=0`'],
target: '/papers/query/Detection',
},
{
title: 'arXiv Paper queryed by Segmentation',
source: ['papers.cool/arxiv/search?highlight=1&query=Segmentation&sort=0`'],
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

target: '/papers/query/Segmentation',
},
],
};
Loading