Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle HTML input #15

Open
Keyrxng opened this issue Sep 17, 2024 · 3 comments
Open

Handle HTML input #15

Keyrxng opened this issue Sep 17, 2024 · 3 comments

Comments

@Keyrxng
Copy link
Contributor

Keyrxng commented Sep 17, 2024

Input should be properly sanitized and then stored according to the database schema. It appears that markdown is handled correctly but HTML is not and it should be as it's a supported GitHub comment format.

ubq-testing#5

image

@0x4007
Copy link
Member

0x4007 commented Sep 17, 2024

We just should rename the column to markup then

We need to test if embeddings work better with plaintext or all the markup context.

@Keyrxng
Copy link
Contributor Author

Keyrxng commented Sep 17, 2024

Well We aren't using GPT for creating embeddings so idk but GPT likes markdown and they used to write the system message in markdown so I'd assume it's better than plaintext as you can signify inner context with block quotes, headings etc but plaintext over HTML all day long if that was in the mix too.

Copy link

Note

The following contributors may be suitable for this task:

zugdev

1% Match ubiquity/ts-template#87

gentlementlegen

0% Match ubiquity-os/ubiquity-os-kernel#125

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants