Skip to content

zzstoatzz/raggy

Repository files navigation

raggy

a Python library for scraping and document processing

installation

pip install raggy

add extras to load different document types:

pip install raggy[chroma]     # ChromaDB support
pip install raggy[tpuf]       # TurboPuffer support
pip install raggy[pdf]        # PDF processing

read the docs

what is it?

a simple-to-use Python library for:

  • scraping the web to produce rich documents
  • putting these documents in vectorstores
  • querying the vectorstores to find documents similar to a query

Tip

See this example to chat with any website, or this example to chat with any GitHub repo.

license

this project is licensed under the Apache 2.0 License - see the LICENSE file for details.

contributing

I welcome contributions! See the contributing guide for details.