a Python library for scraping and document processing
pip install raggy
add extras to load different document types:
pip install raggy[chroma] # ChromaDB support
pip install raggy[tpuf] # TurboPuffer support
pip install raggy[pdf] # PDF processing
read the docs
a simple-to-use Python library for:
- scraping the web to produce rich documents
- putting these documents in vectorstores
- querying the vectorstores to find documents similar to a query
this project is licensed under the Apache 2.0 License - see the LICENSE file for details.
I welcome contributions! See the contributing guide for details.