Skip to content

Latest commit

 

History

History
18 lines (12 loc) · 762 Bytes

README.md

File metadata and controls

18 lines (12 loc) · 762 Bytes

Dataprep Microservice

The Dataprep Microservice aims to preprocess the data from various sources (either structured or unstructured data) to text data, and convert the text data to embedding vectors then store them in the database.

Install Requirements

apt-get update
apt-get install libreoffice

Use LVM (Large Vision Model) for Summarizing Image Data

Occasionally unstructured data will contain image data, to convert the image data to the text data, LVM can be used to summarize the image. To leverage LVM, please refer to this readme to start the LVM microservice first and then set the below environment variable, before starting any dataprep microservice.

export SUMMARIZE_IMAGE_VIA_LVM=1