OpenAI compatible API support #5

Zane-XY · 2024-07-14T02:58:41Z

I noticed that the url in this crate is hardcoded, which currently only supports the official API endpoints of each AI service provider.

Would there be a plan to make the endpoint configurable? For example, allowing users to specify Azure OpenAI endpoints through configuration would greatly enhance flexibility.

jeremychone · 2024-07-14T04:01:17Z

@Zane-XY yes, endpoints will be configurable per adapter kind. I need to find the right way to do it (e.g., host/port v.s path ...)

jeremychone · 2024-07-14T04:02:03Z

@Zane-XY btw, feel free to explain your particular usecase. I will make sure it get covered.

Zane-XY · 2024-07-14T04:45:04Z

In my use case, the service url and models are different, but the service is OpenAI compatible. Really appreciated the fast response!

jeremychone · 2024-07-14T14:19:28Z

@Zane-XY thanks. Is this aws bedrock / Google vertexai, or a custom service somewhere ?
Also, is it a Ollama server? (their OpenAi compatibility layer requires some custom behaviors)

Zane-XY · 2024-07-15T23:58:11Z

It's an enterprise hosted AI service.

jeremychone · 2024-07-16T00:19:34Z

Ok, that's will probably be a Custom Adapter then. I will get to it, genai should support this usecase.

Boscop · 2024-09-29T21:35:53Z

👍 +1

I'm using Jan.ai, TabbyML and LM Studio to run local models with local API server exposing an OpenAI-compatible API.
I would like to use this crate to make requests to them (also for embeddings) 🙂

InAnYan · 2024-10-29T12:47:22Z

Hi! + for this feature.

Basically, it would be better to just make API base URL to be a variable that can be changed in constructor.

That's what I did to use ollama (it was not genai, other project).

You can also make a mock server and test the code. IDK why and where, but you can make a mock webserver and it can give you a URL which you can supply to custom code

yihong0618 · 2025-01-05T00:36:05Z

seems it supported now;
e.g.
https://github.com/jeremychone/rust-genai/blob/main/examples/c06-target-resolver.rs

jeremychone added the requirement Requirement from the end-user when it does not match 1-1 to the feature reqquest. label Jul 14, 2024

Boscop mentioned this issue Sep 29, 2024

Support for embeddings? #28

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI compatible API support #5

OpenAI compatible API support #5

Zane-XY commented Jul 14, 2024

jeremychone commented Jul 14, 2024

jeremychone commented Jul 14, 2024

Zane-XY commented Jul 14, 2024

jeremychone commented Jul 14, 2024

Zane-XY commented Jul 15, 2024

jeremychone commented Jul 16, 2024

Boscop commented Sep 29, 2024 •

edited

Loading

InAnYan commented Oct 29, 2024

yihong0618 commented Jan 5, 2025

OpenAI compatible API support #5

OpenAI compatible API support #5

Comments

Zane-XY commented Jul 14, 2024

jeremychone commented Jul 14, 2024

jeremychone commented Jul 14, 2024

Zane-XY commented Jul 14, 2024

jeremychone commented Jul 14, 2024

Zane-XY commented Jul 15, 2024

jeremychone commented Jul 16, 2024

Boscop commented Sep 29, 2024 • edited Loading

InAnYan commented Oct 29, 2024

yihong0618 commented Jan 5, 2025

Boscop commented Sep 29, 2024 •

edited

Loading