Draft: add embed endpoint #151

rohanshah18 · 2024-09-10T20:00:53Z

Problem

Describe the purpose of this change. What problem is being solved and why?

Solution

Describe the approach you took. Link to any relevant bugs, issues, docs, or other resources.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Infrastructure change (CI configs, etc)
Non-code change (docs, etc)
None of the above: (explain here)

Test Plan

Describe specific steps for validating this change.

ssmith-pc · 2024-09-10T20:32:56Z

src/main/java/io/pinecone/clients/Inference.java

+    public EmbeddingsList embed(String model, String truncate, String inputType, List<String> inputs) throws ApiException {
+        EmbedRequestParameters embedRequestParameters = new EmbedRequestParameters().inputType(inputType);
+        if(truncate != null && !truncate.isEmpty())
+            embedRequestParameters.truncate(truncate);
+        List<EmbedRequestInputsInner> EmbedRequestInputsInnerList = convertInputStringToEmbedRequestInputsInner(inputs);
+        EmbedRequest embedRequest = new EmbedRequest()
+                .model(model)
+                .parameters(embedRequestParameters)
+                .inputs(EmbedRequestInputsInnerList);


Other models might support different so would be better as a map.

Can you please elaborate on this more? Are you suggesting List<String> inputs should be a map instead? If so, how does the key value pair be assigned to the inputs since inputs is of type List<EmbedRequestInputsInner> where EmbedRequestInputsInner has only one member which is text of type String.

I mean for the params like truncate and input_type -- those apply to the multilingual-e5-large model, but they might not apply to the next embedding model we may add.

You can use the Python impl as a reference, where it takes params as basically an optional map (I know, Java optionals are not a thing)

That makes sense, thank you!

ssmith-pc · 2024-09-10T20:34:39Z

src/main/java/io/pinecone/clients/Pinecone.java

+    public Inference getInference() {
+        return new Inference();
+    }


How heavy is it to create this object, or should we cache Inference in Pinecone?

e.g. if someone effectively did this every time:

pc.getInference().embed(...); pc.getInference().embed(...); pc.getInference().embed(...); pc.getInference().embed(...);

Yeah I see your point! Do you feel like java users are traditionally used to getting the Inference object and reusing it rather than instantiating every single time to call the embed()?

I don't think there's a java convention for this, so just thinking about the ergonomics of how we deliver the API to them, and if there's something here that's heavy to construct/connect/etc. then we should try to lazy construct that and reuse across requests

Maybe calling it getInferenceClient() would resolve the ambiguity and encourage use like:

client = pc.getInferenceClient(); client.embed(...); client.embed(...);

Yes, I like this approach. If we hear about performance issues in future of having a lot of unused objects (which should be usually cleaned by the garbage collector), then we can think of caching the object and reusing it.

I'm not worried about the extra objects if they're lightweight to construct. So I'm asking how expensive are they to construct?

How heavy is it to create this object, or should we cache Inference in Pinecone?

ssmith-pc · 2024-09-10T20:35:59Z

src/main/java/io/pinecone/clients/Pinecone.java

@@ -871,6 +871,10 @@ public AsyncIndex getAsyncIndexConnection(String indexName) throws PineconeValid
        return new AsyncIndex(connection, indexName);
    }

+    public Inference getInference() {


This feels a little odd to me as a getter. Wondering if just pc.inference().embed(...) is better?

I think [pc.inference().embed(...) definitely feels more like python usage and it aligns with rest of our SDKs but given its java, do you feel like users will directly call pc.inference().embed()? And pc.inference() will still return the Inference object regardless, so the naming does make sense if not compared with other SDKs.

Also for dataplane opeartions, we have getIndexConnection() and getAsyncIndexConnection().

ssmith-pc · 2024-09-10T20:37:49Z

src/main/java/io/pinecone/clients/Inference.java

+    private List<EmbedRequestInputsInner> convertInputStringToEmbedRequestInputsInner(List<String> inputs) {
+        List<EmbedRequestInputsInner> embedRequestInputsInnerList = new ArrayList<EmbedRequestInputsInner>();
+        for(String input:inputs) {
+            embedRequestInputsInnerList.add(new EmbedRequestInputsInner().text(input));
+        }
+        return embedRequestInputsInnerList;
+    }


This would be better using stream API:

inputs.stream() .map(input -> new EmbedRequestInputsInner().text(input)) .collect(Collectors.toList());

Why do you think using stream is better here? Wouldn't it create a new object for intermediate step?

I'd have to look more closely at the underlying impl, but I think using the stream impl gives java flexibility to optimize how/when the underlying list actually gets created, and it may never have to allocate a new list at all depending on the usage.

There a couple of points we are looking at:

.streams will first and foremost create a stream and then the processing happens using the map function and finally it creates and assigns to the new list while the for-loop approach creates the list and adds the elements to it directly, so there is no intermediate step of creating and allocating extra memory.

I think using the stream impl gives java flexibility to optimize how/when the underlying list actually gets created, and it may never have to allocate a new list at all depending on the usage.

The only relevant optimization I can think of is lazy evaluation but in this case with collect(), the list will still be created as a last step right, since the inputs() accepts List as an argument.

This is your call, and don't think it'll make much difference in how fast things run on such small datasets. I just find the stream api more pleasant to use, and gives java an opportunity to optimize what's happening under the hood more so than if you explicitly instantiate a list. But this is not a showstopper by any means

I agree that the dataset is going to be small so creating intermediate objects is not going to affect the performance and yes, streams are easier to read. Just wanted to make sure I didn't skip any optimization step but I can change this to streams for readability purpose.

Draft: add embed endpoint

c7d3831

rohanshah18 requested a review from ssmith-pc September 10, 2024 20:16

ssmith-pc reviewed Sep 10, 2024

View reviewed changes

rohanshah18 added 2 commits September 17, 2024 10:39

Merge branch 'main' into rshah/inference

a61f2c4

update function name and update parameters with map

aa274d8

rohanshah18 closed this Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: add embed endpoint #151

Draft: add embed endpoint #151

rohanshah18 commented Sep 10, 2024

ssmith-pc Sep 10, 2024

rohanshah18 Sep 10, 2024

ssmith-pc Sep 10, 2024

rohanshah18 Sep 11, 2024

ssmith-pc Sep 10, 2024

rohanshah18 Sep 10, 2024

ssmith-pc Sep 10, 2024

ssmith-pc Sep 10, 2024

rohanshah18 Sep 11, 2024

ssmith-pc Sep 11, 2024

ssmith-pc Sep 10, 2024

rohanshah18 Sep 10, 2024

rohanshah18 Sep 10, 2024

ssmith-pc Sep 10, 2024 •

edited

Loading

rohanshah18 Sep 10, 2024

ssmith-pc Sep 10, 2024

rohanshah18 Sep 11, 2024

ssmith-pc Sep 11, 2024

rohanshah18 Sep 11, 2024

Draft: add embed endpoint #151

Draft: add embed endpoint #151

Conversation

rohanshah18 commented Sep 10, 2024

Problem

Solution

Type of Change

Test Plan

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ssmith-pc Sep 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ssmith-pc Sep 10, 2024 •

edited

Loading