.NET Allow ISKFunction to return IAsyncEnumerable #1298

wayne-o · 2023-06-01T14:59:14Z

Important

Labeled High because it will not require a breaking change, but it's very important to complete by v1.0.0

Is there any way currently to allow ISKFunction to return an IAsyncEnumerable? If not is there any plan to?

The text was updated successfully, but these errors were encountered:

wayne-o · 2023-06-01T18:34:40Z

Or to put it another way, is there any way to get the gpt client to use the streaming endpoint and emit events on each message?

craigomatic · 2023-06-02T18:36:58Z

We added support for streaming chat messages semi-recently, although I'm not sure this is what you are asking for: https://github.com/microsoft/semantic-kernel/blob/main/samples/dotnet/kernel-syntax-examples/Example33_StreamingChat.cs

Can you be a little more specific about the types of events you are looking to consume?

wayne-o · 2023-06-02T18:46:50Z

So for example in the copilot example, I would want the results as a stream. Even if that meant subscribing to a SignalR stream. That’s ok. Because it makes sense to have some of the data in a single payload. But I would want to see the chat response as soon as it started generating. As opposed to waiting for the whole thing to complete - does that make sense? Thank you :)

…

On Fri, 2 Jun 2023 at 19:37, Craig Presti ***@***.***> wrote: We added support for streaming chat messages semi-recently, although I'm not sure this is what you are asking for: https://github.com/microsoft/semantic-kernel/blob/main/samples/dotnet/kernel-syntax-examples/Example33_StreamingChat.cs Can you be a little more specific about the types of events you are looking to consume? — Reply to this email directly, view it on GitHub <#1298 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABLPDHX5UPJXNEDSRXHRFDXJIXFJANCNFSM6AAAAAAYXBZ4YU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

awharrison-28 · 2023-06-02T22:13:42Z

@RogerBarreto has some ideas in the works to make the requested behavior possible. TBD - when we get those changes in, let's make sure to tag this issue.

wayne-o · 2023-06-03T16:48:17Z

Happy to help out anyway I can Thank you :)

…

On Fri, 2 Jun 2023 at 23:13, Abby Harrison ***@***.***> wrote: @RogerBarreto <https://github.com/RogerBarreto> has some ideas in the works to make the requested behavior possible. TBD - when we get those changes in, let's make sure to tag this issue. — Reply to this email directly, view it on GitHub <#1298 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABLPDDV45RWWEFQJ27P4DLXJJQSFANCNFSM6AAAAAAYXBZ4YU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

MithrilMan · 2023-06-13T14:46:24Z

Since you don't have this yet in Semantic Kernel, let me give my view on this:

I'm working on my own implementation of generative LLM integration here: https://github.com/MithrilMan/AIdentities
since a couple of months (I didn't knew about semantic kernel and when I found it it was already late and anyway wasn't adhering 100% to my vision) and I implemented what @wayne-o is asking in my design: I've a concept of Skills and when a skill is executed, this is the signature of the call
IAsyncEnumerable<Thought> ExecuteAsync(Prompt prompt, SkillExecutionContext context, CancellationToken cancellationToken);
The core of my skill/brain thing is here (readme isn't updated) https://github.com/MithrilMan/AIdentities/tree/25e60f875979a0d595561fb7d0f1f5e82e5ab7a1/src/AIdentities.Shared/Features/CognitiveEngine

It works, but I'm sick of a problem that arises around the use of IAsyncEnumerable: when the API fails in the middle of generating a streamed content, it fails badly because it fails when you start to consume the iterator and .Net doesn't have a nice way to handle such cases, so if you want to implement a retry pattern for example by using polly (and we've seen how this is important with OpenAI that lately had big problems serving properly their API), you can't just wrap the call to the stream method because the exception of course arise when you materialize the enumerator.
Unluckily .Net doesn't allow to call a yield return within a try/catch block, so you have to create all sort of workaround to have a proper retry method.

Also when you stream chunks of text, what would you do if something goes wrong in the API ?
In a simple chat application you can just ask the user to send again the message, but if it's part of a skill (or skfunction as you call it) It's very likely that You'd need to inform the consumer that you have to start over and that it should forget the partial generated text (= higher complexity)

I've yet not figured out how to properly and affectively handle such scenarios, in my case when it fails, fails badly, but this makes me wonder if it's worth to use streaming calls in skills at all.

Of course it's cool to see that my Skill is producing a text as a stream and the user can start reading as soon as it's generating, also what I called AIdentity can generate different kind of "thoughts" that can be streemed too (think of it like a kind of log in some scanrio) but at the same time if it breaks in the middle you'll have big troubles to rollback what you eventually did with the partial generated text, without mentioning minor problems it can give if you try to parse that text on a markdown viewer for example (that's why recently in my chat section I'm building the message on a normal div and then swith to markdown once completed), and I'm starting to think that maybe all this complexity isn't worth the effort.

I'm a bit torn as to whether or not implementing stream at the function level is something I'd want to deal with. Maybe just being able to signal to consumer events like "StartingTextGeneration" / "EndingTextGeneration" could be enough

markwallace-microsoft · 2023-10-10T14:24:13Z

Make child of #1649

## Context and Problem Statement Resolves #1649 Resolves #1298 It is quite common in co-pilot implementations to have a streamlined output of messages from the LLM (large language models) and currently that is not possible while using ISKFunctions.InvokeAsync or Kernel.RunAsync methods, which enforces users to work around the Kernel and Functions to use `ITextCompletion` and `IChatCompletion` services directly as the only interfaces that currently support streaming. Currently streaming is a capability that not all providers do support and this as part of our design we try to ensure the services will have the proper abstractions to support streaming not only of text but be open to other types of data like images, audio, video, etc. Needs to be clear for the sk developer when he is attempting to get streaming data. ## Decision Drivers 1. The sk developer should be able to get streaming data from the Kernel and Functions using Kernel.RunAsync or ISKFunctions.InvokeAsync methods 2. The sk developer should be able to get the data in a generic way, so the Kernel and Functions can be able to stream data of any type, not limited to text. 3. The sk developer when using streaming from a model that does not support streaming should still be able to use it with only one streaming update representing the whole data. ## User Experience Goal ```csharp //(providing the type at as generic parameter) // Getting a Raw Streaming data from Kernel await foreach(string update in kernel.StreamingRunAsync<byte[]>(function, variables)) // Getting a String as Streaming data from Kernel await foreach(string update in kernel.StreamingRunAsync<string>(function, variables)) // Getting a StreamingResultUpdate as Streaming data from Kernel await foreach(StreamingResultUpdate update in kernel.StreamingRunAsync<StreamingResultUpdate>(variables, function)) // OR await foreach(StreamingResultUpdate update in kernel.StreamingRunAsync(function, variables)) // defaults to Generic above) { Console.WriteLine(update); } ``` ## Out of Scope - Streaming with plans will not be supported in this phase. Attempting to do so will throw an exception. - Kernel streaming will not support multiple functions (pipeline). ### Contribution Checklist  - [x] The code builds clean without any errors or warnings - [x] The PR follows the [SK Contribution Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) and the [pre-submission formatting script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts) raises no violations - [x] All unit tests pass, and I have added new tests where possible - [ ] I didn't break anyone 😄

evchaki added the kernel Issues or pull requests impacting the core kernel label Jun 1, 2023

lemillermicrosoft added this to Semantic Kernel Jun 10, 2023

RogerBarreto self-assigned this Jun 13, 2023

RogerBarreto mentioned this issue Jun 15, 2023

.Net: Rethinking SKContext and variable flow #1482

Closed

RogerBarreto moved this to Sprint: Planned in Semantic Kernel Jun 16, 2023

RogerBarreto moved this from Sprint: Planned to Sprint: In Progress in Semantic Kernel Jun 19, 2023

RogerBarreto moved this from Sprint: In Progress to Sprint: Planned in Semantic Kernel Jun 26, 2023

alliscode mentioned this issue Jul 10, 2023

Copilot Chat: Chat streaming #1922

Closed

nacharya1 removed the status in Semantic Kernel Jul 20, 2023

nacharya1 moved this to Sprint: Planned in Semantic Kernel Jul 24, 2023

nacharya1 added this to the R3 : Cycle 2 milestone Aug 4, 2023

RogerBarreto moved this from Sprint: Planned to Sprint: In Progress in Semantic Kernel Sep 6, 2023

lemillermicrosoft modified the milestones: R3 : Cycle 2, R3: Cycle 3 Sep 13, 2023

nacharya1 moved this from Sprint: In Progress to Backlog - New features in Semantic Kernel Sep 14, 2023

RogerBarreto modified the milestones: R3: Cycle 3, v1 Sep 15, 2023

lemillermicrosoft modified the milestones: v1, R3: Cycle 3 Sep 22, 2023

RogerBarreto linked a pull request Sep 26, 2023 that will close this issue

.Net: Kernel Streaming Support #2986

Closed

4 tasks

RogerBarreto mentioned this issue Sep 26, 2023

.Net: Kernel Streaming Support #2986

Closed

4 tasks

matthewbolanos changed the title ~~Allow ISKFunction to return IAsyncEnumerable~~ .NET Allow ISKFunction to return IAsyncEnumerable Oct 11, 2023

matthewbolanos mentioned this issue Oct 11, 2023

Epic: Streaming support #3132

Closed

matthewbolanos removed this from the R3: Cycle 3 milestone Oct 12, 2023

RogerBarreto mentioned this issue Nov 14, 2023

[WIP] Streaming ADR #3247

Closed

RogerBarreto moved this from Backlog - New features to Sprint: In Progress in Semantic Kernel Nov 14, 2023

RogerBarreto mentioned this issue Nov 14, 2023

.Net [WIP] Kernel Streaming #2776

Closed

4 tasks

RogerBarreto added this to the v1.0.0 milestone Nov 14, 2023

RogerBarreto mentioned this issue Nov 14, 2023

.Net Kernel and ISKFunction Streaming Option #1 (Phase 1) #3496

Merged

4 tasks

RogerBarreto moved this from Sprint: In Progress to Sprint: In Review in Semantic Kernel Nov 24, 2023

RogerBarreto closed this as completed in #3496 Nov 24, 2023

github-project-automation bot moved this from Sprint: In Review to Sprint: Done in Semantic Kernel Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.NET Allow ISKFunction to return IAsyncEnumerable #1298

.NET Allow ISKFunction to return IAsyncEnumerable #1298

wayne-o commented Jun 1, 2023 •

edited by matthewbolanos

Loading

wayne-o commented Jun 1, 2023

craigomatic commented Jun 2, 2023

wayne-o commented Jun 2, 2023 via email

awharrison-28 commented Jun 2, 2023

wayne-o commented Jun 3, 2023 via email

MithrilMan commented Jun 13, 2023 •

edited

Loading

markwallace-microsoft commented Oct 10, 2023

.NET Allow ISKFunction to return IAsyncEnumerable #1298

.NET Allow ISKFunction to return IAsyncEnumerable #1298

Comments

wayne-o commented Jun 1, 2023 • edited by matthewbolanos Loading

wayne-o commented Jun 1, 2023

craigomatic commented Jun 2, 2023

wayne-o commented Jun 2, 2023 via email

awharrison-28 commented Jun 2, 2023

wayne-o commented Jun 3, 2023 via email

MithrilMan commented Jun 13, 2023 • edited Loading

markwallace-microsoft commented Oct 10, 2023

wayne-o commented Jun 1, 2023 •

edited by matthewbolanos

Loading

MithrilMan commented Jun 13, 2023 •

edited

Loading