Merge pull request #389 from eric-catalysis/main

Updated 2.4, draft version for 2.5
Azure · Oct 28, 2024 · c6e4bbf · c6e4bbf
2 parents 479e191 + ecddf74
commit c6e4bbf
Show file tree

Hide file tree

Showing 14 changed files with 291 additions and 3 deletions.
diff --git a/website/blog-30-days-of-ia-2024/2024-10-17/evaluate-with-ai.md b/website/blog-30-days-of-ia-2024/2024-10-17/evaluate-with-ai.md
@@ -87,9 +87,9 @@ Let's take a quick look at the [**default quality metrics**](https://learn.micro
 | Metric | What does it assess? | How does it work? | When should you use it? | Inputs Needed |
 |:--|:--|:--|:--|:--|
 | **Groundedness** <br/> 1=ungrounded <br/> 5=grounded | How well does model's generated answers align with information from source data ("context")?  | Checks if response corresponds _verifiably_ to source context |When factual correctness and contextual accuracy are key - e.g., is it grounded in "my" product data? | Question, Context, Generated Response |
-| **Relevance** <br/> 1=bad <br/> 5=good | Are the model's generated responses pertinent, and directly related, to the given queries? | Assesses ability of responses to capture the key points of context that relate to the query | When evaluating your application's ability to understand the inputs and generate _contextually-relevant_ responses | |
-| **Groundedness**| Given support knowledge, does the ANSWER use the information provided by the CONTEXT? | | | |
-| **Relevance**| How well does the ANSWER address the main aspects of the QUESTION, based on the CONTEXT? | | | |
+| **Relevance** <br/> 1=bad <br/> 5=good | Are the model's generated responses pertinent, and directly related, to the given queries? | Assesses ability of responses to capture the key points of context that relate to the query | When evaluating your application's ability to understand the inputs and generate _contextually-relevant_ responses | Question, Answer|
+| **Fluency** 1=bad <br/> 5=fluent| How grammatically and linguistically correct the model's predicted answer is. | Checks quality of individual sentences in the ANSWER? Are they well-written and grammatically correct? | When evaluating your application's ability to generate _readable_ responses| Question, Answer |
+| **Coherence** <br/> 1=bad <br/> 5=good | Measures the quality of all sentences in a model's predicted answer and how they fit together naturally. | Checks how well do all sentences in the ANSWER fit together? Do they sound natural when taken as a whole? | When the _readability_ of the response is important | Question, Answer|
 
 To create these custom evaluators, we need to do three things:
 

diff --git a/website/blog-30-days-of-ia-2024/2024-10-18/deploy-with-aca.md b/website/blog-30-days-of-ia-2024/2024-10-18/deploy-with-aca.md
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-aca-default.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-aca-default.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-aca-fastapi.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-aca-fastapi.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-deploy-aca.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-deploy-aca.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-developer-workflow.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-developer-workflow.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-codespace.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-codespace.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-console.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-console.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-docs.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-docs.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-local.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-local.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-response.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-response.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-test.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/05-fastapi-test.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/app_insights.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/app_insights.png
diff --git a/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/app_insights_logic.png b/website/static/img/30-days-of-ia-2024/blogs/2024-10-18/app_insights_logic.png