Setup model routing config and plan routing to o1 #6189

ryanhoangt · 2025-01-10T12:56:16Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

This PR is to:

Setup config for model routing-related features.
Implement a prototype for routing to reasoning models if appropriate. The criteria are based on this paper.

Link of any specific issues this addresses

xingyaoww

Awesome! This is a great start for model routing and LGTM!

xingyaoww · 2025-01-10T13:37:58Z

openhands/router/plan/llm_based.py

+    Router that routes the prompt that is judged by a LLM as complex and requires a step-by-step plan.
+    """
+
+    JUDGE_MODEL = 'gpt-4o'


Would be interesting to see if we can experiment with cheaper model for that 🤔

xingyaoww · 2025-01-10T13:39:50Z

openhands/router/plan/prompts.py

+    * Translating high-level requirements into detailed implementation steps and ensuring consistency.
+
+=== BEGIN USER MESSAGE ===
+{message}


We could also experiment sending O1 with the last 5/10 action/observation 🤔 in case there's some deep reasoning required to figure out the error, etc.

enyst · 2025-01-10T20:03:00Z

openhands/llm/llm.py

+                    )
+
+                # Replace the model with the reasoning model
+                kwargs['model'] = self.model_routing_config.reasoning_model


Is model enough, or also: custom provider, base URL?

We could design the reasoning model not as a part of an LLM instance, but as a second LLM instance in the agent?

Is model enough, or also: custom provider, base URL?

Yeah, I think we also need to allow user to set these, especially if they don't use via a llm proxy 🤔

Using [llm.reasoning_model] will do it implicitly!

enyst · 2025-01-10T21:40:14Z

config.template.toml

+[model_routing]
+
+# The reasoning model to use for plan generation
+reasoning_model = "o1-preview-2024-09-12"


Suggested change

reasoning_model = "o1-preview-2024-09-12"

[llm.reasoning_model]

model = "o1-preview-2024-09-12"

...

Yeah this is also another approach, my thought is for now we only use reasoning models specifically for model routing, so I put it in this config group (with other values in the future). When we also use them for other purposes, we can probably move to llm-specific groups?

My point is that we can reuse the way we define a model (which will implicitly take care of the correct loading and init all base_url etc).

It doesn't say which component of openhands loads the definition of [llm.reasoning_model], it can be the routing component.

To clarify, if a user wants to use a reasoning model today, for the agent, they can do so. They just choose a reasoning model and configure it. Ability to use it isn't new?

We can just avoid to duplicate LLMConfig settings ("reasoning_model", "reasoning_model_base_url", "reasoning_model_api_key", "reasoning_model_aws...something" etc) into the new routing section, instead we can reference existing configurations

Yeah that sounds good to me, thanks for the suggestion! I'll try to address this after getting the routing behavior to work

enyst

I'm so happy to see this, thank you! I do think we are missing some minimal framework to experiment with reasoning models.

About the way to choose another model:
We already have the ability to choose, configure, and use a random model, for example in evals: we can write the model configuration in toml, in a custom named LLM config section, [llm.o1], load it with an utility function, and instantiate an LLM from it.

We can use that here. Names are user-defined, and we can, if we want, set in stone a particular name for the reasoning model, e.g. [llm.reasoning_model], or [llm.oh_reasoning_model], or [llm.blueberry] (or strawberry for that matter), whatever name.

openhands/router/plan/prompts.py

enyst · 2025-01-14T07:18:01Z

openhands/utils/trajectory.py

+        param_value = '\n' + param_value + '\n' if is_multiline_value else param_value
+        tool_call_str += f'<parameter={param_name}>{param_value}</parameter>\n'
+    tool_call_str += '</function>'
+    return tool_call_str


Nit: it feels like this code is also in fn_call_converter.py?

Yeah, but it doesn't contain extra info like turn number I think 🤔 Maybe we can also reuse it

And I'm also planning to add extra markup about whether the turn is routed or not

ryanhoangt added 13 commits January 6, 2025 15:40

prototype

cec10a1

Merge branch 'main' into o1-routing

fd1b4ec

add routing config

c33ba45

wire up with codeact and llm

7b08724

fix bug

910ba8c

working cli

b73f3ec

update config template

54d4401

use via ui

06db2d6

remove dotenv

b5973cd

update judge prompt

e3c8a9e

update prompt

27a83db

update prompt

6f86ad9

adjust rule-based router

ec2d162

xingyaoww reviewed Jan 10, 2025

View reviewed changes

enyst reviewed Jan 10, 2025

View reviewed changes

openhands/router/plan/prompts.py Outdated Show resolved Hide resolved

ryanhoangt added 4 commits January 12, 2025 14:19

fix indentation

9bf5a7f

use full trajectory

8e05f3f

refactor traj formatter and add tests

ddc3248

add o1 to fn calling models

472d95c

enyst reviewed Jan 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup model routing config and plan routing to o1 #6189

Setup model routing config and plan routing to o1 #6189

ryanhoangt commented Jan 10, 2025 •

edited

Loading

xingyaoww left a comment

xingyaoww Jan 10, 2025

xingyaoww Jan 10, 2025

enyst Jan 10, 2025

enyst Jan 10, 2025

ryanhoangt Jan 12, 2025 •

edited

Loading

enyst Jan 12, 2025

enyst Jan 10, 2025

ryanhoangt Jan 12, 2025 •

edited

Loading

enyst Jan 12, 2025

enyst Jan 14, 2025

ryanhoangt Jan 14, 2025

enyst left a comment

enyst Jan 14, 2025

ryanhoangt Jan 14, 2025 •

edited

Loading

ryanhoangt Jan 14, 2025

Setup model routing config and plan routing to o1 #6189

Are you sure you want to change the base?

Setup model routing config and plan routing to o1 #6189

Conversation

ryanhoangt commented Jan 10, 2025 • edited Loading

xingyaoww left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanhoangt Jan 12, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanhoangt Jan 12, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanhoangt Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanhoangt commented Jan 10, 2025 •

edited

Loading

ryanhoangt Jan 12, 2025 •

edited

Loading

ryanhoangt Jan 12, 2025 •

edited

Loading

ryanhoangt Jan 14, 2025 •

edited

Loading