Remove restoring token limit on opening settings #3

jks-liu · 2024-10-16T14:53:25Z

Behavior of setting limit to a default value every time of opening settings is very strange.
The default value is not suitable for num_ctx parameter because it is the maximum supported value by model. For e.g. llama3.1, this value is more than 100k, passing it to num_ctx will decrease performance. (Default value for ollama is only 2048)

1. Behavior of setting limit to a default value every time of opening settings is very strange. 2. The default value is not suitable for num_ctx parameter because it is the maximum supported value by model. For e.g. llama3.1, this value is more than 100k, passing it to num_ctx will decrease performance. (Default value for ollama is only 2048)

tcsenpai · 2024-10-16T17:31:32Z

Do you mean that in your PR we don't use the tokens table anymore and we just rely on the user's settings (or default)?

Regarding (2) I did not find any drawbacks on this, can you provide an example of why it would be detrimental to set num_ctx to 100k for example?

jks-liu · 2024-10-17T11:54:38Z

Yes, I think token limit should be decided by users because users have different hardware.
On my RTX 4090, llama3.1 will be slow if num_ctx is more than 10k.

More discussion about num_ctx:
ollama/ollama#4790
Mintplex-Labs/anything-llm#1991
https://www.reddit.com/r/LocalLLaMA/comments/1dxi6cf/today_i_learned_the_context_length_num_ctx/

tcsenpai · 2024-10-18T13:20:55Z

Do you think it is still worth making the (2) edit now that I merged the other PR?

jks-liu added 2 commits October 16, 2024 22:50

Remove deleted resource file from manifest

3758772

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove restoring token limit on opening settings #3

Remove restoring token limit on opening settings #3

jks-liu commented Oct 16, 2024

tcsenpai commented Oct 16, 2024

jks-liu commented Oct 17, 2024

tcsenpai commented Oct 18, 2024

Remove restoring token limit on opening settings #3

Are you sure you want to change the base?

Remove restoring token limit on opening settings #3

Conversation

jks-liu commented Oct 16, 2024

tcsenpai commented Oct 16, 2024

jks-liu commented Oct 17, 2024

tcsenpai commented Oct 18, 2024