Option to remove part of chat history when prompt limit is reached to prevent reprocessing every generation? #2356

genericgod · 2023-05-26T10:21:23Z

genericgod
May 26, 2023

When reaching the token limit, the prompt gets truncated every generation which takes forever, because it can't use the cache anymore and needs to process all tokens every generation. I have very low-end Hardware.

Is it possible to remove the beginning of the history to a set amount of tokens automatically once the limit is reached so llama.cpps cache function can work again?

I hope this is understandable, as I'm not a programmer.

RansomSpawn · 2023-06-07T14:20:00Z

RansomSpawn
Jun 7, 2023

yes, after a while it starts to talk more and more weird things slowly the longer the chat history gets. it goes back to normal after deleting the chat history but then the AI also loses any contexts of the discussion...
it would come in handy if there would be an option to keep a specific amount of messages in the history and the older messages just get deleted automatically.

0 replies

cutec-chris · 2024-03-03T12:05:05Z

cutec-chris
Mar 3, 2024

what about this ? seems to be an quite good idea

0 replies

TiagoTiago · 2024-03-19T19:46:41Z

TiagoTiago
Mar 19, 2024

In the parameters tab, see if you got a "Truncate the prompt up to this length" option. See if that does what you want.

Though, if it indeed does crop the top, you might lose some initial instructions that I dunno if the system will re-insert automatically somehwere. And yeah, if that's what it does, it will essentially be making the AI "forget" the earlier parts of the conversation.

1 reply

RansomSpawn Mar 21, 2024

this should do the trick :)
the longer the conversation gets, the slower it will respond so its better to crop the message history...
thanks dude :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to remove part of chat history when prompt limit is reached to prevent reprocessing every generation? #2356

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Option to remove part of chat history when prompt limit is reached to prevent reprocessing every generation? #2356

genericgod May 26, 2023

Replies: 3 comments · 1 reply

RansomSpawn Jun 7, 2023

cutec-chris Mar 3, 2024

TiagoTiago Mar 19, 2024

RansomSpawn Mar 21, 2024

genericgod
May 26, 2023

Replies: 3 comments 1 reply

RansomSpawn
Jun 7, 2023

cutec-chris
Mar 3, 2024

TiagoTiago
Mar 19, 2024