read chat prompts from a template file #1196

khimaros · 2023-04-26T19:12:27Z

this is a simple improvement to the chat-13B example which reads prompts from a text file instead of inlining them in the script. i've used this with vicuna 13B with excellent results.

DannyDaemonic · 2023-04-27T04:11:07Z

I think the nice thing about this example is it basically creates a chat bot out of the standard LLaMA model. If you have the official models, there's nothing else to download or figure out to have a back and forth chat.

I probably prefer a single file example people can edit that also works with a base model. That said, I'm not against breaking out the prompt and passing that in separately, but it's going to be a bit of work keeping the same chat in 3 different formats - and then maybe adding more formats as new models come along.

If we did want to support prompts for different chat models, we might want to just have the chat script also convert regular chat style conversations to alpaca/vicuna-1.0 style prompting (with something like sed 's/^$\[\[.*\]\]:$/### \1/' prompt.txt, and then prepend the "infinite length chat" stuff when using a standard LLaMA model or somehow mark extended chat that can be cut out for Vicuna or GPT-like models.

Edit: Keep in mind there's also a chat-13B.bat which gets left behind from these changes and could particularly benefit from prompt separation.

khimaros · 2023-05-03T14:28:33Z

this has now been rebased to master.

N.B. occasionally with this patch, encountering the reverse prompt does not halt text generation. i believe this is triggered by the trailing whitespace that i've added to the reverse prompt.

is this expected? if so, i can rollback the whitespace addition. it is a slightly better user experience to include the whitespace at the end, but not if it causes the aforementioned issue.

i tested this PR with the following invocations:

MODEL=./models/ggml-vicuna-13b-1.1-q4_0.bin ./examples/chat-13B.sh

MODEL=./models/ggml-vicuna-13b-1.1-q4_0.bin USER_NAME=USER AI_NAME=VICUNA PROMPT_TEMPLATE=./prompts/chat-with-vicuna-v1.txt ./examples/chat-13B.sh

ggerganov · 2023-05-03T16:07:10Z

Yes, this is expected. There should be no whitespace at the end.
See this comment for more information: #1297 (comment)

khimaros · 2023-05-03T17:54:55Z

@ggerganov thank you, fixed the whitespace in the reverse prompt.

i think this is ready to merge now.

ivanstepanovftw · 2023-05-04T01:55:48Z

examples/chat-13B.sh

+sed -e "s/\[\[USER_NAME\]\]/$USER_NAME/g" \
+    -e "s/\[\[AI_NAME\]\]/$AI_NAME/g" \
+    -e "s/\[\[DATE_TIME\]\]/$DATE_TIME/g" \
+    -e "s/\[\[DATE_YEAR\]\]/$DATE_YEAR/g" \
+     $PROMPT_TEMPLATE > $PROMPT_FILE


export USER_NAME AI_NAME DATE_TIME DATE_YEAR configure() { local file="$1" sed 's/\\\\/\${__envsubst_b}/g;s/\\\$/\${__envsubst_d}/g' "${file}" | __envsubst_b=\\ __envsubst_d=\$ envsubst } configure "$PROMPT_TEMPLATE" > "$PROMPT_FILE"

ggerganov approved these changes Apr 28, 2023

View reviewed changes

khimaros force-pushed the master branch from a2cd181 to 74c7ff0 Compare May 3, 2023 14:11

read chat prompts from a template file

c2aa881

khimaros force-pushed the master branch from 74c7ff0 to c2aa881 Compare May 3, 2023 17:53

ggerganov merged commit 6daa09d into ggml-org:master May 3, 2023

ivanstepanovftw reviewed May 4, 2023

View reviewed changes

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

read chat prompts from a template file #1196

read chat prompts from a template file #1196

Uh oh!

khimaros commented Apr 26, 2023

Uh oh!

DannyDaemonic commented Apr 27, 2023 •

edited

Loading

Uh oh!

khimaros commented May 3, 2023

Uh oh!

ggerganov commented May 3, 2023

Uh oh!

khimaros commented May 3, 2023 •

edited

Loading

Uh oh!

ivanstepanovftw May 4, 2023

Uh oh!

Uh oh!

read chat prompts from a template file #1196

read chat prompts from a template file #1196

Uh oh!

Conversation

khimaros commented Apr 26, 2023

Uh oh!

DannyDaemonic commented Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

khimaros commented May 3, 2023

Uh oh!

ggerganov commented May 3, 2023

Uh oh!

khimaros commented May 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ivanstepanovftw May 4, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DannyDaemonic commented Apr 27, 2023 •

edited

Loading

khimaros commented May 3, 2023 •

edited

Loading