add save_load_state example #1150

xaedes · 2023-04-24T03:38:01Z

This is the save_load_state example script from #730 (comment)

It will load model params read from command line.
Then evaluate and print the initial prompt and save the state.
n_predict tokens are generate and printed.
Model is freed and load in new context.
Load the state and generate and print n_predict.
The n_predict generated tokens should be the same.

ggerganov · 2023-04-24T15:21:20Z

examples/save_load_state/save_load_state.cpp

@@ -0,0 +1,133 @@
+#include <vector>
+#include <iostream>


Suggested change

#include <iostream>

#include <cstdio>

Is better

ggerganov · 2023-04-24T15:23:25Z

examples/save_load_state/save_load_state.cpp

+    auto n_prompt_tokens = llama_tokenize(ctx, params.prompt.c_str(), tokens.data(), tokens.size(), true);
+
+    if (n_prompt_tokens < 1) {
+        cout << "Failed to tokenize prompt" << endl;


Suggested change

cout << "Failed to tokenize prompt" << endl;

fprintf(stderr, "%s : failed to tokenize prompt\n", __func__);

etc.

xaedes · 2023-04-24T15:35:11Z

I changed cout to printf / fprintf in this last commit.

ggerganov · 2023-04-24T15:50:18Z

Sorry, one more thing - all filenames use dashes - instead of underscore _

examples/save_load_state/save_load_state.cpp -> examples/save-load-state/save-load-state.cpp

xaedes · 2023-04-24T16:21:07Z

Sure, I changed it :)

add save_load_state example

19a2ca0

ggerganov approved these changes Apr 24, 2023

View reviewed changes

use <cstdio> instead of <iostream> and fprintf / printf instead of cout

0f40b0a

renamed save-load-state example files replacing underscores by dashes

00ef34d

ggerganov merged commit 0c56923 into ggml-org:master Apr 24, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add save_load_state example #1150

add save_load_state example #1150

xaedes commented Apr 24, 2023

ggerganov Apr 24, 2023

ggerganov Apr 24, 2023

xaedes commented Apr 24, 2023

ggerganov commented Apr 24, 2023 •

edited

Loading

xaedes commented Apr 24, 2023

	cout << "Failed to tokenize prompt" << endl;
	fprintf(stderr, "%s : failed to tokenize prompt\n", __func__);

add save_load_state example #1150

add save_load_state example #1150

Conversation

xaedes commented Apr 24, 2023

ggerganov Apr 24, 2023

Choose a reason for hiding this comment

ggerganov Apr 24, 2023

Choose a reason for hiding this comment

xaedes commented Apr 24, 2023

ggerganov commented Apr 24, 2023 • edited Loading

xaedes commented Apr 24, 2023

ggerganov commented Apr 24, 2023 •

edited

Loading