feat: add "stop" keywords as alternative to eot token #769

longregen · 2023-04-04T20:42:57Z

Rewrite of #365 (addresses #57) by @joshmackwilliams, updated to the new folder/file structure as the PR might have been abandoned.

From the original author:

Implements #57.
Stop keywords can be specified using the "--stop" parameter. Upon seeing one of these keywords in the generated output, the model will terminate generation immediately. Like reverse prompts, multiple stop keywords can be specified by specifying the --stop argument multiple times.
The implementation is heavily based on the reverse prompt implementation to keep things simple. Tested using 7B (quantized) in both interactive and non-interactive modes.

howard0su · 2023-04-05T15:56:15Z

examples/main/main.cpp

                for (auto id : last_n_tokens) {
                    last_output += llama_token_to_str(ctx, id);
                }
+            }
+
+            // Check for stop keywords, a configurable alternative to the end-of-text token


we should check the token id instead of the string for stop.

For context, the precedent set by #330 is to check for the string (in reverse prompts). I think checking for tokens caused a bug, or at least unintuitive behavior (#292).

Indeed, this should work in the same way as "antiprompts" to provide a better UX to users. Users should be able to add to the CLI parameters --stop "### Assistant" (for example, in the spirit of the trained vicuna model) or --stop PAUSE (for example, to implement Simon Willinson's ReAct Python example), even though these are multi-token markers.

do we know why? It will be a good learning for me to understand why token will not work? Because one stop string can be generated by the different tokens??

Yes, see #292 (comment)

I also noticed that the stop words weren't always consistently formatted. Sometimes it would do STOP instead of stop. I'd appreciate a normalize function to down case before comparing.

joshmackwilliams · 2023-04-05T16:41:28Z

Yes, I got busy and no longer have time to maintain that PR. Thanks for rewriting!

ggerganov

Haven't looked in details.
Merge it if you have confirmed to be working as expected

longregen · 2023-04-14T11:53:39Z

While #863 appears to be more polished, I'd merge these changes since it's uncertain when that one will be reviewed and integrated. Plus, it's likely that any significant use of this code would involve utilizing the library rather than relying on this example binary.

ggerganov · 2023-04-14T14:49:57Z

@longregen
#863 is now ready to merge. Is there anything that this PR adds that is not covered by #863 ?
If yes, maybe we can create a separate PR after the merge?

longregen · 2023-04-14T15:40:42Z

@longregen #863 is now ready to merge. Is there anything that this PR adds that is not covered by #863 ? If yes, maybe we can create a separate PR after the merge?

#863 is a superset of this PR. Closed this one as that other got merged. Thank you for your awesome work!

KevinColemanInc · 2023-04-14T21:43:35Z

@longregen The PR you mentioned has been reverted (c85e03d). Is the plan to wait for that to be tested more or should we reopen this PR?

howard0su reviewed Apr 5, 2023

View reviewed changes

Claude Doppler added 2 commits April 8, 2023 00:09

feat: add "stop" keywords as alternative to eot token

9fd062f

fix endline

67a0878

joshmackwilliams mentioned this pull request Apr 10, 2023

Stop keywords #365

Closed

ggerganov approved these changes Apr 13, 2023

View reviewed changes

longregen closed this Apr 14, 2023

longregen deleted the stop-keywords branch April 14, 2023 15:40

ejones mentioned this pull request May 10, 2023

main : add stop keywords #1387

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add "stop" keywords as alternative to eot token #769

feat: add "stop" keywords as alternative to eot token #769

longregen commented Apr 4, 2023

howard0su Apr 5, 2023

joshmackwilliams Apr 5, 2023

longregen Apr 5, 2023

howard0su Apr 6, 2023

j-f1 Apr 6, 2023

KevinColemanInc Apr 10, 2023

joshmackwilliams commented Apr 5, 2023

ggerganov left a comment

longregen commented Apr 14, 2023 •

edited

Loading

ggerganov commented Apr 14, 2023

longregen commented Apr 14, 2023 •

edited

Loading

KevinColemanInc commented Apr 14, 2023

feat: add "stop" keywords as alternative to eot token #769

feat: add "stop" keywords as alternative to eot token #769

Conversation

longregen commented Apr 4, 2023

howard0su Apr 5, 2023

Choose a reason for hiding this comment

joshmackwilliams Apr 5, 2023

Choose a reason for hiding this comment

longregen Apr 5, 2023

Choose a reason for hiding this comment

howard0su Apr 6, 2023

Choose a reason for hiding this comment

j-f1 Apr 6, 2023

Choose a reason for hiding this comment

KevinColemanInc Apr 10, 2023

Choose a reason for hiding this comment

joshmackwilliams commented Apr 5, 2023

ggerganov left a comment

Choose a reason for hiding this comment

longregen commented Apr 14, 2023 • edited Loading

ggerganov commented Apr 14, 2023

longregen commented Apr 14, 2023 • edited Loading

KevinColemanInc commented Apr 14, 2023

longregen commented Apr 14, 2023 •

edited

Loading

longregen commented Apr 14, 2023 •

edited

Loading