You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Gemma model is trained to include code blocks when returning JSON responses, but our service does not consider this behavior. As a result, these responses are often treated as bad responses. Currently, our system retries when it encounters this issue, and if it still fails, the response is returned as is, leading users to think the output is incorrect even when it could be valid.
To address this, we should improve response handling by detecting and trimming code blocks around JSON outputs automatically. This approach would prevent the system from misclassifying these responses as failures and reduce unnecessary retries.
The text was updated successfully, but these errors were encountered:
The Gemma model is trained to include code blocks when returning JSON responses, but our service does not consider this behavior. As a result, these responses are often treated as bad responses. Currently, our system retries when it encounters this issue, and if it still fails, the response is returned as is, leading users to think the output is incorrect even when it could be valid.
To address this, we should improve response handling by detecting and trimming code blocks around JSON outputs automatically. This approach would prevent the system from misclassifying these responses as failures and reduce unnecessary retries.
The text was updated successfully, but these errors were encountered: