Annotation Reply
Last updated
Last updated
The annotated replies feature provides customizable high-quality question-and-answer responses through manual editing and annotation.
Applicable scenarios:
Customized Responses for Specific Fields: In customer service or knowledge base scenarios for enterprises, government, etc., service providers may want to ensure that certain specific questions are answered with definitive results. Therefore, it is necessary to customize the output for specific questions. For example, creating "standard answers" for certain questions or marking some questions as "unanswerable."
Rapid Tuning for POC or DEMO Products: When quickly building prototype products, customized responses achieved through annotated replies can efficiently enhance the expected generation of Q&A results, thereby improving customer satisfaction.
The annotated replies feature essentially provides another set of retrieval-enhanced systems, allowing you to bypass the LLM generation phase and avoid the hallucination issues of RAG.
After enabling the annotated replies feature, you can annotate the responses from LLM conversations. You can add high-quality answers from LLM responses directly as annotations or edit a high-quality answer according to your needs. These edited annotations will be saved persistently.
When a user asks a similar question again, the system will vectorize the question and search for similar annotated questions.
If a match is found, the corresponding answer from the annotation will be returned directly, bypassing the LLM or RAG process.
If no match is found, the question will continue through the regular process (passing to LLM or RAG).
Once the annotated replies feature is disabled, the system will no longer match responses from annotations.
Enable the annotated replies switch by navigating to “Orchestrate -> Add Features”:
When enabling, you need to set the parameters for annotated replies, which include: Score Threshold and Embedding Model.
Score Threshold: This sets the similarity threshold for matching annotated replies. Only annotations with scores above this threshold will be recalled.
Embedding Model: This is used to vectorize the annotated text. Changing the model will regenerate the embeddings.
Click save and enable, and the settings will take effect immediately. The system will generate embeddings for all saved annotations using the embedding model.
You can directly add or edit annotations on the model response information in the debug and preview pages.
Edit the response to the high-quality reply you need and save it.
Re-enter the same user question, and the system will use the saved annotation to reply to the user's question directly.
Enable the annotated replies switch by navigating to “Logs & Ann. -> Annotations”:
The parameters that can be set for annotated replies include: Score Threshold and Embedding Model.
Score Threshold: This sets the similarity threshold for matching annotated replies. Only annotations with scores above this threshold will be recalled.
Embedding Model: This is used to vectorize the annotated text. Changing the model will regenerate the embeddings.
In the bulk import feature, you can download the annotation import template, edit the annotated Q&A pairs according to the template format, and then import them in bulk.
Through the bulk export feature, you can export all saved annotated Q&A pairs in the system at once.
In the annotation hit history feature, you can view the edit history of all hits on the annotation, the user's hit questions, the response answers, the source of the hits, the matching similarity scores, the hit time, and other information. You can use this information to continuously improve your annotated content.