For example, "history" seems to be unable to obtain the previous "question", and the line response = outputs[0]["generated_text"][len(prompt):] seems to be easily affected by apply_chat_template, resulting in the response not being the actual response
For example, "history" seems to be unable to obtain the previous "question", and the line
response = outputs[0]["generated_text"][len(prompt):]seems to be easily affected by apply_chat_template, resulting in the response not being the actual response