Can I set a limit to the maximum number of tokens allowed in a request? I.e. once the lexer knows the token count, if `count > N` fail the request.
Can I set a limit to the maximum number of tokens allowed in a request?
I.e. once the lexer knows the token count, if
count > Nfail the request.