update Chat model input max size
What does this merge request do and why?
This MR syncs the maximum input token length to 4_096
to be in line with Claude 3 models in Chat endpoint.
See https://gitlab.slack.com/archives/C06LWENL58F/p1718799874330299 for more context.
Edited by Tan Le