[InferenceClient] Add support for adapter_id
(text-generation) and response_format
(chat-completion)
#6249
Job | Run time |
---|---|
5m 53s | |
1m 42s | |
2m 37s | |
50s | |
1m 36s | |
2m 22s | |
1m 0s | |
36s | |
1m 22s | |
53s | |
1m 0s | |
51s | |
20m 42s |