Commit Graph

20 Commits (b63a6853869ffb3035287c44b333bcd091ca302e)

Author SHA1 Message Date
takatost c4d8bdc3db
fix: hf hosted inference check (#1128) 3 years ago
takatost d75e8aeafa
feat: disable anthropic retry (#1067) 3 years ago
takatost 2eba98a465
feat: optimize anthropic connection pool (#1066) 3 years ago
takatost 417c19577a
feat: add LocalAI local embedding model support (#1021)
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
3 years ago
takatost 0796791de5
feat: hf inference endpoint stream support (#1028) 3 years ago
Uranus 2d9616c29c
fix: xinference last token being ignored (#1013) 3 years ago
takatost 9ae91a2ec3
feat: optimize xinference request max token key and stop reason (#998) 3 years ago
takatost bd3a9b2f8d
fix: xinference-chat-stream-response (#991) 3 years ago
takatost 18d3877151
feat: optimize xinference stream (#989) 3 years ago
takatost a76fde3d23
feat: optimize hf inference endpoint (#975) 3 years ago
takatost 78d3aa5fcd
fix: embedding init err (#956) 3 years ago
takatost 4f3053a8cc
fix: xinference chat completion error (#952) 3 years ago
takatost 866ee5da91
fix: openllm generate cutoff (#945) 3 years ago
takatost e0a48c4972
fix: xinference chat support (#939) 3 years ago
takatost 6c832ee328
fix: remove openllm pypi package because of this package too large (#931) 3 years ago
takatost 0cc0b6e052
fix: error raise status code not exist (#888) 3 years ago
takatost f42e7d1a61
feat: add spark v2 support (#885) 3 years ago
takatost c4d759dfba
fix: wenxin error not raise when stream mode (#884) 3 years ago
takatost cc52cdc2a9
Feat/add free provider apply (#829) 3 years ago
takatost 5fa2161b05
feat: server multi models support (#799) 3 years ago