Commit Graph

652 Commits (fix/iteration-thread-pool-error)

Author SHA1 Message Date
yihong 02572e8cca
fix: claude can not handle empty string (#11238)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
yihong 239bf97b47
fix: nvidia special embedding model payload close #11193 (#11239)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Shota Totsuka 594666eb61
fix: use Gemini response metadata for token counting (#11226) 1 year ago
liujiamingtiny 6f9ce6a199
fix: fix azure open-4o-08-06 when enable json schema cant process content = "" (#11204)
Co-authored-by: jiaming.liu <jiaming.liu@zkh.com>
1 year ago
yihong 40fc6f529e
fix: gitee ai wrong default model, and better para (#11168)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Tao Wang aa135a3780
Add TTS to OpenAI_API_Compatible (#11071) 1 year ago
-LAN- 5b7b328193
feat: Allow to contains files in the system prompt even model not support. (#11111) 1 year ago
-LAN- 1db14793fa
fix(anthropic_llm): Ignore non-text parts in the system prompt. (#11107) 1 year ago
fengjiajie ab6dcf7032
fix: update the max tokens configuration for Azure GPT-4o (2024-08-06) to 16384 (#11074) 1 year ago
yihong 8aae235a71
fix: int None will cause error for context size (#11055)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Tao Wang 1065917872
Add grok-vision-beta to xAI + Update grok-beta Features (#11004) 1 year ago
yihong 2e00829b1e
fix: drop useless and wrong code for zhipu embedding (#11069)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
SiliconFlow, Inc a4fc057a1c
ISSUE=11042: add tts model in siliconflow (#11043) 1 year ago
Tao Wang aae29e72ae
Fix Deepseek Function/Tool Calling (#11023) 1 year ago
cyflhn 03ba4bc760
fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012)
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
Bowen Liang 6c8e208ef3
chore: bump minimum supported Python version to 3.11 (#10386) 1 year ago
kenwoodjw 096c0ad564
feat: Add support for TEI API key authentication (#11006)
Signed-off-by: kenwoodjw <blackxin55+@gmail.com>
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
Kazuhisa Wada 16c41585e1
Fixing #11005: Incorrect max_tokens in yaml file for AWS Bedrock US Cross Region Inference version of 3.5 Sonnet v2 and 3.5 Haiku (#11013) 1 year ago
yihong 448a19bf54
fix: fish audio wrong validate credentials interface (#11019)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
非法操作 08ac36812b
feat: support LLM process document file (#10966)
Co-authored-by: -LAN- <laipz8200@outlook.com>
1 year ago
-LAN- c5f7d650b5
feat: Allow using file variables directly in the LLM node and support more file types. (#10679)
Co-authored-by: Joel <iamjoel007@gmail.com>
1 year ago
CXwudi d9579f418d
chore: Added the new gemini exp-1121 and learnlm-1.5 models (#10963) 1 year ago
Agung Besti e8868a7fb9
feat: add gpt-4o-2024-11-20 (#10951)
Co-authored-by: akubesti <agung.besti@insignia.co.id>
1 year ago
LastHopeOfGPNU 1a6b961b5f
Resolve 8475 support rerank model from infinity (#10939)
Co-authored-by: linyanxu <linyanxu2@qq.com>
1 year ago
-LAN- 82575a7aea
fix(gpt-4o-audio-preview): Remove the vision feature (#10932) 1 year ago
yihong 80da0c5830
fix: default max_chunks set to 1 as other providers (#10937)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
yihong 0067b16d1e
fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
-LAN- 4d6b45427c
Support streaming output for OpenAI o1-preview and o1-mini (#10890) 1 year ago
ybalbert001 c3d11c8ff6
fix: aws presign url is not workable remote url (#10884)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
GeorgeCaoJ fbfc811a44
feat: support function call for ollama block chat api (#10784) 1 year ago
Ding Jiatong 3087913b74
Fix the situation where output_tokens/input_tokens may be None in response.usage (#10728) 1 year ago
Jyong bd05df5cc5
fix tongyi embedding endpoint return None output (#10857) 1 year ago
非法操作 bc1013dacf
feat: support json schema for gemini models (#10835) 1 year ago
非法操作 ba537d657f
feat: add gemini-exp-1114 (#10779) 1 year ago
Bowen Liang 51db59622c
chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 year ago
Bowen Liang 365cb4b368
chore(lint): bump ruff from 0.6.9 to 0.7.3 (#10714) 1 year ago
SiliconFlow, Inc e61242a337
feat: add vlm models from siliconflow (#10704) 1 year ago
orangeclk 317ae9233e
feat: add json response format for siliconflow models (#10657) 1 year ago
xiandan-erizo 5b8f03cd9d
add abab7-chat-preview model (#10654)
Co-authored-by: xiandan-erizo <xiandan-erizo@outlook.com>
1 year ago
方程 ef8022f715
Gitee AI Qwen2.5-72B model (#10595) 1 year ago
Kevin9703 e03ec0032b
fix: Azure OpenAI o1 max_completion_token error (#10593) 1 year ago
-LAN- 867bf70f1a
fix(model_runtime): ensure compatibility with O1 models by adjusting token parameters (#10537) 1 year ago
Jyong 0c1307b083
add jina rerank http timout parameter (#10476) 1 year ago
fdb02983rhy 05d43a4074
Fix: Correct the max tokens of Claude-3.5-Sonnet-20241022 for Bedrock and VertexAI (#10508) 1 year ago
larcane97 aa895cfa9b
fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417)
Co-authored-by: moon <moon@vessl.ai>
1 year ago
非法操作 033ab5490b
feat: support LLM understand video (#9828) 1 year ago
Bowen Liang 574c4a264f
chore(lint): Use logging.exception instead of logging.error (#10415) 1 year ago
Matsuda 1e8457441d
fix(model_runtime): remove vision from features for Claude 3.5 Haiku (#10360) 1 year ago
Infinitnet 5a9448245b
fix: remove unsupported vision in OpenRouter Haiku 3.5 (#10364) 1 year ago
Bowen Liang d45d90e8ae
chore: lazy import sagemaker (#10342) 1 year ago