Commit Graph

802 Commits (9889aa10bd5bc6bf65265d93780ec39f0770cacc)

Author SHA1 Message Date
-LAN- 455791b710
fix(model_runtime): make invoke as ValueError (#11929)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
Kalo Chin 2681bafb76
fix: handle document fetching from URL in Anthropic LLM model, solving base64 decoding error (#11858) 1 year ago
yihong 7b03a0316d
fix: better memory usage from 800+ to 500+ (#11796)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
-LAN- 996a9135f6
feat(llm_node): support order in text and files (#11837)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
Dr.MerdanBay bb2f46d7cc
fix: add safe dictionary access for bedrock credentials (#11860) 1 year ago
yihong 463fbe2680
fix: better gard nan value from numpy for issue #11827 (#11864)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
非法操作 9d93ad1f16
feat: add gemini-2.0-flash-thinking-exp-1219 (#11863) 1 year ago
yihong 12d45e9114
fix: silicon change its model fix #11844 (#11847)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
barabicu d057067543
fix: remove ruff ignore SIM300 (#11810) 1 year ago
sino 560d375e0f
feat(ark): add doubao-pro-256k and doubao-embedding-large (#11831) 1 year ago
Agung Besti 3388d6636c
add-model-azure-gpt-4o-2024-11-20 (#11803)
Co-authored-by: agungbesti <agung.besti@insignia.co.id>
1 year ago
xander-art 56434db4f5
feat:add hunyuan model(hunyuan-role, hunyuan-large, hunyuan-large-rol… (#11766)
Co-authored-by: xanderdong <xanderdong@tencent.com>
1 year ago
-LAN- a5db7c9acb
feat: add openai o1 & update pricing and max_token of other models (#11780)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
非法操作 9048832a9a
chore: improve gemini models (#11745) 1 year ago
Shota Totsuka 7d5a385811
feat: use Gemini response metadata for token counting (#11743) 1 year ago
sino 99430a5931
feat(ark): support doubao vision series models (#11740) 1 year ago
非法操作 c9b4029ce7
chore: the consistency of MultiModalPromptMessageContent (#11721) 1 year ago
呆萌闷油瓶 cd4310df25
chore:update azure api version (#11711) 1 year ago
非法操作 74fdc16bd1
feat: enhance gemini models (#11497) 1 year ago
方程 fc8fdbacb4
feat: add gitee ai vl models (#11697)
Co-authored-by: 方程 <fangcheng@oschina.cn>
1 year ago
zhongliliu-butterfly daccb10d8c
fix: volcengine_maas and baichuan message error (#11625)
Co-authored-by: zhongliliu <liuzlx@digitalchina.com>
1 year ago
zhaobingshuang 79801f5c30
fix: deepseek reports an error when using Response Format #11677 (#11678)
Co-authored-by: zhaobs <zhaobs@cailian.net>
1 year ago
非法操作 cf0ff88120
feat: add grok-2-1212 and grok-2-vision-1212 (#11672) 1 year ago
yihong 7e154a467b
fix: better error message for stream (#11635)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
JasonVV bb3bc60f83
feat(model): add vertex_ai Gemini 2.0 Flash Exp (#11604) 1 year ago
crazywoola e7a4cfac4d
fix: name of llama-3.3-70b-specdec (#11596) 1 year ago
Alok Shrivastwa 6478aa1c9d
Added new models and Removed the deleted ones for Groq #11455 (#11456)
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: Alok Shrivastwa <Alok.Shrivastwa@microland.com>
1 year ago
Warren Chen 7b5839335a
[ref] use one method to get boto client for aws bedrock (#11506) 1 year ago
非法操作 926f604f09
feat: add gemini-2.0-flash-exp (#11570) 1 year ago
Tommy 42d986b96d
[Pixtral] Add new model ; add vision (#11231) 1 year ago
zkyTech fbc4ca980c
fix: Remove duplicate 'response_format' parameter from model YAML files (#11531)
Co-authored-by: zhangkunyuan <zhangkunyuan@cmhi.chinamobile.com>
1 year ago
Paul van Oorschot 80c52e0ea4
feat: Add llama-3.3 models for Groq (#11533) 1 year ago
orangeclk ec00b25793
feat: add siliconflow qwq and llama3.3 model (#11492) 1 year ago
Yingchun Lai 32f8439143
fix: add the missing abab6.5t-chat model of Minimax (#11484) 1 year ago
Kazuki Takamatsu 4d7cfd0de5
Fix model provider of vertex ai (#11437) 1 year ago
非法操作 7e1184c071
feat: support json_schema for ollama models (#11449) 1 year ago
非法操作 1ce51e57ab
feat: add gemini exp 1206 (#11444) 1 year ago
非法操作 142b4fd699
feat: add zhipu glm_4v_flash (#11440) 1 year ago
shirochan 5093337de1
FEAT: cohere rerank 3.5 model added (#11289) 1 year ago
Matsuda f54225568c
fix(model_runtime): add vision to Amazon Nova Lite and Pro (#11398) 1 year ago
Warren Chen 631cbcd781
[fix] rename yaml files to fit windows (#11379) 1 year ago
yihong 5669cac16d
fix: some typos using typos (#11374)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Warren Chen 376726cf90
[feat] Add AWS Bedrock rerank (#11349)
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
yihong 961e25f608
fix: better bedrock message handler close #10976 (#11317)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
ybalbert001 1bae9b8ff7
update pricing for bedrock nova LLM models (#11336)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
非法操作 91e1ff5e30
chore: improve zhipu LLM (#11321) 1 year ago
ybalbert001 5908e10549
integrate amazon nove llms to dify (#11324)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
yihong e39e776d03
fix: better wenxin rerank handler, close #11252 (#11283)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Bowen Liang e79eac688a
chore(lint): sort __all__ definitions (#11243) 1 year ago
-LAN- 643a90c48d
fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
yihong 02572e8cca
fix: claude can not handle empty string (#11238)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
yihong 239bf97b47
fix: nvidia special embedding model payload close #11193 (#11239)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Shota Totsuka 594666eb61
fix: use Gemini response metadata for token counting (#11226) 1 year ago
liujiamingtiny 6f9ce6a199
fix: fix azure open-4o-08-06 when enable json schema cant process content = "" (#11204)
Co-authored-by: jiaming.liu <jiaming.liu@zkh.com>
1 year ago
yihong 40fc6f529e
fix: gitee ai wrong default model, and better para (#11168)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Tao Wang aa135a3780
Add TTS to OpenAI_API_Compatible (#11071) 1 year ago
-LAN- 5b7b328193
feat: Allow to contains files in the system prompt even model not support. (#11111) 1 year ago
-LAN- 1db14793fa
fix(anthropic_llm): Ignore non-text parts in the system prompt. (#11107) 1 year ago
fengjiajie ab6dcf7032
fix: update the max tokens configuration for Azure GPT-4o (2024-08-06) to 16384 (#11074) 1 year ago
yihong 8aae235a71
fix: int None will cause error for context size (#11055)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Tao Wang 1065917872
Add grok-vision-beta to xAI + Update grok-beta Features (#11004) 1 year ago
yihong 2e00829b1e
fix: drop useless and wrong code for zhipu embedding (#11069)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
SiliconFlow, Inc a4fc057a1c
ISSUE=11042: add tts model in siliconflow (#11043) 1 year ago
Tao Wang aae29e72ae
Fix Deepseek Function/Tool Calling (#11023) 1 year ago
cyflhn 03ba4bc760
fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012)
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
Bowen Liang 6c8e208ef3
chore: bump minimum supported Python version to 3.11 (#10386) 1 year ago
kenwoodjw 096c0ad564
feat: Add support for TEI API key authentication (#11006)
Signed-off-by: kenwoodjw <blackxin55+@gmail.com>
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
Kazuhisa Wada 16c41585e1
Fixing #11005: Incorrect max_tokens in yaml file for AWS Bedrock US Cross Region Inference version of 3.5 Sonnet v2 and 3.5 Haiku (#11013) 1 year ago
yihong 448a19bf54
fix: fish audio wrong validate credentials interface (#11019)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
非法操作 08ac36812b
feat: support LLM process document file (#10966)
Co-authored-by: -LAN- <laipz8200@outlook.com>
1 year ago
-LAN- c5f7d650b5
feat: Allow using file variables directly in the LLM node and support more file types. (#10679)
Co-authored-by: Joel <iamjoel007@gmail.com>
1 year ago
CXwudi d9579f418d
chore: Added the new gemini exp-1121 and learnlm-1.5 models (#10963) 1 year ago
Agung Besti e8868a7fb9
feat: add gpt-4o-2024-11-20 (#10951)
Co-authored-by: akubesti <agung.besti@insignia.co.id>
1 year ago
LastHopeOfGPNU 1a6b961b5f
Resolve 8475 support rerank model from infinity (#10939)
Co-authored-by: linyanxu <linyanxu2@qq.com>
1 year ago
-LAN- 82575a7aea
fix(gpt-4o-audio-preview): Remove the vision feature (#10932) 1 year ago
yihong 80da0c5830
fix: default max_chunks set to 1 as other providers (#10937)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
yihong 0067b16d1e
fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
-LAN- 4d6b45427c
Support streaming output for OpenAI o1-preview and o1-mini (#10890) 1 year ago
ybalbert001 c3d11c8ff6
fix: aws presign url is not workable remote url (#10884)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
GeorgeCaoJ fbfc811a44
feat: support function call for ollama block chat api (#10784) 1 year ago
Ding Jiatong 3087913b74
Fix the situation where output_tokens/input_tokens may be None in response.usage (#10728) 1 year ago
Jyong bd05df5cc5
fix tongyi embedding endpoint return None output (#10857) 1 year ago
非法操作 bc1013dacf
feat: support json schema for gemini models (#10835) 1 year ago
非法操作 ba537d657f
feat: add gemini-exp-1114 (#10779) 1 year ago
Bowen Liang 51db59622c
chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 year ago
Bowen Liang 365cb4b368
chore(lint): bump ruff from 0.6.9 to 0.7.3 (#10714) 1 year ago
SiliconFlow, Inc e61242a337
feat: add vlm models from siliconflow (#10704) 1 year ago
orangeclk 317ae9233e
feat: add json response format for siliconflow models (#10657) 1 year ago
xiandan-erizo 5b8f03cd9d
add abab7-chat-preview model (#10654)
Co-authored-by: xiandan-erizo <xiandan-erizo@outlook.com>
1 year ago
方程 ef8022f715
Gitee AI Qwen2.5-72B model (#10595) 1 year ago
Kevin9703 e03ec0032b
fix: Azure OpenAI o1 max_completion_token error (#10593) 1 year ago
-LAN- 867bf70f1a
fix(model_runtime): ensure compatibility with O1 models by adjusting token parameters (#10537) 1 year ago
Jyong 0c1307b083
add jina rerank http timout parameter (#10476) 1 year ago
fdb02983rhy 05d43a4074
Fix: Correct the max tokens of Claude-3.5-Sonnet-20241022 for Bedrock and VertexAI (#10508) 1 year ago
larcane97 aa895cfa9b
fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417)
Co-authored-by: moon <moon@vessl.ai>
1 year ago
非法操作 033ab5490b
feat: support LLM understand video (#9828) 1 year ago
Bowen Liang 574c4a264f
chore(lint): Use logging.exception instead of logging.error (#10415) 1 year ago
Matsuda 1e8457441d
fix(model_runtime): remove vision from features for Claude 3.5 Haiku (#10360) 1 year ago
Infinitnet 5a9448245b
fix: remove unsupported vision in OpenRouter Haiku 3.5 (#10364) 1 year ago
Bowen Liang d45d90e8ae
chore: lazy import sagemaker (#10342) 1 year ago
Infinitnet bdadca1a65
feat: add support for anthropic/claude-3-5-haiku through OpenRouter (#10331) 1 year ago
非法操作 bf9349c4dc
feat: add xAI model provider (#10272) 1 year ago
Matsuda 4847548779
feat(model_runtime): add new model 'claude-3-5-haiku-20241022' (#10285) 1 year ago
Matsuda cb245b5435
fix(model_runtime): fix wrong max_tokens for Claude 3.5 Haiku on Amazon Bedrock (#10286) 1 year ago
Matsuda 9305ad2102
feat: support Claude 3.5 Haiku on Amazon Bedrock (#10265) 1 year ago
方程 2aa171c348
Using a dedicated interface to obtain the token credential for the gitee.ai provider (#10243) 1 year ago
Xiao Ley b28cf68097
chore: enable vision support for models in OpenRouter that should have supported vision (#10191) 1 year ago
Lawrence Li 76b0328eb1
feat: add gpustack model provider (#10158) 1 year ago
larcane97 8d5456b6d0
Add VESSL AI OpenAI API-compatible model provider and LLM model (#9474)
Co-authored-by: moon <moon@vessl.ai>
1 year ago
Coal Pigeon 4d5546953a
add llm: ernie-4.0-turbo-128k of wenxin (#10135)
Co-authored-by: Pigeon姚宏锋 <pigeon.yhf@galaxyoversea.com>
1 year ago
Charlie.Wei f6fecb957e
fix azure chatgpt o1 parameter error (#10067) 1 year ago
zhuhao 92a3898540
fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 1 year ago
非法操作 12adcf8925
fix: gemini model use some tools raise error (#9993) 1 year ago
方程 0ebd985672
feat: add models for gitee.ai (#9490) 1 year ago
ice yao 22776f24ab
chore: Extract common functions of the base model in Azure OpenAI Provider (#9907) 1 year ago
非法操作 1b5adf40da
fix: moonshot response_format raise error (#9847) 1 year ago
guogeer 70ddc0ce43
openai compatiable api usage and id (#9800)
Co-authored-by: jinqi.guo <jinqi.guo@ubtrobot.com>
1 year ago
-LAN- e11d5ac708
feat(model_runtime): add new model 'claude-3-5-sonnet-20241022' (#9708) 1 year ago
Pan, Wen-Ming ecc8beef3f
feat: added claude 3.5 sonnet v2 model to Google Cloud Vertex AI (#9688) 1 year ago
ybalbert001 4989d0c904
add bedrock claude 3.5 v2 support (#9685)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
ice yao 1e829ceaf3
chore: format get_customizable_model_schema return value (#9335) 1 year ago
AAEE86 9b32bfb3db
feat: Updata tongyi models (#9552) 1 year ago
-LAN- e61752bd3a
feat/enhance the multi-modal support (#8818) 1 year ago
chzphoenix 42fe208eda
refactor wenxin rerank (#9486)
Co-authored-by: cuihz <cuihz@knowbox.cn>
1 year ago
Ziyu Huang 660fc3bb34
Resolve 9508 openai compatible rerank (#9511) 1 year ago
Tao Wang b92504bebc
Added Llama 3.2 Vision Models Speech2Text Models for Groq (#9479) 1 year ago
zhuhao e0846792d2
feat: add yi custom llm intergration (#9482) 1 year ago
zhuhao b3cde9900c
feat: add parameter top-k for the llm model provided by openrouter and siliconflow (#9455) 1 year ago
zhuhao 3fc0ebdd51
feat: add yi-lightning llm model for yi (#9458) 1 year ago
chzphoenix 211f416806
feat:add wenxin rerank (#9431)
Co-authored-by: cuihz <cuihz@knowbox.cn>
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
zhuhao b90ad587c2
refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
zhuhao a45f8969a0
fix: remove the undefined variable line (#9446) 1 year ago
ybalbert001 fdcf87c70c
fix https://github.com/langgenius/dify/issues/9409 (#9433)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
ice yao dd22e78515
fix: Deprecated gemma2-9b model in Fireworks AI Provider (#9373) 1 year ago
crazywoola 423df67042
fix: use gpt-4o-mini for validating credentials (#9387) 1 year ago
非法操作 da25b91980
fix: remove the stream option of zhipu and gemini (#9319) 1 year ago
Jason Tan 9b8aa9b75d
feat: add minimax abab6.5t support (#9365) 1 year ago
非法操作 4ffaabcc04
feat: add glm-4-flashx, deprecated chatglm_turbo (#9357) 1 year ago
Warren Wong b597a0d31c
fix: Azure OpenAI o1 max_completion_token and get_num_token_from_messages error (#9326)
Co-authored-by: wwwc <wwwc@outlook.com>
1 year ago
ice yao 5908fd6552
Adapt input type parameter with MiniMax embedding model (#9342) 1 year ago
ice yao 3f9d6759d4
feat: Add qwen2.5 72B Instruct model in Fireworks AI (#9340) 1 year ago
ice yao aba70207ab
feat: Add fireworks custom llm intergration (#9333) 1 year ago
非法操作 ffc3f33670
chore: remove the copied zhipu_ai sdk (#9270) 1 year ago
AAEE86 fe41e8bc18
feat: add siliconflow custom add model interface (#8745) 1 year ago
Fei He 5c76131d3d
feat: add gte rerank for tongyi (#9153) 1 year ago
Charlie.Wei 6b6e94da08
Fix code indentation errors (#9164) 1 year ago
Ziyu Huang fc60b554a1
Fixes #9159: Modify to make it works to llama.cpp rerank API (#9160) 1 year ago
ronaksingh27 62051d5171
Corrected type annotation to "Any" from "any" all files in "model_providers" folder (#9135) 1 year ago
luckylhb90 2024a6c941
fix: vertex ai remote url error(Error: not enough values to unpack) (#9134)
Co-authored-by: hobo.l <hobo.l@binance.com>
1 year ago
呆萌闷油瓶 060897b25b
chore:add azure openai api version 2024-09-01-preview (#9141) 1 year ago