Commit Graph

819 Commits (main)

Author SHA1 Message Date
非法操作 ba537d657f
feat: add gemini-exp-1114 (#10779) 1 year ago
Bowen Liang 51db59622c
chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 year ago
Bowen Liang 365cb4b368
chore(lint): bump ruff from 0.6.9 to 0.7.3 (#10714) 1 year ago
SiliconFlow, Inc e61242a337
feat: add vlm models from siliconflow (#10704) 1 year ago
orangeclk 317ae9233e
feat: add json response format for siliconflow models (#10657) 1 year ago
xiandan-erizo 5b8f03cd9d
add abab7-chat-preview model (#10654)
Co-authored-by: xiandan-erizo <xiandan-erizo@outlook.com>
1 year ago
方程 ef8022f715
Gitee AI Qwen2.5-72B model (#10595) 1 year ago
Kevin9703 e03ec0032b
fix: Azure OpenAI o1 max_completion_token error (#10593) 1 year ago
-LAN- 867bf70f1a
fix(model_runtime): ensure compatibility with O1 models by adjusting token parameters (#10537) 1 year ago
Jyong 0c1307b083
add jina rerank http timout parameter (#10476) 1 year ago
fdb02983rhy 05d43a4074
Fix: Correct the max tokens of Claude-3.5-Sonnet-20241022 for Bedrock and VertexAI (#10508) 1 year ago
larcane97 aa895cfa9b
fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417)
Co-authored-by: moon <moon@vessl.ai>
1 year ago
非法操作 033ab5490b
feat: support LLM understand video (#9828) 1 year ago
Bowen Liang 574c4a264f
chore(lint): Use logging.exception instead of logging.error (#10415) 1 year ago
Matsuda 1e8457441d
fix(model_runtime): remove vision from features for Claude 3.5 Haiku (#10360) 1 year ago
Infinitnet 5a9448245b
fix: remove unsupported vision in OpenRouter Haiku 3.5 (#10364) 1 year ago
Bowen Liang d45d90e8ae
chore: lazy import sagemaker (#10342) 1 year ago
Infinitnet bdadca1a65
feat: add support for anthropic/claude-3-5-haiku through OpenRouter (#10331) 1 year ago
非法操作 bf9349c4dc
feat: add xAI model provider (#10272) 1 year ago
Matsuda 4847548779
feat(model_runtime): add new model 'claude-3-5-haiku-20241022' (#10285) 1 year ago
Matsuda cb245b5435
fix(model_runtime): fix wrong max_tokens for Claude 3.5 Haiku on Amazon Bedrock (#10286) 1 year ago
Matsuda 9305ad2102
feat: support Claude 3.5 Haiku on Amazon Bedrock (#10265) 1 year ago
方程 2aa171c348
Using a dedicated interface to obtain the token credential for the gitee.ai provider (#10243) 1 year ago
Xiao Ley b28cf68097
chore: enable vision support for models in OpenRouter that should have supported vision (#10191) 1 year ago
Lawrence Li 76b0328eb1
feat: add gpustack model provider (#10158) 1 year ago
larcane97 8d5456b6d0
Add VESSL AI OpenAI API-compatible model provider and LLM model (#9474)
Co-authored-by: moon <moon@vessl.ai>
1 year ago
Coal Pigeon 4d5546953a
add llm: ernie-4.0-turbo-128k of wenxin (#10135)
Co-authored-by: Pigeon姚宏锋 <pigeon.yhf@galaxyoversea.com>
1 year ago
Charlie.Wei f6fecb957e
fix azure chatgpt o1 parameter error (#10067) 1 year ago
zhuhao 92a3898540
fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 1 year ago
非法操作 12adcf8925
fix: gemini model use some tools raise error (#9993) 1 year ago
方程 0ebd985672
feat: add models for gitee.ai (#9490) 1 year ago
ice yao 22776f24ab
chore: Extract common functions of the base model in Azure OpenAI Provider (#9907) 1 year ago
非法操作 1b5adf40da
fix: moonshot response_format raise error (#9847) 1 year ago
guogeer 70ddc0ce43
openai compatiable api usage and id (#9800)
Co-authored-by: jinqi.guo <jinqi.guo@ubtrobot.com>
1 year ago
-LAN- e11d5ac708
feat(model_runtime): add new model 'claude-3-5-sonnet-20241022' (#9708) 1 year ago
Pan, Wen-Ming ecc8beef3f
feat: added claude 3.5 sonnet v2 model to Google Cloud Vertex AI (#9688) 1 year ago
ybalbert001 4989d0c904
add bedrock claude 3.5 v2 support (#9685)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
ice yao 1e829ceaf3
chore: format get_customizable_model_schema return value (#9335) 1 year ago
AAEE86 9b32bfb3db
feat: Updata tongyi models (#9552) 1 year ago
-LAN- e61752bd3a
feat/enhance the multi-modal support (#8818) 1 year ago
chzphoenix 42fe208eda
refactor wenxin rerank (#9486)
Co-authored-by: cuihz <cuihz@knowbox.cn>
1 year ago
Ziyu Huang 660fc3bb34
Resolve 9508 openai compatible rerank (#9511) 1 year ago
Tao Wang b92504bebc
Added Llama 3.2 Vision Models Speech2Text Models for Groq (#9479) 1 year ago
zhuhao e0846792d2
feat: add yi custom llm intergration (#9482) 1 year ago
zhuhao b3cde9900c
feat: add parameter top-k for the llm model provided by openrouter and siliconflow (#9455) 1 year ago
zhuhao 3fc0ebdd51
feat: add yi-lightning llm model for yi (#9458) 1 year ago
chzphoenix 211f416806
feat:add wenxin rerank (#9431)
Co-authored-by: cuihz <cuihz@knowbox.cn>
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
zhuhao b90ad587c2
refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
zhuhao a45f8969a0
fix: remove the undefined variable line (#9446) 1 year ago
ybalbert001 fdcf87c70c
fix https://github.com/langgenius/dify/issues/9409 (#9433)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
ice yao dd22e78515
fix: Deprecated gemma2-9b model in Fireworks AI Provider (#9373) 1 year ago
crazywoola 423df67042
fix: use gpt-4o-mini for validating credentials (#9387) 1 year ago
非法操作 da25b91980
fix: remove the stream option of zhipu and gemini (#9319) 1 year ago
Jason Tan 9b8aa9b75d
feat: add minimax abab6.5t support (#9365) 1 year ago
非法操作 4ffaabcc04
feat: add glm-4-flashx, deprecated chatglm_turbo (#9357) 1 year ago
Warren Wong b597a0d31c
fix: Azure OpenAI o1 max_completion_token and get_num_token_from_messages error (#9326)
Co-authored-by: wwwc <wwwc@outlook.com>
1 year ago
ice yao 5908fd6552
Adapt input type parameter with MiniMax embedding model (#9342) 1 year ago
ice yao 3f9d6759d4
feat: Add qwen2.5 72B Instruct model in Fireworks AI (#9340) 1 year ago
ice yao aba70207ab
feat: Add fireworks custom llm intergration (#9333) 1 year ago
非法操作 ffc3f33670
chore: remove the copied zhipu_ai sdk (#9270) 1 year ago
AAEE86 fe41e8bc18
feat: add siliconflow custom add model interface (#8745) 1 year ago
Fei He 5c76131d3d
feat: add gte rerank for tongyi (#9153) 1 year ago
Charlie.Wei 6b6e94da08
Fix code indentation errors (#9164) 1 year ago
Ziyu Huang fc60b554a1
Fixes #9159: Modify to make it works to llama.cpp rerank API (#9160) 1 year ago
ronaksingh27 62051d5171
Corrected type annotation to "Any" from "any" all files in "model_providers" folder (#9135) 1 year ago
luckylhb90 2024a6c941
fix: vertex ai remote url error(Error: not enough values to unpack) (#9134)
Co-authored-by: hobo.l <hobo.l@binance.com>
1 year ago
呆萌闷油瓶 060897b25b
chore:add azure openai api version 2024-09-01-preview (#9141) 1 year ago
非法操作 499cc57082
fix: response_format of model_parameters will not be removed (#9148) 1 year ago
Charlie.Wei 55679b4389
azure add o1-mini、o1-preview models (#9088)
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
1 year ago
Bowen Liang 240b66d737
chore: avoid implicit optional in type annotations of method (#8727) 1 year ago
crazywoola 3a0734d94c
Feat/9081 add support for llamaguard through groq provider (#9083) 1 year ago
Infinitnet e741ee2f45
Correct max_tokens for OpenRouter Sonnet 3.5 (#9068) 1 year ago
非法操作 966e65bb66
fix: zhipu ai web_search not work (#9058) 1 year ago
zg0d233 fcfa1252a0
fix bug when adding openai or openai-compatible stt model instance (#9006) 1 year ago
Giannis Kepas dc5839b6bb
feat: Update AWS Bedrock supported regions (#8992) 1 year ago
zhuhao 824a0dd63e
feat: add qwen2.5-72b and llama3.2 for openrouter (#8956) 1 year ago
CXwudi 0d84221b2c
chore: sort Gemini models (#8951) 1 year ago
CXwudi cdd7e55a88
chore: add missing models from Voyage (#8950) 1 year ago
zhuhao 77aef9ff1d
refactor: optimize the calculation of rerank threshold and the logic for forbidden characters in model_uid (#8879) 1 year ago
zhuhao fb49413a41
feat: add voyage ai as a new model provider (#8747) 1 year ago
zhuhao 42dfde6546
docs: add english versions for the files customizable_model_scale_out and predefined_model_scale_out (#8871) 1 year ago
chenxu9741 c531b4a911
fix: #8843 event: tts_message_end always return in api streaming resp… (#8846) 1 year ago
longzhihun e4ed916baa
Add Jamba and Llama3.2 model support (#8878) 1 year ago
Bowen Liang 74f58f29f9
chore: bump ruff to 0.6.8 for fixing violation in SIM910 (#8869) 1 year ago
zhuhao f97607370a
refactor: update Callback to an abstract class (#8868) 1 year ago
zhuhao 850492dafa
feat: deprecate gte-Qwen2-7B-instruct embedding model (#8866) 1 year ago
zhuhao 61c89a9168
feat: add internlm2.5-20b and qwen2.5-coder-7b model (#8862) 1 year ago
zhuhao 6cd22f3bca
fix: update qwen2.5-coder-7b model name (#8861) 1 year ago
CXwudi 0603359e2d
fix: delete harm catalog settings for gemini (#8829) 1 year ago
HowardChan bb781764b8
Add Llama3.2 models in Groq provider (#8831) 1 year ago
zhuhao 29275c7447
feat: deprecate mistral model for siliconflow (#8828) 1 year ago
CXwudi e5efd09ebb
chore: massive update of the Gemini models based on latest documentation (#8822) 1 year ago
wenmeng zhou ecc951609d
add more detailed doc for models of qwen series (#8799)
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
ice yao 063474f408
Add llama3.2 model in fireworks provider (#8809) 1 year ago
AAEE86 9a4b53a212
feat: add stream for Gemini (#8678) 1 year ago
AAEE86 03edfbe6f5
feat: add qwen to add custom model parameters (#8759) 1 year ago
cx 128a66f7fe
fix: Ollama modelfeature set vision, and an exception occurred at the… (#8783) 1 year ago
Shenghang Tsai a0b0809b1c
Add more models for SiliconFlow (#8779) 1 year ago
Aaron Ji 4c9ef6e830
fix: update usage for Jina Embeddings v3 (#8771) 1 year ago
zhuhao ac73763726
chore: add input_type param desc for the _invoke method of text_embedding (#8778) 1 year ago
Pan, Wen-Ming 02ff6cca70
feat: add support for Vertex AI Gemini 1.5 002 and experimental models (#8767) 1 year ago
cherryhuahua d0e0111f88
fix:Spark's large language model token calculation error #7911 (#8755) 1 year ago
ybalbert001 68c7e68a8a
Fix Issue: switch LLM of SageMaker endpoint doesn't take effect (#8737)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
ice yao 91f70d0bd9
Add embedding models in fireworks provider (#8728) 1 year ago
Jyong 4669eb24be
add embedding input type parameter (#8724) 1 year ago
Shota Totsuka 1c7877b048
fix: remove harm category setting from vertex ai (#8721) 1 year ago
ice yao 64baedb484
fix: update nomic model provider token calculation (#8705) 1 year ago
Benjamin 4638f99aaa
fix: change model provider name issue Ref #8691 (#8710) 1 year ago
AAEE86 aebe5fc68c
fix: Remove unsupported parameters in qwen model (#8699) 1 year ago
zhuhao 1ecf70dca0
feat: add mixedbread as a new model provider (#8523) 1 year ago
ybalbert001 7c485f8bb8
fix llm integration problem: It doesn't work on docker env (#8701)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
Sa Zhang 7f1b028840
fix: change the brand name to Jina AI (#8691)
Co-authored-by: sa zhang <sa.zhang@jina.ai>
1 year ago
Nam Vu bef83a4d2e
fix: typos and improve naming conventions: (#8687) 1 year ago
ice yao d7aada38a1
Add nomic embedding model provider (#8640) 1 year ago
AAEE86 a126d535cf
add Spark Max-32K (#8676) 1 year ago
AAEE86 3554a803e7
add zhipuai web search (#8668) 1 year ago
AAEE86 c66cecaa55
add Qwen model translate (#8674) 1 year ago
Aaron Ji 3618a97c20
feat: extend api params for Jina Embeddings V3 (#8657) 1 year ago
zhuhao e34f04380d
feat: add deepseek-v2.5 for model provider siliconflow (#8639) 1 year ago
zhuhao 6df77038a2
docs: fix predefined_model_scale_out.md redirect error (#8633) 1 year ago
zhuhao 45c0a44411
feat: add qwen2.5 for model provider siliconflow (#8630) 1 year ago
CXwudi 97895ec41a
chore: add Gemini newest experimental models (close #7121) (#8621) 1 year ago
sino 6d56d5c1f6
feat: support o1 series models for openrouter (#8358) 1 year ago
AAEE86 c9f1e18df1
Add model parameter translation (#8509)
Co-authored-by: swingchen01 <swings@126.com>
Co-authored-by: 陈长君 <chenchangjun@shuwen.com>
1 year ago
Waffle 740fad06c1
feat(tools/cogview): Updated cogview tool to support cogview-3 and the latest cogview-3-plus (#8382) 1 year ago
ice yao 0665268578
Add Fireworks AI as new model provider (#8428) 1 year ago
呆萌闷油瓶 c8b9bdebfe
feat:use xinference tts stream mode (#8616) 1 year ago
AAEE86 1a8dcae10e
add Qwen custom add model interface (#8565) 1 year ago
AAEE86 5ddb601e43
add MixtralAI Model (#8517) 1 year ago
Hongbin 5541248264
Update the PerfXCloud provider model list,Update PerfXCloudProvider validate_provider_credentials method. (#8587)
Co-authored-by: xhb <466010723@qq.com>
1 year ago
Su Yang c87f710d58
Fix: update qwen model and model config (#8584)
Co-authored-by: -LAN- <laipz8200@outlook.com>
1 year ago
Su Yang 1568c5cae9
fix: fix qwen series model type (#8580) 1 year ago
MuYu a03919c3b3
feat: add hunyuan-vision (#8529) 1 year ago
Su Yang d6de96c4b4
feat: sync Qwen API with Aliyun Bailian (#8538) 1 year ago
Wang Bo 6f222b49f2
refactor: rename task_type to task for jina embeddings v3 (#8488) 1 year ago
-LAN- 8dfe8c773a
chore: Deprecate gpt-3.5-turbo-0613 and gpt-3.5-turbo-16k-0613 models (#8500) 1 year ago
ybalbert001 b6ad7a1e06
Fix: https://github.com/langgenius/dify/issues/8190 (Update Model nam… (#8426)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
Aaron Ji 6f7625fa47
chore: update Jina embedding model (#8376) 1 year ago
ybalbert001 b613b11422
Fix: Support Bedrock cross region inference #8190 (Update Model name to distinguish between different region groups) (#8402)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
crazywoola 71b4480c4a
fix: o1-mini 65563 -> 65536 (#8388) 1 year ago
Bowen Liang 5b98acde2f
chore: improve usage of striping prefix or suffix of string with Ruff 0.6.5 (#8392) 1 year ago
Bowen Liang a1104ab97e
chore: refurish python code by applying Pylint linter rules (#8322) 1 year ago
xiandan-erizo 1ab81b4972
support hunyuan-turbo (#8372)
Co-authored-by: sunkesi <sunkesi@hosecloud.com>
1 year ago
takatost 24af4b9313
fix: o1-series model encounters an error when the generate mode is blocking (#8363) 1 year ago
Bowen Liang 6613b8f2e0
chore: fix unnecessary string concatation in single line (#8311) 1 year ago
sino a45ac6ab98
fix: ark token usage is none (#8351) 1 year ago
takatost 4637ddaa7f
feat: add o1-series models support in Agent App (ReACT only) (#8350) 1 year ago
takatost e90d3c29ab
feat: add OpenAI o1 series models support (#8328) 1 year ago
Nam Vu 153807f243
fix: response_format label (#8326) 1 year ago
呆萌闷油瓶 02c4b1af71
chore:add Azure openai api version 2024-08-01-preview (#8291) 1 year ago
ybalbert001 d4985fb3aa
Fix: Support Bedrock cross region inference [#8190](https://github.com/langgenius/dify/issues/8190) (#8317) 1 year ago
Bowen Liang 40fb4d16ef
chore: refurbish Python code by applying refurb linter rules (#8296) 1 year ago
Bowen Liang c69f5b07ba
chore: apply ruff E501 line-too-long linter rule (#8275)
Co-authored-by: -LAN- <laipz8200@outlook.com>
1 year ago
Bowen Liang 0f14873255
chore: cleanup ruff flake8-simplify linter rules (#8286)
Co-authored-by: -LAN- <laipz8200@outlook.com>
1 year ago
Bowen Liang 781d294f49
chore: cleanup pycodestyle E rules (#8269) 1 year ago
yalei f515af2232
let claude models in bedrock support the response_format parameter (#8220)
Co-authored-by: duyalei <>
1 year ago
crazywoola 4d2cd6703b
chore: remove useless code (#8198) 1 year ago
Bowen Liang 292220c596
chore: apply pep8-naming rules for naming convention (#8261) 1 year ago
HowardChan 53f37a6704
fix:ollama text embedding 500 error (#8252) 1 year ago
Nam Vu 342607f4a4
fix: truthy value (#8208) 1 year ago
HowardChan 82c42b9ec5
fix:error when adding the ollama embedding model (#8236)
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
Bowen Liang 2cf1187b32
chore(api/core): apply ruff reformatting (#7624) 1 year ago
takatost dabfd74622
feat: Parallel Execution of Nodes in Workflows (#8192)
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
Co-authored-by: Yi <yxiaoisme@gmail.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
1 year ago
Jyong 2d690801d1
nvidia rerank top n missed (#8185) 1 year ago
-LAN- 4313d92e6b
feat(api/core/model_runtime/entities/defaults.py): Add TOP_K in default parameters. (#8167) 1 year ago
crazywoola 0bec6a037c
update qwen-long (#8157) 1 year ago
AAEE86 fa34b9aed6
Modify model parameters in Spark LLMs and zhipuai LLMs (#8078)
Co-authored-by: Charlie.Wei <luowei@cvte.com>
1 year ago
crazywoola a27d4d58ec
fix: ollama text embedding 500 error (#8131) 1 year ago
邹成卓 a15791e788
Fix: tongyi code wrapper works not stable (#7871)
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
ybalbert001 954580a4af
feat: support more model types and builtin tools on aws/sagemaker (#8061)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
1 year ago
crazywoola ab7d79275e
fix: Claude can not validate credientials (#8109) 1 year ago
呆萌闷油瓶 d28446301f
feat:add fishaudio in xinference (#8100) 1 year ago
Nam Vu 2d7954c7da
Fix variable typo (#8084) 1 year ago
AAEE86 0cef25ef8c
Revert "fix: parameter rule" (#8070) 1 year ago
crazywoola 900fd82a92
fix: parameter rule (#8064) 1 year ago
tmuife 89aede80cc
Add OCI(Oracle Cloud Infrastructure) Generative AI Service as a Model Provider (#7775)
Co-authored-by: Walter Jin <jinshuhaicc@gmail.com>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: walter from vm <walter.jin@oracle.com>
1 year ago
Leng Yue bd0992275c
feat: support fish audio TTS (#7982) 1 year ago
非法操作 3e7597f2bd
feat: add gpt-4o-2024-08-06 and json_schema for azure openAI service (#7648) 1 year ago
wochuideng f6b9982c23
Concurrent calls to the Wenxin model, and the exception problem when obtaining the token is fixed (#7976)
Co-authored-by: puqs1 <puqs1@lenovo.com>
1 year ago
非法操作 0f72a8e89d
chore: refactor the beichuan model (#7953) 1 year ago
呆萌闷油瓶 83494cb4f5
fix:empty voice occurs when xinference CosyVoice tts model (#7958) 1 year ago
orangeclk 3f2a806abe
fix: glm models prices and max_tokens correction (#7882) 1 year ago
sino 1f56a20b62
feat: support auth by api key for ark provider (#7845) 1 year ago
非法操作 dc015c380a
feat: add zhipu glm_4_plus and glm_4v_plus model (#7824) 1 year ago
hisir f0273f00e1
Fixed when testing the openai compatible interface model, an error is reported when no object is returned (#7808) 1 year ago
sino 7cfebffbb8
chore: update default endpoint for ark provider (#7741) 1 year ago
crazywoola da326baa5e
fix: tongyi Error: 'NoneType' object is not subscriptable (#7705) 1 year ago
sino ee7d5e7206
feat: support Moonshot and GLM models tool call for volc ark provider (#7666) 1 year ago
Hélio Lúcio 7b7576ad55
Add Azure AI Studio as provider (#7549)
Co-authored-by: Hélio Lúcio <canais.hlucio@voegol.com.br>
1 year ago
代君 7c2bb31a55
[fix] openai's tool role dose not support name parameter. (#7659) 1 year ago
Seayon 561a61e7fe
Improve MIME type detection for image URLs (#6531)
Co-authored-by: seayon <zhaoxuyang@shouqianba.com>
1 year ago
sino efc136cce5
feat: Introduce Ark SDK v3 and ensure compatibility with models of SDK v2 (#7579)
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
噢哎哟喂 ad13011043
add JSON Mode support for moonshot models (#7568) 1 year ago
Fei He 6025002971
add qwen text-embedding-v3 support. (#7567) 1 year ago
orangeclk a24717765e
feat: forward zhipu finish_reason (#7560) 1 year ago
orangeclk f53454f81d
add finish_reason to the LLM node output (#7498) 1 year ago
非法操作 f7af8c7cc7
feat: gpt-4o-mini-2024-07-18 support json schema (#7489) 1 year ago
Xiyuan Chen 4e7b6aec3a
feat: support pinning, including, and excluding for model providers and tools (#7419)
Co-authored-by: GareArc <chen4851@purude.edu>
1 year ago
Nam Vu 6991a243aa
chore: correct _tts_invoke_streaming max length (#7423) 1 year ago
Chengyu Yan 1f944c6eeb
feat(api): support wenxin bge-large and tao embedding model. (#7393) 1 year ago
Xiao Ley 53cf756207
feat: OpenRouter add gpt-4o-2024-08-06 model (#7409) 1 year ago
-LAN- 0087afc2e3
fix(api/core/model_runtime/model_providers/__base/large_language_model.py): Add TEXT type checker (#7407) 1 year ago
SoaringEthan acd72e3ab2
feat: support xinference's auth system (#7369) 1 year ago
Chengyu Yan bfd905602f
feat(api): support wenxin text embedding (#7377) 1 year ago
sino a0a67873aa
chore: optimize ark model parameters (#7378) 1 year ago
噢哎哟喂 baaa3f7f42
add base url for moonshot model (#7360) 1 year ago
Weaxs 3a33062405
feat: support siliconflow rerank (#7337) 1 year ago
Xiyuan Chen c7df6783df
Revert "feat: support pinning, including, and excluding for Model Providers and Tools" (#7324) 1 year ago
噢哎哟喂 6fdbc7dbf3
fix error when use farui-plus model (#7316)
Co-authored-by: 雪风 <xuefeng@shifaedu.cn>
1 year ago
Hongbin d1a6702aa4
Update PerfXCloud Model List (#7212)
Co-authored-by: xhb <466010723@qq.com>
1 year ago
Xiyuan Chen 7619850855
feat: support pinning, including, and excluding for Model Providers and Tools (#7283) 1 year ago
非法操作 6ff7fd80a1
feat: support OPENAI json_schema (#7258) 1 year ago
非法操作 5aa373dc04
feat: add chatgpt-4o-latest (#7289) 1 year ago
Xiyuan Chen d29b32fce2
fix: typo in upstage/llm/_position.yaml (#7286) 1 year ago
噢哎哟喂 52383d0161
add support for tongyi-farui (#7248)
Co-authored-by: 雪风 <xuefeng@shifaedu.cn>
1 year ago
Onelevenvy 0f59d76997
fix: add context_size and max_chunks to Tongyi embedding to resolve issue #7189 (#7227) 2 years ago
shAlfred a12ddc47e7
feat: add support of speech2text function for OpenAI-API-compatible and Siliconflow (#7197) 2 years ago
Weaxs 67b9fdaad7
siliconflow support bge-3 && bce-v1 embedding (#7198) 2 years ago
ybalbert001 f2cb1fb09f
Fix : Workflow "start" paste url not support s3 pre-signed URL (#6855)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2 years ago
Yanyi Liu 5b32f2e0dd
Feat: Add model provider Text Embedding Inference for embedding and rerank (#7132) 2 years ago
Yanyi Liu 4cbeb6815b
Fix: Wrong cutoff length lead to empty input in openai compatible embedding model. (#7133) 2 years ago
forrestlinfeng 07511dfaf4
update stepfun model (#7118)
Co-authored-by: chenlinfeng <chenlinfeng@step.ai>
Co-authored-by: Tfsh <tianfs_fight@163.com>
2 years ago
小羽 7944ce0594
feat: wenxin add yi-34b-chat (#7117) 2 years ago
orangeclk 83acb53c08
feat: add zhipu embedding-3 (#7100) 2 years ago
shAlfred a7162240e6
feat: add text-embedding functon and LLM models to Siliconflow (#7090) 2 years ago
小羽 34a9dbe826
Feat/add 360-zhinao provider (#7069) 2 years ago
orangeclk f288d367ac
Add price info for zhipu models (#7084) 2 years ago
Waffle 5e2fa13126
feat: support glm-4-long (#7070)
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2 years ago
Joe d7bb422a5c
fix: hunyuan assistant_prompt_message pydantic error (#7062) 2 years ago
majian 99b78dd198
feat: add gpt-4o-2024-08-06 (#7046) 2 years ago
crazywoola 3516989738
fix: typos in wenxin llm (#7021) 2 years ago
Sa Zhang 26991443ed
fix: Fix incorrect context size for jina-reranker-v2 model (#7006) 2 years ago
Yefori bd3ed89516
feat: add function calling for deepseek models (#6990) 2 years ago
小羽 23ed15d19f
feat:nvidia add nemotron4-340b and microsoft/phi-3 (#6973) 2 years ago
takatost 6da14c2d48
security: fix api image security issues (#6971) 2 years ago
Pedro Gomes a34285196b
Revise the wrong pricing of certain LLM models. (#6967) 2 years ago
takatost ea30174057
chore: optimize streaming tts of xinference (#6966) 2 years ago
liuzhenghua 141e4e0276
fix: restore xinference secret field (#6941)
Co-authored-by: liuzhenghua-jk <liuzhenghua-jk@360shuke.com>
2 years ago
Weaxs 5e634a59a2
compatible xinference reranker server (#6927) 2 years ago
JuHyung Son 2e941bb91c
add new provider Solar (#6884) 2 years ago
sino 8166a8caf5
feat: update llama3.1 parameters for openrouter (#6901) 2 years ago
灰灰 56af1a0adf
pref: change ollama embedded api request (#6876) 2 years ago
dufei f8617db012
fix tongyi tool calls (#6896) 2 years ago
Weaxs cc4785f094
fix: xinference reranker return_documents (#6888) 2 years ago
chenxu9741 a9cd6df97e
Remove tts (blocking call) (#6869) 2 years ago
呆萌闷油瓶 f31142e758
Azure 4o mini options (#6873) 2 years ago
crazywoola 792f908afb
Revert "feat:Azure gpt4o mini" (#6870) 2 years ago
呆萌闷油瓶 14367ddc09
feat:Azure gpt4o mini (#6866) 2 years ago
Charlie.Wei cbf7f21ade
Add azure gpt4omini (#6862)
Co-authored-by: luowei <glpat-EjySCyNjWiLqAED-YmwM>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2 years ago
Weaxs f6e8e120a1
support xinference tts (#6746) 2 years ago