Commit Graph

585 Commits (95dc90e6b2ba50eb9795b7879fc9e1bfe8897959)

Author SHA1 Message Date
zhuhao f97607370a
refactor: update Callback to an abstract class (#8868) 1 year ago
zhuhao 850492dafa
feat: deprecate gte-Qwen2-7B-instruct embedding model (#8866) 1 year ago
zhuhao 61c89a9168
feat: add internlm2.5-20b and qwen2.5-coder-7b model (#8862) 2 years ago
zhuhao 6cd22f3bca
fix: update qwen2.5-coder-7b model name (#8861) 2 years ago
CXwudi 0603359e2d
fix: delete harm catalog settings for gemini (#8829) 2 years ago
HowardChan bb781764b8
Add Llama3.2 models in Groq provider (#8831) 2 years ago
zhuhao 29275c7447
feat: deprecate mistral model for siliconflow (#8828) 2 years ago
CXwudi e5efd09ebb
chore: massive update of the Gemini models based on latest documentation (#8822) 2 years ago
wenmeng zhou ecc951609d
add more detailed doc for models of qwen series (#8799)
Co-authored-by: crazywoola <427733928@qq.com>
2 years ago
ice yao 063474f408
Add llama3.2 model in fireworks provider (#8809) 2 years ago
AAEE86 9a4b53a212
feat: add stream for Gemini (#8678) 2 years ago
AAEE86 03edfbe6f5
feat: add qwen to add custom model parameters (#8759) 2 years ago
cx 128a66f7fe
fix: Ollama modelfeature set vision, and an exception occurred at the… (#8783) 2 years ago
Shenghang Tsai a0b0809b1c
Add more models for SiliconFlow (#8779) 2 years ago
Aaron Ji 4c9ef6e830
fix: update usage for Jina Embeddings v3 (#8771) 2 years ago
zhuhao ac73763726
chore: add input_type param desc for the _invoke method of text_embedding (#8778) 2 years ago
Pan, Wen-Ming 02ff6cca70
feat: add support for Vertex AI Gemini 1.5 002 and experimental models (#8767) 2 years ago
cherryhuahua d0e0111f88
fix:Spark's large language model token calculation error #7911 (#8755) 2 years ago
ybalbert001 68c7e68a8a
Fix Issue: switch LLM of SageMaker endpoint doesn't take effect (#8737)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2 years ago
ice yao 91f70d0bd9
Add embedding models in fireworks provider (#8728) 2 years ago
Jyong 4669eb24be
add embedding input type parameter (#8724) 2 years ago
Shota Totsuka 1c7877b048
fix: remove harm category setting from vertex ai (#8721) 2 years ago
ice yao 64baedb484
fix: update nomic model provider token calculation (#8705) 2 years ago
Benjamin 4638f99aaa
fix: change model provider name issue Ref #8691 (#8710) 2 years ago
AAEE86 aebe5fc68c
fix: Remove unsupported parameters in qwen model (#8699) 2 years ago
zhuhao 1ecf70dca0
feat: add mixedbread as a new model provider (#8523) 2 years ago
ybalbert001 7c485f8bb8
fix llm integration problem: It doesn't work on docker env (#8701)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2 years ago
Sa Zhang 7f1b028840
fix: change the brand name to Jina AI (#8691)
Co-authored-by: sa zhang <sa.zhang@jina.ai>
2 years ago
Nam Vu bef83a4d2e
fix: typos and improve naming conventions: (#8687) 2 years ago
ice yao d7aada38a1
Add nomic embedding model provider (#8640) 2 years ago
AAEE86 a126d535cf
add Spark Max-32K (#8676) 2 years ago
AAEE86 3554a803e7
add zhipuai web search (#8668) 2 years ago
AAEE86 c66cecaa55
add Qwen model translate (#8674) 2 years ago
Aaron Ji 3618a97c20
feat: extend api params for Jina Embeddings V3 (#8657) 2 years ago
zhuhao e34f04380d
feat: add deepseek-v2.5 for model provider siliconflow (#8639) 2 years ago
zhuhao 6df77038a2
docs: fix predefined_model_scale_out.md redirect error (#8633) 2 years ago
zhuhao 45c0a44411
feat: add qwen2.5 for model provider siliconflow (#8630) 2 years ago
CXwudi 97895ec41a
chore: add Gemini newest experimental models (close #7121) (#8621) 2 years ago
sino 6d56d5c1f6
feat: support o1 series models for openrouter (#8358) 2 years ago
AAEE86 c9f1e18df1
Add model parameter translation (#8509)
Co-authored-by: swingchen01 <swings@126.com>
Co-authored-by: 陈长君 <chenchangjun@shuwen.com>
2 years ago
Waffle 740fad06c1
feat(tools/cogview): Updated cogview tool to support cogview-3 and the latest cogview-3-plus (#8382) 2 years ago
ice yao 0665268578
Add Fireworks AI as new model provider (#8428) 2 years ago
呆萌闷油瓶 c8b9bdebfe
feat:use xinference tts stream mode (#8616) 2 years ago
AAEE86 1a8dcae10e
add Qwen custom add model interface (#8565) 2 years ago
AAEE86 5ddb601e43
add MixtralAI Model (#8517) 2 years ago
Hongbin 5541248264
Update the PerfXCloud provider model list,Update PerfXCloudProvider validate_provider_credentials method. (#8587)
Co-authored-by: xhb <466010723@qq.com>
2 years ago
Su Yang c87f710d58
Fix: update qwen model and model config (#8584)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
Su Yang 1568c5cae9
fix: fix qwen series model type (#8580) 2 years ago
MuYu a03919c3b3
feat: add hunyuan-vision (#8529) 2 years ago
Su Yang d6de96c4b4
feat: sync Qwen API with Aliyun Bailian (#8538) 2 years ago
Wang Bo 6f222b49f2
refactor: rename task_type to task for jina embeddings v3 (#8488) 2 years ago
-LAN- 8dfe8c773a
chore: Deprecate gpt-3.5-turbo-0613 and gpt-3.5-turbo-16k-0613 models (#8500) 2 years ago
ybalbert001 b6ad7a1e06
Fix: https://github.com/langgenius/dify/issues/8190 (Update Model nam… (#8426)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2 years ago
Aaron Ji 6f7625fa47
chore: update Jina embedding model (#8376) 2 years ago
ybalbert001 b613b11422
Fix: Support Bedrock cross region inference #8190 (Update Model name to distinguish between different region groups) (#8402)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2 years ago
crazywoola 71b4480c4a
fix: o1-mini 65563 -> 65536 (#8388) 2 years ago
Bowen Liang 5b98acde2f
chore: improve usage of striping prefix or suffix of string with Ruff 0.6.5 (#8392) 2 years ago
Bowen Liang a1104ab97e
chore: refurish python code by applying Pylint linter rules (#8322) 2 years ago
xiandan-erizo 1ab81b4972
support hunyuan-turbo (#8372)
Co-authored-by: sunkesi <sunkesi@hosecloud.com>
2 years ago
takatost 24af4b9313
fix: o1-series model encounters an error when the generate mode is blocking (#8363) 2 years ago
Bowen Liang 6613b8f2e0
chore: fix unnecessary string concatation in single line (#8311) 2 years ago
sino a45ac6ab98
fix: ark token usage is none (#8351) 2 years ago
takatost 4637ddaa7f
feat: add o1-series models support in Agent App (ReACT only) (#8350) 2 years ago
takatost e90d3c29ab
feat: add OpenAI o1 series models support (#8328) 2 years ago
Nam Vu 153807f243
fix: response_format label (#8326) 2 years ago
呆萌闷油瓶 02c4b1af71
chore:add Azure openai api version 2024-08-01-preview (#8291) 2 years ago
ybalbert001 d4985fb3aa
Fix: Support Bedrock cross region inference [#8190](https://github.com/langgenius/dify/issues/8190) (#8317) 2 years ago
Bowen Liang 40fb4d16ef
chore: refurbish Python code by applying refurb linter rules (#8296) 2 years ago
Bowen Liang c69f5b07ba
chore: apply ruff E501 line-too-long linter rule (#8275)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
Bowen Liang 0f14873255
chore: cleanup ruff flake8-simplify linter rules (#8286)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
Bowen Liang 781d294f49
chore: cleanup pycodestyle E rules (#8269) 2 years ago
yalei f515af2232
let claude models in bedrock support the response_format parameter (#8220)
Co-authored-by: duyalei <>
2 years ago
crazywoola 4d2cd6703b
chore: remove useless code (#8198) 2 years ago
Bowen Liang 292220c596
chore: apply pep8-naming rules for naming convention (#8261) 2 years ago
HowardChan 53f37a6704
fix:ollama text embedding 500 error (#8252) 2 years ago
Nam Vu 342607f4a4
fix: truthy value (#8208) 2 years ago
HowardChan 82c42b9ec5
fix:error when adding the ollama embedding model (#8236)
Co-authored-by: crazywoola <427733928@qq.com>
2 years ago
Bowen Liang 2cf1187b32
chore(api/core): apply ruff reformatting (#7624) 2 years ago
takatost dabfd74622
feat: Parallel Execution of Nodes in Workflows (#8192)
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
Co-authored-by: Yi <yxiaoisme@gmail.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
Jyong 2d690801d1
nvidia rerank top n missed (#8185) 2 years ago
-LAN- 4313d92e6b
feat(api/core/model_runtime/entities/defaults.py): Add TOP_K in default parameters. (#8167) 2 years ago
crazywoola 0bec6a037c
update qwen-long (#8157) 2 years ago
AAEE86 fa34b9aed6
Modify model parameters in Spark LLMs and zhipuai LLMs (#8078)
Co-authored-by: Charlie.Wei <luowei@cvte.com>
2 years ago
crazywoola a27d4d58ec
fix: ollama text embedding 500 error (#8131) 2 years ago
邹成卓 a15791e788
Fix: tongyi code wrapper works not stable (#7871)
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: crazywoola <427733928@qq.com>
2 years ago
ybalbert001 954580a4af
feat: support more model types and builtin tools on aws/sagemaker (#8061)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2 years ago
crazywoola ab7d79275e
fix: Claude can not validate credientials (#8109) 2 years ago
呆萌闷油瓶 d28446301f
feat:add fishaudio in xinference (#8100) 2 years ago
Nam Vu 2d7954c7da
Fix variable typo (#8084) 2 years ago
AAEE86 0cef25ef8c
Revert "fix: parameter rule" (#8070) 2 years ago
crazywoola 900fd82a92
fix: parameter rule (#8064) 2 years ago
tmuife 89aede80cc
Add OCI(Oracle Cloud Infrastructure) Generative AI Service as a Model Provider (#7775)
Co-authored-by: Walter Jin <jinshuhaicc@gmail.com>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: walter from vm <walter.jin@oracle.com>
2 years ago
Leng Yue bd0992275c
feat: support fish audio TTS (#7982) 2 years ago
非法操作 3e7597f2bd
feat: add gpt-4o-2024-08-06 and json_schema for azure openAI service (#7648) 2 years ago
wochuideng f6b9982c23
Concurrent calls to the Wenxin model, and the exception problem when obtaining the token is fixed (#7976)
Co-authored-by: puqs1 <puqs1@lenovo.com>
2 years ago
非法操作 0f72a8e89d
chore: refactor the beichuan model (#7953) 2 years ago
呆萌闷油瓶 83494cb4f5
fix:empty voice occurs when xinference CosyVoice tts model (#7958) 2 years ago
orangeclk 3f2a806abe
fix: glm models prices and max_tokens correction (#7882) 2 years ago
sino 1f56a20b62
feat: support auth by api key for ark provider (#7845) 2 years ago
非法操作 dc015c380a
feat: add zhipu glm_4_plus and glm_4v_plus model (#7824) 2 years ago