Commit Graph

365 Commits (8eb0d0fdddfe5def6ab4efb9e53ede2ebcc2d435)

Author SHA1 Message Date
呆萌闷油瓶 68ac433218
feat: add support Spark4.0 (#5688) 2 years ago
Kevin b3d6726f65
Feature/add qwen llm (#5659) 2 years ago
liuzhenghua 2b080b5cfc
feature: Add presence_penalty and frequency_penalty parameters to the … (#5637)
Co-authored-by: liuzhenghua-jk <liuzhenghua-jk@360shuke.com>
2 years ago
takatost 3ccad33194
feat: add jina new pre-defined rerankers, include: jina-reranker-v2 (#5657) 2 years ago
sunxichen bafc8a0bde
fix: tool call message role according to credentials (#5625)
Co-authored-by: sunxichen <sun.xc@digitalcnzz.com>
2 years ago
Bowen Liang dcb72e0067
chore: apply flake8-comprehensions Ruff rules to improve collection comprehensions (#5652)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
Joe 4e2de638af
feat: add ops trace (#5483)
Co-authored-by: takatost <takatost@gmail.com>
2 years ago
sino 877a2c144b
feat: support predefined models for openrouter (#5494) 2 years ago
-LAN- ba67206bb9
fix(api/model_runtime/azure/llm): Switch to tool_call. (#5541) 2 years ago
vccler 48757e581e
fix: zhipu tool calling, this PR fixes the bug described in issue #5496 (#5469)
Co-authored-by: vccler <vccler@163.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
LXM e8ad0339a3
fix: tongyi json output (#5396) 2 years ago
crazywoola 91d38a535f
fix: max_tokens of qwen-plus & qwen-plus-chat (#5480) 2 years ago
Pan, Wen-Ming 95c882934e
feat: add support for Vertex AI claude-3-5-sonnet@20240620 (#5475)
Co-authored-by: Wenming Pan <pwm@google.com>
2 years ago
Su Yang 26b6fd2236
feat: add support for bedrock claude-3-5-sonnet-20240620 (#5461) 2 years ago
takatost ff0f02d809
feat: add support for claude-3-5-sonnet-20240620 (#5452) 2 years ago
-LAN- 142dc0afd7
refactor: Remove unused code in large_language_model.py (#5433) 2 years ago
-LAN- 23fa3dedc4
fix(core): Fix incorrect type hints. (#5427) 2 years ago
Ikko Eltociear Ashimine 8266842809
chore: update llm.py (#5335) 2 years ago
Richards Tu c163521b9e
Update and fix the model param of Deepseek (#5329) 2 years ago
Justin Wu 61f4f08744
Add bedrock command r models (#4521)
Co-authored-by: Justin Wu <justin.wu@ringcentral.com>
Co-authored-by: Chenhe Gu <guchenhe@gmail.com>
2 years ago
-LAN- 5a99aeb864
fix(core): Reorder `field_validator` and `classmethod` to fit Pydantic V2. (#5257) 2 years ago
crazywoola 9a64aa76c1
fix: typo and check (#5287) 2 years ago
Pan, Wen-Ming 4b54843ed7
fix: run agent with Vertex AI Gemini models (#5260)
Co-authored-by: Wenming Pan <pwm@google.com>
2 years ago
kurokobo 2e842333b1
fix: correct typos in the icons for microsoft (#5243) 2 years ago
Masashi Tomooka d9bee03ff6
fix: embedding job fails using IAM role (#5252) 2 years ago
Jyong ba5f8afaa8
Feat/firecrawl data source (#5232)
Co-authored-by: Nicolas <nicolascamara29@gmail.com>
Co-authored-by: chenhe <guchenhe@gmail.com>
Co-authored-by: takatost <takatost@gmail.com>
2 years ago
Bin 0f35d07052
support ERNIE-4.0-8K-Latest (#5216) 2 years ago
-LAN- 7f44e88eda
fix(model_providers/ollama): Fix OllamaLargeLanguageModel to correctly set the stop option (#5217) 2 years ago
Jason b7ff765d8d
Add novita.ai as model provider (#4961) 2 years ago
Masashi Tomooka 0633aae7dc
feat: allow to use IAM Role for Bedrock (#5188) 2 years ago
takatost 415022aa14
fix: pydantic2 error (#5172) 2 years ago
rerorero b85ae146a7
fix: JSON mode with an image doesn't work for Gemini (#5169) 2 years ago
Pan, Wen-Ming f13af5a811
fix(model_providers/vertex_ai): Vertex AI Anthropic models authentication failed (#4971) 2 years ago
Bowen Liang f976740b57
improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 2 years ago
kurokobo e61f5d029a
chore(docs): fix minor small typos (#5124) 2 years ago
sino 8210637bc5
feat: support jina-clip-v1 embedding model (#5146) 2 years ago
呆萌闷油瓶 790543131a
chore:add some new api version for azure openai (#5142) 2 years ago
yanghx adc948e87c
fix(api/core/model_runtime/model_providers/baichuan,localai): Parse ToolPromptMessage. #4943 (#5138)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
orangeclk 79e8489942
feat: support siliconflow (#5129) 2 years ago
xielong ea69dc2a7e
feat: support hunyuan llm models (#5013)
Co-authored-by: takatost <takatost@users.noreply.github.com>
Co-authored-by: Bowen Liang <bowenliang@apache.org>
2 years ago
Pika ecc7f130b4
fix(typo): misspelling (#5094) 2 years ago
sino 0ce97e6315
feat: support doubao llm function calling (#5100) 2 years ago
rerorero 28997772a5
fix: remote_url doesn't work for gemini (#5090) 2 years ago
orangeclk 2050a8b8f0
feat: add glm4 new models and zhipu embedding-2 (#5089) 2 years ago
sino 5f870ac950
chore: update maas model provider description (#5056) 2 years ago
Jaxon Ley 2573b138bf
fix: update presence_penalty configuration for wenxin AI ernie-4.0-8k and ernie-3.5-8k models (#5039) 2 years ago
takatost 3929d289e0
feat: set default memory messages limit to infinite (#5002) 2 years ago
Joe 5cdb95be1f
fix: gemini timeout error (#4955) 2 years ago
Bowen Liang f32b440c4a
chore: fix indention violations by applying E111 to E117 ruff rules (#4925) 2 years ago
takatost f44d1e62d2
fix: bedrock get_num_tokens prompt_messages parameter name err (#4932) 2 years ago
takatost d1dbbc1e33
feat: backend model load balancing support (#4927) 2 years ago
Pan, Wen-Ming b98a1a3303
feat: added Anthropic Claude3 models to Google Cloud Vertex AI (#4870)
Co-authored-by: pwm <pwm@google.com>
2 years ago
takatost 696c5308a9
chore: optimize nvidia nim credential schema and info (#4898) 2 years ago
Joshua 3c8a120e51
add-nvidia-mim (#4882) 2 years ago
Pan, Wen-Ming cdbc260571
Bugfix: Vertex AI vision model not support image (#4853) 2 years ago
Yash Parmar e0da0744b5
add: ollama keep alive parameter added. issue #4024 (#4655) 2 years ago
Weaxs b189faca52
feat: update ernie model (#4756) 2 years ago
xielong e1cd9aef8f
feat: support baichuan3 turbo, baichuan3 turbo 128k, and baichuan4 (#4762) 2 years ago
crazywoola 705a6e3a8e
Fix/4742 ollama num gpu option not consistent with allowed values (#4751) 2 years ago
xielong 793f0c1dd6
fix: Corrected schema link in model_runtime's README.md (#4757) 2 years ago
xielong 88b4d69278
fix: Correct context size for banchuan2-53b and banchuan2-turbo (#4721) 2 years ago
crazywoola 27dae156db
fix: colon in file mistral.mistral-small-2402-v1:0 (#4673) 2 years ago
Giovanny Gutiérrez 2deb23e00e
fix: Show rerank in system for localai (#4652) 2 years ago
longzhihun fe9bf5fc4a
[seanguo] add support of amazon titan v2 and modify the price of amazon titan v1 (#4643)
Co-authored-by: Chenhe Gu <guchenhe@gmail.com>
2 years ago
miendinh f804adbff3
feat: Support for Vertex AI - load Default Application Configuration (#4641)
Co-authored-by: miendinh <miendinh@users.noreply.github.com>
Co-authored-by: crazywoola <427733928@qq.com>
2 years ago
Krasus.Chen f156014daa
update lite8k/speed8k/128k max_token to newest (#4636)
Co-authored-by: Your Name <chen@krasus.red>
2 years ago
Bowen Liang 3fda2245a4
improve: extract method for safe loading yaml file and avoid using PyYaml's FullLoader (#4031) 2 years ago
Patryk Garstecki 296887754f
Support for Vertex AI (#4586) 2 years ago
QuietRocket 9ae72cdcf4
feat: Add Gemini Flash (#4616) 2 years ago
takatost 11642192d1
chore: add https://api.openai.com placeholder in OpenAI api base (#4604) 2 years ago
呆萌闷油瓶 e57bdd4e58
chore:update gpt-3.5-turbo and gpt-4-turbo parameter for azure (#4596) 2 years ago
somethingwentwell 461488e9bf
Add Azure OpenAI API version for GPT4o support (#4569)
Co-authored-by: wwwc <wwwc@outlook.com>
2 years ago
Justin Wu 3ab19be9ea
Fix bedrock claude wrong pricing (#4572)
Co-authored-by: Justin Wu <justin.wu@ringcentral.com>
2 years ago
呆萌闷油瓶 d5a33a0323
feat:add gpt-4o for azure (#4568) 2 years ago
Bowen Liang e8e213ad1e
chore: apply and fix flake8-bugbear lint rules (#4496) 2 years ago
Ever 4086f5051c
feat:Provide parameter config for mask_sensitive_info of MiniMax mode… (#4294)
Co-authored-by: 老潮 <zhangyongsheng@3vjia.com>
Co-authored-by: takatost <takatost@users.noreply.github.com>
Co-authored-by: takatost <takatost@gmail.com>
2 years ago
fanghongtai 1cca100a48
fix:modify spelling errors: lanuage ->language in schema.md (#4499)
Co-authored-by: wxfanghongtai <wxfanghongtai@gf.com.cn>
2 years ago
Bowen Liang 04ad46dd31
chore: skip unnecessary key checks prior to accessing a dictionary (#4497) 2 years ago
Yeuoly 091fba74cb
enhance: claude stream tool call (#4469) 2 years ago
jiaqianjing 0ac5d621b6
add llm: ernie-character-8k of wenxin (#4448) 2 years ago
sino 6e9066ebf4
feat: support doubao llm and embeding models (#4431) 2 years ago
Yash Parmar 332baca538
FIX: fix the temperature value of ollama model (#4027) 2 years ago
Yeuoly e8311357ff
feat: gpt-4o (#4346) 2 years ago
orangeclk ece0f08a2b
add yi models (#4335)
Co-authored-by: 陈力坤 <likunchen@caixin.com>
2 years ago
Weaxs 8cc492721b
fix: minimax streaming function_call message (#4271) 2 years ago
Joshua a80fe20456
add-some-new-models-hosted-on-nvidia (#4303) 2 years ago
呆萌闷油瓶 4796f9d914
feat:add gpt-4-turbo for azure (#4287) 2 years ago
Sebastian.W a588df4371
Add rerank model type for LocalAI provider (#3952) 2 years ago
Bowen Liang 228de1f12a
fix: miss usage of os.path.join for URL assembly and add tests on yarl (#4224) 2 years ago
sino 4aa21242b6
feat: add volcengine maas model provider (#4142) 2 years ago
Yong723 8ce93faf08
Typo on deepseek.yaml and yi.yaml (#4170) 2 years ago
Su Yang 9f440c11e0
feat: DeepSeek (#4162) 2 years ago
Joshua 58bd5627bf
Add-Deepseek (#4157) 2 years ago
Moonlit 2fdd64c1b5
feat: add proxy configuration for Cohere model (#4152) 2 years ago
VoidIsVoid 543a00e597
feat: update model_provider jina to support custom url and model (#4110)
Co-authored-by: Gimling <huangjl@ruyi.ai>
Co-authored-by: takatost <takatost@gmail.com>
2 years ago
Minamiyama f361c7004d
feat: support vision models from xinference (#4094)
Co-authored-by: Yeuoly <admin@srmxy.cn>
2 years ago
Tomy bb7c62777d
Add support for local ai speech to text (#3921)
Co-authored-by: Yeuoly <admin@srmxy.cn>
2 years ago
Charlie.Wei 087b7a6607
azure_openai add gpt-4-turbo-2024-04-09 model (#4144)
Co-authored-by: luowei <glpat-EjySCyNjWiLqAED-YmwM>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2 years ago
Weaxs 6f1911533c
bug fix: update minimax model_apis (#4116) 2 years ago
Yeuoly d5d8b98d82
feat: support openai stream usage (#4140) 2 years ago