Commit Graph

785 Commits (903cf3334298b07efe552dbc9e0656940586f20f)

Author SHA1 Message Date
-LAN- 32588f562e
feat(model): fix and re-add gpt-4.1.
Signed-off-by: -LAN- <laipz8200@outlook.com>
10 months ago
Zixuan Cheng 4fa3d78ed8
Revert "feat : add GPT4.1 in the model providers" (#19002) 11 months ago
-LAN- 559ab46ee1
fix: Removes redundant token calculations and updates dependencies
Eliminates unnecessary pre-calculation of token limits and recalculation of max tokens
across multiple app runners, simplifying the logic for prompt handling.

Updates tiktoken library from version 0.8.0 to 0.9.0 for improved tokenization performance.

Increases default token limit in TokenBufferMemory to accommodate larger prompt messages.

These changes streamline the token management process and leverage the latest
improvements in the tiktoken library.

Fixes potential token overflow issues and prepares the system for handling larger
inputs more efficiently.

Relates to internal optimization tasks.

Signed-off-by: -LAN- <laipz8200@outlook.com>
11 months ago
Zixuan Cheng 144f9507f8
feat : add GPT4.1 in the model providers (#18912) 11 months ago
kelvintsim 2e097a1ac0
add bedrock deepseek-r1 (#18908) 11 months ago
kelvintsim 024f242251
add bedrock claude-sonnet-3.7 (#18788) 11 months ago
kautsar_masuara b26e20fe34
fix: fix vertex gemini 2.0 flash 001 schema (#18405)
Co-authored-by: achmad-kautsar <achmad.kautsar@insignia.co.id>
11 months ago
Alexi.F fe1846c437
fix: change gemini-2.0-flash to validate google api #17082 (#17115) 12 months ago
-LAN- 413dfd5628
feat: add completion mode and context size options for LLM configuration (#13325)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
-LAN- f9515901cc
fix: Azure AI Foundry model cannot be used in the workflow (#13323)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
呆萌闷油瓶 3f42fabff8
chore:improve thinking display for llm from xinference and ollama pro… (#13318) 1 year ago
-LAN- 1caa578771
chore(*): Update style of thinking (#13319)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
非法操作 3eb3db0663
chore: refactor the OpenAICompatible and improve thinking display (#13299) 1 year ago
sino 6e5c915f96
feat(model): add deepseek-r1 for openrouter (#13312) 1 year ago
Riddhimaan-Senapati 2348abe4bf
feat: added a couple of models not defined in vertex ai, that were already … (#13296) 1 year ago
呆萌闷油瓶 f7e7a399d9
feat:add think tag display for xinference deepseek r1 (#13291) 1 year ago
zhu-an 16865d43a8
feat: add deepseek models for volcengine provider (#13283)
Co-authored-by: zhaoqingyu.1075 <zhaoqingyu.1075@bytedance.com>
1 year ago
呆萌闷油瓶 0d13aee15c
feat:add deepseek r1 think display for ollama provider (#13272) 1 year ago
engchina 40dd63ecef
Upgrade oracle models (#13174)
Co-authored-by: engchina <atjapan2015@gmail.com>
1 year ago
-LAN- 6d66d6da15
feat(model_providers): Support deepseek-r1 for Nvidia Catalog (#13269)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
-LAN- 87763fc234
feat(model_providers): Support deepseek for Azure AI Foundry (#13267)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
JasonVV f6c44cae2e
feat(model): add gemini-2.0 model (#13266) 1 year ago
xhe da2ee04fce
fix: correct linewrap think display in generic openai api (#13260)
Signed-off-by: xhe <xw897002528@gmail.com>
1 year ago
JasonVV 7673c36af3
feat(model): add gemini-2.0-flash-thinking-exp-01-21 (#13230) 1 year ago
Riddhimaan-Senapati 9457b2af2f
feat: added models :gemini 2.0 flash 001 and gemini 2.0 pro exp 02-05 (#13247) 1 year ago
k-zaku 7203991032
feat: add parameter "reasoning_effort" and Openai o3-mini (#13243) 1 year ago
xhe 5a685f7156
feat: add think display for volcengine and generic openapi (#13234)
Signed-off-by: xhe <xw897002528@gmail.com>
1 year ago
Riddhimaan-Senapati a6a25030ad
fix: updated _position.yaml to include the latest model already integ… (#13245) 1 year ago
Riddhimaan-Senapati 00458a31d5
feat: added deepseek r1 and v3 to siliconflow (#13238) 1 year ago
-LAN- c6ddf6d6cc
feat(model_providers): Add Groq DeepSeek-R1-Distill-Llama-70b (#13229)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
Joshbly 34b21b3065
feat: Add o3-mini and o3-mini-2025-01-31 model variants (#13129)
Co-authored-by: crazywoola <427733928@qq.com>
1 year ago
-LAN- 59ca44f493
chore(model_runtime): Move deepseek ahead in the providers list. (#13197)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
MaFee921 1a2523fd15
feat: bedrock_endpoint_url (#12838) 1 year ago
Kei YAMAZAKI 7452032d81
add azure openai api version 2024-12-01-preview (#13135) 1 year ago
非法操作 840729afa5
feat: the think tag display of siliconflow's deepseek r1 (#13153) 1 year ago
Yingchun Lai b09c39c8dc
refactor: avoid to use extra space when finding model by name (#13043) 1 year ago
heyszt b4b09ddc3c
add tongyi qwen2.5-14b/7b-instruct-1m model (#13089) 1 year ago
Yingchun Lai d44882c1b5
refactor: reduce duplciate code by inheritance (#13073) 1 year ago
Jason 560c5de1b7
Fixed Novita AI color and added DeepSeek R1 model (#13074) 1 year ago
heyszt 6c31ee36cd
fix qwen-vl blocking mode (#13052) 1 year ago
Jason d4be5ef9de
Update Novita AI predefined models (#13045) 1 year ago
非法操作 59b3e672aa
feat: add agent thinking content display of deepseek R1 (#12949) 1 year ago
IWAI, Masaharu a2f8bce8f5
chore: add Japanese translation: model_providers/bedrock (#13016) 1 year ago
IWAI, Masaharu 28067640b5
fix: wrong zh_Hans translation: Ohio (#13006) 1 year ago
lowell da67916843
feat: add glm-4-air-0111 (#12997)
Co-authored-by: lowell <lowell.hu@zkteco.in>
1 year ago
sino d167d5b1be
feat(ark): support doubao 1.5 series of models (#12935) 1 year ago
jiandanfeng e23f4b0265
feat: add gemini-2.0-flash-thinking-exp-01-21 (#12924) 1 year ago
luckylhb90 3d1ce4c53f
bug: fixed bedrock rerank bug (#12774)
Co-authored-by: hobo.l <hobo.l@binance.com>
1 year ago
k-zaku 46e95e8309
fix: OpenAI o1 Bad Request Error (#12839) 1 year ago
JasonVV a7b9375877
Update deepseek model configuration (#12899) 1 year ago