zhuhao
850492dafa
feat: deprecate gte-Qwen2-7B-instruct embedding model ( #8866 )
2024-09-28 21:40:27 +08:00
zhuhao
61c89a9168
feat: add internlm2.5-20b and qwen2.5-coder-7b model ( #8862 )
2024-09-28 16:31:02 +08:00
zhuhao
6cd22f3bca
fix: update qwen2.5-coder-7b model name ( #8861 )
2024-09-28 15:01:27 +08:00
CXwudi
0603359e2d
fix: delete harm catalog settings for gemini ( #8829 )
2024-09-27 13:49:03 +08:00
HowardChan
bb781764b8
Add Llama3.2 models in Groq provider ( #8831 )
2024-09-27 12:13:00 +08:00
zhuhao
29275c7447
feat: deprecate mistral model for siliconflow ( #8828 )
2024-09-27 12:11:56 +08:00
CXwudi
e5efd09ebb
chore: massive update of the Gemini models based on latest documentation ( #8822 )
2024-09-27 09:14:33 +08:00
wenmeng zhou
ecc951609d
add more detailed doc for models of qwen series ( #8799 )
...
Co-authored-by: crazywoola <427733928@qq.com>
2024-09-26 22:32:33 +08:00
ice yao
063474f408
Add llama3.2 model in fireworks provider ( #8809 )
2024-09-26 22:21:01 +08:00
AAEE86
9a4b53a212
feat: add stream for Gemini ( #8678 )
2024-09-26 19:08:59 +08:00
AAEE86
03edfbe6f5
feat: add qwen to add custom model parameters ( #8759 )
2024-09-26 19:04:25 +08:00
cx
128a66f7fe
fix: Ollama modelfeature set vision, and an exception occurred at the… ( #8783 )
2024-09-26 16:34:40 +08:00
Shenghang Tsai
a0b0809b1c
Add more models for SiliconFlow ( #8779 )
2024-09-26 11:29:53 +08:00
Aaron Ji
4c9ef6e830
fix: update usage for Jina Embeddings v3 ( #8771 )
2024-09-26 11:29:35 +08:00
zhuhao
ac73763726
chore: add input_type param desc for the _invoke method of text_embedding ( #8778 )
2024-09-26 11:23:09 +08:00
Pan, Wen-Ming
02ff6cca70
feat: add support for Vertex AI Gemini 1.5 002 and experimental models ( #8767 )
2024-09-25 21:27:26 +08:00
cherryhuahua
d0e0111f88
fix:Spark's large language model token calculation error #7911 ( #8755 )
2024-09-25 14:51:42 +08:00
ybalbert001
68c7e68a8a
Fix Issue: switch LLM of SageMaker endpoint doesn't take effect ( #8737 )
...
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-25 09:12:35 +08:00
ice yao
91f70d0bd9
Add embedding models in fireworks provider ( #8728 )
2024-09-25 08:47:11 +08:00
Jyong
4669eb24be
add embedding input type parameter ( #8724 )
2024-09-24 21:53:50 +08:00
Shota Totsuka
1c7877b048
fix: remove harm category setting from vertex ai ( #8721 )
2024-09-24 20:53:26 +08:00
ice yao
64baedb484
fix: update nomic model provider token calculation ( #8705 )
2024-09-24 14:04:07 +08:00
Benjamin
4638f99aaa
fix: change model provider name issue Ref #8691 ( #8710 )
2024-09-24 13:26:58 +08:00
AAEE86
aebe5fc68c
fix: Remove unsupported parameters in qwen model ( #8699 )
2024-09-24 13:06:21 +08:00
zhuhao
1ecf70dca0
feat: add mixedbread as a new model provider ( #8523 )
2024-09-24 11:20:15 +08:00
ybalbert001
7c485f8bb8
fix llm integration problem: It doesn't work on docker env ( #8701 )
...
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-24 10:33:30 +08:00
Sa Zhang
7f1b028840
fix: change the brand name to Jina AI ( #8691 )
...
Co-authored-by: sa zhang <sa.zhang@jina.ai>
2024-09-23 21:39:26 +08:00
Nam Vu
bef83a4d2e
fix: typos and improve naming conventions: ( #8687 )
2024-09-23 21:32:58 +08:00
ice yao
d7aada38a1
Add nomic embedding model provider ( #8640 )
2024-09-23 19:57:21 +08:00
AAEE86
a126d535cf
add Spark Max-32K ( #8676 )
2024-09-23 16:39:46 +08:00
AAEE86
3554a803e7
add zhipuai web search ( #8668 )
2024-09-23 16:19:42 +08:00
AAEE86
c66cecaa55
add Qwen model translate ( #8674 )
2024-09-23 16:18:55 +08:00
Aaron Ji
3618a97c20
feat: extend api params for Jina Embeddings V3 ( #8657 )
2024-09-23 13:45:09 +08:00
zhuhao
e34f04380d
feat: add deepseek-v2.5 for model provider siliconflow ( #8639 )
2024-09-22 21:44:06 +08:00
zhuhao
6df77038a2
docs: fix predefined_model_scale_out.md redirect error ( #8633 )
2024-09-22 16:45:45 +08:00
zhuhao
45c0a44411
feat: add qwen2.5 for model provider siliconflow ( #8630 )
2024-09-22 16:42:34 +08:00
CXwudi
97895ec41a
chore: add Gemini newest experimental models ( close #7121 ) ( #8621 )
2024-09-22 13:38:08 +08:00
sino
6d56d5c1f6
feat: support o1 series models for openrouter ( #8358 )
2024-09-22 10:23:50 +08:00
AAEE86
c9f1e18df1
Add model parameter translation ( #8509 )
...
Co-authored-by: swingchen01 <swings@126.com>
Co-authored-by: 陈长君 <chenchangjun@shuwen.com>
2024-09-22 10:14:33 +08:00
Waffle
740fad06c1
feat(tools/cogview): Updated cogview tool to support cogview-3 and the latest cogview-3-plus ( #8382 )
2024-09-22 10:14:14 +08:00
ice yao
0665268578
Add Fireworks AI as new model provider ( #8428 )
2024-09-22 10:13:00 +08:00
呆萌闷油瓶
c8b9bdebfe
feat:use xinference tts stream mode ( #8616 )
2024-09-22 10:08:35 +08:00
AAEE86
1a8dcae10e
add Qwen custom add model interface ( #8565 )
2024-09-21 22:52:10 +08:00
AAEE86
5ddb601e43
add MixtralAI Model ( #8517 )
2024-09-21 18:08:07 +08:00
Hongbin
5541248264
Update the PerfXCloud provider model list,Update PerfXCloudProvider validate_provider_credentials method. ( #8587 )
...
Co-authored-by: xhb <466010723@qq.com>
2024-09-21 17:33:15 +08:00
Su Yang
c87f710d58
Fix: update qwen model and model config ( #8584 )
...
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-20 17:05:57 +08:00
Su Yang
1568c5cae9
fix: fix qwen series model type ( #8580 )
2024-09-20 15:29:33 +08:00
MuYu
a03919c3b3
feat: add hunyuan-vision ( #8529 )
2024-09-19 18:08:01 +08:00
Su Yang
d6de96c4b4
feat: sync Qwen API with Aliyun Bailian ( #8538 )
2024-09-19 17:08:59 +08:00
Wang Bo
6f222b49f2
refactor: rename task_type to task for jina embeddings v3 ( #8488 )
2024-09-18 14:53:15 +08:00
-LAN-
8dfe8c773a
chore: Deprecate gpt-3.5-turbo-0613 and gpt-3.5-turbo-16k-0613 models ( #8500 )
2024-09-18 14:38:09 +08:00
ybalbert001
b6ad7a1e06
Fix: https://github.com/langgenius/dify/issues/8190 (Update Model nam… ( #8426 )
...
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-14 17:14:18 +08:00
Aaron Ji
6f7625fa47
chore: update Jina embedding model ( #8376 )
2024-09-14 16:21:17 +08:00
ybalbert001
b613b11422
Fix: Support Bedrock cross region inference #8190 (Update Model name to distinguish between different region groups) ( #8402 )
...
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-14 11:06:20 +08:00
crazywoola
71b4480c4a
fix: o1-mini 65563 -> 65536 ( #8388 )
2024-09-14 02:39:58 +08:00
Bowen Liang
5b98acde2f
chore: improve usage of striping prefix or suffix of string with Ruff 0.6.5 ( #8392 )
2024-09-13 23:34:39 +08:00
Bowen Liang
a1104ab97e
chore: refurish python code by applying Pylint linter rules ( #8322 )
2024-09-13 22:42:08 +08:00
xiandan-erizo
1ab81b4972
support hunyuan-turbo ( #8372 )
...
Co-authored-by: sunkesi <sunkesi@hosecloud.com>
2024-09-13 20:21:48 +08:00
takatost
24af4b9313
fix: o1-series model encounters an error when the generate mode is blocking ( #8363 )
2024-09-13 15:37:54 +08:00
Bowen Liang
6613b8f2e0
chore: fix unnecessary string concatation in single line ( #8311 )
2024-09-13 14:24:49 +08:00
sino
a45ac6ab98
fix: ark token usage is none ( #8351 )
2024-09-13 14:19:24 +08:00
takatost
4637ddaa7f
feat: add o1-series models support in Agent App (ReACT only) ( #8350 )
2024-09-13 13:08:27 +08:00
takatost
e90d3c29ab
feat: add OpenAI o1 series models support ( #8328 )
2024-09-13 02:15:19 +08:00
Nam Vu
153807f243
fix: response_format label ( #8326 )
2024-09-12 23:17:29 +08:00
呆萌闷油瓶
02c4b1af71
chore:add Azure openai api version 2024-08-01-preview ( #8291 )
2024-09-12 20:22:57 +08:00
ybalbert001
d4985fb3aa
Fix: Support Bedrock cross region inference [ #8190 ]( https://github.com/langgenius/dify/issues/8190 ) ( #8317 )
2024-09-12 19:15:20 +08:00
Bowen Liang
40fb4d16ef
chore: refurbish Python code by applying refurb linter rules ( #8296 )
2024-09-12 15:50:49 +08:00
Bowen Liang
c69f5b07ba
chore: apply ruff E501 line-too-long linter rule ( #8275 )
...
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-12 14:00:36 +08:00
Bowen Liang
0f14873255
chore: cleanup ruff flake8-simplify linter rules ( #8286 )
...
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-12 12:55:45 +08:00
Bowen Liang
781d294f49
chore: cleanup pycodestyle E rules ( #8269 )
2024-09-11 18:55:00 +08:00
yalei
f515af2232
let claude models in bedrock support the response_format parameter ( #8220 )
...
Co-authored-by: duyalei <>
2024-09-11 18:24:50 +08:00
crazywoola
4d2cd6703b
chore: remove useless code ( #8198 )
2024-09-11 18:19:34 +08:00
Bowen Liang
292220c596
chore: apply pep8-naming rules for naming convention ( #8261 )
2024-09-11 16:40:52 +08:00
HowardChan
53f37a6704
fix:ollama text embedding 500 error ( #8252 )
2024-09-11 16:23:19 +08:00
Nam Vu
342607f4a4
fix: truthy value ( #8208 )
2024-09-11 15:44:53 +08:00
HowardChan
82c42b9ec5
fix:error when adding the ollama embedding model ( #8236 )
...
Co-authored-by: crazywoola <427733928@qq.com>
2024-09-11 10:25:45 +08:00
Bowen Liang
2cf1187b32
chore(api/core): apply ruff reformatting ( #7624 )
2024-09-10 17:00:20 +08:00
takatost
dabfd74622
feat: Parallel Execution of Nodes in Workflows ( #8192 )
...
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
Co-authored-by: Yi <yxiaoisme@gmail.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-10 15:23:16 +08:00
Jyong
2d690801d1
nvidia rerank top n missed ( #8185 )
2024-09-10 13:17:48 +08:00
-LAN-
4313d92e6b
feat(api/core/model_runtime/entities/defaults.py): Add TOP_K in default parameters. ( #8167 )
2024-09-10 09:11:31 +08:00
crazywoola
0bec6a037c
update qwen-long ( #8157 )
2024-09-09 19:09:42 +08:00
AAEE86
fa34b9aed6
Modify model parameters in Spark LLMs and zhipuai LLMs ( #8078 )
...
Co-authored-by: Charlie.Wei <luowei@cvte.com>
2024-09-09 15:36:47 +08:00
crazywoola
a27d4d58ec
fix: ollama text embedding 500 error ( #8131 )
2024-09-09 15:27:49 +08:00
邹成卓
a15791e788
Fix: tongyi code wrapper works not stable ( #7871 )
...
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: crazywoola <427733928@qq.com>
2024-09-09 11:15:17 +08:00
ybalbert001
954580a4af
feat: support more model types and builtin tools on aws/sagemaker ( #8061 )
...
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-09 10:34:11 +08:00
crazywoola
ab7d79275e
fix: Claude can not validate credientials ( #8109 )
2024-09-09 10:22:42 +08:00
呆萌闷油瓶
d28446301f
feat:add fishaudio in xinference ( #8100 )
2024-09-08 23:58:02 +08:00
Nam Vu
2d7954c7da
Fix variable typo ( #8084 )
2024-09-08 13:14:11 +08:00
AAEE86
0cef25ef8c
Revert "fix: parameter rule" ( #8070 )
2024-09-07 10:44:56 +08:00
crazywoola
900fd82a92
fix: parameter rule ( #8064 )
2024-09-06 19:15:24 +08:00
tmuife
89aede80cc
Add OCI(Oracle Cloud Infrastructure) Generative AI Service as a Model Provider ( #7775 )
...
Co-authored-by: Walter Jin <jinshuhaicc@gmail.com>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: walter from vm <walter.jin@oracle.com>
2024-09-06 14:15:40 +08:00
Leng Yue
bd0992275c
feat: support fish audio TTS ( #7982 )
2024-09-05 14:18:39 +08:00
非法操作
3e7597f2bd
feat: add gpt-4o-2024-08-06 and json_schema for azure openAI service ( #7648 )
2024-09-04 21:56:08 +08:00
wochuideng
f6b9982c23
Concurrent calls to the Wenxin model, and the exception problem when obtaining the token is fixed ( #7976 )
...
Co-authored-by: puqs1 <puqs1@lenovo.com>
2024-09-04 21:44:57 +08:00
非法操作
0f72a8e89d
chore: refactor the beichuan model ( #7953 )
2024-09-04 16:22:31 +08:00
呆萌闷油瓶
83494cb4f5
fix:empty voice occurs when xinference CosyVoice tts model ( #7958 )
2024-09-04 13:04:31 +08:00
orangeclk
3f2a806abe
fix: glm models prices and max_tokens correction ( #7882 )
2024-09-02 14:29:09 +08:00
sino
1f56a20b62
feat: support auth by api key for ark provider ( #7845 )
2024-08-31 10:56:32 +08:00
非法操作
dc015c380a
feat: add zhipu glm_4_plus and glm_4v_plus model ( #7824 )
2024-08-30 15:08:31 +08:00
hisir
f0273f00e1
Fixed when testing the openai compatible interface model, an error is reported when no object is returned ( #7808 )
2024-08-29 18:58:19 +08:00
sino
7cfebffbb8
chore: update default endpoint for ark provider ( #7741 )
2024-08-28 13:56:50 +08:00
crazywoola
da326baa5e
fix: tongyi Error: 'NoneType' object is not subscriptable ( #7705 )
2024-08-27 16:56:06 +08:00
sino
ee7d5e7206
feat: support Moonshot and GLM models tool call for volc ark provider ( #7666 )
2024-08-27 14:43:37 +08:00
Hélio Lúcio
7b7576ad55
Add Azure AI Studio as provider ( #7549 )
...
Co-authored-by: Hélio Lúcio <canais.hlucio@voegol.com.br>
2024-08-27 09:52:59 +08:00
代君
7c2bb31a55
[fix] openai's tool role dose not support name parameter. ( #7659 )
2024-08-26 18:52:34 +08:00
Seayon
561a61e7fe
Improve MIME type detection for image URLs ( #6531 )
...
Co-authored-by: seayon <zhaoxuyang@shouqianba.com>
2024-08-25 13:36:16 +08:00
sino
efc136cce5
feat: Introduce Ark SDK v3 and ensure compatibility with models of SDK v2 ( #7579 )
...
Co-authored-by: crazywoola <427733928@qq.com>
2024-08-24 19:29:45 +08:00
噢哎哟喂
ad13011043
add JSON Mode support for moonshot models ( #7568 )
2024-08-23 16:24:45 +08:00
Fei He
6025002971
add qwen text-embedding-v3 support. ( #7567 )
2024-08-23 15:32:38 +08:00
orangeclk
a24717765e
feat: forward zhipu finish_reason ( #7560 )
2024-08-23 11:15:38 +08:00
orangeclk
f53454f81d
add finish_reason to the LLM node output ( #7498 )
2024-08-21 17:29:30 +08:00
非法操作
f7af8c7cc7
feat: gpt-4o-mini-2024-07-18 support json schema ( #7489 )
2024-08-21 15:11:29 +08:00
Xiyuan Chen
4e7b6aec3a
feat: support pinning, including, and excluding for model providers and tools ( #7419 )
...
Co-authored-by: GareArc <chen4851@purude.edu>
2024-08-21 11:16:43 +08:00
Nam Vu
6991a243aa
chore: correct _tts_invoke_streaming max length ( #7423 )
2024-08-20 10:20:04 +08:00
Chengyu Yan
1f944c6eeb
feat(api): support wenxin bge-large and tao embedding model. ( #7393 )
2024-08-19 22:25:09 +08:00
Xiao Ley
53cf756207
feat: OpenRouter add gpt-4o-2024-08-06 model ( #7409 )
2024-08-19 19:14:08 +08:00
-LAN-
0087afc2e3
fix(api/core/model_runtime/model_providers/__base/large_language_model.py): Add TEXT type checker ( #7407 )
2024-08-19 18:45:30 +08:00
SoaringEthan
acd72e3ab2
feat: support xinference's auth system ( #7369 )
2024-08-19 12:41:56 +08:00
Chengyu Yan
bfd905602f
feat(api): support wenxin text embedding ( #7377 )
2024-08-19 09:15:19 +08:00
sino
a0a67873aa
chore: optimize ark model parameters ( #7378 )
2024-08-19 08:44:19 +08:00
噢哎哟喂
baaa3f7f42
add base url for moonshot model ( #7360 )
2024-08-17 10:28:09 +08:00
Weaxs
3a33062405
feat: support siliconflow rerank ( #7337 )
2024-08-16 20:21:41 +08:00
Xiyuan Chen
c7df6783df
Revert "feat: support pinning, including, and excluding for Model Providers and Tools" ( #7324 )
2024-08-15 23:51:00 +08:00
噢哎哟喂
6fdbc7dbf3
fix error when use farui-plus model ( #7316 )
...
Co-authored-by: 雪风 <xuefeng@shifaedu.cn>
2024-08-15 20:14:13 +08:00
Hongbin
d1a6702aa4
Update PerfXCloud Model List ( #7212 )
...
Co-authored-by: xhb <466010723@qq.com>
2024-08-15 19:42:15 +08:00
Xiyuan Chen
7619850855
feat: support pinning, including, and excluding for Model Providers and Tools ( #7283 )
2024-08-15 12:58:38 +08:00
非法操作
6ff7fd80a1
feat: support OPENAI json_schema ( #7258 )
2024-08-15 11:29:19 +08:00
非法操作
5aa373dc04
feat: add chatgpt-4o-latest ( #7289 )
2024-08-15 11:19:10 +08:00
Xiyuan Chen
d29b32fce2
fix: typo in upstage/llm/_position.yaml ( #7286 )
2024-08-15 08:39:35 +08:00
噢哎哟喂
52383d0161
add support for tongyi-farui ( #7248 )
...
Co-authored-by: 雪风 <xuefeng@shifaedu.cn>
2024-08-14 14:09:13 +08:00
Onelevenvy
0f59d76997
fix: add context_size and max_chunks to Tongyi embedding to resolve issue #7189 ( #7227 )
2024-08-13 16:35:22 +08:00
shAlfred
a12ddc47e7
feat: add support of speech2text function for OpenAI-API-compatible and Siliconflow ( #7197 )
2024-08-12 21:38:59 +08:00
Weaxs
67b9fdaad7
siliconflow support bge-3 && bce-v1 embedding ( #7198 )
2024-08-12 19:14:43 +08:00
ybalbert001
f2cb1fb09f
Fix : Workflow "start" paste url not support s3 pre-signed URL ( #6855 )
...
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-08-11 16:45:15 +08:00
Yanyi Liu
5b32f2e0dd
Feat: Add model provider Text Embedding Inference for embedding and rerank ( #7132 )
2024-08-09 19:12:13 +08:00
Yanyi Liu
4cbeb6815b
Fix: Wrong cutoff length lead to empty input in openai compatible embedding model. ( #7133 )
2024-08-09 19:11:57 +08:00
forrestlinfeng
07511dfaf4
update stepfun model ( #7118 )
...
Co-authored-by: chenlinfeng <chenlinfeng@step.ai>
Co-authored-by: Tfsh <tianfs_fight@163.com>
2024-08-08 20:40:37 +08:00
小羽
7944ce0594
feat: wenxin add yi-34b-chat ( #7117 )
2024-08-08 20:01:21 +08:00
orangeclk
83acb53c08
feat: add zhipu embedding-3 ( #7100 )
2024-08-08 17:08:46 +08:00
shAlfred
a7162240e6
feat: add text-embedding functon and LLM models to Siliconflow ( #7090 )
2024-08-08 17:08:28 +08:00
小羽
34a9dbe826
Feat/add 360-zhinao provider ( #7069 )
2024-08-08 14:23:08 +08:00
orangeclk
f288d367ac
Add price info for zhipu models ( #7084 )
2024-08-08 14:17:05 +08:00
Waffle
5e2fa13126
feat: support glm-4-long ( #7070 )
...
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2024-08-08 10:54:39 +08:00
Joe
d7bb422a5c
fix: hunyuan assistant_prompt_message pydantic error ( #7062 )
2024-08-07 18:31:40 +08:00
majian
99b78dd198
feat: add gpt-4o-2024-08-06 ( #7046 )
2024-08-07 15:35:57 +08:00
crazywoola
3516989738
fix: typos in wenxin llm ( #7021 )
2024-08-06 22:33:03 +08:00
Sa Zhang
26991443ed
fix: Fix incorrect context size for jina-reranker-v2 model ( #7006 )
2024-08-06 21:08:29 +08:00
Yefori
bd3ed89516
feat: add function calling for deepseek models ( #6990 )
2024-08-06 13:37:27 +08:00
小羽
23ed15d19f
feat:nvidia add nemotron4-340b and microsoft/phi-3 ( #6973 )
2024-08-06 10:16:41 +08:00
takatost
6da14c2d48
security: fix api image security issues ( #6971 )
2024-08-05 20:21:08 +08:00
Pedro Gomes
a34285196b
Revise the wrong pricing of certain LLM models. ( #6967 )
2024-08-05 18:41:44 +08:00
takatost
ea30174057
chore: optimize streaming tts of xinference ( #6966 )
2024-08-05 18:23:23 +08:00
liuzhenghua
141e4e0276
fix: restore xinference secret field ( #6941 )
...
Co-authored-by: liuzhenghua-jk <liuzhenghua-jk@360shuke.com>
2024-08-04 22:32:24 +08:00
Weaxs
5e634a59a2
compatible xinference reranker server ( #6927 )
2024-08-04 13:49:38 +08:00
JuHyung Son
2e941bb91c
add new provider Solar ( #6884 )
2024-08-02 20:48:09 +08:00
sino
8166a8caf5
feat: update llama3.1 parameters for openrouter ( #6901 )
2024-08-02 13:13:34 +08:00
灰灰
56af1a0adf
pref: change ollama embedded api request ( #6876 )
2024-08-02 12:04:47 +08:00
dufei
f8617db012
fix tongyi tool calls ( #6896 )
2024-08-02 10:03:43 +08:00
Weaxs
cc4785f094
fix: xinference reranker return_documents ( #6888 )
2024-08-01 19:57:53 +08:00
chenxu9741
a9cd6df97e
Remove tts (blocking call) ( #6869 )
2024-08-01 14:50:22 +08:00
呆萌闷油瓶
f31142e758
Azure 4o mini options ( #6873 )
2024-08-01 14:04:18 +08:00
crazywoola
792f908afb
Revert "feat:Azure gpt4o mini" ( #6870 )
2024-08-01 13:32:03 +08:00
呆萌闷油瓶
14367ddc09
feat:Azure gpt4o mini ( #6866 )
2024-08-01 13:03:08 +08:00
Charlie.Wei
cbf7f21ade
Add azure gpt4omini ( #6862 )
...
Co-authored-by: luowei <glpat-EjySCyNjWiLqAED-YmwM>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2024-08-01 12:57:52 +08:00
Weaxs
f6e8e120a1
support xinference tts ( #6746 )
2024-08-01 11:59:15 +08:00
Joe
08f922d8c9
fix: anthropic max token NoneType error ( #6858 )
2024-08-01 11:30:00 +08:00
小羽
56b43f62d1
feat: nvidia add llama3.1 model ( #6844 )
2024-07-31 21:24:02 +08:00
Giga Group
4b410494b3
Add model parameter enable_enhance for hunyuan llm model ( #6847 )
...
Co-authored-by: sun <sun@centen.cn>
2024-07-31 20:04:43 +08:00
Joe
df9bd36cab
fix: claude-3-5-sonnet-20240620 max token error ( #6843 )
2024-07-31 18:34:44 +08:00
longzhihun
9ce5cea911
feat: bedrock invoke enhancement ( #6808 )
2024-07-30 21:57:18 +08:00
SiliconFlow, Inc
3e18d32ce5
add deepseek-coder-v2 in siliconflow ( #6149 )
2024-07-29 18:45:19 +08:00
Charles
94d68b6a08
upgrade deepseek params ( #6744 )
2024-07-29 18:31:56 +08:00
Giga Group
c9ff0e3961
Add model hunyuan-embedding ( #6657 )
...
Co-authored-by: sun <sun@centen.cn>
2024-07-29 18:30:52 +08:00
Bowen Liang
20268708cc
chore: improve position map conversion and tolerate empty position yaml file ( #6541 )
2024-07-29 10:32:11 +08:00
-LAN-
83af50368f
fix(api/core/model_runtime/model_providers/azure_openai/llm/llm.py): Try to skip if delta.delta
is None. ( #6727 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2024-07-27 00:05:21 +08:00
Joe
e4542215cc
fix: tongyi empty tool_calls is not supported in message ( #6719 )
2024-07-26 18:10:13 +08:00
Jason
3d3677e912
Feat/model provider novita ( #6717 )
...
Co-authored-by: takatost <takatost@gmail.com>
2024-07-26 17:37:21 +08:00
chenxu9741
6b50bb0fe6
issues #6655 Open ai tts issues ( #6696 )
2024-07-26 14:55:49 +08:00
longzhihun
c5ac004f15
[seanguo] fix: unsupported filename in windows & add Mistral Large 2 ( #6679 )
2024-07-25 19:26:46 +08:00
RookieAgent
78a339a794
modify llama3-1 yaml filename to support Windows pull operations ( #6677 )
2024-07-25 18:58:55 +08:00
Giga Group
ca696fe94c
Add support of tool-call for model provider "hunyuan" ( #6656 )
...
Co-authored-by: sun <sun@centen.cn>
2024-07-25 11:27:58 +08:00
longzhihun
9815aab7a3
[seanguo] feat: add llama 3.1 support in bedrock ( #6645 )
2024-07-25 11:20:37 +08:00
zhangzhiqiangcs
d4c55748f1
doc: fix about model features ( #6619 )
2024-07-24 19:12:10 +08:00
dufei
5af2df0cd5
fix: qwen fc error ( #6620 )
...
Co-authored-by: dufei <du_fei@venusgroup.com.cn>
2024-07-24 16:56:06 +08:00
takatost
4c85393a1d
feat: add GroqCloud llama3.1 series models support ( #6596 )
2024-07-24 00:41:58 +08:00
sino
d5c2680fde
feat: support llama3.1 series models for openrouter provider ( #6595 )
2024-07-24 00:37:48 +08:00
Joe
8123a00e97
feat: update prompt generate ( #6516 )
2024-07-23 19:52:14 +08:00
Lance Mao
7c55c39085
feat: add tencent asr ( #6091 )
2024-07-23 16:38:39 +08:00
-LAN-
5e6fc58db3
Feat/environment variables in workflow ( #6515 )
...
Co-authored-by: JzoNg <jzongcode@gmail.com>
2024-07-22 15:29:39 +08:00
sino
4f9f175f25
fix: correct gpt-4o-mini max token ( #6472 )
...
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2024-07-19 18:24:58 +08:00
sino
9e168f9d1c
feat: support gpt-4o-mini for openrouter provider ( #6447 )
2024-07-19 13:09:41 +08:00
Weaxs
ea45496a74
update ernie models ( #6454 )
2024-07-19 13:08:39 +08:00
Richards Tu
8e49146a35
[EMERGENCY] Fix Anthropic header issue ( #6445 )
2024-07-19 07:38:15 +08:00
takatost
dad3fd2dc1
feat: add gpt-4o-mini ( #6442 )
2024-07-19 01:53:43 +08:00
ybalbert001
4a026fa352
Enhancement: add model provider - Amazon Sagemaker ( #6255 )
...
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
Co-authored-by: crazywoola <427733928@qq.com>
2024-07-18 19:32:31 +08:00
themanforfree
ba181197c2
feat: api_key support for xinference ( #6417 )
...
Signed-off-by: themanforfree <themanforfree@gmail.com>
2024-07-18 18:58:46 +08:00
forrestlinfeng
3b5b548af3
Add Stepfun LLM Support ( #6346 )
2024-07-18 07:47:18 +08:00
Richards Tu
4782fb50c4
Support new Claude-3.5 Sonnet max token limit ( #6335 )
2024-07-18 07:47:06 +08:00
xielong
f3f052ba36
fix: rename model from ernie-4.0-8k-Latest to ernie-4.0-8k-latest ( #6383 )
2024-07-17 19:07:47 +08:00
longzhihun
ed9e692263
feat: bedrock model runtime enhancement ( #6299 )
2024-07-16 15:54:39 +08:00
呆萌闷油瓶
d66d7146a3
chore:update azure GA version 2024-06-01 ( #6307 )
2024-07-16 10:32:18 +08:00
Onelevenvy
b47fa27a35
fix: zhipuai validate error when user's api key not support for chatglm_turbo in issue #6289 ( #6290 )
2024-07-15 19:27:18 +08:00
thibautleaux-kreactive
96c171805a
Update bedrock.yaml ( #6281 )
2024-07-15 16:53:03 +08:00
Benjamin
ec181649ae
Update model provider configuration for Triton Inference Server and X… ( #6274 )
2024-07-15 15:07:28 +08:00
Waffle
07add06c59
Feat/add zhipu CogView 3 tool ( #6210 )
2024-07-13 17:39:17 +08:00
Little 羊
7c2c949f01
Update ernie_bot.py ( #6236 )
2024-07-12 19:54:53 +08:00
耐小心
d7a6f25c63
fix: differentiate prompts fields based on function_calling_type ( #5880 )
2024-07-12 11:07:38 +08:00
crazywoola
ee3936916f
upgrade deepseek params ( #6215 )
2024-07-12 10:55:44 +08:00
Little 羊
2f064c68bc
Create ernie-4.0-turbo-8k-preview ( #6132 )
2024-07-11 20:20:07 +08:00
Su Yang
215661ef91
feat: add PerfXCloud, Qwen series #6116 ( #6117 )
2024-07-10 18:26:10 +08:00
chenxu9741
6ef401a9f0
feat:add tts-streaming config and future ( #5492 )
2024-07-09 11:33:58 +08:00
sino
85744b72e5
feat: support moonshot and glm base models for volcengine provider ( #6029 )
2024-07-07 01:17:33 +08:00
Masashi Tomooka
3b23d6764f
fix: token count includes base64 string of input images ( #5868 )
2024-07-06 16:53:32 +08:00
-LAN-
4d105d7bd7
feat(*): Swtich to dify_config. ( #6025 )
2024-07-06 12:05:13 +08:00
orangeclk
f8aaa57f31
feat: add retry mechanism for zhipuai ( #5926 )
2024-07-05 10:49:18 +08:00
-LAN-
d7f75d17cc
Chore/remove-unused-code ( #5917 )
2024-07-04 18:18:26 +08:00
longzhihun
aecdfa2d5c
feat: add claude3 function calling ( #5889 )
2024-07-03 22:21:02 +08:00
longzhihun
fdfbbde10d
[seanguo] modify bedrock Claude3 invoke method to converse API ( #5768 )
...
Co-authored-by: Chenhe Gu <guchenhe@gmail.com>
2024-07-01 04:36:13 +08:00
takatost
0bf4817474
fix: _convert_prompt_message_to_dict parameters err ( #5716 )
2024-06-28 21:00:00 +08:00
呆萌闷油瓶
68ac433218
feat: add support Spark4.0 ( #5688 )
2024-06-28 17:39:11 +08:00
Kevin
b3d6726f65
Feature/add qwen llm ( #5659 )
2024-06-28 11:06:29 +08:00
liuzhenghua
2b080b5cfc
feature: Add presence_penalty and frequency_penalty parameters to the … ( #5637 )
...
Co-authored-by: liuzhenghua-jk <liuzhenghua-jk@360shuke.com>
2024-06-28 00:27:20 +08:00
takatost
3ccad33194
feat: add jina new pre-defined rerankers, include: jina-reranker-v2 ( #5657 )
2024-06-27 13:45:35 +08:00
sunxichen
bafc8a0bde
fix: tool call message role according to credentials ( #5625 )
...
Co-authored-by: sunxichen <sun.xc@digitalcnzz.com>
2024-06-27 12:35:27 +08:00
Bowen Liang
dcb72e0067
chore: apply flake8-comprehensions Ruff rules to improve collection comprehensions ( #5652 )
...
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-06-27 11:21:31 +08:00
Joe
4e2de638af
feat: add ops trace ( #5483 )
...
Co-authored-by: takatost <takatost@gmail.com>
2024-06-26 17:33:29 +08:00
sino
877a2c144b
feat: support predefined models for openrouter ( #5494 )
2024-06-24 16:31:53 +08:00
-LAN-
ba67206bb9
fix(api/model_runtime/azure/llm): Switch to tool_call. ( #5541 )
2024-06-24 15:35:21 +08:00
vccler
48757e581e
fix: zhipu tool calling, this PR fixes the bug described in issue #5496 ( #5469 )
...
Co-authored-by: vccler <vccler@163.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-06-22 12:41:24 +08:00
LXM
e8ad0339a3
fix: tongyi json output ( #5396 )
2024-06-22 12:25:23 +08:00
crazywoola
91d38a535f
fix: max_tokens of qwen-plus & qwen-plus-chat ( #5480 )
2024-06-21 16:49:33 +08:00
Pan, Wen-Ming
95c882934e
feat: add support for Vertex AI claude-3-5-sonnet@20240620 ( #5475 )
...
Co-authored-by: Wenming Pan <pwm@google.com>
2024-06-21 16:45:56 +08:00
Su Yang
26b6fd2236
feat: add support for bedrock claude-3-5-sonnet-20240620 ( #5461 )
2024-06-21 10:21:35 +08:00
takatost
ff0f02d809
feat: add support for claude-3-5-sonnet-20240620 ( #5452 )
2024-06-21 00:23:15 +08:00
-LAN-
142dc0afd7
refactor: Remove unused code in large_language_model.py ( #5433 )
2024-06-20 16:20:40 +08:00
-LAN-
23fa3dedc4
fix(core): Fix incorrect type hints. ( #5427 )
2024-06-20 15:16:21 +08:00
Ikko Eltociear Ashimine
8266842809
chore: update llm.py ( #5335 )
2024-06-18 09:29:14 +08:00
Richards Tu
c163521b9e
Update and fix the model param of Deepseek ( #5329 )
2024-06-17 21:40:04 +08:00
Justin Wu
61f4f08744
Add bedrock command r models ( #4521 )
...
Co-authored-by: Justin Wu <justin.wu@ringcentral.com>
Co-authored-by: Chenhe Gu <guchenhe@gmail.com>
2024-06-17 20:37:46 +08:00
-LAN-
5a99aeb864
fix(core): Reorder field_validator
and classmethod
to fit Pydantic V2. ( #5257 )
2024-06-17 10:04:28 +08:00
crazywoola
9a64aa76c1
fix: typo and check ( #5287 )
2024-06-17 09:15:43 +08:00
Pan, Wen-Ming
4b54843ed7
fix: run agent with Vertex AI Gemini models ( #5260 )
...
Co-authored-by: Wenming Pan <pwm@google.com>
2024-06-16 09:36:31 +08:00
kurokobo
2e842333b1
fix: correct typos in the icons for microsoft ( #5243 )
2024-06-15 21:02:47 +08:00
Masashi Tomooka
d9bee03ff6
fix: embedding job fails using IAM role ( #5252 )
2024-06-15 18:57:54 +08:00
Jyong
ba5f8afaa8
Feat/firecrawl data source ( #5232 )
...
Co-authored-by: Nicolas <nicolascamara29@gmail.com>
Co-authored-by: chenhe <guchenhe@gmail.com>
Co-authored-by: takatost <takatost@gmail.com>
2024-06-15 02:46:02 +08:00
Bin
0f35d07052
support ERNIE-4.0-8K-Latest ( #5216 )
2024-06-14 18:45:24 +08:00
-LAN-
7f44e88eda
fix(model_providers/ollama): Fix OllamaLargeLanguageModel to correctly set the stop option ( #5217 )
2024-06-14 18:26:14 +08:00
Jason
b7ff765d8d
Add novita.ai as model provider ( #4961 )
2024-06-14 18:23:06 +08:00
Masashi Tomooka
0633aae7dc
feat: allow to use IAM Role for Bedrock ( #5188 )
2024-06-14 15:18:42 +08:00
takatost
415022aa14
fix: pydantic2 error ( #5172 )
2024-06-14 03:05:04 +08:00