Commit Graph

559 Commits

Author SHA1 Message Date
ybalbert001
7c485f8bb8
fix llm integration problem: It doesn't work on docker env (#8701)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-24 10:33:30 +08:00
Sa Zhang
7f1b028840
fix: change the brand name to Jina AI (#8691)
Co-authored-by: sa zhang <sa.zhang@jina.ai>
2024-09-23 21:39:26 +08:00
Nam Vu
bef83a4d2e
fix: typos and improve naming conventions: (#8687) 2024-09-23 21:32:58 +08:00
ice yao
d7aada38a1
Add nomic embedding model provider (#8640) 2024-09-23 19:57:21 +08:00
AAEE86
a126d535cf
add Spark Max-32K (#8676) 2024-09-23 16:39:46 +08:00
AAEE86
3554a803e7
add zhipuai web search (#8668) 2024-09-23 16:19:42 +08:00
AAEE86
c66cecaa55
add Qwen model translate (#8674) 2024-09-23 16:18:55 +08:00
Aaron Ji
3618a97c20
feat: extend api params for Jina Embeddings V3 (#8657) 2024-09-23 13:45:09 +08:00
zhuhao
e34f04380d
feat: add deepseek-v2.5 for model provider siliconflow (#8639) 2024-09-22 21:44:06 +08:00
zhuhao
6df77038a2
docs: fix predefined_model_scale_out.md redirect error (#8633) 2024-09-22 16:45:45 +08:00
zhuhao
45c0a44411
feat: add qwen2.5 for model provider siliconflow (#8630) 2024-09-22 16:42:34 +08:00
CXwudi
97895ec41a
chore: add Gemini newest experimental models (close #7121) (#8621) 2024-09-22 13:38:08 +08:00
sino
6d56d5c1f6
feat: support o1 series models for openrouter (#8358) 2024-09-22 10:23:50 +08:00
AAEE86
c9f1e18df1
Add model parameter translation (#8509)
Co-authored-by: swingchen01 <swings@126.com>
Co-authored-by: 陈长君 <chenchangjun@shuwen.com>
2024-09-22 10:14:33 +08:00
Waffle
740fad06c1
feat(tools/cogview): Updated cogview tool to support cogview-3 and the latest cogview-3-plus (#8382) 2024-09-22 10:14:14 +08:00
ice yao
0665268578
Add Fireworks AI as new model provider (#8428) 2024-09-22 10:13:00 +08:00
呆萌闷油瓶
c8b9bdebfe
feat:use xinference tts stream mode (#8616) 2024-09-22 10:08:35 +08:00
AAEE86
1a8dcae10e
add Qwen custom add model interface (#8565) 2024-09-21 22:52:10 +08:00
AAEE86
5ddb601e43
add MixtralAI Model (#8517) 2024-09-21 18:08:07 +08:00
Hongbin
5541248264
Update the PerfXCloud provider model list,Update PerfXCloudProvider validate_provider_credentials method. (#8587)
Co-authored-by: xhb <466010723@qq.com>
2024-09-21 17:33:15 +08:00
Su Yang
c87f710d58
Fix: update qwen model and model config (#8584)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-20 17:05:57 +08:00
Su Yang
1568c5cae9
fix: fix qwen series model type (#8580) 2024-09-20 15:29:33 +08:00
MuYu
a03919c3b3
feat: add hunyuan-vision (#8529) 2024-09-19 18:08:01 +08:00
Su Yang
d6de96c4b4
feat: sync Qwen API with Aliyun Bailian (#8538) 2024-09-19 17:08:59 +08:00
Wang Bo
6f222b49f2
refactor: rename task_type to task for jina embeddings v3 (#8488) 2024-09-18 14:53:15 +08:00
-LAN-
8dfe8c773a
chore: Deprecate gpt-3.5-turbo-0613 and gpt-3.5-turbo-16k-0613 models (#8500) 2024-09-18 14:38:09 +08:00
ybalbert001
b6ad7a1e06
Fix: https://github.com/langgenius/dify/issues/8190 (Update Model nam… (#8426)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-14 17:14:18 +08:00
Aaron Ji
6f7625fa47
chore: update Jina embedding model (#8376) 2024-09-14 16:21:17 +08:00
ybalbert001
b613b11422
Fix: Support Bedrock cross region inference #8190 (Update Model name to distinguish between different region groups) (#8402)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-14 11:06:20 +08:00
crazywoola
71b4480c4a
fix: o1-mini 65563 -> 65536 (#8388) 2024-09-14 02:39:58 +08:00
Bowen Liang
5b98acde2f
chore: improve usage of striping prefix or suffix of string with Ruff 0.6.5 (#8392) 2024-09-13 23:34:39 +08:00
Bowen Liang
a1104ab97e
chore: refurish python code by applying Pylint linter rules (#8322) 2024-09-13 22:42:08 +08:00
xiandan-erizo
1ab81b4972
support hunyuan-turbo (#8372)
Co-authored-by: sunkesi <sunkesi@hosecloud.com>
2024-09-13 20:21:48 +08:00
takatost
24af4b9313
fix: o1-series model encounters an error when the generate mode is blocking (#8363) 2024-09-13 15:37:54 +08:00
Bowen Liang
6613b8f2e0
chore: fix unnecessary string concatation in single line (#8311) 2024-09-13 14:24:49 +08:00
sino
a45ac6ab98
fix: ark token usage is none (#8351) 2024-09-13 14:19:24 +08:00
takatost
4637ddaa7f
feat: add o1-series models support in Agent App (ReACT only) (#8350) 2024-09-13 13:08:27 +08:00
takatost
e90d3c29ab
feat: add OpenAI o1 series models support (#8328) 2024-09-13 02:15:19 +08:00
Nam Vu
153807f243
fix: response_format label (#8326) 2024-09-12 23:17:29 +08:00
呆萌闷油瓶
02c4b1af71
chore:add Azure openai api version 2024-08-01-preview (#8291) 2024-09-12 20:22:57 +08:00
ybalbert001
d4985fb3aa
Fix: Support Bedrock cross region inference [#8190](https://github.com/langgenius/dify/issues/8190) (#8317) 2024-09-12 19:15:20 +08:00
Bowen Liang
40fb4d16ef
chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
Bowen Liang
c69f5b07ba
chore: apply ruff E501 line-too-long linter rule (#8275)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-12 14:00:36 +08:00
Bowen Liang
0f14873255
chore: cleanup ruff flake8-simplify linter rules (#8286)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-12 12:55:45 +08:00
Bowen Liang
781d294f49
chore: cleanup pycodestyle E rules (#8269) 2024-09-11 18:55:00 +08:00
yalei
f515af2232
let claude models in bedrock support the response_format parameter (#8220)
Co-authored-by: duyalei <>
2024-09-11 18:24:50 +08:00
crazywoola
4d2cd6703b
chore: remove useless code (#8198) 2024-09-11 18:19:34 +08:00
Bowen Liang
292220c596
chore: apply pep8-naming rules for naming convention (#8261) 2024-09-11 16:40:52 +08:00
HowardChan
53f37a6704
fix:ollama text embedding 500 error (#8252) 2024-09-11 16:23:19 +08:00
Nam Vu
342607f4a4
fix: truthy value (#8208) 2024-09-11 15:44:53 +08:00
HowardChan
82c42b9ec5
fix:error when adding the ollama embedding model (#8236)
Co-authored-by: crazywoola <427733928@qq.com>
2024-09-11 10:25:45 +08:00
Bowen Liang
2cf1187b32
chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
takatost
dabfd74622
feat: Parallel Execution of Nodes in Workflows (#8192)
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
Co-authored-by: Yi <yxiaoisme@gmail.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-09-10 15:23:16 +08:00
Jyong
2d690801d1
nvidia rerank top n missed (#8185) 2024-09-10 13:17:48 +08:00
-LAN-
4313d92e6b
feat(api/core/model_runtime/entities/defaults.py): Add TOP_K in default parameters. (#8167) 2024-09-10 09:11:31 +08:00
crazywoola
0bec6a037c
update qwen-long (#8157) 2024-09-09 19:09:42 +08:00
AAEE86
fa34b9aed6
Modify model parameters in Spark LLMs and zhipuai LLMs (#8078)
Co-authored-by: Charlie.Wei <luowei@cvte.com>
2024-09-09 15:36:47 +08:00
crazywoola
a27d4d58ec
fix: ollama text embedding 500 error (#8131) 2024-09-09 15:27:49 +08:00
邹成卓
a15791e788
Fix: tongyi code wrapper works not stable (#7871)
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: crazywoola <427733928@qq.com>
2024-09-09 11:15:17 +08:00
ybalbert001
954580a4af
feat: support more model types and builtin tools on aws/sagemaker (#8061)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-09-09 10:34:11 +08:00
crazywoola
ab7d79275e
fix: Claude can not validate credientials (#8109) 2024-09-09 10:22:42 +08:00
呆萌闷油瓶
d28446301f
feat:add fishaudio in xinference (#8100) 2024-09-08 23:58:02 +08:00
Nam Vu
2d7954c7da
Fix variable typo (#8084) 2024-09-08 13:14:11 +08:00
AAEE86
0cef25ef8c
Revert "fix: parameter rule" (#8070) 2024-09-07 10:44:56 +08:00
crazywoola
900fd82a92
fix: parameter rule (#8064) 2024-09-06 19:15:24 +08:00
tmuife
89aede80cc
Add OCI(Oracle Cloud Infrastructure) Generative AI Service as a Model Provider (#7775)
Co-authored-by: Walter Jin <jinshuhaicc@gmail.com>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: walter from vm <walter.jin@oracle.com>
2024-09-06 14:15:40 +08:00
Leng Yue
bd0992275c
feat: support fish audio TTS (#7982) 2024-09-05 14:18:39 +08:00
非法操作
3e7597f2bd
feat: add gpt-4o-2024-08-06 and json_schema for azure openAI service (#7648) 2024-09-04 21:56:08 +08:00
wochuideng
f6b9982c23
Concurrent calls to the Wenxin model, and the exception problem when obtaining the token is fixed (#7976)
Co-authored-by: puqs1 <puqs1@lenovo.com>
2024-09-04 21:44:57 +08:00
非法操作
0f72a8e89d
chore: refactor the beichuan model (#7953) 2024-09-04 16:22:31 +08:00
呆萌闷油瓶
83494cb4f5
fix:empty voice occurs when xinference CosyVoice tts model (#7958) 2024-09-04 13:04:31 +08:00
orangeclk
3f2a806abe
fix: glm models prices and max_tokens correction (#7882) 2024-09-02 14:29:09 +08:00
sino
1f56a20b62
feat: support auth by api key for ark provider (#7845) 2024-08-31 10:56:32 +08:00
非法操作
dc015c380a
feat: add zhipu glm_4_plus and glm_4v_plus model (#7824) 2024-08-30 15:08:31 +08:00
hisir
f0273f00e1
Fixed when testing the openai compatible interface model, an error is reported when no object is returned (#7808) 2024-08-29 18:58:19 +08:00
sino
7cfebffbb8
chore: update default endpoint for ark provider (#7741) 2024-08-28 13:56:50 +08:00
crazywoola
da326baa5e
fix: tongyi Error: 'NoneType' object is not subscriptable (#7705) 2024-08-27 16:56:06 +08:00
sino
ee7d5e7206
feat: support Moonshot and GLM models tool call for volc ark provider (#7666) 2024-08-27 14:43:37 +08:00
Hélio Lúcio
7b7576ad55
Add Azure AI Studio as provider (#7549)
Co-authored-by: Hélio Lúcio <canais.hlucio@voegol.com.br>
2024-08-27 09:52:59 +08:00
代君
7c2bb31a55
[fix] openai's tool role dose not support name parameter. (#7659) 2024-08-26 18:52:34 +08:00
Seayon
561a61e7fe
Improve MIME type detection for image URLs (#6531)
Co-authored-by: seayon <zhaoxuyang@shouqianba.com>
2024-08-25 13:36:16 +08:00
sino
efc136cce5
feat: Introduce Ark SDK v3 and ensure compatibility with models of SDK v2 (#7579)
Co-authored-by: crazywoola <427733928@qq.com>
2024-08-24 19:29:45 +08:00
噢哎哟喂
ad13011043
add JSON Mode support for moonshot models (#7568) 2024-08-23 16:24:45 +08:00
Fei He
6025002971
add qwen text-embedding-v3 support. (#7567) 2024-08-23 15:32:38 +08:00
orangeclk
a24717765e
feat: forward zhipu finish_reason (#7560) 2024-08-23 11:15:38 +08:00
orangeclk
f53454f81d
add finish_reason to the LLM node output (#7498) 2024-08-21 17:29:30 +08:00
非法操作
f7af8c7cc7
feat: gpt-4o-mini-2024-07-18 support json schema (#7489) 2024-08-21 15:11:29 +08:00
Xiyuan Chen
4e7b6aec3a
feat: support pinning, including, and excluding for model providers and tools (#7419)
Co-authored-by: GareArc <chen4851@purude.edu>
2024-08-21 11:16:43 +08:00
Nam Vu
6991a243aa
chore: correct _tts_invoke_streaming max length (#7423) 2024-08-20 10:20:04 +08:00
Chengyu Yan
1f944c6eeb
feat(api): support wenxin bge-large and tao embedding model. (#7393) 2024-08-19 22:25:09 +08:00
Xiao Ley
53cf756207
feat: OpenRouter add gpt-4o-2024-08-06 model (#7409) 2024-08-19 19:14:08 +08:00
-LAN-
0087afc2e3
fix(api/core/model_runtime/model_providers/__base/large_language_model.py): Add TEXT type checker (#7407) 2024-08-19 18:45:30 +08:00
SoaringEthan
acd72e3ab2
feat: support xinference's auth system (#7369) 2024-08-19 12:41:56 +08:00
Chengyu Yan
bfd905602f
feat(api): support wenxin text embedding (#7377) 2024-08-19 09:15:19 +08:00
sino
a0a67873aa
chore: optimize ark model parameters (#7378) 2024-08-19 08:44:19 +08:00
噢哎哟喂
baaa3f7f42
add base url for moonshot model (#7360) 2024-08-17 10:28:09 +08:00
Weaxs
3a33062405
feat: support siliconflow rerank (#7337) 2024-08-16 20:21:41 +08:00
Xiyuan Chen
c7df6783df
Revert "feat: support pinning, including, and excluding for Model Providers and Tools" (#7324) 2024-08-15 23:51:00 +08:00
噢哎哟喂
6fdbc7dbf3
fix error when use farui-plus model (#7316)
Co-authored-by: 雪风 <xuefeng@shifaedu.cn>
2024-08-15 20:14:13 +08:00
Hongbin
d1a6702aa4
Update PerfXCloud Model List (#7212)
Co-authored-by: xhb <466010723@qq.com>
2024-08-15 19:42:15 +08:00