outman/dify - dify - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
zhuhao	850492dafa	feat: deprecate gte-Qwen2-7B-instruct embedding model (#8866 )	2024-09-28 21:40:27 +08:00
zhuhao	61c89a9168	feat: add internlm2.5-20b and qwen2.5-coder-7b model (#8862 )	2024-09-28 16:31:02 +08:00
zhuhao	6cd22f3bca	fix: update qwen2.5-coder-7b model name (#8861 )	2024-09-28 15:01:27 +08:00
CXwudi	0603359e2d	fix: delete harm catalog settings for gemini (#8829 )	2024-09-27 13:49:03 +08:00
HowardChan	bb781764b8	Add Llama3.2 models in Groq provider (#8831 )	2024-09-27 12:13:00 +08:00
zhuhao	29275c7447	feat: deprecate mistral model for siliconflow (#8828 )	2024-09-27 12:11:56 +08:00
CXwudi	e5efd09ebb	chore: massive update of the Gemini models based on latest documentation (#8822 )	2024-09-27 09:14:33 +08:00
wenmeng zhou	ecc951609d	add more detailed doc for models of qwen series (#8799 ) Co-authored-by: crazywoola <427733928@qq.com>	2024-09-26 22:32:33 +08:00
ice yao	063474f408	Add llama3.2 model in fireworks provider (#8809 )	2024-09-26 22:21:01 +08:00
AAEE86	9a4b53a212	feat: add stream for Gemini (#8678 )	2024-09-26 19:08:59 +08:00
AAEE86	03edfbe6f5	feat: add qwen to add custom model parameters (#8759 )	2024-09-26 19:04:25 +08:00
cx	128a66f7fe	fix: Ollama modelfeature set vision, and an exception occurred at the… (#8783 )	2024-09-26 16:34:40 +08:00
Shenghang Tsai	a0b0809b1c	Add more models for SiliconFlow (#8779 )	2024-09-26 11:29:53 +08:00
Aaron Ji	4c9ef6e830	fix: update usage for Jina Embeddings v3 (#8771 )	2024-09-26 11:29:35 +08:00
zhuhao	ac73763726	chore: add input_type param desc for the _invoke method of text_embedding (#8778 )	2024-09-26 11:23:09 +08:00
Pan, Wen-Ming	02ff6cca70	feat: add support for Vertex AI Gemini 1.5 002 and experimental models (#8767 )	2024-09-25 21:27:26 +08:00
cherryhuahua	d0e0111f88	fix:Spark's large language model token calculation error #7911 (#8755 )	2024-09-25 14:51:42 +08:00
ybalbert001	68c7e68a8a	Fix Issue: switch LLM of SageMaker endpoint doesn't take effect (#8737 ) Co-authored-by: Yuanbo Li <ybalbert@amazon.com>	2024-09-25 09:12:35 +08:00
ice yao	91f70d0bd9	Add embedding models in fireworks provider (#8728 )	2024-09-25 08:47:11 +08:00
Jyong	4669eb24be	add embedding input type parameter (#8724 )	2024-09-24 21:53:50 +08:00
Shota Totsuka	1c7877b048	fix: remove harm category setting from vertex ai (#8721 )	2024-09-24 20:53:26 +08:00
ice yao	64baedb484	fix: update nomic model provider token calculation (#8705 )	2024-09-24 14:04:07 +08:00
Benjamin	4638f99aaa	fix: change model provider name issue Ref #8691 (#8710 )	2024-09-24 13:26:58 +08:00
AAEE86	aebe5fc68c	fix: Remove unsupported parameters in qwen model (#8699 )	2024-09-24 13:06:21 +08:00
zhuhao	1ecf70dca0	feat: add mixedbread as a new model provider (#8523 )	2024-09-24 11:20:15 +08:00
ybalbert001	7c485f8bb8	fix llm integration problem: It doesn't work on docker env (#8701 ) Co-authored-by: Yuanbo Li <ybalbert@amazon.com>	2024-09-24 10:33:30 +08:00
Sa Zhang	7f1b028840	fix: change the brand name to Jina AI (#8691 ) Co-authored-by: sa zhang <sa.zhang@jina.ai>	2024-09-23 21:39:26 +08:00
Nam Vu	bef83a4d2e	fix: typos and improve naming conventions: (#8687 )	2024-09-23 21:32:58 +08:00
ice yao	d7aada38a1	Add nomic embedding model provider (#8640 )	2024-09-23 19:57:21 +08:00
AAEE86	a126d535cf	add Spark Max-32K (#8676 )	2024-09-23 16:39:46 +08:00
AAEE86	3554a803e7	add zhipuai web search (#8668 )	2024-09-23 16:19:42 +08:00
AAEE86	c66cecaa55	add Qwen model translate (#8674 )	2024-09-23 16:18:55 +08:00
Aaron Ji	3618a97c20	feat: extend api params for Jina Embeddings V3 (#8657 )	2024-09-23 13:45:09 +08:00
zhuhao	e34f04380d	feat: add deepseek-v2.5 for model provider siliconflow (#8639 )	2024-09-22 21:44:06 +08:00
zhuhao	6df77038a2	docs: fix predefined_model_scale_out.md redirect error (#8633 )	2024-09-22 16:45:45 +08:00
zhuhao	45c0a44411	feat: add qwen2.5 for model provider siliconflow (#8630 )	2024-09-22 16:42:34 +08:00
CXwudi	97895ec41a	chore: add Gemini newest experimental models (close #7121 ) (#8621 )	2024-09-22 13:38:08 +08:00
sino	6d56d5c1f6	feat: support o1 series models for openrouter (#8358 )	2024-09-22 10:23:50 +08:00
AAEE86	c9f1e18df1	Add model parameter translation (#8509 ) Co-authored-by: swingchen01 <swings@126.com> Co-authored-by: 陈长君 <chenchangjun@shuwen.com>	2024-09-22 10:14:33 +08:00
Waffle	740fad06c1	feat(tools/cogview): Updated cogview tool to support cogview-3 and the latest cogview-3-plus (#8382 )	2024-09-22 10:14:14 +08:00
ice yao	0665268578	Add Fireworks AI as new model provider (#8428 )	2024-09-22 10:13:00 +08:00
呆萌闷油瓶	c8b9bdebfe	feat:use xinference tts stream mode (#8616 )	2024-09-22 10:08:35 +08:00
AAEE86	1a8dcae10e	add Qwen custom add model interface (#8565 )	2024-09-21 22:52:10 +08:00
AAEE86	5ddb601e43	add MixtralAI Model (#8517 )	2024-09-21 18:08:07 +08:00
Hongbin	5541248264	Update the PerfXCloud provider model list，Update PerfXCloudProvider validate_provider_credentials method. (#8587 ) Co-authored-by: xhb <466010723@qq.com>	2024-09-21 17:33:15 +08:00
Su Yang	c87f710d58	Fix: update qwen model and model config (#8584 ) Co-authored-by: -LAN- <laipz8200@outlook.com>	2024-09-20 17:05:57 +08:00
Su Yang	1568c5cae9	fix: fix qwen series model type (#8580 )	2024-09-20 15:29:33 +08:00
MuYu	a03919c3b3	feat: add hunyuan-vision (#8529 )	2024-09-19 18:08:01 +08:00
Su Yang	d6de96c4b4	feat: sync Qwen API with Aliyun Bailian (#8538 )	2024-09-19 17:08:59 +08:00
Wang Bo	6f222b49f2	refactor: rename task_type to task for jina embeddings v3 (#8488 )	2024-09-18 14:53:15 +08:00
-LAN-	8dfe8c773a	chore: Deprecate gpt-3.5-turbo-0613 and gpt-3.5-turbo-16k-0613 models (#8500 )	2024-09-18 14:38:09 +08:00
ybalbert001	b6ad7a1e06	Fix: https://github.com/langgenius/dify/issues/8190 (Update Model nam… (#8426 ) Co-authored-by: Yuanbo Li <ybalbert@amazon.com>	2024-09-14 17:14:18 +08:00
Aaron Ji	6f7625fa47	chore: update Jina embedding model (#8376 )	2024-09-14 16:21:17 +08:00
ybalbert001	b613b11422	Fix: Support Bedrock cross region inference #8190 (Update Model name to distinguish between different region groups) (#8402 ) Co-authored-by: Yuanbo Li <ybalbert@amazon.com>	2024-09-14 11:06:20 +08:00
crazywoola	71b4480c4a	fix: o1-mini 65563 -> 65536 (#8388 )	2024-09-14 02:39:58 +08:00
Bowen Liang	5b98acde2f	chore: improve usage of striping prefix or suffix of string with Ruff 0.6.5 (#8392 )	2024-09-13 23:34:39 +08:00
Bowen Liang	a1104ab97e	chore: refurish python code by applying Pylint linter rules (#8322 )	2024-09-13 22:42:08 +08:00
xiandan-erizo	1ab81b4972	support hunyuan-turbo (#8372 ) Co-authored-by: sunkesi <sunkesi@hosecloud.com>	2024-09-13 20:21:48 +08:00
takatost	24af4b9313	fix: o1-series model encounters an error when the generate mode is blocking (#8363 )	2024-09-13 15:37:54 +08:00
Bowen Liang	6613b8f2e0	chore: fix unnecessary string concatation in single line (#8311 )	2024-09-13 14:24:49 +08:00
sino	a45ac6ab98	fix: ark token usage is none (#8351 )	2024-09-13 14:19:24 +08:00
takatost	4637ddaa7f	feat: add o1-series models support in Agent App (ReACT only) (#8350 )	2024-09-13 13:08:27 +08:00
takatost	e90d3c29ab	feat: add OpenAI o1 series models support (#8328 )	2024-09-13 02:15:19 +08:00
Nam Vu	153807f243	fix: response_format label (#8326 )	2024-09-12 23:17:29 +08:00
呆萌闷油瓶	02c4b1af71	chore:add Azure openai api version 2024-08-01-preview (#8291 )	2024-09-12 20:22:57 +08:00
ybalbert001	d4985fb3aa	Fix: Support Bedrock cross region inference [#8190 ](https://github.com/langgenius/dify/issues/8190 ) (#8317 )	2024-09-12 19:15:20 +08:00
Bowen Liang	40fb4d16ef	chore: refurbish Python code by applying refurb linter rules (#8296 )	2024-09-12 15:50:49 +08:00
Bowen Liang	c69f5b07ba	chore: apply ruff E501 line-too-long linter rule (#8275 ) Co-authored-by: -LAN- <laipz8200@outlook.com>	2024-09-12 14:00:36 +08:00
Bowen Liang	0f14873255	chore: cleanup ruff flake8-simplify linter rules (#8286 ) Co-authored-by: -LAN- <laipz8200@outlook.com>	2024-09-12 12:55:45 +08:00
Bowen Liang	781d294f49	chore: cleanup pycodestyle E rules (#8269 )	2024-09-11 18:55:00 +08:00
yalei	f515af2232	let claude models in bedrock support the response_format parameter (#8220 ) Co-authored-by: duyalei <>	2024-09-11 18:24:50 +08:00
crazywoola	4d2cd6703b	chore: remove useless code (#8198 )	2024-09-11 18:19:34 +08:00
Bowen Liang	292220c596	chore: apply pep8-naming rules for naming convention (#8261 )	2024-09-11 16:40:52 +08:00
HowardChan	53f37a6704	fix:ollama text embedding 500 error (#8252 )	2024-09-11 16:23:19 +08:00
Nam Vu	342607f4a4	fix: truthy value (#8208 )	2024-09-11 15:44:53 +08:00
HowardChan	82c42b9ec5	fix:error when adding the ollama embedding model (#8236 ) Co-authored-by: crazywoola <427733928@qq.com>	2024-09-11 10:25:45 +08:00
Bowen Liang	2cf1187b32	chore(api/core): apply ruff reformatting (#7624 )	2024-09-10 17:00:20 +08:00
takatost	dabfd74622	feat: Parallel Execution of Nodes in Workflows (#8192 ) Co-authored-by: StyleZhang <jasonapring2015@outlook.com> Co-authored-by: Yi <yxiaoisme@gmail.com> Co-authored-by: -LAN- <laipz8200@outlook.com>	2024-09-10 15:23:16 +08:00
Jyong	2d690801d1	nvidia rerank top n missed (#8185 )	2024-09-10 13:17:48 +08:00
-LAN-	4313d92e6b	feat(api/core/model_runtime/entities/defaults.py): Add TOP_K in default parameters. (#8167 )	2024-09-10 09:11:31 +08:00
crazywoola	0bec6a037c	update qwen-long (#8157 )	2024-09-09 19:09:42 +08:00
AAEE86	fa34b9aed6	Modify model parameters in Spark LLMs and zhipuai LLMs (#8078 ) Co-authored-by: Charlie.Wei <luowei@cvte.com>	2024-09-09 15:36:47 +08:00
crazywoola	a27d4d58ec	fix: ollama text embedding 500 error (#8131 )	2024-09-09 15:27:49 +08:00
邹成卓	a15791e788	Fix: tongyi code wrapper works not stable (#7871 ) Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com> Co-authored-by: crazywoola <427733928@qq.com>	2024-09-09 11:15:17 +08:00
ybalbert001	954580a4af	feat: support more model types and builtin tools on aws/sagemaker (#8061 ) Co-authored-by: Yuanbo Li <ybalbert@amazon.com>	2024-09-09 10:34:11 +08:00
crazywoola	ab7d79275e	fix: Claude can not validate credientials (#8109 )	2024-09-09 10:22:42 +08:00
呆萌闷油瓶	d28446301f	feat:add fishaudio in xinference (#8100 )	2024-09-08 23:58:02 +08:00
Nam Vu	2d7954c7da	Fix variable typo (#8084 )	2024-09-08 13:14:11 +08:00
AAEE86	0cef25ef8c	Revert "fix: parameter rule" (#8070 )	2024-09-07 10:44:56 +08:00
crazywoola	900fd82a92	fix: parameter rule (#8064 )	2024-09-06 19:15:24 +08:00
tmuife	89aede80cc	Add OCI(Oracle Cloud Infrastructure) Generative AI Service as a Model Provider (#7775 ) Co-authored-by: Walter Jin <jinshuhaicc@gmail.com> Co-authored-by: crazywoola <427733928@qq.com> Co-authored-by: walter from vm <walter.jin@oracle.com>	2024-09-06 14:15:40 +08:00
Leng Yue	bd0992275c	feat: support fish audio TTS (#7982 )	2024-09-05 14:18:39 +08:00
非法操作	3e7597f2bd	feat: add gpt-4o-2024-08-06 and json_schema for azure openAI service (#7648 )	2024-09-04 21:56:08 +08:00
wochuideng	f6b9982c23	Concurrent calls to the Wenxin model, and the exception problem when obtaining the token is fixed (#7976 ) Co-authored-by: puqs1 <puqs1@lenovo.com>	2024-09-04 21:44:57 +08:00
非法操作	0f72a8e89d	chore: refactor the beichuan model (#7953 )	2024-09-04 16:22:31 +08:00
呆萌闷油瓶	83494cb4f5	fix:empty voice occurs when xinference CosyVoice tts model (#7958 )	2024-09-04 13:04:31 +08:00
orangeclk	3f2a806abe	fix: glm models prices and max_tokens correction (#7882 )	2024-09-02 14:29:09 +08:00
sino	1f56a20b62	feat: support auth by api key for ark provider (#7845 )	2024-08-31 10:56:32 +08:00
非法操作	dc015c380a	feat: add zhipu glm_4_plus and glm_4v_plus model (#7824 )	2024-08-30 15:08:31 +08:00
hisir	f0273f00e1	Fixed when testing the openai compatible interface model, an error is reported when no object is returned (#7808 )	2024-08-29 18:58:19 +08:00
sino	7cfebffbb8	chore: update default endpoint for ark provider (#7741 )	2024-08-28 13:56:50 +08:00
crazywoola	da326baa5e	fix: tongyi Error: 'NoneType' object is not subscriptable (#7705 )	2024-08-27 16:56:06 +08:00
sino	ee7d5e7206	feat: support Moonshot and GLM models tool call for volc ark provider (#7666 )	2024-08-27 14:43:37 +08:00
Hélio Lúcio	7b7576ad55	Add Azure AI Studio as provider (#7549 ) Co-authored-by: Hélio Lúcio <canais.hlucio@voegol.com.br>	2024-08-27 09:52:59 +08:00
代君	7c2bb31a55	[fix] openai's tool role dose not support name parameter. (#7659 )	2024-08-26 18:52:34 +08:00
Seayon	561a61e7fe	Improve MIME type detection for image URLs (#6531 ) Co-authored-by: seayon <zhaoxuyang@shouqianba.com>	2024-08-25 13:36:16 +08:00
sino	efc136cce5	feat: Introduce Ark SDK v3 and ensure compatibility with models of SDK v2 (#7579 ) Co-authored-by: crazywoola <427733928@qq.com>	2024-08-24 19:29:45 +08:00
噢哎哟喂	ad13011043	add JSON Mode support for moonshot models (#7568 )	2024-08-23 16:24:45 +08:00
Fei He	6025002971	add qwen text-embedding-v3 support. (#7567 )	2024-08-23 15:32:38 +08:00
orangeclk	a24717765e	feat: forward zhipu finish_reason (#7560 )	2024-08-23 11:15:38 +08:00
orangeclk	f53454f81d	add finish_reason to the LLM node output (#7498 )	2024-08-21 17:29:30 +08:00
非法操作	f7af8c7cc7	feat: gpt-4o-mini-2024-07-18 support json schema (#7489 )	2024-08-21 15:11:29 +08:00
Xiyuan Chen	4e7b6aec3a	feat: support pinning, including, and excluding for model providers and tools (#7419 ) Co-authored-by: GareArc <chen4851@purude.edu>	2024-08-21 11:16:43 +08:00
Nam Vu	6991a243aa	chore: correct _tts_invoke_streaming max length (#7423 )	2024-08-20 10:20:04 +08:00
Chengyu Yan	1f944c6eeb	feat(api): support wenxin bge-large and tao embedding model. (#7393 )	2024-08-19 22:25:09 +08:00
Xiao Ley	53cf756207	feat: OpenRouter add gpt-4o-2024-08-06 model (#7409 )	2024-08-19 19:14:08 +08:00
-LAN-	0087afc2e3	fix(api/core/model_runtime/model_providers/__base/large_language_model.py): Add TEXT type checker (#7407 )	2024-08-19 18:45:30 +08:00
SoaringEthan	acd72e3ab2	feat: support xinference's auth system (#7369 )	2024-08-19 12:41:56 +08:00
Chengyu Yan	bfd905602f	feat(api): support wenxin text embedding (#7377 )	2024-08-19 09:15:19 +08:00
sino	a0a67873aa	chore: optimize ark model parameters (#7378 )	2024-08-19 08:44:19 +08:00
噢哎哟喂	baaa3f7f42	add base url for moonshot model (#7360 )	2024-08-17 10:28:09 +08:00
Weaxs	3a33062405	feat: support siliconflow rerank (#7337 )	2024-08-16 20:21:41 +08:00
Xiyuan Chen	c7df6783df	Revert "feat: support pinning, including, and excluding for Model Providers and Tools" (#7324 )	2024-08-15 23:51:00 +08:00
噢哎哟喂	6fdbc7dbf3	fix error when use farui-plus model (#7316 ) Co-authored-by: 雪风 <xuefeng@shifaedu.cn>	2024-08-15 20:14:13 +08:00
Hongbin	d1a6702aa4	Update PerfXCloud Model List (#7212 ) Co-authored-by: xhb <466010723@qq.com>	2024-08-15 19:42:15 +08:00
Xiyuan Chen	7619850855	feat: support pinning, including, and excluding for Model Providers and Tools (#7283 )	2024-08-15 12:58:38 +08:00
非法操作	6ff7fd80a1	feat: support OPENAI json_schema (#7258 )	2024-08-15 11:29:19 +08:00
非法操作	5aa373dc04	feat: add chatgpt-4o-latest (#7289 )	2024-08-15 11:19:10 +08:00
Xiyuan Chen	d29b32fce2	fix: typo in upstage/llm/_position.yaml (#7286 )	2024-08-15 08:39:35 +08:00
噢哎哟喂	52383d0161	add support for tongyi-farui (#7248 ) Co-authored-by: 雪风 <xuefeng@shifaedu.cn>	2024-08-14 14:09:13 +08:00
Onelevenvy	0f59d76997	fix: add context_size and max_chunks to Tongyi embedding to resolve issue #7189 (#7227 )	2024-08-13 16:35:22 +08:00
shAlfred	a12ddc47e7	feat: add support of speech2text function for OpenAI-API-compatible and Siliconflow (#7197 )	2024-08-12 21:38:59 +08:00
Weaxs	67b9fdaad7	siliconflow support bge-3 && bce-v1 embedding (#7198 )	2024-08-12 19:14:43 +08:00
ybalbert001	f2cb1fb09f	Fix : Workflow "start" paste url not support s3 pre-signed URL (#6855 ) Co-authored-by: Yuanbo Li <ybalbert@amazon.com>	2024-08-11 16:45:15 +08:00
Yanyi Liu	5b32f2e0dd	Feat: Add model provider Text Embedding Inference for embedding and rerank (#7132 )	2024-08-09 19:12:13 +08:00
Yanyi Liu	4cbeb6815b	Fix: Wrong cutoff length lead to empty input in openai compatible embedding model. (#7133 )	2024-08-09 19:11:57 +08:00
forrestlinfeng	07511dfaf4	update stepfun model (#7118 ) Co-authored-by: chenlinfeng <chenlinfeng@step.ai> Co-authored-by: Tfsh <tianfs_fight@163.com>	2024-08-08 20:40:37 +08:00
小羽	7944ce0594	feat: wenxin add yi-34b-chat (#7117 )	2024-08-08 20:01:21 +08:00
orangeclk	83acb53c08	feat: add zhipu embedding-3 (#7100 )	2024-08-08 17:08:46 +08:00
shAlfred	a7162240e6	feat: add text-embedding functon and LLM models to Siliconflow (#7090 )	2024-08-08 17:08:28 +08:00
小羽	34a9dbe826	Feat/add 360-zhinao provider (#7069 )	2024-08-08 14:23:08 +08:00
orangeclk	f288d367ac	Add price info for zhipu models (#7084 )	2024-08-08 14:17:05 +08:00
Waffle	5e2fa13126	feat: support glm-4-long (#7070 ) Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>	2024-08-08 10:54:39 +08:00
Joe	d7bb422a5c	fix: hunyuan assistant_prompt_message pydantic error (#7062 )	2024-08-07 18:31:40 +08:00
majian	99b78dd198	feat: add gpt-4o-2024-08-06 (#7046 )	2024-08-07 15:35:57 +08:00
crazywoola	3516989738	fix: typos in wenxin llm (#7021 )	2024-08-06 22:33:03 +08:00
Sa Zhang	26991443ed	fix: Fix incorrect context size for jina-reranker-v2 model (#7006 )	2024-08-06 21:08:29 +08:00
Yefori	bd3ed89516	feat: add function calling for deepseek models (#6990 )	2024-08-06 13:37:27 +08:00
小羽	23ed15d19f	feat:nvidia add nemotron4-340b and microsoft/phi-3 (#6973 )	2024-08-06 10:16:41 +08:00
takatost	6da14c2d48	security: fix api image security issues (#6971 )	2024-08-05 20:21:08 +08:00
Pedro Gomes	a34285196b	Revise the wrong pricing of certain LLM models. (#6967 )	2024-08-05 18:41:44 +08:00
takatost	ea30174057	chore: optimize streaming tts of xinference (#6966 )	2024-08-05 18:23:23 +08:00
liuzhenghua	141e4e0276	fix: restore xinference secret field (#6941 ) Co-authored-by: liuzhenghua-jk <liuzhenghua-jk@360shuke.com>	2024-08-04 22:32:24 +08:00
Weaxs	5e634a59a2	compatible xinference reranker server (#6927 )	2024-08-04 13:49:38 +08:00
JuHyung Son	2e941bb91c	add new provider Solar (#6884 )	2024-08-02 20:48:09 +08:00
sino	8166a8caf5	feat: update llama3.1 parameters for openrouter (#6901 )	2024-08-02 13:13:34 +08:00
灰灰	56af1a0adf	pref: change ollama embedded api request (#6876 )	2024-08-02 12:04:47 +08:00
dufei	f8617db012	fix tongyi tool calls (#6896 )	2024-08-02 10:03:43 +08:00
Weaxs	cc4785f094	fix: xinference reranker return_documents (#6888 )	2024-08-01 19:57:53 +08:00
chenxu9741	a9cd6df97e	Remove tts (blocking call) (#6869 )	2024-08-01 14:50:22 +08:00
呆萌闷油瓶	f31142e758	Azure 4o mini options (#6873 )	2024-08-01 14:04:18 +08:00
crazywoola	792f908afb	Revert "feat:Azure gpt4o mini" (#6870 )	2024-08-01 13:32:03 +08:00
呆萌闷油瓶	14367ddc09	feat:Azure gpt4o mini (#6866 )	2024-08-01 13:03:08 +08:00
Charlie.Wei	cbf7f21ade	Add azure gpt4omini (#6862 ) Co-authored-by: luowei <glpat-EjySCyNjWiLqAED-YmwM> Co-authored-by: crazywoola <427733928@qq.com> Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>	2024-08-01 12:57:52 +08:00
Weaxs	f6e8e120a1	support xinference tts (#6746 )	2024-08-01 11:59:15 +08:00
Joe	08f922d8c9	fix: anthropic max token NoneType error (#6858 )	2024-08-01 11:30:00 +08:00
小羽	56b43f62d1	feat: nvidia add llama3.1 model (#6844 )	2024-07-31 21:24:02 +08:00
Giga Group	4b410494b3	Add model parameter enable_enhance for hunyuan llm model (#6847 ) Co-authored-by: sun <sun@centen.cn>	2024-07-31 20:04:43 +08:00
Joe	df9bd36cab	fix: claude-3-5-sonnet-20240620 max token error (#6843 )	2024-07-31 18:34:44 +08:00
longzhihun	9ce5cea911	feat: bedrock invoke enhancement (#6808 )	2024-07-30 21:57:18 +08:00
SiliconFlow, Inc	3e18d32ce5	add deepseek-coder-v2 in siliconflow (#6149 )	2024-07-29 18:45:19 +08:00
Charles	94d68b6a08	upgrade deepseek params (#6744 )	2024-07-29 18:31:56 +08:00
Giga Group	c9ff0e3961	Add model hunyuan-embedding (#6657 ) Co-authored-by: sun <sun@centen.cn>	2024-07-29 18:30:52 +08:00
Bowen Liang	20268708cc	chore: improve position map conversion and tolerate empty position yaml file (#6541 )	2024-07-29 10:32:11 +08:00
-LAN-	83af50368f	fix(api/core/model_runtime/model_providers/azure_openai/llm/llm.py): Try to skip if `delta.delta` is None. (#6727 ) Signed-off-by: -LAN- <laipz8200@outlook.com>	2024-07-27 00:05:21 +08:00
Joe	e4542215cc	fix: tongyi empty tool_calls is not supported in message (#6719 )	2024-07-26 18:10:13 +08:00
Jason	3d3677e912	Feat/model provider novita (#6717 ) Co-authored-by: takatost <takatost@gmail.com>	2024-07-26 17:37:21 +08:00
chenxu9741	6b50bb0fe6	issues #6655 Open ai tts issues (#6696 )	2024-07-26 14:55:49 +08:00
longzhihun	c5ac004f15	[seanguo] fix: unsupported filename in windows & add Mistral Large 2 (#6679 )	2024-07-25 19:26:46 +08:00
RookieAgent	78a339a794	modify llama3-1 yaml filename to support Windows pull operations (#6677 )	2024-07-25 18:58:55 +08:00
Giga Group	ca696fe94c	Add support of tool-call for model provider "hunyuan" (#6656 ) Co-authored-by: sun <sun@centen.cn>	2024-07-25 11:27:58 +08:00
longzhihun	9815aab7a3	[seanguo] feat: add llama 3.1 support in bedrock (#6645 )	2024-07-25 11:20:37 +08:00
zhangzhiqiangcs	d4c55748f1	doc: fix about model features (#6619 )	2024-07-24 19:12:10 +08:00
dufei	5af2df0cd5	fix: qwen fc error (#6620 ) Co-authored-by: dufei <du_fei@venusgroup.com.cn>	2024-07-24 16:56:06 +08:00
takatost	4c85393a1d	feat: add GroqCloud llama3.1 series models support (#6596 )	2024-07-24 00:41:58 +08:00
sino	d5c2680fde	feat: support llama3.1 series models for openrouter provider (#6595 )	2024-07-24 00:37:48 +08:00
Joe	8123a00e97	feat: update prompt generate (#6516 )	2024-07-23 19:52:14 +08:00
Lance Mao	7c55c39085	feat: add tencent asr (#6091 )	2024-07-23 16:38:39 +08:00
-LAN-	5e6fc58db3	Feat/environment variables in workflow (#6515 ) Co-authored-by: JzoNg <jzongcode@gmail.com>	2024-07-22 15:29:39 +08:00
sino	4f9f175f25	fix: correct gpt-4o-mini max token (#6472 ) Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>	2024-07-19 18:24:58 +08:00
sino	9e168f9d1c	feat: support gpt-4o-mini for openrouter provider (#6447 )	2024-07-19 13:09:41 +08:00
Weaxs	ea45496a74	update ernie models (#6454 )	2024-07-19 13:08:39 +08:00
Richards Tu	8e49146a35	[EMERGENCY] Fix Anthropic header issue (#6445 )	2024-07-19 07:38:15 +08:00
takatost	dad3fd2dc1	feat: add gpt-4o-mini (#6442 )	2024-07-19 01:53:43 +08:00
ybalbert001	4a026fa352	Enhancement: add model provider - Amazon Sagemaker (#6255 ) Co-authored-by: Yuanbo Li <ybalbert@amazon.com> Co-authored-by: crazywoola <427733928@qq.com>	2024-07-18 19:32:31 +08:00
themanforfree	ba181197c2	feat: api_key support for xinference (#6417 ) Signed-off-by: themanforfree <themanforfree@gmail.com>	2024-07-18 18:58:46 +08:00
forrestlinfeng	3b5b548af3	Add Stepfun LLM Support (#6346 )	2024-07-18 07:47:18 +08:00
Richards Tu	4782fb50c4	Support new Claude-3.5 Sonnet max token limit (#6335 )	2024-07-18 07:47:06 +08:00
xielong	f3f052ba36	fix: rename model from ernie-4.0-8k-Latest to ernie-4.0-8k-latest (#6383 )	2024-07-17 19:07:47 +08:00
longzhihun	ed9e692263	feat: bedrock model runtime enhancement (#6299 )	2024-07-16 15:54:39 +08:00
呆萌闷油瓶	d66d7146a3	chore:update azure GA version 2024-06-01 (#6307 )	2024-07-16 10:32:18 +08:00
Onelevenvy	b47fa27a35	fix: zhipuai validate error when user's api key not support for chatglm_turbo in issue #6289 (#6290 )	2024-07-15 19:27:18 +08:00
thibautleaux-kreactive	96c171805a	Update bedrock.yaml (#6281 )	2024-07-15 16:53:03 +08:00
Benjamin	ec181649ae	Update model provider configuration for Triton Inference Server and X… (#6274 )	2024-07-15 15:07:28 +08:00
Waffle	07add06c59	Feat/add zhipu CogView 3 tool (#6210 )	2024-07-13 17:39:17 +08:00
Little 羊	7c2c949f01	Update ernie_bot.py (#6236 )	2024-07-12 19:54:53 +08:00
耐小心	d7a6f25c63	fix: differentiate prompts fields based on function_calling_type (#5880 )	2024-07-12 11:07:38 +08:00
crazywoola	ee3936916f	upgrade deepseek params (#6215 )	2024-07-12 10:55:44 +08:00
Little 羊	2f064c68bc	Create ernie-4.0-turbo-8k-preview (#6132 )	2024-07-11 20:20:07 +08:00
Su Yang	215661ef91	feat: add PerfXCloud, Qwen series #6116 (#6117 )	2024-07-10 18:26:10 +08:00
chenxu9741	6ef401a9f0	feat:add tts-streaming config and future (#5492 )	2024-07-09 11:33:58 +08:00
sino	85744b72e5	feat: support moonshot and glm base models for volcengine provider (#6029 )	2024-07-07 01:17:33 +08:00
Masashi Tomooka	3b23d6764f	fix: token count includes base64 string of input images (#5868 )	2024-07-06 16:53:32 +08:00
-LAN-	4d105d7bd7	feat(*): Swtich to dify_config. (#6025 )	2024-07-06 12:05:13 +08:00
orangeclk	f8aaa57f31	feat: add retry mechanism for zhipuai (#5926 )	2024-07-05 10:49:18 +08:00
-LAN-	d7f75d17cc	Chore/remove-unused-code (#5917 )	2024-07-04 18:18:26 +08:00
longzhihun	aecdfa2d5c	feat: add claude3 function calling (#5889 )	2024-07-03 22:21:02 +08:00
longzhihun	fdfbbde10d	[seanguo] modify bedrock Claude3 invoke method to converse API (#5768 ) Co-authored-by: Chenhe Gu <guchenhe@gmail.com>	2024-07-01 04:36:13 +08:00
takatost	0bf4817474	fix: _convert_prompt_message_to_dict parameters err (#5716 )	2024-06-28 21:00:00 +08:00
呆萌闷油瓶	68ac433218	feat: add support Spark4.0 (#5688 )	2024-06-28 17:39:11 +08:00
Kevin	b3d6726f65	Feature/add qwen llm (#5659 )	2024-06-28 11:06:29 +08:00
liuzhenghua	2b080b5cfc	feature: Add presence_penalty and frequency_penalty parameters to the … (#5637 ) Co-authored-by: liuzhenghua-jk <liuzhenghua-jk@360shuke.com>	2024-06-28 00:27:20 +08:00
takatost	3ccad33194	feat: add jina new pre-defined rerankers, include: jina-reranker-v2 (#5657 )	2024-06-27 13:45:35 +08:00
sunxichen	bafc8a0bde	fix: tool call message role according to credentials (#5625 ) Co-authored-by: sunxichen <sun.xc@digitalcnzz.com>	2024-06-27 12:35:27 +08:00
Bowen Liang	dcb72e0067	chore: apply flake8-comprehensions Ruff rules to improve collection comprehensions (#5652 ) Co-authored-by: -LAN- <laipz8200@outlook.com>	2024-06-27 11:21:31 +08:00
Joe	4e2de638af	feat: add ops trace (#5483 ) Co-authored-by: takatost <takatost@gmail.com>	2024-06-26 17:33:29 +08:00
sino	877a2c144b	feat: support predefined models for openrouter (#5494 )	2024-06-24 16:31:53 +08:00
-LAN-	ba67206bb9	fix(api/model_runtime/azure/llm): Switch to tool_call. (#5541 )	2024-06-24 15:35:21 +08:00
vccler	48757e581e	fix: zhipu tool calling, this PR fixes the bug described in issue #5496 (#5469 ) Co-authored-by: vccler <vccler@163.com> Co-authored-by: -LAN- <laipz8200@outlook.com>	2024-06-22 12:41:24 +08:00
LXM	e8ad0339a3	fix: tongyi json output (#5396 )	2024-06-22 12:25:23 +08:00
crazywoola	91d38a535f	fix: max_tokens of qwen-plus & qwen-plus-chat (#5480 )	2024-06-21 16:49:33 +08:00
Pan, Wen-Ming	95c882934e	feat: add support for Vertex AI claude-3-5-sonnet@20240620 (#5475 ) Co-authored-by: Wenming Pan <pwm@google.com>	2024-06-21 16:45:56 +08:00
Su Yang	26b6fd2236	feat: add support for bedrock claude-3-5-sonnet-20240620 (#5461 )	2024-06-21 10:21:35 +08:00
takatost	ff0f02d809	feat: add support for claude-3-5-sonnet-20240620 (#5452 )	2024-06-21 00:23:15 +08:00
-LAN-	142dc0afd7	refactor: Remove unused code in large_language_model.py (#5433 )	2024-06-20 16:20:40 +08:00
-LAN-	23fa3dedc4	fix(core): Fix incorrect type hints. (#5427 )	2024-06-20 15:16:21 +08:00
Ikko Eltociear Ashimine	8266842809	chore: update llm.py (#5335 )	2024-06-18 09:29:14 +08:00
Richards Tu	c163521b9e	Update and fix the model param of Deepseek (#5329 )	2024-06-17 21:40:04 +08:00
Justin Wu	61f4f08744	Add bedrock command r models (#4521 ) Co-authored-by: Justin Wu <justin.wu@ringcentral.com> Co-authored-by: Chenhe Gu <guchenhe@gmail.com>	2024-06-17 20:37:46 +08:00
-LAN-	5a99aeb864	fix(core): Reorder `field_validator` and `classmethod` to fit Pydantic V2. (#5257 )	2024-06-17 10:04:28 +08:00
crazywoola	9a64aa76c1	fix: typo and check (#5287 )	2024-06-17 09:15:43 +08:00
Pan, Wen-Ming	4b54843ed7	fix: run agent with Vertex AI Gemini models (#5260 ) Co-authored-by: Wenming Pan <pwm@google.com>	2024-06-16 09:36:31 +08:00
kurokobo	2e842333b1	fix: correct typos in the icons for microsoft (#5243 )	2024-06-15 21:02:47 +08:00
Masashi Tomooka	d9bee03ff6	fix: embedding job fails using IAM role (#5252 )	2024-06-15 18:57:54 +08:00
Jyong	ba5f8afaa8	Feat/firecrawl data source (#5232 ) Co-authored-by: Nicolas <nicolascamara29@gmail.com> Co-authored-by: chenhe <guchenhe@gmail.com> Co-authored-by: takatost <takatost@gmail.com>	2024-06-15 02:46:02 +08:00
Bin	0f35d07052	support ERNIE-4.0-8K-Latest (#5216 )	2024-06-14 18:45:24 +08:00
-LAN-	7f44e88eda	fix(model_providers/ollama): Fix OllamaLargeLanguageModel to correctly set the stop option (#5217 )	2024-06-14 18:26:14 +08:00
Jason	b7ff765d8d	Add novita.ai as model provider (#4961 )	2024-06-14 18:23:06 +08:00
Masashi Tomooka	0633aae7dc	feat: allow to use IAM Role for Bedrock (#5188 )	2024-06-14 15:18:42 +08:00
takatost	415022aa14	fix: pydantic2 error (#5172 )	2024-06-14 03:05:04 +08:00

... 3 4 5 6 7 ...

734 Commits