dify/api/core/model_runtime/model_providers
yihong 7e154a467b
fix: better error message for stream (#11635)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-12-15 17:16:04 +08:00
..
__base feat: Allow using file variables directly in the LLM node and support more file types. (#10679) 2024-11-22 16:30:22 +08:00
anthropic fix: claude can not handle empty string (#11238) 2024-12-02 16:00:40 +08:00
azure_ai_studio chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 2024-11-15 15:41:40 +08:00
azure_openai fix: fix azure open-4o-08-06 when enable json schema cant process content = "" (#11204) 2024-11-29 17:26:07 +08:00
baichuan refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
bedrock [ref] use one method to get boto client for aws bedrock (#11506) 2024-12-12 13:56:52 +08:00
chatglm chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
cohere FEAT: cohere rerank 3.5 model added (#11289) 2024-12-06 09:58:55 +08:00
deepseek Fix Deepseek Function/Tool Calling (#11023) 2024-11-25 11:03:53 +08:00
fireworks refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
fishaudio fix: fish audio wrong validate credentials interface (#11019) 2024-11-23 23:39:41 +08:00
gitee_ai fix: gitee ai wrong default model, and better para (#11168) 2024-11-27 17:27:11 +08:00
google feat: add gemini-2.0-flash-exp (#11570) 2024-12-12 09:33:39 +08:00
gpustack fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
groq fix: name of llama-3.3-70b-specdec (#11596) 2024-12-12 16:33:49 +08:00
huggingface_hub refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
huggingface_tei feat: Add support for TEI API key authentication (#11006) 2024-11-23 23:55:35 +08:00
hunyuan fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 2024-10-30 15:43:29 +08:00
jina fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
leptonai chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
localai chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
minimax fix: add the missing abab6.5t-chat model of Minimax (#11484) 2024-12-09 17:59:20 +08:00
mistralai [Pixtral] Add new model ; add vision (#11231) 2024-12-11 10:14:16 +08:00
mixedbread refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
moonshot fix: use removeprefix() instead of lstrip() to remove the data: prefix (#11272) 2024-12-03 09:16:25 +08:00
nomic refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
novita chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
nvidia refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
nvidia_nim chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
oci refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
ollama feat: support json_schema for ollama models (#11449) 2024-12-08 08:36:12 +08:00
openai Add TTS to OpenAI_API_Compatible (#11071) 2024-11-26 15:14:02 +08:00
openai_api_compatible fix: better error message for stream (#11635) 2024-12-15 17:16:04 +08:00
openllm fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) 2024-11-21 10:34:43 +08:00
openrouter Support streaming output for OpenAI o1-preview and o1-mini (#10890) 2024-11-20 15:10:41 +08:00
perfxcloud fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
replicate refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
sagemaker chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 2024-11-15 15:41:40 +08:00
siliconflow feat: add siliconflow qwq and llama3.3 model (#11492) 2024-12-10 08:49:45 +08:00
spark fix:Spark's large language model token calculation error #7911 (#8755) 2024-09-25 14:51:42 +08:00
stepfun fix: use removeprefix() instead of lstrip() to remove the data: prefix (#11272) 2024-12-03 09:16:25 +08:00
tencent chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
togetherai chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
tongyi fix: Remove duplicate 'response_format' parameter from model YAML files (#11531) 2024-12-11 10:10:53 +08:00
triton_inference_server chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
upstage refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
vertex_ai feat(model): add vertex_ai Gemini 2.0 Flash Exp (#11604) 2024-12-12 20:20:49 +08:00
vessl_ai fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417) 2024-11-11 08:38:26 +08:00
volcengine_maas chore(lint): sort __all__ definitions (#11243) 2024-12-03 13:26:33 +08:00
voyage fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
wenxin fix: better wenxin rerank handler, close #11252 (#11283) 2024-12-03 13:57:16 +08:00
x Add grok-vision-beta to xAI + Update grok-beta Features (#11004) 2024-11-25 20:53:03 +08:00
xinference fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 2024-11-24 15:29:30 +08:00
yi feat: add yi custom llm intergration (#9482) 2024-10-18 17:23:21 +08:00
zhinao chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
zhipuai feat: add zhipu glm_4v_flash (#11440) 2024-12-07 22:27:57 +08:00
__init__.py Model Runtime (#1858) 2024-01-02 23:42:00 +08:00
_position.yaml feat: add voyage ai as a new model provider (#8747) 2024-09-29 16:55:59 +08:00
model_provider_factory.py feat: support pinning, including, and excluding for model providers and tools (#7419) 2024-08-21 11:16:43 +08:00