dify/api/core/model_runtime/model_providers
2024-11-30 17:30:55 +08:00
..
__base feat: Allow using file variables directly in the LLM node and support more file types. (#10679) 2024-11-22 16:30:22 +08:00
anthropic feat: Allow to contains files in the system prompt even model not support. (#11111) 2024-11-26 13:45:49 +08:00
azure_ai_studio chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 2024-11-15 15:41:40 +08:00
azure_openai fix: fix azure open-4o-08-06 when enable json schema cant process content = "" (#11204) 2024-11-29 17:26:07 +08:00
baichuan refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
bedrock Fixing #11005: Incorrect max_tokens in yaml file for AWS Bedrock US Cross Region Inference version of 3.5 Sonnet v2 and 3.5 Haiku (#11013) 2024-11-23 23:46:25 +08:00
chatglm chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
cohere fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) 2024-11-21 10:34:43 +08:00
deepseek Fix Deepseek Function/Tool Calling (#11023) 2024-11-25 11:03:53 +08:00
fireworks refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
fishaudio fix: fish audio wrong validate credentials interface (#11019) 2024-11-23 23:39:41 +08:00
gitee_ai fix: gitee ai wrong default model, and better para (#11168) 2024-11-27 17:27:11 +08:00
google fix: use Gemini response metadata for token counting (#11226) 2024-11-30 17:30:55 +08:00
gpustack fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
groq Added Llama 3.2 Vision Models Speech2Text Models for Groq (#9479) 2024-10-18 18:10:33 +08:00
huggingface_hub refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
huggingface_tei feat: Add support for TEI API key authentication (#11006) 2024-11-23 23:55:35 +08:00
hunyuan fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 2024-10-30 15:43:29 +08:00
jina fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
leptonai chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
localai chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
minimax add abab7-chat-preview model (#10654) 2024-11-13 19:30:42 +08:00
mistralai add MixtralAI Model (#8517) 2024-09-21 18:08:07 +08:00
mixedbread refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
moonshot fix: moonshot response_format raise error (#9847) 2024-10-25 14:59:55 +08:00
nomic refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
novita chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
nvidia refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
nvidia_nim chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
oci refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
ollama fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
openai Add TTS to OpenAI_API_Compatible (#11071) 2024-11-26 15:14:02 +08:00
openai_api_compatible Add TTS to OpenAI_API_Compatible (#11071) 2024-11-26 15:14:02 +08:00
openllm fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) 2024-11-21 10:34:43 +08:00
openrouter Support streaming output for OpenAI o1-preview and o1-mini (#10890) 2024-11-20 15:10:41 +08:00
perfxcloud fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
replicate refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
sagemaker chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 2024-11-15 15:41:40 +08:00
siliconflow ISSUE=11042: add tts model in siliconflow (#11043) 2024-11-25 11:04:13 +08:00
spark fix:Spark's large language model token calculation error #7911 (#8755) 2024-09-25 14:51:42 +08:00
stepfun chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
tencent chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
togetherai chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
tongyi feat: support LLM process document file (#10966) 2024-11-22 19:32:44 +08:00
triton_inference_server chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
upstage refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
vertex_ai fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
vessl_ai fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417) 2024-11-11 08:38:26 +08:00
volcengine_maas fix: default max_chunks set to 1 as other providers (#10937) 2024-11-21 16:36:05 +08:00
voyage fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
wenxin add llm: ernie-4.0-turbo-128k of wenxin (#10135) 2024-10-31 21:49:04 +08:00
x Add grok-vision-beta to xAI + Update grok-beta Features (#11004) 2024-11-25 20:53:03 +08:00
xinference fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 2024-11-24 15:29:30 +08:00
yi feat: add yi custom llm intergration (#9482) 2024-10-18 17:23:21 +08:00
zhinao chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
zhipuai fix: drop useless and wrong code for zhipu embedding (#11069) 2024-11-25 19:50:23 +08:00
__init__.py Model Runtime (#1858) 2024-01-02 23:42:00 +08:00
_position.yaml feat: add voyage ai as a new model provider (#8747) 2024-09-29 16:55:59 +08:00
model_provider_factory.py feat: support pinning, including, and excluding for model providers and tools (#7419) 2024-08-21 11:16:43 +08:00