dify/api/core/model_runtime/model_providers
2024-12-17 17:42:21 +08:00
..
__base feat: Allow using file variables directly in the LLM node and support more file types. (#10679) 2024-11-22 16:30:22 +08:00
anthropic chore: the consistency of MultiModalPromptMessageContent (#11721) 2024-12-17 15:01:38 +08:00
azure_ai_studio chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 2024-11-15 15:41:40 +08:00
azure_openai chore:update azure api version (#11711) 2024-12-17 13:39:56 +08:00
baichuan fix: volcengine_maas and baichuan message error (#11625) 2024-12-16 13:05:27 +08:00
bedrock [ref] use one method to get boto client for aws bedrock (#11506) 2024-12-12 13:56:52 +08:00
chatglm chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
cohere FEAT: cohere rerank 3.5 model added (#11289) 2024-12-06 09:58:55 +08:00
deepseek fix: deepseek reports an error when using Response Format #11677 (#11678) 2024-12-16 12:58:03 +08:00
fireworks refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
fishaudio fix: fish audio wrong validate credentials interface (#11019) 2024-11-23 23:39:41 +08:00
gitee_ai feat: add gitee ai vl models (#11697) 2024-12-16 18:45:26 +08:00
google chore: improve gemini models (#11745) 2024-12-17 17:42:21 +08:00
gpustack fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
groq fix: name of llama-3.3-70b-specdec (#11596) 2024-12-12 16:33:49 +08:00
huggingface_hub refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
huggingface_tei feat: Add support for TEI API key authentication (#11006) 2024-11-23 23:55:35 +08:00
hunyuan fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 2024-10-30 15:43:29 +08:00
jina fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
leptonai chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
localai chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
minimax fix: add the missing abab6.5t-chat model of Minimax (#11484) 2024-12-09 17:59:20 +08:00
mistralai [Pixtral] Add new model ; add vision (#11231) 2024-12-11 10:14:16 +08:00
mixedbread refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
moonshot fix: use removeprefix() instead of lstrip() to remove the data: prefix (#11272) 2024-12-03 09:16:25 +08:00
nomic refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
novita chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
nvidia refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
nvidia_nim chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
oci refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
ollama feat: support json_schema for ollama models (#11449) 2024-12-08 08:36:12 +08:00
openai feat: enhance gemini models (#11497) 2024-12-17 12:05:13 +08:00
openai_api_compatible fix: better error message for stream (#11635) 2024-12-15 17:16:04 +08:00
openllm fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) 2024-11-21 10:34:43 +08:00
openrouter Support streaming output for OpenAI o1-preview and o1-mini (#10890) 2024-11-20 15:10:41 +08:00
perfxcloud fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
replicate refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
sagemaker chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 2024-11-15 15:41:40 +08:00
siliconflow feat: add siliconflow qwq and llama3.3 model (#11492) 2024-12-10 08:49:45 +08:00
spark fix:Spark's large language model token calculation error #7911 (#8755) 2024-09-25 14:51:42 +08:00
stepfun fix: use removeprefix() instead of lstrip() to remove the data: prefix (#11272) 2024-12-03 09:16:25 +08:00
tencent chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
togetherai chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
tongyi chore: the consistency of MultiModalPromptMessageContent (#11721) 2024-12-17 15:01:38 +08:00
triton_inference_server chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
upstage refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
vertex_ai feat(model): add vertex_ai Gemini 2.0 Flash Exp (#11604) 2024-12-12 20:20:49 +08:00
vessl_ai fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417) 2024-11-11 08:38:26 +08:00
volcengine_maas feat(ark): support doubao vision series models (#11740) 2024-12-17 15:43:11 +08:00
voyage fix: int None will cause error for context size (#11055) 2024-11-25 21:04:16 +08:00
wenxin fix: better wenxin rerank handler, close #11252 (#11283) 2024-12-03 13:57:16 +08:00
x feat: add grok-2-1212 and grok-2-vision-1212 (#11672) 2024-12-15 21:18:24 +08:00
xinference fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 2024-11-24 15:29:30 +08:00
yi feat: add yi custom llm intergration (#9482) 2024-10-18 17:23:21 +08:00
zhinao chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
zhipuai feat: add zhipu glm_4v_flash (#11440) 2024-12-07 22:27:57 +08:00
__init__.py Model Runtime (#1858) 2024-01-02 23:42:00 +08:00
_position.yaml feat: add voyage ai as a new model provider (#8747) 2024-09-29 16:55:59 +08:00
model_provider_factory.py feat: support pinning, including, and excluding for model providers and tools (#7419) 2024-08-21 11:16:43 +08:00