Commit Graph

275 Commits

Author SHA1 Message Date
Yeuoly
c03adcb154
Fix: style checks and unittests (#12603) 2025-01-10 16:40:39 +08:00
Yeuoly
17babca362
Introducing: Plugin Mechanism (#12553) 2025-01-09 19:54:17 +08:00
zxhlyh
3c014f3ae5
Feat/plugins (#12547)
Co-authored-by: AkaraChen <akarachen@outlook.com>
Co-authored-by: Yi <yxiaoisme@gmail.com>
Co-authored-by: Joel <iamjoel007@gmail.com>
Co-authored-by: JzoNg <jzongcode@gmail.com>
Co-authored-by: twwu <twwu@dify.ai>
Co-authored-by: kurokobo <kuro664@gmail.com>
Co-authored-by: Hiroshi Fujita <fujita-h@users.noreply.github.com>
2025-01-09 18:47:41 +08:00
Yeuoly
f748d6c7c4 fix: mypy issues 2025-01-09 16:53:30 +08:00
Yeuoly
fb309462ad Merge branch 'main' into fix/chore-fix 2025-01-08 20:36:22 +08:00
YoungLH
040a3b782c
FEAT: support milvus to full text search (#11430)
Signed-off-by: YoungLH <974840768@qq.com>
2025-01-08 17:39:53 +08:00
Yingchun Lai
53bb37b749
fix: fix the incorrect plaintext file key when saving (#10429) 2025-01-08 12:52:45 +08:00
Hiroshi Fujita
d2586278d6
Feat elasticsearch japanese (#12194) 2025-01-08 12:35:41 +08:00
Jyong
05bda6f38d
add tidb on qdrant redis lock (#12462) 2025-01-08 08:55:44 +08:00
huangzhuo1949
70698024f5
fix: empty delete bug (#12339)
Co-authored-by: huangzhuo <huangzhuo1@xiaomi.com>
2025-01-03 20:46:39 +08:00
Jyong
b873e6349c
add child chunk preview number limit (#12309) 2025-01-03 16:14:27 +08:00
Yeuoly
107e44c8fb Merge branch 'main' into fix/chore-fix 2024-12-31 16:47:56 +08:00
-LAN-
8d15c8cfbf
fix: improve error handling in NotionExtractor data fetching (#12182)
Signed-off-by: -LAN- <laipz8200@outlook.com>
2024-12-29 11:53:09 +08:00
Yeuoly
78664c8903 Merge branch 'main' into fix/chore-fix 2024-12-27 17:33:58 +08:00
-LAN-
dae1b5a619
fix: import jieba.analyse (#12133)
Signed-off-by: -LAN- <laipz8200@outlook.com>
2024-12-27 11:37:55 +08:00
Jyong
811e4bd0cf
fix unstructured setting (#12116) 2024-12-26 12:08:36 +08:00
Jyong
84ac004772
py lint (#12102)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-12-26 00:16:35 +08:00
Jyong
9231fdbf4c
Feat/support parent child chunk (#12092) 2024-12-25 19:49:07 +08:00
Yeuoly
4199998c7e Merge branch 'main' into fix/chore-fix 2024-12-24 21:28:56 +08:00
yihong
56e15d09a9
feat: mypy for all type check (#10921) 2024-12-24 18:38:51 +08:00
-LAN-
599d410d99
fix: validate reranking model attributes before processing (#11930)
Signed-off-by: -LAN- <laipz8200@outlook.com>
2024-12-21 21:23:12 +08:00
-LAN-
8c559d6231
fix(retrieval_service): avoid to use exception (#11925)
Signed-off-by: -LAN- <laipz8200@outlook.com>
2024-12-21 21:19:46 +08:00
yihong
7b03a0316d
fix: better memory usage from 800+ to 500+ (#11796)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-12-20 14:51:43 +08:00
yihong
463fbe2680
fix: better gard nan value from numpy for issue #11827 (#11864)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-12-20 09:28:32 +08:00
yihong
5a8a901560
fix: float values are not json for nan value close #11827 (#11840)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-12-19 20:50:20 +08:00
Jiang
ad17ff9a92
Lindorm vdb bug-fix (#11790)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
2024-12-18 15:19:20 +08:00
Yeuoly
75fe785d88
Merge branch 'main' into fix/chore-fix 2024-12-16 14:08:18 +08:00
Bowen Liang
924b4fe742
test: run vdb tests on TiDB Vector with docker in CI tests (#11645) 2024-12-15 17:16:40 +08:00
yihong
22258fb0bf
fix: filter bug for keywork cause code can not reach (#11666)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-12-15 17:12:06 +08:00
yihong
36cb25b341
fix: support mdx files close #11557 (#11565)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-12-12 13:37:56 +08:00
Jiang
0d04cdc323
Lindorm vdb (#11574)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
2024-12-12 09:43:27 +08:00
Yeuoly
16b49ac436
Merge branch 'main' into fix/chore-fix 2024-12-09 16:08:19 +08:00
Jyong
9b7adcd4d9
update tidb batch get endpoint to basic mode (#11426) 2024-12-06 17:06:46 +08:00
Yeuoly
337eff2b79
Merge branch 'main' into fix/chore-fix 2024-12-06 16:45:25 +08:00
Jyong
d7c1f43b49
fix tidb full-text-search vector missed (#11337) 2024-12-04 16:13:23 +08:00
Yeuoly
86291c13e4
Merge branch 'main' into fix/chore-fix 2024-12-04 15:34:39 +08:00
Jyong
c58d2fce89
roll back rerank topn setting (#11297) 2024-12-03 17:34:56 +08:00
yihong
e686f12317
fix: better handle error (#11265)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-12-03 09:15:38 +08:00
-LAN-
9601102885
fix(word_extractor): Fix type error and remove stream in ssrf_proxy (#11241)
Signed-off-by: -LAN- <laipz8200@outlook.com>
2024-12-02 10:24:03 +08:00
Cling_o3
f9c2aa7689
feat: add retireval_top_n to config in env (#11132) 2024-11-30 11:14:45 +08:00
kazuya-awano
2d6865d421
Ensure consistent float type for cached embedding return values (#10185) 2024-11-29 09:18:41 +08:00
yihong
d7160ee563
fix: typo in upstashVector if id is always true, also fix some type hint (#11183)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-11-28 14:05:25 +08:00
-LAN-
9789905a1f
chore(*): Removes debugging print statements (#11145)
Signed-off-by: -LAN- <laipz8200@outlook.com>
2024-11-26 22:03:19 +08:00
Yeuoly
db726e02a0
feat: support multi token count 2024-11-26 18:59:03 +08:00
Yeuoly
5ff9cee326
Merge branch 'main' into fix/chore-fix 2024-11-25 15:37:19 +08:00
Bowen Liang
6c8e208ef3
chore: bump minimum supported Python version to 3.11 (#10386) 2024-11-24 13:28:46 +08:00
yihong
ed55de888a
fix: rules should not be None for in (#10977)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-11-22 23:04:20 +08:00
AkisAya
cb0c55daa7
fix weight rerank of knowledge retrieval (#10931) 2024-11-21 17:53:20 +08:00
Yeuoly
24734009b9
Merge branch 'main' into fix/chore-fix 2024-11-21 13:52:28 +08:00
yihong
58a9d9eb9a
fix: better WeightRerankRunner run logic use O(1) and delete unused code (#10849)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2024-11-19 20:12:13 +08:00