Commit Graph

240 Commits (provider-gpustack)

Author SHA1 Message Date
huangzhuo1949 70698024f5
fix: empty delete bug (#12339)
Co-authored-by: huangzhuo <huangzhuo1@xiaomi.com>
1 year ago
Jyong b873e6349c
add child chunk preview number limit (#12309) 1 year ago
-LAN- 8d15c8cfbf
fix: improve error handling in NotionExtractor data fetching (#12182)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
-LAN- dae1b5a619
fix: import jieba.analyse (#12133)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
Jyong 811e4bd0cf
fix unstructured setting (#12116) 1 year ago
Jyong 84ac004772
py lint (#12102)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
1 year ago
Jyong 9231fdbf4c
Feat/support parent child chunk (#12092) 1 year ago
yihong 56e15d09a9
feat: mypy for all type check (#10921) 1 year ago
-LAN- 599d410d99
fix: validate reranking model attributes before processing (#11930)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
-LAN- 8c559d6231
fix(retrieval_service): avoid to use exception (#11925)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
yihong 7b03a0316d
fix: better memory usage from 800+ to 500+ (#11796)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
yihong 463fbe2680
fix: better gard nan value from numpy for issue #11827 (#11864)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
yihong 5a8a901560
fix: float values are not json for nan value close #11827 (#11840)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Jiang ad17ff9a92
Lindorm vdb bug-fix (#11790)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
1 year ago
Bowen Liang 924b4fe742
test: run vdb tests on TiDB Vector with docker in CI tests (#11645) 1 year ago
yihong 22258fb0bf
fix: filter bug for keywork cause code can not reach (#11666)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
yihong 36cb25b341
fix: support mdx files close #11557 (#11565)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Jiang 0d04cdc323
Lindorm vdb (#11574)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
1 year ago
Jyong 9b7adcd4d9
update tidb batch get endpoint to basic mode (#11426) 1 year ago
Jyong d7c1f43b49
fix tidb full-text-search vector missed (#11337) 1 year ago
Jyong c58d2fce89
roll back rerank topn setting (#11297) 1 year ago
yihong e686f12317
fix: better handle error (#11265)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
-LAN- 9601102885
fix(word_extractor): Fix type error and remove stream in ssrf_proxy (#11241)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
Cling_o3 f9c2aa7689
feat: add retireval_top_n to config in env (#11132) 1 year ago
kazuya-awano 2d6865d421
Ensure consistent float type for cached embedding return values (#10185) 1 year ago
yihong d7160ee563
fix: typo in upstashVector if id is always true, also fix some type hint (#11183)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
-LAN- 9789905a1f
chore(*): Removes debugging print statements (#11145)
Signed-off-by: -LAN- <laipz8200@outlook.com>
1 year ago
Bowen Liang 6c8e208ef3
chore: bump minimum supported Python version to 3.11 (#10386) 1 year ago
yihong ed55de888a
fix: rules should not be None for in (#10977)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
AkisAya cb0c55daa7
fix weight rerank of knowledge retrieval (#10931) 1 year ago
yihong 58a9d9eb9a
fix: better WeightRerankRunner run logic use O(1) and delete unused code (#10849)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
1 year ago
Zane 14f3d44c37
refactor: improve handling of leading punctuation removal (#10761) 1 year ago
8bitpd 873e9720e9
feat: AnalyticDB vector store supports invocation via SQL. (#10802)
Co-authored-by: 璟义 <yangshangpo.ysp@alibaba-inc.com>
1 year ago
Bowen Liang 51db59622c
chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 year ago
Jyong 0b2d51d859
add the index field for elasticsearch (#10592) 1 year ago
-LAN- a1543b7da0
fix(extractor): temporary file (#10543) 1 year ago
Leo.Wang c9f785e00f
Feat/tools/gitlab (#10407) 1 year ago
Bowen Liang 574c4a264f
chore(lint): Use logging.exception instead of logging.error (#10415) 1 year ago
Jyong 1024fc623e
fix the ssrf of docx file extractor external images (#10237) 1 year ago
Jiang 0c9e79cd67
Add Lindorm as a VDB choice (#10202)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
1 year ago
Shili Cao b61baa87ec
fix: avoid unexpected error when create knowledge base with baidu vector database and wenxin embedding model (#10130) 1 year ago
Jyong dad041c49f
fix issue: query is none when doing retrieval (#10129) 1 year ago
omr 11ca1bec0b
fix: optimize unique document filtering with set (#10082) 1 year ago
zhuhao 7433095240
chore: use dify_config.TIDB_SPEND_LIMIT instead of constant value (#10038) 1 year ago
Jyong 9ebd453b87
add rerank check when doing mutil-retrieval (#9998) 1 year ago
powerfool 878d13ef42
Added OceanBase as an option for the vector store in Dify (#10010) 1 year ago
Jyong 5580bcf870
add tidb spend limit config (#9999) 1 year ago
roadgoat19 c8ef9223e5
feat: couchbase integration (#6165)
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: Elliot Scribner <elliot.scribner@couchbase.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: Bowen Liang <bowenliang@apache.org>
1 year ago
Jyong f47177ecb4
add top_k for es full text search (#9963) 1 year ago
virgosoy 17cacf258e
fix: wrong element object (#9868) 1 year ago