Commit Graph

376 Commits (20ca2033ceef98b025fa61b071230b54bebb4bb2)

Author SHA1 Message Date
uply23333 ab012fe1a2
fix: improve document filtering in full text search(elasticsearch) (#22683) 7 months ago
8bitpd 9251a66a10
fix: update analyticdb vector to do filter by metadata (#22698)
Co-authored-by: xiaozeyu <xiaozeyu.xzy@alibaba-inc.com>
7 months ago
znn ed263aed9f
fix text splitter (#22596) 7 months ago
-LAN- 460a825ef1
refactor: decouple Node and NodeData (#22581)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: QuantumGhost <obelisk.reg+git@gmail.com>
7 months ago
helojo e7d80bf7bf
Fix: the pict type picture was not processed in the docx (#19305)
Co-authored-by: zqgame <zqgame@zqgame.local>
7 months ago
yihong d2933c2bfe
fix: drop dead code phase2 unused class (#22042)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
7 months ago
wanttobeamaster bf7b2c339b
tablestore vector support more method (#22225)
Co-authored-by: xiaozhiqing.xzq <xiaozhiqing.xzq@alibaba-inc.com>
7 months ago
Jacky Wu 3e96c0c468
fix: close session before doing long latency operation (#22306) 7 months ago
luckylhb90 a371390d6c
optimize: batch embedding and qdrant write_consistency_factor parameter (#21776)
Co-authored-by: hobo.l <hobo.l@binance.com>
7 months ago
wlleiiwang 89b52471fb
Optimize the memory usage of Tencent Vector Database (#22079)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
7 months ago
baonudesifeizhai 1c7404099d
fix: prevent timeout in file encoding detection for large files (#21453)
Co-authored-by: crazywoola <427733928@qq.com>
7 months ago
efrey kong 826bf25abf
Fix: prevent SQL errors when metadata filter Constant value is None or blank (#21803) 7 months ago
Dongyu Li 00f0b569cc
Feat/kb index (#20868)
Co-authored-by: twwu <twwu@dify.ai>
7 months ago
Jin 3e7f8bad56
fix: markdown_extractor lost chunks if it starts without a header(#21308) (#21309) 8 months ago
LiuBo 17fe62cf91
feat: add support for Matrixone database (#20714) 8 months ago
NeatGuyCoding 9835730278
Translation fix (#21194) 8 months ago
NeatGuyCoding 2eae7503e1
Minor Improvements for File Validation and Configuration Handling #21179 (#21171)
Co-authored-by: tech <cto@sb>
8 months ago
Ademílson Tonato 9e73e8b9e8
feat: add search endpoint for Firecrawl Integration (#20521)
Co-authored-by: crazywoola <427733928@qq.com>
8 months ago
Rain Wang 47e0f92c0f
Fixes #20748 KnowledgeRetrievalNode return all external documents when reranker disabled even top-k configed (#20762) 8 months ago
kazuya-awano 45c89bd6de
feat: add pagenation to notion extractor (#20919) 8 months ago
kurokobo 4689e8953e
fix: shorten connection timeout to pypi.org for deprecation check for weaviate client (#21131) 8 months ago
Bowen Liang 366ddb05ae
test: run vdb test of oceanbase with docker compose in CI tests (#20945) 8 months ago
Bowen Liang 0f3d4d0b6e
chore: bump mypy to 1.16 (#20608) 8 months ago
QuantumGhost c439e82038
refactor(api): Decouple `ParameterExtractorNode` from `LLMNode` (#20843)
- Extract methods used by `ParameterExtractorNode` from `LLMNode` into a separate file.
- Convert `ParameterExtractorNode` into a subclass of `BaseNode`.
- Refactor code referencing the extracted methods to ensure functionality and clarity.
- Fixes the issue that `ParameterExtractorNode` returns error when executed.
- Fix relevant test cases.

Closes #20840.
8 months ago
yihong 65c7c01d90
fix: clean up two unreachable code (#20773)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
8 months ago
jefferyvvv 37c3283450
fix: opensearch vector search falls back to keyword search (#20723)
Co-authored-by: wenjun.gu <wenjun.gu@envision-energy.com>
8 months ago
jefferyvvv 4271602cfc
fix: opensearch metadata filtering returns empty (#20701)
Co-authored-by: wenjun.gu <wenjun.gu@envision-energy.com>
Co-authored-by: crazywoola <427733928@qq.com>
8 months ago
jefferyvvv 138ad6e8b3
fix: opensearch fulltext search with metadata filtering dsl error (#20702)
Co-authored-by: wenjun.gu <wenjun.gu@envision-energy.com>
8 months ago
kenwoodjw 01d500db14
fix: autocorrect everything in web (#20605)
Signed-off-by: kenwoodjw <blackxin55+@gmail.com>
8 months ago
zhaobingshuang 3f7aa38d77
fix: #20560 When elasticsearch is used as the vector database, the Retrieval Test fails to filter the data after setting the Score Threshold, and the score of the recalled results is empty (#20561) 8 months ago
Cheney Zhang b4b59148dc
check zilliz cloud of full-text search (#20519) 8 months ago
Dongyu Li 1ea4459d9f
update knowledge base api (#20426) 8 months ago
-LAN- a6ea15e63c
Refactor/message cycle manage and knowledge retrieval (#20460)
Signed-off-by: -LAN- <laipz8200@outlook.com>
8 months ago
yihong 5a991295e0
fix: drop some type fixme (#20344) 8 months ago
-LAN- 482e50aae9
Refactor/remove db from cycle manager (#20455)
Signed-off-by: -LAN- <laipz8200@outlook.com>
8 months ago
rouxiaomin 4c4887c5fc
feat(qdrant):add replication_factor when create collection in qdrant (#20133)
Co-authored-by: 刘敏 <min.liu@tongdun.net>
8 months ago
He Huang 6f48af2610
Refactor OpenSearch config to separate use_ssl and verify_certs flags (#20075)
Co-authored-by: he.huang <he.huang1@outlook.com>
Co-authored-by: crazywoola <427733928@qq.com>
9 months ago
wlleiiwang 7d230acf40
tencent vectordb compatible with version 1.1.3 and below (#20056)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
9 months ago
-LAN- 3196dc2d61
refactor: Use typed SQLAlchemy base model and fix type errors (#19980)
Signed-off-by: -LAN- <laipz8200@outlook.com>
9 months ago
Darlewo 8cb3b4aef2
fix: multiple retrieve reranking_enabled switch (#19958) 9 months ago
Amir Mohsen Asaran c9ee60e197
Feat(WaterCrawl error handling): add custom exceptions and error handling (#19948) 9 months ago
-LAN- 4977bb21ec
feat(workflow): domain model for workflow node execution (#19430)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
9 months ago
k-kayashima b292990075
Fix: Ensure unique index names for pgvector knowledge tables (#19672)
Co-authored-by: crazywoola <427733928@qq.com>
9 months ago
非法操作 085bd1aa93
chore: model.query change to db.session.query (#19551)
Co-authored-by: QuantumGhost <obelisk.reg+git@gmail.com>
9 months ago
非法操作 14cd71ed0a
chore: all model.query replace to db.session.query (#19521) 9 months ago
非法操作 b00f94df64
fix: replace all dataset.Model.query to db.session.query(Model) (#19509) 9 months ago
湛露先生 1119790b02
clean rag word_extractor. (#19397)
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
9 months ago
Will bfa652f2d0
fix: metadata filtering condition variable unassigned; fix External K… (#19208) 9 months ago
Hao Cheng 9bcf837f17
fix: use only supported operators in metadata filter system prompts (#19195) 9 months ago
Will a212a63e6a
fix: time type metadata filtering error (#19192) 9 months ago