Commit Graph

84 Commits (51db59622c9fac87f06fd33b0b8aaf36efba97b5)

Author SHA1 Message Date
Bowen Liang 51db59622c
chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 year ago
pinsily 5f21d13572
fix: handle KeyError when accessing rules in CleanProcessor.clean (#10258) 1 year ago
Jyong 82033af097
clean un-allowed special charters when doing indexing estimate (#10153) 1 year ago
Bowen Liang 240b66d737
chore: avoid implicit optional in type annotations of method (#8727) 1 year ago
Bowen Liang a1104ab97e
chore: refurish python code by applying Pylint linter rules (#8322) 1 year ago
Bowen Liang 292220c596
chore: apply pep8-naming rules for naming convention (#8261) 1 year ago
Bowen Liang 2cf1187b32
chore(api/core): apply ruff reformatting (#7624) 1 year ago
Nam Vu 2d7954c7da
Fix variable typo (#8084) 1 year ago
KVOJJJin 14af87527f
Feat:remove estimation of embedding cost (#7950)
Co-authored-by: jyong <718720800@qq.com>
1 year ago
Jyong 35431bce0d
fix dataset_id and index_node_id idx missed in document_segments tabl… (#7681) 1 year ago
Shota Totsuka 430e100142
refactor: Add @staticmethod decorator in `api/core` (#7652) 1 year ago
wellCh4n b7778de224
fix: document error message can not be cleared (#7453) 1 year ago
-LAN- a98284b1ef
refactor(api): Switch to `dify_config` (#6750)
Signed-off-by: -LAN- <laipz8200@outlook.com>
2 years ago
sino d27e3ab99d
chore: remove unresolved reference (#6110) 2 years ago
Bowen Liang dcb72e0067
chore: apply flake8-comprehensions Ruff rules to improve collection comprehensions (#5652)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2 years ago
Jyong ba5f8afaa8
Feat/firecrawl data source (#5232)
Co-authored-by: Nicolas <nicolascamara29@gmail.com>
Co-authored-by: chenhe <guchenhe@gmail.com>
Co-authored-by: takatost <takatost@gmail.com>
2 years ago
Bowen Liang f976740b57
improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 2 years ago
takatost d1dbbc1e33
feat: backend model load balancing support (#4927) 2 years ago
Jyong b6631cd878
modify rerank and splitter code directory (#4924) 2 years ago
Jyong 233c4150d1
support images and tables extract from docx (#4619) 2 years ago
Rain Chen c255a20d7c
allow to config max segmentation tokens length for RAG document using environment variable (#4375) 2 years ago
Bowen Liang 04ad46dd31
chore: skip unnecessary key checks prior to accessing a dictionary (#4497) 2 years ago
LIU HONGWEI c227f3d985
feat: Deprecate datetime.utcnow() in favor of datetime.now(timezone.utc).replace(tzinfo=None) for better timezone handling (#3408) (#3416) 2 years ago
Jyong 33ea689861
fix detached instance error in keyword index create thread and fix question classifier node out of index error (#3219) 2 years ago
Jyong 283979fc46
fix keyword index error when storage source is S3 (#3182) 2 years ago
takatost 7753ba2d37
FEAT: NEW WORKFLOW ENGINE (#3160)
Co-authored-by: Joel <iamjoel007@gmail.com>
Co-authored-by: Yeuoly <admin@srmxy.cn>
Co-authored-by: JzoNg <jzongcode@gmail.com>
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
Co-authored-by: jyong <jyong@dify.ai>
Co-authored-by: nite-knite <nkCoding@gmail.com>
Co-authored-by: jyong <718720800@qq.com>
2 years ago
Jyong b0b0cc045f
add mutil-thread document embedding (#3016)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 6454e1d644
chunk-overlap None check (#2781)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 31070ffbca
fix qa index processor tenant id is None error (#2713)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Charlie.Wei fa7ba30ba3
Fix rebuild index&csv parsing (#2705)
Co-authored-by: luowei <glpat-EjySCyNjWiLqAED-YmwM>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2 years ago
Jyong 5b953c1ef2
Fix some RAG bugs (#2570)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 0620fa3094
Feat/vdb migrate command (#2562)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 4be3087642
Fix/new RAG bugs (#2547)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 91ea6fe4ee
Fix/langchain document schema (#2539)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 6c4e6bf1d6
Feat/dify rag (#2528)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 97fe817186
Fix/upload limit (#2521)
Co-authored-by: jyong <jyong@dify.ai>
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
2 years ago
Bowen Liang 063191889d
chore: apply ruff's pyupgrade linter rules to modernize Python code with targeted version (#2419) 2 years ago
crazywoola 243ca5b1e2
fix: typo in package path of core.splitter (#2411) 2 years ago
Bowen Liang 843280f82b
enhancement: introduce Ruff for Python linter for reordering and removing unused imports with automated pre-commit and sytle check (#2366) 2 years ago
takatost 9f637ead38
bump version to 0.5.3 (#2306) 2 years ago
KVOJJJin 89fcf4ea7c
Feat: chunk overlap supported (#2209)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
takatost 6cf93379b3
fix: split chunks return empty strings (#2197) 2 years ago
Jyong 869690c485
fix notion estimate (#2090)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong cb7a608d75
ascii filter Unicode U+FFFE (#2038)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong a63a9c7d45
text spliter length method use default embedding model tokenizer (#2011)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Bowen Liang cc9e74123c
improve: introduce isort for linting Python imports (#1983) 2 years ago
Jyong 24bdedf802
fix get embedding model provider in empty dataset (#1986)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 4a3d15b6de
fix customer spliter character (#1915)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
takatost a938e1f184
fix: notion_indexing_estimate embedding_model_instance NPE (#1907) 2 years ago
Yeuoly 9134849744
fix: remove tiktoken from text splitter (#1876) 2 years ago