Commit Graph

21 Commits (80a87f36eabb717ed107cb286455e8a3bda8d077)

Author SHA1 Message Date
Bowen Liang 58db719a2c
dep: bump pandas from 1.x to 2.x (#4820) 2 years ago
Oliver Lee 176d91937d
fix 'NoneType' and new ContentType supported. (#4818) 2 years ago
yalei 026175c8f7
feat: update notion extractor (#3898)
Co-authored-by: duyalei <>
2 years ago
Jyong 233c4150d1
support images and tables extract from docx (#4619) 2 years ago
majian b5204111da
Add UNSTRUCTURED_API_KEY env support (#4369) 2 years ago
Charlie.Wei 97b65f9b4b
Optimize webscraper (#4392)
Co-authored-by: luowei <glpat-EjySCyNjWiLqAED-YmwM>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2 years ago
Bowen Liang 7919596a21
fix: UP031 style rule violation (#3866) 2 years ago
Jyong 0737e930cb
chore: remove Langchain tools import (#3407) 2 years ago
chenxu9741 ad65c891e7
add xls file suport (#3321) 2 years ago
LiuVaayne b00466f025
feat:api Add support for extracting EPUB files in ExtractProcessor (#3254)
Co-authored-by: crazywoola <427733928@qq.com>
2 years ago
Jyong 6164604462
fix dataset retrival in dataset mode (#3334) 2 years ago
Jyong 9eba6ffdd4
Optimize csv and excel extract (#3155)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Vikey Chen e4f686deb7
fix unstructured api,remove unused parameters (#3056) 2 years ago
Jyong b0b0cc045f
add mutil-thread document embedding (#3016)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Weaxs 20bd49285b
excel: get keys from every sheet (#2796) 2 years ago
Bowen Liang b163545771
Use `python-docx` to extract docx files (#2654) 2 years ago
Charlie.Wei fa7ba30ba3
Fix rebuild index&csv parsing (#2705)
Co-authored-by: luowei <glpat-EjySCyNjWiLqAED-YmwM>
Co-authored-by: crazywoola <427733928@qq.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2 years ago
takatost a4d86496e1
fix: notion extractor raise 'NoneType' object has no attribute 'curre… (#2608) 2 years ago
Jyong 5b953c1ef2
Fix some RAG bugs (#2570)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 91ea6fe4ee
Fix/langchain document schema (#2539)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago
Jyong 6c4e6bf1d6
Feat/dify rag (#2528)
Co-authored-by: jyong <jyong@dify.ai>
2 years ago