You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
gcgj-dify-1.7.0/api/core/rag/extractor
Aurelius Huang f546db5437
fix: document truncation and loss in notion document sync (#5631)
Co-authored-by: Aurelius Huang <cm.huang@aftership.com>
2 years ago
..
blod improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 2 years ago
entity fix: ExtractSetting optional value missing None as default val (#5238) 2 years ago
firecrawl Feat/firecrawl data source (#5232) 2 years ago
unstructured Add UNSTRUCTURED_API_KEY env support (#4369) 2 years ago
csv_extractor.py dep: bump pandas from 1.x to 2.x (#4820) 2 years ago
excel_extractor.py delete the deprecated method (#5612) 2 years ago
extract_processor.py Feat/firecrawl data source (#5232) 2 years ago
extractor_base.py Feat/dify rag (#2528) 2 years ago
helpers.py Feat/dify rag (#2528) 2 years ago
html_extractor.py Fix some RAG bugs (#2570) 2 years ago
markdown_extractor.py Feat/dify rag (#2528) 2 years ago
notion_extractor.py fix: document truncation and loss in notion document sync (#5631) 2 years ago
pdf_extractor.py Feat/dify rag (#2528) 2 years ago
text_extractor.py Feat/dify rag (#2528) 2 years ago
word_extractor.py deal the external image when extract docx image (#5024) 2 years ago