You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
gcgj-dify-1.7.0/api/core/rag/extractor
Bowen Liang 39c14ec7c1
improve: unify Excel files parsing in either xls or xlsx file format by Pandas (#4965)
2 years ago
..
blod improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 2 years ago
entity fix: ExtractSetting optional value missing None as default val (#5238) 2 years ago
firecrawl Feat/firecrawl data source (#5232) 2 years ago
unstructured Add UNSTRUCTURED_API_KEY env support (#4369) 2 years ago
csv_extractor.py dep: bump pandas from 1.x to 2.x (#4820) 2 years ago
excel_extractor.py improve: unify Excel files parsing in either xls or xlsx file format by Pandas (#4965) 2 years ago
extract_processor.py Feat/firecrawl data source (#5232) 2 years ago
extractor_base.py Feat/dify rag (#2528) 2 years ago
helpers.py Feat/dify rag (#2528) 2 years ago
html_extractor.py Fix some RAG bugs (#2570) 2 years ago
markdown_extractor.py Feat/dify rag (#2528) 2 years ago
notion_extractor.py Feat/firecrawl data source (#5232) 2 years ago
pdf_extractor.py Feat/dify rag (#2528) 2 years ago
text_extractor.py Feat/dify rag (#2528) 2 years ago
word_extractor.py deal the external image when extract docx image (#5024) 2 years ago