You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
gcgj-dify-1.7.0/api/core/rag/extractor
Jyong 3b60c28b3a
deal the external image when extract docx image (#5024)
2 years ago
..
blod chore: remove Langchain tools import (#3407) 2 years ago
entity Fix some RAG bugs (#2570) 2 years ago
unstructured Add UNSTRUCTURED_API_KEY env support (#4369) 2 years ago
csv_extractor.py dep: bump pandas from 1.x to 2.x (#4820) 2 years ago
excel_extractor.py fixing a bug of handling header row when parsing xls file, and tune xls/xlsx parsing result to be more structured (#3600) 2 years ago
extract_processor.py fix 'NoneType' and new ContentType supported. (#4818) 2 years ago
extractor_base.py Feat/dify rag (#2528) 2 years ago
helpers.py Feat/dify rag (#2528) 2 years ago
html_extractor.py Fix some RAG bugs (#2570) 2 years ago
markdown_extractor.py Feat/dify rag (#2528) 2 years ago
notion_extractor.py feat: update notion extractor (#3898) 2 years ago
pdf_extractor.py Feat/dify rag (#2528) 2 years ago
text_extractor.py Feat/dify rag (#2528) 2 years ago
word_extractor.py deal the external image when extract docx image (#5024) 2 years ago