dify/api/core/rag/extractor
Ademílson Tonato 6024d8a42d
refactor: Update Firecrawl to use v1 API (#12574)
Co-authored-by: Ademílson Tonato <ademilson.tonato@refurbed.com>
2025-01-23 11:14:48 +08:00
..
blob chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
entity feat: mypy for all type check (#10921) 2024-12-24 18:38:51 +08:00
firecrawl refactor: Update Firecrawl to use v1 API (#12574) 2025-01-23 11:14:48 +08:00
unstructured fix unstructured setting (#12116) 2024-12-26 12:08:36 +08:00
csv_extractor.py chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
excel_extractor.py py lint (#12102) 2024-12-26 00:16:35 +08:00
extract_processor.py fix unstructured setting (#12116) 2024-12-26 12:08:36 +08:00
extractor_base.py chore(api/core): apply ruff reformatting (#7624) 2024-09-10 17:00:20 +08:00
helpers.py chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
html_extractor.py feat: mypy for all type check (#10921) 2024-12-24 18:38:51 +08:00
jina_reader_extractor.py feat(website-crawl): add jina reader as additional alternative for website crawling (#8761) 2024-09-30 09:57:19 +08:00
markdown_extractor.py chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
notion_extractor.py chore(lint): fix quotes for f-string formatting by bumping ruff to 0.9.x (#12702) 2025-01-21 10:12:29 +08:00
pdf_extractor.py fix: fix the incorrect plaintext file key when saving (#10429) 2025-01-08 12:52:45 +08:00
text_extractor.py chore: refurbish Python code by applying refurb linter rules (#8296) 2024-09-12 15:50:49 +08:00
word_extractor.py Feat/support parent child chunk (#12092) 2024-12-25 19:49:07 +08:00