Before extraction, the pipeline now: 1. Identifies 3-5 themes from source (Haiku, ~$0.002/source) 2. Searches Qdrant for each theme + title (with author-stripped variant) 3. Injects "Prior Art" into extraction prompt showing existing KB claims 4. Requires ENRICHMENT/CHALLENGE to cite specific target_claim (hard gate) Reduces near-duplicate extractions (our #1 rejection cause) by showing the extractor what the KB already knows before it starts. Prior art also persisted to .prior-art/ sidecar files and included in PR body for reviewer verification. Design: Leo. Owner: Epimetheus. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| conftest.py | ||
| test_attribution.py | ||
| test_entity_queue.py | ||
| test_extraction_prompt.py | ||
| test_feedback.py | ||
| test_post_extract.py | ||
| test_pre_screen.py | ||
| test_search.py | ||