drop pdfs · convert to single-page tiffs + ocr text · runs locally
pdf files · local only
heuristic screener · vendor schema varies · not definitive proof