drop document set · detect non-english documents + segment for translation queue · runs locally
drop document set · local only
heuristic screener · vendor schema varies · not definitive proof