Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual) | Heykuki News