MMC4: An open, billion-scale corpus of images interleaved with text | Heykuki News