Just Released: The World's Largest Open-Source Multimodal Dataset