参考资料:https://arxiv.org/pdf/2402.13217.pdfhttps://blog.research.google/2024/02/videoprism-foundational-visual-encoder.html