Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Vid2Seq: A pretrained visual language model for describing multi-event videos
ai.googleblog.com
87 points
famouswaffles
3 years ago
16 comments
Loading...
Vid2Seq: A pretrained visual language model for describing multi-event videos | Heykuki News