LargeWorldModel

LargeWorldModel Organization

Below provides an overview of research related the LargeWorldModel Org.

Research

ElasticTok: Adaptive Tokenization for Image and Video
Wilson Yan*, Matei Zaharia, Volodynmyr Mnih, Pieter Abbeel, Aleksandra Faust, Hao Liu*

project page
tweet

An adaptive tokenization model that can encode image and long video to variable-length sequence.

World Model on Million-Length Video and Language With Blockwise RingAttention
Hao Liu*, Wilson Yan*, Matei Zaharia, Pieter Abbeel

project page
tweet

A vision-language model trained on million-length sequences of images, language, and video.

Website Source