Not Verified

Free

Phenaki

A model for generating videos from text.

Phenaki Features

Phenaki is an AI model to generate videos that can be multiple minutes long straight from text. You can also generate video from a still image and a prompt. The proposed video encoder-decoder outperforms all per-frame baselines currently used in the literature in terms of spatio-temporal quality and number of tokens per video. To generate video tokens from text, they are using bidirectional masked transformer conditioned on pre-computed text tokens. The generated video tokens are subsequently de-tokenized to create the actual video.

Comments

0 Comments

Submit a Comment

Your email address will not be published. Required fields are marked *