Wan2.1 I2v 720p 14b Fp16.safetensors Jun 2026

With 14B parameters, the cross-attention layers (which connect text to pixels) are deep and rich. The model handles complex compound prompts:

Supports multilingual text prompts (Chinese and English) via a T5 Encoder Excels at cinematic aesthetics and complex motion. Hugging Face Performance & Requirements Wan-AI/Wan2.1-I2V-14B-720P - Hugging Face wan2.1 i2v 720p 14b fp16.safetensors

Source : Available via official Wan-AI Hugging Face or repackaged versions like Comfy-Org . With 14B parameters