A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing. - View it on GitHub
Star
882
Rank
47813