New SOTA Video Generator

HAPPYHORSE
1.0 AI

A state-of-the-art AI Video Generator that jointly generates video and audio from text — blazing fast, multilingual, fully open source.

15B

Parameters

256p · 5-Sec Clip

80%

Win vs Ovi 1.1

Languages

Denoising Steps

Highlights

One model. Text, video
and audio — unified.

HappyHorse 1.0 replaces multi-stream complexity with a single self-attention Transformer, achieving state-of-the-art results at record speeds.

⟁

Single-Stream Architecture

40-layer Transformer processes text, video, and audio via self-attention only. No cross-attention, no complexity.

◎

Human-Centric Quality

Expressive facial performance, natural speech coordination, realistic body motion, accurate sync.

⚡

Blazing Fast Inference

5-second 256p video in 2 seconds. 5-second 1080p in 38 seconds — on a single H100.

✦

State-of-the-Art Results

80.0% win rate vs Ovi 1.1 and 60.9% vs LTX 2.3 across 2,000 human evaluations.

◈

Multilingual

Chinese, English, Japanese, Korean, German, and French — natively supported.

⊡

Fully Open Source

Base model, distilled model, super-resolution model, and inference code — all released.

Performance

Benchmarks that
speak for themselves.

Leads on word error rate while matching or exceeding peers on all quality axes.

Model	Visual Quality ↑	Text Align ↑	Physical ↑	WER ↓
Ovi 1.1	4.73	4.10	4.41	40.45%
LTX 2.3	4.76	4.12	4.56	19.23%
HappyHorse 1.0	4.80	4.18	4.52	14.60%

Human Eval · vs Ovi 1.1

80.0%

Win rate across 2,000 human evaluations

Human Eval · vs LTX 2.3

60.9%

Win rate across 2,000 human evaluations

Inference Speed · H100, 5-Sec Video

256p

2.0 s

total

540p

8.0 s

with super-res

1080p

38.4 s

full quality

Languages

🇨🇳 Mandarin 🇭🇰 Cantonese 🇬🇧 English 🇯🇵 Japanese 🇰🇷 Korean 🇩🇪 German 🇫🇷 French

Community Response · X

老张来了 @laozhang2579

𝕏

HappyHorse 盲测超越 Seedance 2.0 登顶第一👑 Artificial Analysis 突然冒出一匹黑马，没有官网，没有论文，没有任何公开信息。老张只想问下万能的推友：这是谁的部下如此勇猛？

180 LIKES 74K VIEWS

Justine Moore @venturetwins

𝕏

A new video model dropped at #1 on the leaderboard 👀 It's called HappyHorse-1.0, and it's currently leading in both text-to-video and image-to-video. From my testing, it's particularly good at multi-shot videos and following detailed directions.

361 LIKES 28K VIEWS

AngryTom @AngryTomtweets

𝕏

Did Google just drop Veo 4? A new anonymous video model is currently leading both text-to-video and image-to-video on the Artificial Analysis leaderboards. It's called HappyHorse-1.0, and it's very promising.

91 LIKES 8.5K VIEWS

koltregaskes @koltregaskes

𝕏

Is this "Happy Horse" on AA the new Grok Imagine video model? Beating Seedance 2.0 on the leaderboard right now.

23 LIKES 4K VIEWS

berryxia @berryxia

𝕏

HappyHorse 1.0 全面超越Seedance 2.0 ？这货应该就是阿里马爸爸出的Wan2.7 了！看看是不是真的强！

41 LIKES 10K VIEWS

arsh_goyal @arsh_goyal

𝕏

now what is Happy Horse 1.0, better than Seedance 2.0 😮

8 LIKES 1.8K VIEWS

DataLearnerAI @DataLearnerAI

𝕏

又一个国产的视频生成大模型要登场了？且能超过字节的SeedDance 2.0? ArtificialAnalysis出现了一个神秘的HappyHorse大模型，ELO积分为1336，而排名第二的Dreamina Seedance 2.0积分为1273，差距高达63分。

4 LIKES 1.5K VIEWS

tangchuan_CN @tangchuan_CN

𝕏

HappyHorse 全面SOTA seedance2.0的视频生成模型出现在榜单，到底是谁呢？好难猜呀

1 LIKES 1K VIEWS

VigoCreativeAI @VigoCreativeAI

𝕏

HappyHorse-1.0 这个模型基本可以确定是daVinci-MagiHuman，基本参数对的上！

2 LIKES 260 VIEWS

Neuralithic @Neuralithic

𝕏

Happy-Horse. A new model that ranks above Seedance 2.0 on Artificial Analysis (more examples below)

5 LIKES 1.8K VIEWS

Architecture

Designed for
elegance and speed.

Text tokens, a reference image latent, and noisy video and audio tokens jointly denoised within a single unified token sequence.

⟁

Sandwich Architecture

First and last 4 layers use modality-specific projections; middle 32 layers share parameters across all modalities.

◎

Timestep-Free Denoising

No explicit timestep embeddings — the model infers the denoising state directly from input latents.

⚡

Per-Head Gating

Learned scalar gates with sigmoid activation on each attention head for training stability.

✦

Unified Conditioning

Denoising and reference signals handled through a minimal unified interface — no dedicated conditioning branches.

◈

DMD-2 Distillation

Enables generation in only 8 denoising steps with no CFG, without sacrificing output quality.

⊡

MagiCompiler

Full-graph compilation that fuses operators across Transformer layers for ~1.2× end-to-end speedup.

Try HappyHorse 1.0 AI today.

Explore the live demo, browse the model hub, or clone the inference code. Everything is open.

▶ Live Demo Model Hub — coming soon GitHub — coming soon

HAPPYHORSE1.0 AI

One model. Text, videoand audio — unified.

Benchmarks thatspeak for themselves.

Designed forelegance and speed.

HAPPYHORSE
1.0 AI

One model. Text, video
and audio — unified.

Benchmarks that
speak for themselves.

Designed for
elegance and speed.