Fc2 3292343 | Exclusive

We compare against the strongest publicly available multi‑modal models:

Given , a ∈ ℝⁿ, GCMA computes:

Our contributions can be summarized as follows: fc2 3292343

The pooled vectors p₁,…,p_K are concatenated and fed to the classification head. By allowing multiple “pools,” ATP can capture both short‑term actions and long‑range context. a ∈ ℝⁿ

Key properties: