Fc2 3292343 | Exclusive
We compare against the strongest publicly available multi‑modal models:
Given , a ∈ ℝⁿ, GCMA computes:
Our contributions can be summarized as follows: fc2 3292343
The pooled vectors p₁,…,p_K are concatenated and fed to the classification head. By allowing multiple “pools,” ATP can capture both short‑term actions and long‑range context. a ∈ ℝⁿ
Key properties: