ExplorerArtificial IntelligenceAI
Research PaperResearchia:202602.13002

GENIUS: Generative Fluid Intelligence Evaluation Suite

Ruichuan An

Abstract

Unified Multimodal Models (UMMs) have shown remarkable progress in visual generation. Yet, existing benchmarks predominantly assess $\textit{Crystallized Intelligence}$, which relies on recalling accumulated knowledge and learned schemas. This focus overlooks $\textit{Generative Fluid Intelligence (GFI)}$: the capacity to induce patterns, reason through constraints, and adapt to novel scenarios on the fly. To rigorously assess this capability, we introduce $\textbf{GENIUS}$ ($\textbf{GEN}$ Fluid...

Submitted: February 13, 2026Subjects: AI; Artificial Intelligence

Description / Details

Unified Multimodal Models (UMMs) have shown remarkable progress in visual generation. Yet, existing benchmarks predominantly assess Crystallized Intelligence\textit{Crystallized Intelligence}, which relies on recalling accumulated knowledge and learned schemas. This focus overlooks Generative Fluid Intelligence (GFI)\textit{Generative Fluid Intelligence (GFI)}: the capacity to induce patterns, reason through constraints, and adapt to novel scenarios on the fly. To rigorously assess this capability, we introduce GENIUS\textbf{GENIUS} (GEN\textbf{GEN} Fluid I\textbf{I}ntelligence EvalU\textbf{U}ation S\textbf{S}uite). We formalize GFI\textit{GFI} as a synthesis of three primitives. These include Inducing Implicit Patterns\textit{Inducing Implicit Patterns} (e.g., inferring personalized visual preferences), Executing Ad-hoc Constraints\textit{Executing Ad-hoc Constraints} (e.g., visualizing abstract metaphors), and Adapting to Contextual Knowledge\textit{Adapting to Contextual Knowledge} (e.g., simulating counter-intuitive physics). Collectively, these primitives challenge models to solve problems grounded entirely in the immediate context. Our systematic evaluation of 12 representative models reveals significant performance deficits in these tasks. Crucially, our diagnostic analysis disentangles these failure modes. It demonstrates that deficits stem from limited context comprehension rather than insufficient intrinsic generative capability. To bridge this gap, we propose a training-free attention intervention strategy. Ultimately, GENIUS\textbf{GENIUS} establishes a rigorous standard for GFI\textit{GFI}, guiding the field beyond knowledge utilization toward dynamic, general-purpose reasoning. Our dataset and code will be released at: \href\href{https://github.com/arctanxarc/GENIUS}{https://github.com/arctanxarc/GENIUS}.


Source: arXiv:2602.11144v1 - http://arxiv.org/abs/2602.11144v1 PDF: https://arxiv.org/pdf/2602.11144v1 Original Link: http://arxiv.org/abs/2602.11144v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Feb 13, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
GENIUS: Generative Fluid Intelligence Evaluation Suite | Researchia