ExplorerArtificial IntelligenceAI
Research PaperResearchia:202604.28002

Learning to Think from Multiple Thinkers

Nirmit Joshi

Abstract

We study learning with Chain-of-Thought (CoT) supervision from multiple thinkers, all of whom provide correct but possibly systematically different solutions, e.g., step-by-step solutions to math problems written by different thinkers, or step-by-step execution traces of different programs solving the same problem. We consider classes that are computationally easy to learn using CoT supervision from a single thinker, but hard to learn with only end-result supervision, i.e., without CoT (Joshi ...

Submitted: April 28, 2026Subjects: AI; Artificial Intelligence

Description / Details

We study learning with Chain-of-Thought (CoT) supervision from multiple thinkers, all of whom provide correct but possibly systematically different solutions, e.g., step-by-step solutions to math problems written by different thinkers, or step-by-step execution traces of different programs solving the same problem. We consider classes that are computationally easy to learn using CoT supervision from a single thinker, but hard to learn with only end-result supervision, i.e., without CoT (Joshi et al. 2025). We establish that, under cryptographic assumptions, learning can be hard from CoT supervision provided by two or a few different thinkers, in passive data-collection settings. On the other hand, we provide a generic computationally efficient active learning algorithm that learns with a small amount of CoT data per thinker that is completely independent of the target accuracy ε\varepsilon, a moderate number of thinkers that scales as log1εloglog1ε\log \frac{1}{\varepsilon}\log \log \frac{1}{\varepsilon}, and sufficient passive end-result data that scales as 1εpolylog1ε\frac{1}{\varepsilon}\cdot poly\log\frac{1}{\varepsilon}.


Source: arXiv:2604.24737v1 - http://arxiv.org/abs/2604.24737v1 PDF: https://arxiv.org/pdf/2604.24737v1 Original Link: http://arxiv.org/abs/2604.24737v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Apr 28, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
Learning to Think from Multiple Thinkers | Researchia