ExplorerArtificial IntelligenceAI
Research PaperResearchia:202603.23054

Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

Qi Cao

Abstract

Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks. However, the truthfulness of their outputs is not guaranteed, and their tendency toward overconfidence further limits reliability. Uncertainty quantification offers a promising way to identify potentially unreliable outputs, but most existing methods rely on repeated sampling or auxiliary models, introducing substantial computational overhead. To address these limitations, we propose Semantic Token Clust...

Submitted: March 23, 2026Subjects: AI; Artificial Intelligence

Description / Details

Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks. However, the truthfulness of their outputs is not guaranteed, and their tendency toward overconfidence further limits reliability. Uncertainty quantification offers a promising way to identify potentially unreliable outputs, but most existing methods rely on repeated sampling or auxiliary models, introducing substantial computational overhead. To address these limitations, we propose Semantic Token Clustering (STC), an efficient uncertainty quantification method that leverages the semantic information inherently encoded in LLMs. Specifically, we group tokens into semantically consistent clusters using embedding clustering and prefix matching, and quantify uncertainty based on the probability mass aggregated over the corresponding semantic cluster. Our approach requires only a single generation and does not depend on auxiliary models. Experimental results show that STC achieves performance comparable to state-of-the-art baselines while substantially reducing computational overhead.


Source: arXiv:2603.20161v1 - http://arxiv.org/abs/2603.20161v1 PDF: https://arxiv.org/pdf/2603.20161v1 Original Link: http://arxiv.org/abs/2603.20161v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Mar 23, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark