Back to Explorer
Research PaperResearchia:202602.16018[Chemical Engineering > Engineering]

Quantization-Aware Collaborative Inference for Large Embodied AI Models

Zhonghao Lyu

Abstract

Large artificial intelligence models (LAIMs) are increasingly regarded as a core intelligence engine for embodied AI applications. However, the massive parameter scale and computational demands of LAIMs pose significant challenges for resource-limited embodied agents. To address this issue, we investigate quantization-aware collaborative inference (co-inference) for embodied AI systems. First, we develop a tractable approximation for quantization-induced inference distortion. Based on this approximation, we derive lower and upper bounds on the quantization rate-inference distortion function, characterizing its dependence on LAIM statistics, including the quantization bit-width. Next, we formulate a joint quantization bit-width and computation frequency design problem under delay and energy constraints, aiming to minimize the distortion upper bound while ensuring tightness through the corresponding lower bound. Extensive evaluations validate the proposed distortion approximation, the derived rate-distortion bounds, and the effectiveness of the proposed joint design. Particularly, simulations and real-world testbed experiments demonstrate the effectiveness of the proposed joint design in balancing inference quality, latency, and energy consumption in edge embodied AI systems.


Source: arXiv:2602.13052v1 - http://arxiv.org/abs/2602.13052v1 PDF: https://arxiv.org/pdf/2602.13052v1 Original Link: http://arxiv.org/abs/2602.13052v1

Submission:2/16/2026
Comments:0 comments
Subjects:Engineering; Chemical Engineering
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Quantization-Aware Collaborative Inference for Large Embodied AI Models | Researchia | Researchia