Back to Explorer
Research PaperResearchia:202602.10033[Artificial Intelligence > AI]

Robustness Is a Function, Not a Number: A Factorized Comprehensive Study of OOD Robustness in Vision-Based Driving

Amir Mallak

Abstract

Out of distribution (OOD) robustness in autonomous driving is often reduced to a single number, hiding what breaks a policy. We decompose environments along five axes: scene (rural/urban), season, weather, time (day/night), and agent mix; and measure performance under controlled kk-factor perturbations (k{0,1,2,3}k \in \{0,1,2,3\}). Using closed loop control in VISTA, we benchmark FC, CNN, and ViT policies, train compact ViT heads on frozen foundation-model (FM) features, and vary ID support in scale, diversity, and temporal context. (1) ViT policies are markedly more OOD-robust than comparably sized CNN/FC, and FM features yield state-of-the-art success at a latency cost. (2) Naive temporal inputs (multi-frame) do not beat the best single-frame baseline. (3) The largest single factor drops are rural \rightarrow urban and day \rightarrow night (31%\sim 31\% each); actor swaps 10%\sim 10\%, moderate rain 7%\sim 7\%; season shifts can be drastic, and combining a time flip with other changes further degrades performance. (4) FM-feature policies stay above 85%85\% under three simultaneous changes; non-FM single-frame policies take a large first-shift hit, and all no-FM models fall below 50%50\% by three changes. (5) Interactions are non-additive: some pairings partially offset, whereas season-time combinations are especially harmful. (6) Training on winter/snow is most robust to single-factor shifts, while a rural+summer baseline gives the best overall OOD performance. (7) Scaling traces/views improves robustness (+11.8+11.8 points from 55 to 1414 traces), yet targeted exposure to hard conditions can substitute for scale. (8) Using multiple ID environments broadens coverage and strengthens weak cases (urban OOD 60.6%70.1%60.6\% \rightarrow 70.1\%) with a small ID drop; single-ID preserves peak performance but in a narrow domain. These results yield actionable design rules for OOD-robust driving policies.


Source: arXiv:2602.09018v1 - http://arxiv.org/abs/2602.09018v1 PDF: https://arxiv.org/pdf/2602.09018v1 Original Link: http://arxiv.org/abs/2602.09018v1

Submission:2/10/2026
Comments:0 comments
Subjects:AI; Artificial Intelligence
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Robustness Is a Function, Not a Number: A Factorized Comprehensive Study of OOD Robustness in Vision-Based Driving | Researchia | Researchia