ExplorerBio-AI InterfacesNeuroscience
Research PaperResearchia:202606.23047

B[FM]$^2$: Brain Foundation Model via Flow Matching with SplitUNet

Jaedong Hwang

Abstract

EEG foundation models can learn generalizable representations from large-scale EEG corpora to enable single-backbone transfer across diverse clinical and brain-computer interface tasks. Existing models typically discretize the continuous multi-channel EEG waveform into patches or codebook tokens and train a transformer with masked self-supervision. Recognizing that this discretization fragments continuous brain rhythms and obscures fine-grained temporal dynamics, we present B[FM]$^2$(Brain Found...

Submitted: June 23, 2026Subjects: Neuroscience; Bio-AI Interfaces

Description / Details

EEG foundation models can learn generalizable representations from large-scale EEG corpora to enable single-backbone transfer across diverse clinical and brain-computer interface tasks. Existing models typically discretize the continuous multi-channel EEG waveform into patches or codebook tokens and train a transformer with masked self-supervision. Recognizing that this discretization fragments continuous brain rhythms and obscures fine-grained temporal dynamics, we present B[FM]2^2(Brain Foundation Model via Flow Matching), whose inductive bias aligns with the data by pretraining directly on the raw signal using continuous-time flow matching without patches, tokenization, or masking. However, multi-channel EEG signals pose an architectural challenge for flow matching: time is densely sampled and highly autocorrelated (thousands of timepoints), while the electrode axis is short (tens of channels) at distinct scalp positions. To address this time-electrode asymmetry, we introduce SplitUNet, a velocity network that factorizes each block into separate 1D temporal and 1D electrode convolutions and downsamples only along time, preserving electrode topology throughout the hierarchy. B[FM]2^2 sets a new state of the art on 7 of 9 standard downstream EEG classification tasks, using a pretraining budget of only 36,895 segments (\approx 307h), 1-2 orders of magnitude (\approx 30x) less than required by existing EEG foundation models. Further, it generates synthetic EEGs that two board-certified neurologists cannot distinguish from brain data (Cohen's κ=κ= -0.096). https://jd730.github.io/projects/BFM2


Source: arXiv:2606.20812v1 - http://arxiv.org/abs/2606.20812v1 PDF: https://arxiv.org/pdf/2606.20812v1 Original Link: http://arxiv.org/abs/2606.20812v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 23, 2026
Topic:
Bio-AI Interfaces
Area:
Neuroscience
Comments:
0
Bookmark
B[FM]$^2$: Brain Foundation Model via Flow Matching with SplitUNet | Researchia