Central challenge: The minimal definition of category selectivity is underconstrained

A voxel or unit is deemed "face-selective" if it responds more to faces than to other categories — but infinitely many response patterns satisfy this minimal criterion. Without examining the full stimulus-level response profile, we cannot know whether the form of ANN selectivity matches what is observed in the human brain.

·
Response profile explorer  ·  schematic

Each point is a face-selective response profile (faces > other categories). Click any point to reveal its full response pattern across categories.

← click a point
Human FFA
ANN unit (example)
Other face-selective profiles

Research questions
Q1: Do different brains exhibit the same form of category selectivity?
Yes! Category-selective regions are strikingly consistent — establishing a human-human ceiling for model comparison.
Q2: Do ANN units exhibit the same form of category selectivity as the brain?
No! ANN units fall well below the human-human ceiling in every model tested — regardless of the selectivity threshold, functional localizer, and category-selective fROI.

Finding 1: Category-selective units emerge only in trained, but not untrained, ANNs

Using the same localizer procedure applied in human fMRI, we identified category-selective units across 55 ANN models (35 trained + 20 untrained). All pretrained models show robust face-, body-, and scene-selective units that generalize to an independent stimulus set, but randomly initialized models showed no reliable selectivity. Training is necessary for category selectivity to emerge.

·
Responses to independent stimuli (Prince et al., 2024)

Finding 2: Human category-selective responses are strikingly consistent across individuals  · 
Finding 3: ANN units fall well below the human-human ceiling — in every model tested

Category-selective voxel responses in FFA, EBA, and PPA are strikingly consistent across all 8 subjects — defining the human-human ceiling. Every ANN model tested falls substantially below this ceiling, across both univariate and multivariate measures, and across all choices of selectivity thresholds, functional localizers, and category-selective regions.

·

Finding 4: Category-selective ANN units are neither necessary nor sufficient for predicting brain responses

Voxel-wise encoding models trained on all units in a layer predict brain responses at within-subject ceiling. Removing category-selective units causes no meaningful drop in accuracy — but using only those units substantially reduces it. The information needed to predict brain responses is distributed broadly across the layer, not concentrated in the units a functional localizer deems category-selective.

·

Finding 5: The differences between category selectivity in ANNs and brains are systematic and interpretable

The divergences between ANN unit responses and human category-selective regions are not arbitrary — they reflect simple, interpretable stimulus-level features. These candidate factors generalized to held-out subjects, models, and stimuli.

Face selectivity
FFA: Human faces Animal faces
ANNs: Human faces Animal faces
Body selectivity
EBA: Exposed limbs Covered bodies
ANNs: Exposed limbs Covered bodies
Scene selectivity
PPA: Scenes without people Scenes with people
ANNs: Scenes without people Scenes with people
·

Current ANNs do not exhibit the form of category selectivity observed in the human brain

A category preference alone can arise from many different response patterns. The existence of category-selective units in ANNs does not imply that they exhibit the same form of category selectivity as the human brain.

Recommendations
  • Use the same localizer procedure as human neuroimaging and verify that selectivity generalizes to an independent stimulus set
  • Go beyond category preference — compare full stimulus-level response profiles across hundreds of natural images
  • Confirm that conclusions are stable across multiple localizer choices

ANN category-selective units differ from category-selective voxels in systematic and interpretable ways. Current ANN models do not yet share the biological invariances present in human category-selective cortex. Hence, ANN category-selective units cannot yet be equated with human category-selective voxels.

Recommendations
  • Use voxel-wise encoding models rather than direct unit comparisons; relevant features are distributed across the layer, not concentrated in selective units
  • Use our interpretable diagnostic tests (human vs. animal faces; visible vs. obscured limbs; scenes with vs. without people) as a useful starting point for evaluating brain-like selectivity
Open question 1
What developmental and evolutionary pressures give rise to the stable, invariant category representations observed in human visual cortex?
Open question 2
What determines the stability of category-selective representations? What model constraints are needed to achieve brain-like category selectivity?

Citation
Dipani, A., & Ratan Murty, N. A. (2026). Category selectivity observed in the human brain is distinct from category selectivity observed in artificial neural networks. bioRxiv. https://doi.org/10.64898/2026.05.29.728609
Show BibTeX ↓
@article{dipani2026category,
  title   = {Category selectivity observed in the human brain is distinct
             from category selectivity observed in artificial neural networks},
  author  = {Dipani, Alish and Ratan Murty, N Apurva},
  journal = {bioRxiv},
  pages   = {2026--05},
  year    = {2026},
  doi     = {10.64898/2026.05.29.728609},
  url     = {https://doi.org/10.64898/2026.05.29.728609}
}