Visual Spatial Learning

Dynamics of mesoscale brain network during visual discrimination learning revealed by chronic, large-scale single-unit recording

During the acquisition of correct rejection response, rankings of functional connection separated for cortical and subcortical regions, which is predictive of the peak timing of visual information ...

FAA encouragement of spatial disorientation training has been UND practice for decades

The Federal Aviation Administration has recently encouraged operators and training programs to incorporate a particular type of training, which the University of North Dakota's ...

Courtship is complicated, even in fruit flies

Love is in the air for the vinegar fly. Drosophila melanogaster has long been a model for understanding how brains translate sensory information into courtship behavior. Male flies perform a multitude ...

GitHub

Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Welcome to the official codebase for Franca (pronounced Fran-ka), the first fully open-source vision foundation model—including data, code, and pretrained weights. Franca matches or surpasses the ...

GitHub

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction (CVPR 2026)

VLM-3R is a unified Vision-Language Model (VLM) framework integrating 3D reconstructive instruction tuning for deep spatial understanding from monocular video. The rapid advancement of Large ...

IEEE

Complementary and Contrastive Learning for Audio-Visual Segmentation

Abstract: Audio-Visual Segmentation (AVS) aims to generate pixel-wise segmentation maps that correlate with the auditory signals of objects. This field has seen significant progress with numerous CNN ...

IEEE

Attention-Guided Reinforcement Learning for Visual Servoing Control of Multirotor UAVs

Abstract: To address the challenges of dynamic perception, real-time decision-making, and control stability in UAV visual tracking tasks, this study proposes an Attention-guided Visual Servoing ...

eLife

Spatial learning in multi-scale environments: Roles of hippocampus, orbitofrontal cortex, and retrosplenial cortex

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, and public reviews. In this paper, Qiu et al. developed a novel spatial navigation ...

Microsoft

Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials

Vision Transformers (ViTs) have become a universal backbone for both image recognition and image generation. Yet their Multi–Head Self–Attention (MHSA) layer still performs a quadratic query–key ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results