QUICK REVIEW

[논문 리뷰] The Dynamic Sensorium competition for predicting large-scale mouse visual cortex activity from videos

Polina Turishcheva, Paul G. Fahey|arXiv (Cornell University)|2023. 05. 31.

Neural dynamics and brain function인용 수 11

한 줄 요약

본 논문은 SENSORIUM 2023 벤치마크를 소개합니다—비디오로부터 단일 뉴런의 생쥐 시각 피질 활동을 예측하기 위한 대규모의 동적 자극-반응 데이터셋 및 대회로, 행동 입력을 포함한 구분 내(in-domain) 및 구분 외(out-of-domain, OOD) 평가를 포함합니다.

ABSTRACT

Understanding how biological visual systems process information is challenging due to the complex nonlinear relationship between neuronal responses and high-dimensional visual input. Artificial neural networks have already improved our understanding of this system by allowing computational neuroscientists to create predictive models and bridge biological and machine vision. During the Sensorium 2022, we introduced benchmarks for vision models with static input. However, animals operate and excel in dynamic environments, making it crucial to study and understand how the brain functions under these conditions. Moreover, many biological theories, such as predictive coding, suggest that previous input is crucial for current input processing. Currently, there is no standardized benchmark to identify state-of-the-art dynamic models of the mouse visual system. To address this gap, we propose the Sensorium 2023 Benchmark Competition with dynamic input. It includes the collection of a new large-scale dataset from the primary visual cortex of ten mice, containing responses from over 78,000 neurons to over 2 hours of dynamic stimuli per neuron. Participants in the main benchmark track will compete to identify the best predictive models of neuronal responses for dynamic input. We will also host a bonus track in which submission performance will be evaluated on out-of-domain input, using withheld neuronal responses to dynamic input stimuli whose statistics differ from the training set. Both tracks will offer behavioral data along with video stimuli. As before, we will provide code, tutorials, and strong pre-trained baseline models to encourage participation. We hope this competition will continue to strengthen the accompanying Sensorium benchmarks collection as a standard tool to measure progress in large-scale neural system identification models of the entire mouse visual hierarchy and beyond.

연구 동기 및 목표

생쥐 시각 시스템의 동적 신경 예측에 대한 표준화된 벤치마크를 동기부여하고 체계화한다.
다수의 생쥐에 걸친 동적 시각 자극, 신경 반응, 행동의 대규모 데이터셋을 제공한다.
시간적 동역학을 다루고 도메인 외 자극에 일반화하는 예측 모델의 개발을 촉진한다.
신경 반응에 대한 조절 인자를 반영하기 위해 행동 변수들을 모델 입력으로 포함한다.

제안 방법

비디오 자극 x와 행동 b를 시간에 따라 신경 반응 r로 매핑하는 동적 예측 모델링 프레임워크 f_theta(x,b)를 도입한다.
다규모 데이터셋 제공: 다섯 마리의 생쥐, 38,819 neurons, 약 600분의 동적 자극, 학습/검증/실시간 테스트/최종 테스트 파티션 포함.
기본 모델 아키텍처를 설명: 2D CNN 코어와 가우시안 리드아웃을 갖춘 GRU 베이스라인; 동적 코어와 가우시안 리드아웃을 갖춘 3D 팩토라이즈드 베이스라인; 앙상블 베이스라인.
데이터 로드 및 모델 학습을 위한 튜토리얼, 코드, API를 포함한 시작 키트를 제공한다.
평가 지표 정의: 메인 트랙의 자연스러운 비디오 최종 테스트 세트에서 단일 트라이얼 상관관계; 보너스 트랙을 위한 5개 OOD 테스트 세트 전반에 걸친 같은 지표; 또한 평균과의 상관관계도 보고한다.

실험 결과

연구 질문

RQ1비디오와 행동을 입력으로 사용할 때 생쥐의 동적 자연 영화 자극에 대한 단일 세포 신경 반응의 예측 성능 상한은 얼마인가?
RQ2시간에 따라 진화하는 신경 활동을 예측하는 데 있어 동적 모델은 정적 또는 비시간적 베이스라인과 어떻게 비교되는가?
RQ3행동 변수( 이동, 동공 크기, 눈 위치 )를 포함하면 예측 정확도가 향상되는가?
RQ4도메인 내 자연 영상에서 학습된 모델이 통계가 다른 도메인 외 자극으로 일반화되는가?
RQ5생쥐 시각 피질의 시공간 역학을 가장 잘 포착하는 아키텍처와 학습 방식은 무엇인가?

주요 결과

다섯 마리에 걸친 38,819 neurons의 대규모 데이터셋이 공개되었으며, 약 600분에 걸친 동적 자극을 포함한다.
기본 모델(GRU, 3D Factorized, 앙상블)은 성능 차이를 보이며, 보류된 테스트 데이터에서 일반적으로 앙상블 베이스라인이 단일 베이스라인보다 우수하게 나타난다.
대회는 정적 이미지가 아닌 동적 입력(비디오)을 강조하며, 도메인 외 평가를 위한 보너스 트랙을 포함한다.
행동 변수들이 예측 모델의 입력으로 포함되어 신경 활동에 대한 조절 효과를 반영한다.
성능은 자연 비디오 최종 테스트 세트에서의 단일 트라이얼 상관관계(메인 트랙) 및 여러 OOD 자극에 걸쳐 평가된다(보너스 트랙).
논문은 참가를 촉진하고 팀 간의 표준화를 돕기 위한 시작 키트, 튜토리얼, 및 사전 학습된 베이스라인을 제공한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.