*Result*: Facilitating analysis of open neurophysiology data on the DANDI Archive using large language model tools.

Title:
Facilitating analysis of open neurophysiology data on the DANDI Archive using large language model tools.
Authors:
Magland JF; Flatiron Institute, New York, NY, USA. jmagland@flatironinstitute.org., Ly R; Lawrence Berkeley National Laboratory, Berkeley, CA, USA., Rübel O; Lawrence Berkeley National Laboratory, Berkeley, CA, USA., Dichter B; CatalystNeuro, Casper, WY, USA.
Source:
Scientific data [Sci Data] 2025 Dec 16; Vol. 12 (1), pp. 1988. Date of Electronic Publication: 2025 Dec 16.
Publication Type:
Journal Article; Dataset
Language:
English
Journal Info:
Publisher: Nature Publishing Group Country of Publication: England NLM ID: 101640192 Publication Model: Electronic Cited Medium: Internet ISSN: 2052-4463 (Electronic) Linking ISSN: 20524463 NLM ISO Abbreviation: Sci Data Subsets: MEDLINE
Imprint Name(s):
Original Publication: London : Nature Publishing Group, 2014-
Comments:
Update of: bioRxiv. 2025 Jul 24:2025.07.17.663965. doi: 10.1101/2025.07.17.663965.. (PMID: 40777235)
References:
Sci Data. 2025 Dec 16;12(1):1988. (PMID: 41402353)
Nat Neurosci. 2025 Jan;28(1):201-212. (PMID: 39414974)
J Cereb Blood Flow Metab. 2025 Nov 14;:271678X251386182. (PMID: 41235725)
PLoS One. 2011;6(6):e21101. (PMID: 21738610)
Neuron. 2023 Nov 15;111(22):3531-3540.e6. (PMID: 37659408)
Neuron. 2015 Nov 18;88(4):629-34. (PMID: 26590340)
Nat Neurosci. 2025 Jul;28(7):1497-1509. (PMID: 40500314)
Nat Methods. 2024 May;21(5):914-921. (PMID: 38589517)
Nat Neurosci. 2024 Mar;27(3):547-560. (PMID: 38238431)
Elife. 2025 Aug 01;14:. (PMID: 40747785)
Nature. 2023 May;617(7960):360-368. (PMID: 37138088)
Nat Hum Behav. 2025 Feb;9(2):305-315. (PMID: 39604572)
Elife. 2023 Jul 11;12:. (PMID: 37432073)
Elife. 2022 Oct 04;11:. (PMID: 36193886)
Neuron. 2023 Dec 6;111(23):3710-3715. (PMID: 37944519)
J Neurosci. 2025 May 7;45(19):. (PMID: 40180571)
iScience. 2025 Feb 26;28(3):112104. (PMID: 40129704)
Grant Information:
R24 MH117295 United States MH NIMH NIH HHS; U24 NS120057 United States NS NINDS NIH HHS; U24NS120057 U.S. Department of Health & Human Services | National Institutes of Health (NIH); 1R24MH117295 U.S. Department of Health & Human Services | National Institutes of Health (NIH)
Entry Date(s):
Date Created: 20251216 Date Completed: 20251223 Latest Revision: 20251229
Update Code:
20260130
PubMed Central ID:
PMC12727687
DOI:
10.1038/s41597-025-06285-x
PMID:
41402353
Database:
MEDLINE

*Further Information*

*The DANDI Archive is a key resource for sharing open neurophysiology data, hosting over 400 datasets in the Neurodata Without Borders (NWB) format. While these datasets hold tremendous potential for reanalysis and discovery, many researchers face barriers to reuse, including unfamiliarity with access methods and difficulty identifying relevant content. Here we introduce an AI-powered, agentic chat assistant and a notebook generation pipeline. The chat assistant serves as an interactive tool for exploring DANDI datasets. It leverages large language models (LLMs) and integrates with agentic tools to guide users through data access, visualization, and preliminary analysis. The notebook generator analyzes dataset structure with minimal human input, executing inspection scripts and generating visualizations. It then produces an instructional Python notebook tailored to the dataset. We applied this system to 12 recent datasets. Review by neurophysiology data specialists found the generated notebooks to be generally accurate and well-structured, with most notebooks rated as "very helpful." This work demonstrates how AI can support FAIR principles by leveraging data standards and lowering barriers to data reuse and engagement.
(© 2025. The Author(s).)*

*Competing interests: Benjamin Dichter is the Founder and CEO of CatalystNeuro, a consulting company that specializes in open science in neurophysiology. The other three authors declare no competing interests.*