Open Connectome Project: reverse engineering the brain one synapse at a time
Joshua Vogelstein (Johns Hopkins University), Eric Perlman (Johns Hopkins University), Davi Bock (Janelia Farm Research Campus), Wei-Chung Lee (Harvard University), Bobby Kasthuri (Harvard University), Jeff Lichtman (Harvard University), Clay Reid (Harvard University), Albert Cardona (Institute of Neuroinformatics), Randal Burns (Johns Hopkins University), R. Jacob Vogelstein (Johns Hopkins University Applied Physics Laboratory)
Clearly, while even collecting this type of data is an enormous task, interpreting and analyzing the data is far more difficult. It is infeasible to annotate this volume of data manually, and probably impractical to assume that any one group will devise a perfect automated solution. We are therefore working to provide universal access to this type of data via web services hosted at http://openconnectomeproject.org. More specifically, we are developing tools for both human (visualization) and computer (application programming interface, or API) access to the data. Granting global access will enable the largest possible community of image processing and machine learning experts to investigate the data and develop algorithms to annotate it. Unlike standard crowdsourcing endeavors, we aim to compile efforts from a variety of machine annotators, as opposed to human annotators, an approach we have dubbed “alg-sourcing” (for algorithm outsourcing). As different groups tackle different aspects of the problem with different approaches, we intend to aggregate the results and share the collective output, building towards our long-term vision of a fully-annotated cortical volume.
Our project is being initialized with two datasets: (1) a 12 TB dataset from Bock et al. described above, and (2) a >600 GB dataset from Kasthuri and Lichtman (unpublished; spatial resolution: 3 x 3 x 29 cubic nanometers). Panning, zooming, and manual annotation are made possible via a web-based graphical user interface called CATMAID (Saalfield et al, 2009). An API for two-dimensional analysis of the data, including downloading arbitrary image planes and uploading planar annotations to the shared repository are in progress. An additional server for three-dimensional representation of the data is being built, along with an API for downloading volumes and uploading volumetric annotations. Graphics processor unit (GPU)-enabled software will allow for visualizing arbitrary rotations of the data in three dimensions, overlaid with the annotations. All of the services are designed to scale up to petabytes and beyond, and all of the code we develop will be released as open source.
In conclusion, the Open Connectome Project is gearing up for massive polyscience, i.e. science collectively conducted by a large group of individuals. This marks a radical departure from the typical scientific workflow, in which raw data are kept local until results are released, and will hopefully usher in a new era of understanding about the brain.