Frequently Asked Questions


Data Access

The MICrONS dataset is massive. The imagery and segmentation are on the order of petabytes. There are hundreds of millions of synapses, and even downloading a static table of the synaptic conenctivity is tens of gigabytes.

In general, you do not want to download the entire dataset.

Instead, we have tools that allow you to visualize, locate, and download only the data you are most interested in.

For downloading the imagery, segmentation, and meshes, you will use the the python package cloud-volume. CloudVolume is a serverless Python client for random access reading and writing of Neuroglancer volumes in “Precomputed” format, a set of representations for arbitrarily large volumetric images, meshes, and skeletons. See Download Imagery and Segmentation and Download Meshes

For downloading neuron skeletons in .swc format, see Download Skeletons.

For connectivity information, you can use the brower-based Connectivity Viewer which will: access the connectivity for cells of interest, generate neuroglancer links for interactive visualization, and generate connectivity summaries on demand. For large scale connectivity analysis (between many cells, or populations of cells), you will want to access the data programmatically. See the Python tools quickstart notebooks, specifically CAVE Query Synaptic Connectivity.

For all other annotations – such as cell types, synapse compartment types, and axon myelination – you will want to access the annotation tables programmatically through the python tool CAVEclient. See the Python tools quickstart notebooks, specifically CAVE Query Cell Types and CAVE Query Proofread Cells for practical examples.

If you really do still want to download the entire dataset:

We have bulk downloads available from our cloud service providers. See:

  • Static Repositories for a list of available data, sizes, and cloud paths. This includes flat segmentation, meshes, or skeletons at some major versions.
  • Bossdb repository for MICrONS aligned volume data
  • DANDI for functional NWB files

Directly downloading these resources may speed up processing time, and you can point cloud-volume to a local filepath just as easily as a cloud source.

Browser-based method

You can use the Connectivity Viewer to: access the connectivity for cells of interest, generate neuroglancer links for interactive visualization, and generate connectivity summaries on demand.

Programmatic access

For large scale connectivity analysis (between many cells, or populations of cells), you will want to access the data programmatically. See the Python tools quickstart notebooks, specifically CAVE Query Synaptic Connectivity.

Bulk download of connectivity grpah

Important

This connectivity table represents the original v117 segmentation. Proofreading has improved the accuracy and added millions of high-quality synaptic connections. The programmatic access above is our recommended method to perform connectivity analysis

The synaptic connectivity map at version 117 is available for bulk download from Bossdb. See Static Repositories - Synapse Graph for details.

There are many, many things worth analyzing in the dataset, and a myriad of ways to do that analysis. On this website we host Quickstart tutorials that provide minimal examples of what is possible. We are also expanding the gallery of Analysis Examples. And we outlink to relevant repositories from Publications using the MICrONS dataset.

Consider posting to the Tutorials Forum for feedback and discussion.

We are also available for individual analysis consultation and support. Access begins with submitting a VORTEX request

microns-explorer.org/requests | Scientific Request Form

For any general use of the MICrONS Cubic Millimeter data and resources, please cite the primary flagship publication:

The MICrONS Consortium, J. Alexander Bae, Mahaly Baptiste, Maya R. Baptiste, Caitlyn A. Bishop, Agnes L. Bodor, Derrick Brittain, et al. 2025. “Functional Connectomics Spanning Multiple Areas of Mouse Visual Cortex.” Nature 640 (8058): 435–47. https://doi.org/10.1038/s41586-025-08790-w.

For use of the MICrONS Layer 2/3 dataset (Phase 1), please cite the primary flagship publication

Nicholas L. Turner, Thomas Macrina, J. et al. 2022. “Reconstruction of neocortex: Organelles, compartments, cells, circuits, and activity.” Cell 185 (6): 1082-1100. https://doi.org/10.1016/j.cell.2022.01.023.

For any primary research publication, cite the publication (if released) or preprint (if unreleased). See microns-explorer.org/publications for more.

When integrating our data into a website, cite the source in a manner that is clear, accurate and easily discoverable and link to www.microns-explorer.org. Also, be sure that our Content is never displayed in the absence of such citation. For images obtained from the Sites, use the following citation: www.microns-explorer.org.


Data Curation

The MICrONS dataset is public and open access, ready for analysis, but manual edits to the segmentation continue to improve data quality. The automatic segmentation from EM imagery to 3D reconstruction was largely effective, and the only way to process data at this scale (The MICrONS Consortium et al. 2025). However, due to imaging defects and the nature of thin, branching axons, the automated methods do make mistakes that have large impacts on the biological accuracy of the reconstructions.

The data is regularly versioned; that is, a long-term copy of the dataset is made available for users. We highly recommend setting the version or timestamp in your analysis for consistency.

However, even if you do not set the version, there is a lineage graph of changes to the dataset. Meaning, you can find the past version of your cell, annotation table, mesh, skeleton etc. as long as you know the root id of the object you are interested in – or the date at which you performed an analysis.

See the Materialization and Versioning section for detailed examples of how to 1) find your missing cell, and 2) set the version of your analysis so you do not encounter this issue again.

Automated methods for cellular segmentation, synapse detection, and classification are fantastic for working with data at this scale (The MICrONS Consortium et al. 2025). However, each of these processes makes mistakes. We have methods of maunal correction or proofreading to improve the quality of the data.

Proofreading

It is useful to distinguish between two kinds of proofreading errors: splits and merges.

Split errors occur when a process incorrectly appears to stop in the segmentation, but actually could be continued. Merge errors occur when processes from two different segments are incorrectly merged together. These two kinds of errors impact analysis differently. For example for neurons: while split errors reduce the number of correct synaptic connections, merge errors add incorrect connections.

The frequency of errors is roughly related to the size of the process, and thus axons and dendrites have extremely different error profiles. Because they are thicker, dendrites are largely well-segmented, with only a few split errors and most merge errors being with small axons rather than other dendrites.

Segmentation errors are also more common in places where the image quality was lower, or the image sections interupted. Such as at the EM volume boundaries (pial surface, white matter, and lateral edges.

We provide ongoing proofreading to the dataset through the VORTEX program. We also support and train scientists who want to make their own proofreading edits to the dataset. For either object, see VORTEX below

Virtual Observatory of the Cortex VORTEX

The VORTEX program is a grant funded mechanism designed to assist you in precisely this situation. We provide the labor and experience to make corrections and annotations in the data. The results of these efforts will be integrated with an evolving version of the dataset, and shared with you and the larger scientific community.

Depending on the scale of errors in your cells of interest, we may provide:

  1. analysis support to interpret the data despite the errors
  2. limited correction of small errors
  3. large-scale proofreading support to answer scientific questions
  4. provide you direct proofreading access to the dataset and training in how to make edits

Any of these solutions begins with submitting a VORTEX request

microns-explorer.org/requests | Scientific Request Form

Automated classifiers

Automated classifiers such as the synapse detection and cell type tables can also make mistakes. In practice, it is easiest to exclude data from your analysis if it does not match expectations or quality metrics. If you have specific feedback about performance of the automated classifiers, reach out of the maintainers of the Annotation table in question. This information is included in the table metadata

client.materialize.tables.get_table_metadata(table_name)
Back to top

References

The MICrONS Consortium, J. Alexander Bae, Mahaly Baptiste, Maya R. Baptiste, Caitlyn A. Bishop, Agnes L. Bodor, Derrick Brittain, et al. 2025. “Functional Connectomics Spanning Multiple Areas of Mouse Visual Cortex.” Nature 640 (8058): 435–47. https://doi.org/10.1038/s41586-025-08790-w.