NBLAST against FlyCircuit¶

If you work with Drosophila, chances are you have heard of FlyCircuit. It’s a collection of ~16k single neuron clones published by Chiang et al. (2010).

For R, there is a package containing dotprops and meta data for these neurons: nat.flycircuit. This does not yet exist for Python. However, it’s still reasonably straight forward to run an NBLAST against the entire flycircuit dataset in Python:

First you need to download flycircuit dotprops from Zenodo (850Mb). Next, unzip the archive containing the dotprops as CSV files. Now we can load them into navis (adjust filepaths as required):

import navis
import pandas as pd

from pathlib import Path
from tqdm import tqdm

def load_dotprops_csv(fp):
    """Load dotprops from CSV files in filepath `fp`."""
    # Turn into a path object
    fp = Path(fp).expanduser()

    # Go over each CSV file
    files = list(fp.glob('*.csv'))
    dotprops = []
    for f in tqdm(files, desc='Loading dotprops', leave=False):
        # Read file
        csv = pd.read_csv(f)

        # Each row is a point with associated vector
        pts = csv[['pt_x', 'pt_y', 'pt_z']].values
        vect = csv[['vec_x', 'vec_y', 'vec_z']].values

        # Turn into a Dotprops
        dp = navis.Dotprops(points=pts, k=20, vect=vect, units='1 micron')

        # Use filename as ID/name
        dp.name = dp.id = f.name[:-4]

        # Add this dotprop to the list before moving on to the next
        dotprops.append(dp)

    return navis.NeuronList(dotprops)

fc_dps = load_dotprops_csv('~/Downloads/flycircuit_dotprops')
fc_dps

<class 'navis.core.neuronlist.NeuronList'> containing 16129 neurons (est. 1.4GiB)

	type	name	id	k	units	n_points
0	navis.Dotprops	DvGlutMARCM-F1822_seg2	DvGlutMARCM-F1822_seg2	20	1 micrometer	87
1	navis.Dotprops	DvGlutMARCM-F002216_seg001	DvGlutMARCM-F002216_seg001	20	1 micrometer	2574
...	...	...	...	...	...	...
16127	navis.Dotprops	GadMARCM-F000020_seg001	GadMARCM-F000020_seg001	20	1 micrometer	477
16128	navis.Dotprops	DvGlutMARCM-F1720_seg1	DvGlutMARCM-F1720_seg1	20	1 micrometer	941

To avoid having to re-generate these dotprops, you could consider pickling them:

In the future you can then reload the dotprops like so (much faster):

Note: The names/ids correspond to the flycircuit gene + clone names!

fc_dps[0]


type	navis.Dotprops
name	FruMARCM-M002262_seg001
id	FruMARCM-M002262_seg001
k	5
units	1 micrometer
n_points	4121

In case your query neurons are in a different brain space, you can use navis-flybrains to convert them to flycircuit’s FCWB space.

For demonstration purposes we will use the example neurons - olfactory DA1 projection neurons from the hemibrain connectome - that ship with navis.

# Load some of the example neurons
n = navis.example_neurons(3)

# Convert from hemibrain (JRCFIB2018Fraw) to FCWB space
import flybrains

n_fcwb = navis.xform_brain(n, source='JRCFIB2018Fraw', target='FCWB')

Transform path: JRCFIB2018Fraw -> JRCFIB2018F -> JRCFIB2018Fum -> JRC2018F -> FCWB

# A sanity check to make sure the transform worked
fig, ax = navis.plot2d([n_fcwb, flybrains.FCWB])

# Rotate to front view
ax.elev = -90

# Zoom in a little
ax.dist = 4

# Mirror neurons from the right to the left
# (FlyCircuit neurons are supposed to all be on the left)
n_fcwb_mirr = navis.mirror_brain(n_fcwb, template='FCWB')

# Convert to dotprops
n_dps = navis.make_dotprops(n_fcwb_mirr, resample=1, k=None)

# Run a "smart" NBLAST to get the top hits
# (ignore the warning about data not being in microns - that's a rounding error from the transform)
scores = navis.nblast_smart(query=n_dps, target=fc_dps, scores='mean', progress=False)

scores.head()

	FruMARCM-M002262_seg001	FruMARCM-M002262_seg002	FruMARCM-M002278_seg001	FruMARCM-M002287_seg001	FruMARCM-M002507_seg001	FruMARCM-M002522_seg001	FruMARCM-M002578_seg001	FruMARCM-M002578_seg002	FruMARCM-M002579_seg001	FruMARCM-M002579_seg002	...	DvGlutMARCM-F594-X2_seg1	DvGlutMARCM-F594-X2_seg2	DvGlutMARCM-F595_seg1	DvGlutMARCM-F596_seg1	DvGlutMARCM-F597_seg1	DvGlutMARCM-F598_seg1	DvGlutMARCM-F599_seg1	DvGlutMARCM-F599_seg2	DvGlutMARCM-F600-X2_seg1	DvGlutMARCM-F600-X2_seg2
1734350788	-0.260213	-0.200286	-0.645926	-0.639277	-0.663507	-0.451059	-0.229996	-0.604237	-0.705247	-0.307929	...	-0.881092	-0.648820	-0.660410	0.118524	-0.315490	-0.671891	-0.499438	-0.882007	-0.882510	-0.880825
1734350908	-0.267769	-0.191842	-0.655194	-0.623617	-0.679471	-0.459428	-0.233066	-0.607806	-0.715372	-0.310166	...	-0.881365	-0.654605	-0.669929	0.113844	-0.316001	-0.682929	-0.505106	-0.881421	-0.881361	-0.882250
722817260	-0.270642	-0.170228	-0.680244	-0.667571	-0.670850	-0.431723	-0.236943	-0.634743	-0.704928	-0.298078	...	-0.881398	-0.649228	-0.659683	0.132546	-0.317336	-0.627997	-0.503828	-0.882124	-0.881570	-0.881419

3 rows × 16129 columns

# Find the top hits for each of our query neurons
import numpy as np

for dp in n_dps:
    hit = scores.columns[np.argmax(scores.loc[dp.id].values)]
    sc = scores.loc[dp.id].values.max()
    print(f'Top hit for {dp.id}: {hit} ({sc:.3f})')

Top hit for 1734350788: FruMARCM-F001496_seg001 (0.778)
Top hit for 1734350908: FruMARCM-F001496_seg001 (0.744)
Top hit for 722817260: FruMARCM-F001496_seg001 (0.792)

Conveniently all of our query neurons have the same top match (they are all from the same cell type after all): FruMARCM-F001496_seg001

# Let's co-visualize:
# Queries in red, hit in black
fig, ax = navis.plot2d([n_fcwb_mirr, fc_dps.idx['FruMARCM-F001496_seg001']],
                       color=['r'] * len(n_fcwb_mirr) + ['k'])
ax.elev = -90

../../_images/nblast_flycircuit_16_1.png

Looking good!