Building a data pipeline

Overview

Data pipeline

What you’ll build

A pipeline demonstrating how to:
- download data (e.g. morphologies) from a remote source (e.g. Allen Cell Types Database)
- register the downloaded data in a knowledge graph within Blue Brain Nexus
- register the dataset provenance in the knowledge graph
- build a dataset made of entities of interest (e.g. a set of morphologies from a given species, brain region) and register it in Blue Brain Nexus with an immutable identifier
- Run analysis on the build dataset (e.g. topological analysis of neuronal morphologies using Topological Morphology Descriptor )
- Register the analysis results (plots, figures) in the Knowledge graph and record their provenance

What you’ll learn

How to leverage a knowledge graph as part of a data anaylis pipeline to store and manage (meta)data, search for data and record data provenance

What you’ll need

A Python 3.5 or higher with support of Jupyter notebook

Get the tutorial code

This tutorial code is available:

at Github
on Google Colab

0.2.0+53-b8db2ada