diff --git a/Contribute-Docs.md b/Contribute-Docs.md
new file mode 100644
index 000000000..58e39bc8a
--- /dev/null
+++ b/Contribute-Docs.md
@@ -0,0 +1,27 @@
+Contributions towards documentations and examples are always welcome!
+
+Documentation is built using [Sphinx.](https://www.sphinx-doc.org/en/master/). If you've ever seen any docs over at [readthedocs](https://readthedocs.org/) then you've likely seen some examples of sphinx out in the wild.
+
+## Build the Documentation
+
+These instructions require that you have docker installed. The best way to do that is to follow the installation instructions at [Get Docker](https://docs.docker.com/get-docker/). The upside to this is that you don't need to clobber any existing conda environments in order to build your docs.
+
+You don't need to have any particular understanding of docker to run these commands. We are treating the docker image as a shell.
+
+```
+docker build -t sphinx-sgkit -f docs/Dockerfile .
+cd docs
+docker run --rm -i -v "$(pwd):/docs" sphinx-sgkit make clean html
+```
+
+## Serve the Documentation
+
+Now that we've run built the docs let's view them in their native html state.
+
+```
+# You can use any port you'd like instead of 8080
+docker run -p 8080:80 -v "$(pwd)/_build/html:/usr/share/nginx/html:ro"  nginx
+```
+
+Now open up localhost:8080 in your browser and you'll see the docs just as they appear on the docs website.
+
diff --git a/docs/Dockerfile b/docs/Dockerfile
new file mode 100644
index 000000000..75879ea91
--- /dev/null
+++ b/docs/Dockerfile
@@ -0,0 +1,19 @@
+FROM continuumio/miniconda3
+
+RUN apt-get update \
+ && apt-get install --no-install-recommends -y \
+      graphviz \
+      imagemagick \
+      make \
+      git \
+ && apt-get autoremove \
+ && apt-get clean \
+ && rm -rf /var/lib/apt/lists/*
+
+WORKDIR /docs
+ADD requirements-dev.txt /docs/
+
+RUN conda install -y -c conda-forge scikit-allel sphinx nbsphinx pip pandoc && \
+        pip3 install -r requirements-dev.txt && \
+        pip3 install git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc
+
diff --git a/docs/examples.rst b/docs/examples.rst
new file mode 100644
index 000000000..1315aa546
--- /dev/null
+++ b/docs/examples.rst
@@ -0,0 +1,12 @@
+Examples
+========
+
+Understanding the Xarray Genotype Call Dataset
+**********************************************
+
+.. toctree::
+    :maxdepth: 1
+
+    examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-VCF
+    examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-SGKit-Zarr
+    examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-Minimal-Numpy-Example
diff --git a/docs/examples/notebooks/Genotype-Call-Dataset-From-SGKit-Zarr.ipynb b/docs/examples/notebooks/Genotype-Call-Dataset-From-SGKit-Zarr.ipynb
new file mode 100644
index 000000000..6e63cc5a1
--- /dev/null
+++ b/docs/examples/notebooks/Genotype-Call-Dataset-From-SGKit-Zarr.ipynb
@@ -0,0 +1,1153 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Load From Malaria Gen Zarr\n",
+    "\n",
+    "A central point to the SGkit API is the Genotype Call Dataset. This is the data structure that most of the other functions use. It uses [Xarray](http://xarray.pydata.org/en/stable/) underneath the hood to give a programmatic interface that allows for the backend to be several different data files.\n",
+    "\n",
+    "The Xarray itself is *sort of* a transposed VCF file.\n",
+    "\n",
+    "For this example we are going to from the preprocessed zarr to the sgkit Genotype Call XArray Dataset.\n",
+    "\n",
+    "This is only meant to demonstrate the datatypes that we feed into the Xarray dataset. For a more conceptual understanding please check out the `Genotype-Call-Dataset-From-VCF.ipynb`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import numpy as np\n",
+    "import zarr\n",
+    "import pandas as pd\n",
+    "import dask.array as da\n",
+    "import allel\n",
+    "from pprint import pprint\n",
+    "import matplotlib.pyplot as plt\n",
+    "%matplotlib inline"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Create a Dask Cluster\n",
+    "\n",
+    "This isn't that important for this example, but SGkit can use Dask under the hood for many of it's calculations. Divide and conquer your statistical genomics data!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "60ad30bcd7044d6fb7f8fd803c140a26",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "VBox(children=(HTML(value='<h2>KubeCluster</h2>'), HBox(children=(HTML(value='\\n<div>\\n  <style scoped>\\n    .…"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    }
+   ],
+   "source": [
+    "from dask_kubernetes import KubeCluster\n",
+    "cluster = KubeCluster(n_workers=30, silence_logs='error')\n",
+    "cluster"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Import sgkit"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Collecting git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc\n",
+      "  Cloning https://github.com/pystatgen/sgkit (to revision 96203d471531e7e2416d4dd9b48ca11d660a1bcc) to /tmp/pip-req-build-7iudp4iv\n",
+      "  Running command git clone -q https://github.com/pystatgen/sgkit /tmp/pip-req-build-7iudp4iv\n",
+      "  Running command git checkout -q 96203d471531e7e2416d4dd9b48ca11d660a1bcc\n",
+      "Requirement already satisfied (use --upgrade to upgrade): sgkit==0.1.dev67+g96203d4 from git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc in /opt/conda/lib/python3.7/site-packages\n",
+      "Requirement already satisfied: numpy in /opt/conda/lib/python3.7/site-packages (from sgkit==0.1.dev67+g96203d4) (1.18.4)\n",
+      "Requirement already satisfied: xarray in /opt/conda/lib/python3.7/site-packages (from sgkit==0.1.dev67+g96203d4) (0.15.1)\n",
+      "Requirement already satisfied: setuptools>=41.2 in /opt/conda/lib/python3.7/site-packages (from sgkit==0.1.dev67+g96203d4) (47.1.1.post20200529)\n",
+      "Requirement already satisfied: pandas>=0.25 in /opt/conda/lib/python3.7/site-packages (from xarray->sgkit==0.1.dev67+g96203d4) (1.0.4)\n",
+      "Requirement already satisfied: python-dateutil>=2.6.1 in /opt/conda/lib/python3.7/site-packages (from pandas>=0.25->xarray->sgkit==0.1.dev67+g96203d4) (2.8.1)\n",
+      "Requirement already satisfied: pytz>=2017.2 in /opt/conda/lib/python3.7/site-packages (from pandas>=0.25->xarray->sgkit==0.1.dev67+g96203d4) (2020.1)\n",
+      "Requirement already satisfied: six>=1.5 in /opt/conda/lib/python3.7/site-packages (from python-dateutil>=2.6.1->pandas>=0.25->xarray->sgkit==0.1.dev67+g96203d4) (1.15.0)\n",
+      "Building wheels for collected packages: sgkit\n",
+      "  Building wheel for sgkit (setup.py) ... \u001b[?25ldone\n",
+      "\u001b[?25h  Created wheel for sgkit: filename=sgkit-0.1.dev67+g96203d4-py3-none-any.whl size=19421 sha256=76ddd164160ed34beee7e6e8f6f0bde32b36b898074de2a50e0e1ce64f228d70\n",
+      "  Stored in directory: /home/jovyan/.cache/pip/wheels/6f/2b/6e/48d20c382bb6a66ea96c6dee6e6e575ea88180fef1e96a9024\n",
+      "Successfully built sgkit\n"
+     ]
+    }
+   ],
+   "source": [
+    "! pip install git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Help on function create_genotype_call_dataset in module sgkit.api:\n",
+      "\n",
+      "create_genotype_call_dataset(*, variant_contig_names: List[str], variant_contig: Any, variant_position: Any, variant_alleles: Any, sample_id: Any, call_genotype: Any, call_genotype_phased: Any = None, variant_id: Any = None) -> xarray.core.dataset.Dataset\n",
+      "    Create a dataset of genotype calls.\n",
+      "    \n",
+      "    Parameters\n",
+      "    ----------\n",
+      "    variant_contig_names : list of str\n",
+      "        The contig names.\n",
+      "    variant_contig : array_like, int\n",
+      "        The (index of the) contig for each variant.\n",
+      "    variant_position : array_like, int\n",
+      "        The reference position of the variant.\n",
+      "    variant_alleles : array_like, S1\n",
+      "        The possible alleles for the variant.\n",
+      "    sample_id : array_like, str\n",
+      "        The unique identifier of the sample.\n",
+      "    call_genotype : array_like, int\n",
+      "        Genotype, encoded as allele values (0 for the reference, 1 for\n",
+      "        the first allele, 2 for the second allele), or -1 to indicate a\n",
+      "        missing value.\n",
+      "    call_genotype_phased : array_like, bool, optional\n",
+      "        A flag for each call indicating if it is phased or not. If\n",
+      "        omitted all calls are unphased.\n",
+      "    variant_id: array_like, str, optional\n",
+      "        The unique identifier of the variant.\n",
+      "    \n",
+      "    Returns\n",
+      "    -------\n",
+      "    xr.Dataset\n",
+      "        The dataset of genotype calls.\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "import sgkit\n",
+    "help(sgkit.api.create_genotype_call_dataset)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Get the Malaria Gen Zarr Data"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "The [zarr](https://zarr.readthedocs.io/en/stable) data is hosted in a google cloud bucket, or available for download from the public FTP site."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import gcsfs\n",
+    "\n",
+    "gcs_bucket_fs = gcsfs.GCSFileSystem(project='malariagen-jupyterhub', token='anon', access='read_only')\n",
+    "\n",
+    "storage_path = 'ag1000g-release/phase2.AR1/variation/main/zarr/pass/ag1000g.phase2.ar1.pass'\n",
+    "store = gcsfs.mapping.GCSMap(storage_path, gcs=gcs_bucket_fs, check=False, create=False)\n",
+    "callset = zarr.Group(store)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "If you explore the zarr data you will see that it is mostly the VCF data, with a few fields pre calculated for convenience."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "<zarr.core.Array '/samples' (1142,) object>\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(callset['samples'])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "3R\n",
+      " ├── calldata\n",
+      " │   └── GT (14481509, 1142, 2) int8\n",
+      " ├── samples (1142,) object\n",
+      " └── variants\n",
+      "     ├── ABHet (14481509,) float32\n",
+      "     ├── ABHom (14481509,) float32\n",
+      "     ├── AC (14481509, 3) int32\n",
+      "     ├── AF (14481509, 3) float32\n",
+      "     ├── ALT (14481509, 3) |S1\n",
+      "     ├── AN (14481509,) int32\n",
+      "     ├── Accessible (14481509,) bool\n",
+      "     ├── BaseCounts (14481509, 4) int32\n",
+      "     ├── BaseQRankSum (14481509,) float32\n",
+      "     ├── Coverage (14481509,) int32\n",
+      "     ├── CoverageMQ0 (14481509,) int32\n",
+      "     ├── DP (14481509,) int32\n",
+      "     ├── DS (14481509,) bool\n",
+      "     ├── Dels (14481509,) float32\n",
+      "     ├── FILTER_BaseQRankSum (14481509,) bool\n",
+      "     ├── FILTER_FS (14481509,) bool\n",
+      "     ├── FILTER_HRun (14481509,) bool\n",
+      "     ├── FILTER_HighCoverage (14481509,) bool\n",
+      "     ├── FILTER_HighMQ0 (14481509,) bool\n",
+      "     ├── FILTER_LowCoverage (14481509,) bool\n",
+      "     ├── FILTER_LowMQ (14481509,) bool\n",
+      "     ├── FILTER_LowQual (14481509,) bool\n",
+      "     ├── FILTER_NoCoverage (14481509,) bool\n",
+      "     ├── FILTER_PASS (14481509,) bool\n",
+      "     ├── FILTER_QD (14481509,) bool\n",
+      "     ├── FILTER_ReadPosRankSum (14481509,) bool\n",
+      "     ├── FILTER_RefN (14481509,) bool\n",
+      "     ├── FILTER_RepeatDUST (14481509,) bool\n",
+      "     ├── FS (14481509,) float32\n",
+      "     ├── HRun (14481509,) int32\n",
+      "     ├── HW (14481509,) float32\n",
+      "     ├── HaplotypeScore (14481509,) float32\n",
+      "     ├── HighCoverage (14481509,) int32\n",
+      "     ├── HighMQ0 (14481509,) int32\n",
+      "     ├── InbreedingCoeff (14481509,) float32\n",
+      "     ├── LowCoverage (14481509,) int32\n",
+      "     ├── LowMQ (14481509,) int32\n",
+      "     ├── LowPairing (14481509,) int32\n",
+      "     ├── MLEAC (14481509, 3) int32\n",
+      "     ├── MLEAF (14481509, 3) float32\n",
+      "     ├── MQ (14481509,) float32\n",
+      "     ├── MQ0 (14481509,) int32\n",
+      "     ├── MQRankSum (14481509,) float32\n",
+      "     ├── NDA (14481509,) int32\n",
+      "     ├── NoCoverage (14481509,) int32\n",
+      "     ├── OND (14481509,) float32\n",
+      "     ├── POS (14481509,) int32\n",
+      "     ├── QD (14481509,) float32\n",
+      "     ├── QUAL (14481509,) float32\n",
+      "     ├── REF (14481509,) |S1\n",
+      "     ├── RPA (14481509,) int32\n",
+      "     ├── RU (14481509,) object\n",
+      "     ├── ReadPosRankSum (14481509,) float32\n",
+      "     ├── RefMasked (14481509,) bool\n",
+      "     ├── RefN (14481509,) bool\n",
+      "     ├── RepeatDUST (14481509,) bool\n",
+      "     ├── RepeatMasker (14481509,) bool\n",
+      "     ├── RepeatTRF (14481509,) bool\n",
+      "     ├── STR (14481509,) bool\n",
+      "     ├── VariantType (14481509,) object\n",
+      "     ├── altlen (14481509, 3) int32\n",
+      "     ├── is_snp (14481509,) bool\n",
+      "     └── numalt (14481509,) int32\n"
+     ]
+    }
+   ],
+   "source": [
+    "chrom = '3R'\n",
+    "print(callset[chrom].tree())"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Get the Call Data"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div class=\"allel allel-DisplayAs2D\"><span>&lt;GenotypeChunkedArray shape=(14481509, 1142, 2) dtype=int8 chunks=(524288, 61, 2)\n",
+       "   nbytes=30.8G cbytes=-1 cratio=-33075766556.0\n",
+       "   compression=blosc compression_opts={'cname': 'zstd', 'clevel': 1, 'shuffle': -1, 'blocksize': 0}\n",
+       "   values=zarr.core.Array&gt;</span><table><thead><tr><th></th><th style=\"text-align: center\">0</th><th style=\"text-align: center\">1</th><th style=\"text-align: center\">2</th><th style=\"text-align: center\">3</th><th style=\"text-align: center\">4</th><th style=\"text-align: center\">...</th><th style=\"text-align: center\">1137</th><th style=\"text-align: center\">1138</th><th style=\"text-align: center\">1139</th><th style=\"text-align: center\">1140</th><th style=\"text-align: center\">1141</th></tr></thead><tbody><tr><th style=\"text-align: center; background-color: white; border-right: 1px solid black; \">0</th><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">...</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td></tr><tr><th style=\"text-align: center; background-color: white; border-right: 1px solid black; \">1</th><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">...</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td></tr><tr><th style=\"text-align: center; background-color: white; border-right: 1px solid black; \">2</th><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">...</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td></tr><tr><th style=\"text-align: center; background-color: white; border-right: 1px solid black; \">...</th><td style=\"text-align: center\" colspan=\"12\">...</td></tr><tr><th style=\"text-align: center; background-color: white; border-right: 1px solid black; \">14481506</th><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">...</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td></tr><tr><th style=\"text-align: center; background-color: white; border-right: 1px solid black; \">14481507</th><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">...</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td></tr><tr><th style=\"text-align: center; background-color: white; border-right: 1px solid black; \">14481508</th><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">...</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td></tr></tbody></table></div>"
+      ],
+      "text/plain": [
+       "<GenotypeChunkedArray shape=(14481509, 1142, 2) dtype=int8 chunks=(524288, 61, 2)\n",
+       "   nbytes=30.8G cbytes=-1 cratio=-33075766556.0\n",
+       "   compression=blosc compression_opts={'cname': 'zstd', 'clevel': 1, 'shuffle': -1, 'blocksize': 0}\n",
+       "   values=zarr.core.Array>"
+      ]
+     },
+     "execution_count": 8,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "chrom = '3R'\n",
+    "calldata = callset[chrom]['calldata']\n",
+    "\n",
+    "# TODO Will this be changed for SGKit?\n",
+    "genotypes = allel.GenotypeChunkedArray(calldata['GT'])\n",
+    "genotypes"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Genotype Chunked Array Data Structure\n",
+    "\n",
+    "When looking at the `allel.GenotypeChunkedArray` we see that we have: GenotypeChunkedArray shape=(14481509, 1142, 2)\n",
+    "\n",
+    "The shape corresponds to `variants`, `samples`, `alleles`.\n",
+    "\n",
+    "For every index of a variant we have the alleles of each of the samples.\n",
+    "\n",
+    "So let's get all the sample data for the first variant."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div class=\"allel allel-DisplayAs1D\"><span>&lt;GenotypeVector shape=(1142, 2) dtype=int8&gt;</span><table><thead><tr><th style=\"text-align: center\">0</th><th style=\"text-align: center\">1</th><th style=\"text-align: center\">2</th><th style=\"text-align: center\">3</th><th style=\"text-align: center\">4</th><th style=\"text-align: center\">...</th><th style=\"text-align: center\">1137</th><th style=\"text-align: center\">1138</th><th style=\"text-align: center\">1139</th><th style=\"text-align: center\">1140</th><th style=\"text-align: center\">1141</th></tr></thead><tbody><tr><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">...</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td><td style=\"text-align: center\">0/0</td></tr></tbody></table></div>"
+      ],
+      "text/plain": [
+       "<GenotypeVector shape=(1142, 2) dtype=int8>\n",
+       "0/0 0/0 0/0 0/0 0/0 ... 0/0 0/0 0/0 0/0 0/0"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "genotypes[0]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "And now let's look at the first variant call for the first sample."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array([0, 0], dtype=int8)"
+      ]
+     },
+     "execution_count": 10,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "genotypes[0][0]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "You can see above that for sample[0] the allele is 0/0, meaning it is homozygous for the reference."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Get the Samples"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "samples = callset['samples']\n",
+    "sample_id = np.array(samples, dtype='U')"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array(['AA0040-C', 'AA0041-C', 'AA0042-C', 'AA0043-C', 'AA0044-C'],\n",
+       "      dtype='<U8')"
+      ]
+     },
+     "execution_count": 12,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "sample_id[0:5]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Grab the Variant Positions\n",
+    "\n",
+    "Get the positions of each variant"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "variant_position = callset[chrom]['variants/POS']"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Let's investigate some of the attributes of our numpy array."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "(14481509,)\n",
+      "i\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(variant_position.shape)\n",
+    "print(variant_position.dtype.kind)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Grab the Reference Alleles\n",
+    "\n",
+    "For each variant we need the reference and the alternate."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "<zarr.core.Array '/3R/variants/REF' (14481509,) |S1>"
+      ]
+     },
+     "execution_count": 15,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "variant_ref = callset[chrom]['variants/REF']\n",
+    "variant_ref"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "<zarr.core.Array '/3R/variants/ALT' (14481509, 3) |S1>"
+      ]
+     },
+     "execution_count": 16,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "variant_alt = callset[chrom]['variants/ALT']\n",
+    "variant_alt"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now, instead of having 2 separate variant arrays, we want an np array of :\n",
+    "\n",
+    "```python\n",
+    "\n",
+    "[ \n",
+    "    # variant position index\n",
+    "    [ ref, alt ],\n",
+    "]    \n",
+    "```"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# the alternate lists all possible variants. we'll just grab the first, but really we should filter out any variants that aren't biallelic\n",
+    "variant_alleles = np.column_stack((variant_ref, variant_alt[:,0]))\n",
+    "variant_contig = np.zeros(len(variant_alleles))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array([0., 0., 0., 0., 0., 0., 0., 0., 0., 0.])"
+      ]
+     },
+     "execution_count": 18,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "variant_contig[0:10]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array([[b'A', b'G'],\n",
+       "       [b'A', b'T'],\n",
+       "       [b'T', b'C'],\n",
+       "       [b'G', b'A'],\n",
+       "       [b'T', b'A'],\n",
+       "       [b'A', b'G'],\n",
+       "       [b'G', b'C'],\n",
+       "       [b'C', b'T'],\n",
+       "       [b'C', b'T'],\n",
+       "       [b'G', b'A']], dtype='|S1')"
+      ]
+     },
+     "execution_count": 19,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "variant_alleles[0:10]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Create the Xarray Genotype Callset"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# You can use the dataset_size to create a smaller dataset if you're just exploring\n",
+    "\n",
+    "#dataset_size = len(variant_alleles)\n",
+    "variant_contig_names = [chrom]\n",
+    "call_genotype = genotypes\n",
+    "dataset_size = 10000\n",
+    "variant_contig = np.zeros(dataset_size)\n",
+    "variant_position = variant_position[0:dataset_size]\n",
+    "variant_alleles = variant_alleles[0:dataset_size]\n",
+    "call_genotype = call_genotype[0:dataset_size]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "genotype_xarray_dataset = sgkit.api.create_genotype_call_dataset(\n",
+    "    variant_contig_names = variant_contig_names,\n",
+    "    # these are all on the 0th contig, because we only have one contig\n",
+    "    variant_contig = np.zeros(len(variant_position), dtype='int'),\n",
+    "    variant_position = variant_position,\n",
+    "    variant_alleles = variant_alleles,\n",
+    "    sample_id = sample_id,\n",
+    "    call_genotype = call_genotype,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 22,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div><svg style=\"position: absolute; width: 0; height: 0; overflow: hidden\">\n",
+       "<defs>\n",
+       "<symbol id=\"icon-database\" viewBox=\"0 0 32 32\">\n",
+       "<title>Show/Hide data repr</title>\n",
+       "<path d=\"M16 0c-8.837 0-16 2.239-16 5v4c0 2.761 7.163 5 16 5s16-2.239 16-5v-4c0-2.761-7.163-5-16-5z\"></path>\n",
+       "<path d=\"M16 17c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z\"></path>\n",
+       "<path d=\"M16 26c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z\"></path>\n",
+       "</symbol>\n",
+       "<symbol id=\"icon-file-text2\" viewBox=\"0 0 32 32\">\n",
+       "<title>Show/Hide attributes</title>\n",
+       "<path d=\"M28.681 7.159c-0.694-0.947-1.662-2.053-2.724-3.116s-2.169-2.030-3.116-2.724c-1.612-1.182-2.393-1.319-2.841-1.319h-15.5c-1.378 0-2.5 1.121-2.5 2.5v27c0 1.378 1.122 2.5 2.5 2.5h23c1.378 0 2.5-1.122 2.5-2.5v-19.5c0-0.448-0.137-1.23-1.319-2.841zM24.543 5.457c0.959 0.959 1.712 1.825 2.268 2.543h-4.811v-4.811c0.718 0.556 1.584 1.309 2.543 2.268zM28 29.5c0 0.271-0.229 0.5-0.5 0.5h-23c-0.271 0-0.5-0.229-0.5-0.5v-27c0-0.271 0.229-0.5 0.5-0.5 0 0 15.499-0 15.5 0v7c0 0.552 0.448 1 1 1h7v19.5z\"></path>\n",
+       "<path d=\"M23 26h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "<path d=\"M23 22h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "<path d=\"M23 18h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "</symbol>\n",
+       "</defs>\n",
+       "</svg>\n",
+       "<style>/* CSS stylesheet for displaying xarray objects in jupyterlab.\n",
+       " *\n",
+       " */\n",
+       "\n",
+       ":root {\n",
+       "  --xr-font-color0: var(--jp-content-font-color0, rgba(0, 0, 0, 1));\n",
+       "  --xr-font-color2: var(--jp-content-font-color2, rgba(0, 0, 0, 0.54));\n",
+       "  --xr-font-color3: var(--jp-content-font-color3, rgba(0, 0, 0, 0.38));\n",
+       "  --xr-border-color: var(--jp-border-color2, #e0e0e0);\n",
+       "  --xr-disabled-color: var(--jp-layout-color3, #bdbdbd);\n",
+       "  --xr-background-color: var(--jp-layout-color0, white);\n",
+       "  --xr-background-color-row-even: var(--jp-layout-color1, white);\n",
+       "  --xr-background-color-row-odd: var(--jp-layout-color2, #eeeeee);\n",
+       "}\n",
+       "\n",
+       ".xr-wrap {\n",
+       "  min-width: 300px;\n",
+       "  max-width: 700px;\n",
+       "}\n",
+       "\n",
+       ".xr-header {\n",
+       "  padding-top: 6px;\n",
+       "  padding-bottom: 6px;\n",
+       "  margin-bottom: 4px;\n",
+       "  border-bottom: solid 1px var(--xr-border-color);\n",
+       "}\n",
+       "\n",
+       ".xr-header > div,\n",
+       ".xr-header > ul {\n",
+       "  display: inline;\n",
+       "  margin-top: 0;\n",
+       "  margin-bottom: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-obj-type,\n",
+       ".xr-array-name {\n",
+       "  margin-left: 2px;\n",
+       "  margin-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-obj-type {\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-sections {\n",
+       "  padding-left: 0 !important;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 150px auto auto 1fr 20px 20px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input + label {\n",
+       "  color: var(--xr-disabled-color);\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input:enabled + label {\n",
+       "  cursor: pointer;\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input:enabled + label:hover {\n",
+       "  color: var(--xr-font-color0);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary {\n",
+       "  grid-column: 1;\n",
+       "  color: var(--xr-font-color2);\n",
+       "  font-weight: 500;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary > span {\n",
+       "  display: inline-block;\n",
+       "  padding-left: 0.5em;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:disabled + label {\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in + label:before {\n",
+       "  display: inline-block;\n",
+       "  content: '►';\n",
+       "  font-size: 11px;\n",
+       "  width: 15px;\n",
+       "  text-align: center;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:disabled + label:before {\n",
+       "  color: var(--xr-disabled-color);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked + label:before {\n",
+       "  content: '▼';\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked + label > span {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary,\n",
+       ".xr-section-inline-details {\n",
+       "  padding-top: 4px;\n",
+       "  padding-bottom: 4px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-inline-details {\n",
+       "  grid-column: 2 / -1;\n",
+       "}\n",
+       "\n",
+       ".xr-section-details {\n",
+       "  display: none;\n",
+       "  grid-column: 1 / -1;\n",
+       "  margin-bottom: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked ~ .xr-section-details {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-array-wrap {\n",
+       "  grid-column: 1 / -1;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 20px auto;\n",
+       "}\n",
+       "\n",
+       ".xr-array-wrap > label {\n",
+       "  grid-column: 1;\n",
+       "  vertical-align: top;\n",
+       "}\n",
+       "\n",
+       ".xr-preview {\n",
+       "  color: var(--xr-font-color3);\n",
+       "}\n",
+       "\n",
+       ".xr-array-preview,\n",
+       ".xr-array-data {\n",
+       "  padding: 0 5px !important;\n",
+       "  grid-column: 2;\n",
+       "}\n",
+       "\n",
+       ".xr-array-data,\n",
+       ".xr-array-in:checked ~ .xr-array-preview {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-array-in:checked ~ .xr-array-data,\n",
+       ".xr-array-preview {\n",
+       "  display: inline-block;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list {\n",
+       "  display: inline-block !important;\n",
+       "  list-style: none;\n",
+       "  padding: 0 !important;\n",
+       "  margin: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list li {\n",
+       "  display: inline-block;\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list:before {\n",
+       "  content: '(';\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list:after {\n",
+       "  content: ')';\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list li:not(:last-child):after {\n",
+       "  content: ',';\n",
+       "  padding-right: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-has-index {\n",
+       "  font-weight: bold;\n",
+       "}\n",
+       "\n",
+       ".xr-var-list,\n",
+       ".xr-var-item {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-var-item > div,\n",
+       ".xr-var-item label,\n",
+       ".xr-var-item > .xr-var-name span {\n",
+       "  background-color: var(--xr-background-color-row-even);\n",
+       "  margin-bottom: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-var-item > .xr-var-name:hover span {\n",
+       "  padding-right: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-var-list > li:nth-child(odd) > div,\n",
+       ".xr-var-list > li:nth-child(odd) > label,\n",
+       ".xr-var-list > li:nth-child(odd) > .xr-var-name span {\n",
+       "  background-color: var(--xr-background-color-row-odd);\n",
+       "}\n",
+       "\n",
+       ".xr-var-name {\n",
+       "  grid-column: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-var-dims {\n",
+       "  grid-column: 2;\n",
+       "}\n",
+       "\n",
+       ".xr-var-dtype {\n",
+       "  grid-column: 3;\n",
+       "  text-align: right;\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-var-preview {\n",
+       "  grid-column: 4;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name,\n",
+       ".xr-var-dims,\n",
+       ".xr-var-dtype,\n",
+       ".xr-preview,\n",
+       ".xr-attrs dt {\n",
+       "  white-space: nowrap;\n",
+       "  overflow: hidden;\n",
+       "  text-overflow: ellipsis;\n",
+       "  padding-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name:hover,\n",
+       ".xr-var-dims:hover,\n",
+       ".xr-var-dtype:hover,\n",
+       ".xr-attrs dt:hover {\n",
+       "  overflow: visible;\n",
+       "  width: auto;\n",
+       "  z-index: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-var-attrs,\n",
+       ".xr-var-data {\n",
+       "  display: none;\n",
+       "  background-color: var(--xr-background-color) !important;\n",
+       "  padding-bottom: 5px !important;\n",
+       "}\n",
+       "\n",
+       ".xr-var-attrs-in:checked ~ .xr-var-attrs,\n",
+       ".xr-var-data-in:checked ~ .xr-var-data {\n",
+       "  display: block;\n",
+       "}\n",
+       "\n",
+       ".xr-var-data > table {\n",
+       "  float: right;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name span,\n",
+       ".xr-var-data,\n",
+       ".xr-attrs {\n",
+       "  padding-left: 25px !important;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs,\n",
+       ".xr-var-attrs,\n",
+       ".xr-var-data {\n",
+       "  grid-column: 1 / -1;\n",
+       "}\n",
+       "\n",
+       "dl.xr-attrs {\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 125px auto;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt, dd {\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "  float: left;\n",
+       "  padding-right: 10px;\n",
+       "  width: auto;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt {\n",
+       "  font-weight: normal;\n",
+       "  grid-column: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt:hover span {\n",
+       "  display: inline-block;\n",
+       "  background: var(--xr-background-color);\n",
+       "  padding-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dd {\n",
+       "  grid-column: 2;\n",
+       "  white-space: pre-wrap;\n",
+       "  word-break: break-all;\n",
+       "}\n",
+       "\n",
+       ".xr-icon-database,\n",
+       ".xr-icon-file-text2 {\n",
+       "  display: inline-block;\n",
+       "  vertical-align: middle;\n",
+       "  width: 1em;\n",
+       "  height: 1.5em !important;\n",
+       "  stroke-width: 0;\n",
+       "  stroke: currentColor;\n",
+       "  fill: currentColor;\n",
+       "}\n",
+       "</style><div class='xr-wrap'><div class='xr-header'><div class='xr-obj-type'>xarray.Dataset</div></div><ul class='xr-sections'><li class='xr-section-item'><input id='section-22eec0db-b831-4dca-b52d-52654f8e4504' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-22eec0db-b831-4dca-b52d-52654f8e4504' class='xr-section-summary'  title='Expand/collapse section'>Dimensions:</label><div class='xr-section-inline-details'><ul class='xr-dim-list'><li><span>alleles</span>: 2</li><li><span>ploidy</span>: 2</li><li><span>samples</span>: 1142</li><li><span>variants</span>: 10000</li></ul></div><div class='xr-section-details'></div></li><li class='xr-section-item'><input id='section-6a87421e-9ebf-436e-9bde-ae5575a0daf3' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-6a87421e-9ebf-436e-9bde-ae5575a0daf3' class='xr-section-summary'  title='Expand/collapse section'>Coordinates: <span>(0)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'></ul></div></li><li class='xr-section-item'><input id='section-31ae6a07-816e-4a23-96ac-0ff515452c9e' class='xr-section-summary-in' type='checkbox'  checked><label for='section-31ae6a07-816e-4a23-96ac-0ff515452c9e' class='xr-section-summary' >Data variables: <span>(6)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'><li class='xr-var-item'><div class='xr-var-name'><span>variant/contig</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int64</div><div class='xr-var-preview xr-preview'>0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0</div><input id='attrs-252c5f9e-51f0-468a-946f-12d0f5d0eb25' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-252c5f9e-51f0-468a-946f-12d0f5d0eb25' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-d5353cb0-2e94-489d-91f8-517a50551886' class='xr-var-data-in' type='checkbox'><label for='data-d5353cb0-2e94-489d-91f8-517a50551886' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([0, 0, 0, ..., 0, 0, 0])</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/position</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>9526 9531 9536 ... 64416 64418</div><input id='attrs-a3f4ac32-650d-43c9-a598-23d0e8cca0c8' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-a3f4ac32-650d-43c9-a598-23d0e8cca0c8' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-009961d4-692f-4330-be05-43eb18d23d3f' class='xr-var-data-in' type='checkbox'><label for='data-009961d4-692f-4330-be05-43eb18d23d3f' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([ 9526,  9531,  9536, ..., 64411, 64416, 64418], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/alleles</span></div><div class='xr-var-dims'>(variants, alleles)</div><div class='xr-var-dtype'>|S1</div><div class='xr-var-preview xr-preview'>b&#x27;A&#x27; b&#x27;G&#x27; b&#x27;A&#x27; ... b&#x27;T&#x27; b&#x27;T&#x27; b&#x27;C&#x27;</div><input id='attrs-c261d2fc-5319-45cf-ab2a-cf8a67b8221d' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-c261d2fc-5319-45cf-ab2a-cf8a67b8221d' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-3f11c281-316b-40e5-9339-6bc398687572' class='xr-var-data-in' type='checkbox'><label for='data-3f11c281-316b-40e5-9339-6bc398687572' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[b&#x27;A&#x27;, b&#x27;G&#x27;],\n",
+       "       [b&#x27;A&#x27;, b&#x27;T&#x27;],\n",
+       "       [b&#x27;T&#x27;, b&#x27;C&#x27;],\n",
+       "       ...,\n",
+       "       [b&#x27;A&#x27;, b&#x27;T&#x27;],\n",
+       "       [b&#x27;G&#x27;, b&#x27;T&#x27;],\n",
+       "       [b&#x27;T&#x27;, b&#x27;C&#x27;]], dtype=&#x27;|S1&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>sample/id</span></div><div class='xr-var-dims'>(samples)</div><div class='xr-var-dtype'>&lt;U8</div><div class='xr-var-preview xr-preview'>&#x27;AA0040-C&#x27; ... &#x27;AY0091-C&#x27;</div><input id='attrs-f2412ad2-05de-4bdb-ad05-202b7efdb85d' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-f2412ad2-05de-4bdb-ad05-202b7efdb85d' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-0e60cdea-4f9b-4d10-9951-4b706b496983' class='xr-var-data-in' type='checkbox'><label for='data-0e60cdea-4f9b-4d10-9951-4b706b496983' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([&#x27;AA0040-C&#x27;, &#x27;AA0041-C&#x27;, &#x27;AA0042-C&#x27;, ..., &#x27;AY0089-C&#x27;, &#x27;AY0090-C&#x27;,\n",
+       "       &#x27;AY0091-C&#x27;], dtype=&#x27;&lt;U8&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>int8</div><div class='xr-var-preview xr-preview'>0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0</div><input id='attrs-b38916c5-7df9-402d-bc61-0d0a22281b38' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-b38916c5-7df9-402d-bc61-0d0a22281b38' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-66bd4645-476e-47e5-8c60-83efaf92da1f' class='xr-var-data-in' type='checkbox'><label for='data-66bd4645-476e-47e5-8c60-83efaf92da1f' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        ...,\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0]],\n",
+       "\n",
+       "       [[0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        ...,\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0]],\n",
+       "\n",
+       "       [[0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        ...,\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0]],\n",
+       "\n",
+       "       ...,\n",
+       "\n",
+       "       [[0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        ...,\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0]],\n",
+       "\n",
+       "       [[0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        ...,\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0]],\n",
+       "\n",
+       "       [[0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        ...,\n",
+       "        [0, 0],\n",
+       "        [0, 0],\n",
+       "        [0, 0]]], dtype=int8)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype_mask</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>bool</div><div class='xr-var-preview xr-preview'>False False False ... False False</div><input id='attrs-02e03dd0-1afb-43f7-a4e6-433e7aca4649' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-02e03dd0-1afb-43f7-a4e6-433e7aca4649' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-bb049653-2f77-4298-a6fa-ed7d44903daa' class='xr-var-data-in' type='checkbox'><label for='data-bb049653-2f77-4298-a6fa-ed7d44903daa' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]],\n",
+       "\n",
+       "       [[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]],\n",
+       "\n",
+       "       [[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]],\n",
+       "\n",
+       "       ...,\n",
+       "\n",
+       "       [[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]],\n",
+       "\n",
+       "       [[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]],\n",
+       "\n",
+       "       [[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]]])</pre></li></ul></div></li><li class='xr-section-item'><input id='section-a86f4a9b-9a3d-446b-9330-e56ed78f2387' class='xr-section-summary-in' type='checkbox'  checked><label for='section-a86f4a9b-9a3d-446b-9330-e56ed78f2387' class='xr-section-summary' >Attributes: <span>(1)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><dl class='xr-attrs'><dt><span>contigs :</span></dt><dd>[&#x27;3R&#x27;]</dd></dl></div></li></ul></div></div>"
+      ],
+      "text/plain": [
+       "<xarray.Dataset>\n",
+       "Dimensions:             (alleles: 2, ploidy: 2, samples: 1142, variants: 10000)\n",
+       "Dimensions without coordinates: alleles, ploidy, samples, variants\n",
+       "Data variables:\n",
+       "    variant/contig      (variants) int64 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0\n",
+       "    variant/position    (variants) int32 9526 9531 9536 ... 64411 64416 64418\n",
+       "    variant/alleles     (variants, alleles) |S1 b'A' b'G' b'A' ... b'T' b'C'\n",
+       "    sample/id           (samples) <U8 'AA0040-C' 'AA0041-C' ... 'AY0091-C'\n",
+       "    call/genotype       (variants, samples, ploidy) int8 0 0 0 0 0 ... 0 0 0 0 0\n",
+       "    call/genotype_mask  (variants, samples, ploidy) bool False False ... False\n",
+       "Attributes:\n",
+       "    contigs:  ['3R']"
+      ]
+     },
+     "execution_count": 22,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "genotype_xarray_dataset"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python [conda env:root] *",
+   "language": "python",
+   "name": "conda-root-py"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.7.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
diff --git a/docs/examples/notebooks/Genotype-Call-Dataset-From-VCF.ipynb b/docs/examples/notebooks/Genotype-Call-Dataset-From-VCF.ipynb
new file mode 100644
index 000000000..f8d2160fe
--- /dev/null
+++ b/docs/examples/notebooks/Genotype-Call-Dataset-From-VCF.ipynb
@@ -0,0 +1,1083 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Load From a VCF File Example\n",
+    "\n",
+    "A central point to the SGkit API is the Genotype Call Dataset. This is the data structure that most of the other functions use. It uses [Xarray](http://xarray.pydata.org/en/stable/) underneath the hood to give a programmatic interface that allows for the backend to be several different data files.\n",
+    "\n",
+    "The Xarray itself is *sort of* a transposed VCF file.\n",
+    "\n",
+    "For this particular example we are going to go from a VCF file to the Genotype Call DataSet. \n",
+    "\n",
+    "**Please note that in the real world you *should not* read in your VCF files like this, but instead use the functionality in sgkit to go from a VCF to a Zarr file.** \n",
+    "\n",
+    "We are starting from the VCF file in order to give a conceptual understanding of the data structure itself."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import numpy as np\n",
+    "import zarr\n",
+    "import pandas as pd\n",
+    "import dask.array as da\n",
+    "import allel\n",
+    "from pprint import pprint\n",
+    "import matplotlib.pyplot as plt\n",
+    "%matplotlib inline"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Prep Work - Install Packages\n",
+    "\n",
+    "SGKit is still under rapid development, so I'm installing based on a commit. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "#! pip install git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Install PyVCF\n",
+    "\n",
+    "You'll need to install PyVCF, samtools and tabix in order to run this example as is. \n",
+    "\n",
+    "PyVCF needs to be in the same kernel in order to use it, but tabix can be installed anywhere."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Or install to your existing environment\n",
+    "# ! conda install -c bioconda -c conda-forge -y pyvcf samtools tabix\n",
+    "\n",
+    "\n",
+    "# Uncomment these to create a new conda environment and install these packages\n",
+    "# If you create a new environment you will have to switch your jupyterhub kernel\n",
+    "# ! conda create -n samtools -c bioconda -c conda-forge -y samtools pyvcf samtools tabix\n",
+    "# ! conda activate samtools \n",
+    "\n",
+    "# ! tabix -h ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20100804/ALL.2of4intersection.20100804.genotypes.vcf.gz 2:39967768-39967768 > chr2.vcf\n",
+    "# ls -lah chr2.vcf"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Grab Some Data\n",
+    "\n",
+    "We're going to grab a small subset of a VCF file from the [1000 Genomes Project.](https://www.internationalgenome.org/faq/how-do-i-get-sub-section-vcf-file/). We're only going to grab 3 calls, which is fine for our purposes.\n",
+    "\n",
+    "These calls are also already biallelic. I cheat. ;-)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# I couldn't run this from jupyterhub but needed an actual terminal\n",
+    "#! conda activate samtools \n",
+    "#! tabix -h ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20100804/ALL.2of4intersection.20100804.genotypes.vcf.gz 2:39967768-39967800 > chr2.vcf\n",
+    "#! conda deactivate\n",
+    "# ls -lah chr2.vcf"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import vcf\n",
+    "import os"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Let's write up a quick, *not to be used in the real world*, parser to grab data about the variants.\n",
+    "\n",
+    "* Variant Contig Names - A unique list of all the chromosomes and contigs\n",
+    "* Variant Contig - an index of the variant_contig_names list.\n",
+    "* Variant Position - Position on the chromosome\n",
+    "* Variant Reference and Alternate\n",
+    "* Samples\n",
+    "* Genotype calls per sample - with missing encoded as -1\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "vcf_reader = vcf.Reader(open('/home/jovyan/chr2.vcf', 'r'))\n",
+    "\n",
+    "# I already know these come from chr2\n",
+    "# but let's grab them anyways\n",
+    "variant_contig_names = []\n",
+    "\n",
+    "variant_chrom = []\n",
+    "variant_position = []\n",
+    "variant_alleles = []\n",
+    "variant_contig = []\n",
+    "\n",
+    "sample_id = []\n",
+    "call_genotype = []\n",
+    "\n",
+    "count = 0\n",
+    "\n",
+    "for record in vcf_reader:\n",
+    "    \n",
+    "    chrom = str(record.CHROM)\n",
+    "    if chrom not in variant_contig_names:\n",
+    "        variant_contig_names.append(chrom)\n",
+    "        \n",
+    "    # Grab the index of the contig\n",
+    "    variant_contig.append(variant_contig_names.index(chrom))\n",
+    "    \n",
+    "    # Get the variant data\n",
+    "    # I'm cheating and only getting the first alternate. In the real world you would filter for biallelic variants.\n",
+    "    variant_alleles.append([str(record.REF), str(record.ALT[0])])\n",
+    "    variant_position.append(record.POS)\n",
+    "    \n",
+    "    # the sample records is an object that has call data       \n",
+    "    samples = record.samples\n",
+    "    \n",
+    "    # Grab the sample names\n",
+    "    if count == 0:\n",
+    "        for sample in samples:\n",
+    "            sample_id.append(sample.sample)\n",
+    "    \n",
+    "    # Grab the call data for each sample for the variant\n",
+    "    variant_genotypes = []\n",
+    "    for sample in samples:\n",
+    "        # If its missing encode as -1, -1\n",
+    "        if sample['GT'] == './.':\n",
+    "            variant_genotypes.append([-1, -1])\n",
+    "        else:\n",
+    "            GT = sample['GT'].split('|')\n",
+    "            variant_genotypes.append([int(GT[0]), int(GT[1])])\n",
+    "    \n",
+    "    call_genotype.append(variant_genotypes)\n",
+    "    count = count + 1"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Convert to Numpy\n",
+    "\n",
+    "Now that we have our data, we need to prepare for our XArray dataset by converting these to Numpy arrays.\n",
+    "\n",
+    "If you're wondering how I know what these are you can check out the `sgkit.api.create_genotype_call_dataset`. The exact functions are `check_array_like` and make sure that these are numpy arrays of a particular type.\n",
+    "\n",
+    "```\n",
+    "check_array_like(variant_contig, kind=\"i\", ndim=1)\n",
+    "check_array_like(variant_position, kind=\"i\", ndim=1)\n",
+    "check_array_like(variant_alleles, kind=\"S\", ndim=2)\n",
+    "check_array_like(sample_id, kind=\"U\", ndim=1)\n",
+    "check_array_like(call_genotype, kind=\"i\", ndim=3)\n",
+    "```"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "sample_id = np.array(sample_id, dtype='U')\n",
+    "variant_position = np.array(variant_position, dtype='i')\n",
+    "variant_alleles = np.array(variant_alleles, dtype='S')\n",
+    "variant_contig_names = np.array(variant_contig_names, dtype='S')\n",
+    "variant_contig = np.array(variant_contig, dtype='i')"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Understanding Variant Contig and Variant Position\n",
+    "\n",
+    "The Genotype Call Xarray dataset is meant to be able to incorporate multiple chromosomes.\n",
+    "\n",
+    "Let's say we have variant calls from chrs 1 and 2, which we read into an array `['chr1','chr2']`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import pandas as pd"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div>\n",
+       "<style scoped>\n",
+       "    .dataframe tbody tr th:only-of-type {\n",
+       "        vertical-align: middle;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe tbody tr th {\n",
+       "        vertical-align: top;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead th {\n",
+       "        text-align: right;\n",
+       "    }\n",
+       "</style>\n",
+       "<table border=\"1\" class=\"dataframe\">\n",
+       "  <thead>\n",
+       "    <tr style=\"text-align: right;\">\n",
+       "      <th></th>\n",
+       "      <th>variant_contig_index</th>\n",
+       "      <th>variant_position</th>\n",
+       "    </tr>\n",
+       "  </thead>\n",
+       "  <tbody>\n",
+       "    <tr>\n",
+       "      <th>0</th>\n",
+       "      <td>0</td>\n",
+       "      <td>1</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>1</th>\n",
+       "      <td>0</td>\n",
+       "      <td>2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>2</th>\n",
+       "      <td>1</td>\n",
+       "      <td>1</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>3</th>\n",
+       "      <td>1</td>\n",
+       "      <td>2</td>\n",
+       "    </tr>\n",
+       "  </tbody>\n",
+       "</table>\n",
+       "</div>"
+      ],
+      "text/plain": [
+       "   variant_contig_index  variant_position\n",
+       "0                     0                 1\n",
+       "1                     0                 2\n",
+       "2                     1                 1\n",
+       "3                     1                 2"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "contigs = ['chr1', 'chr2']\n",
+    "    \n",
+    "df = pd.DataFrame({\n",
+    "                    'variant_contig_index': [0, 0, 1, 1],\n",
+    "                    'variant_position': [1, 2, 1, 2],\n",
+    "                    })\n",
+    "df"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "The Xarray dataset looks like the dataframe above. \n",
+    "\n",
+    "When we initialize the Xarray dataset we will give it a list of contigs (or chromosomes). We don't need to explicitly list the contig per position because we can calculate this based on the contig index.\n",
+    "\n",
+    "**Contig**: `contigs[row['variant_contig_index']]`\n",
+    "\n",
+    "**Position**: `row['variant_position']`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div>\n",
+       "<style scoped>\n",
+       "    .dataframe tbody tr th:only-of-type {\n",
+       "        vertical-align: middle;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe tbody tr th {\n",
+       "        vertical-align: top;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead th {\n",
+       "        text-align: right;\n",
+       "    }\n",
+       "</style>\n",
+       "<table border=\"1\" class=\"dataframe\">\n",
+       "  <thead>\n",
+       "    <tr style=\"text-align: right;\">\n",
+       "      <th></th>\n",
+       "      <th>variant_contig_index</th>\n",
+       "      <th>variant_position</th>\n",
+       "      <th>description</th>\n",
+       "    </tr>\n",
+       "  </thead>\n",
+       "  <tbody>\n",
+       "    <tr>\n",
+       "      <th>0</th>\n",
+       "      <td>0</td>\n",
+       "      <td>1</td>\n",
+       "      <td>Chr: chr1 Pos: 1</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>1</th>\n",
+       "      <td>0</td>\n",
+       "      <td>2</td>\n",
+       "      <td>Chr: chr1 Pos: 2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>2</th>\n",
+       "      <td>1</td>\n",
+       "      <td>1</td>\n",
+       "      <td>Chr: chr2 Pos: 1</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>3</th>\n",
+       "      <td>1</td>\n",
+       "      <td>2</td>\n",
+       "      <td>Chr: chr2 Pos: 2</td>\n",
+       "    </tr>\n",
+       "  </tbody>\n",
+       "</table>\n",
+       "</div>"
+      ],
+      "text/plain": [
+       "   variant_contig_index  variant_position       description\n",
+       "0                     0                 1  Chr: chr1 Pos: 1\n",
+       "1                     0                 2  Chr: chr1 Pos: 2\n",
+       "2                     1                 1  Chr: chr2 Pos: 1\n",
+       "3                     1                 2  Chr: chr2 Pos: 2"
+      ]
+     },
+     "execution_count": 10,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "def return_contig(row):\n",
+    "    return 'Chr: {chr} Pos: {pos}'.format(chr=contigs[row['variant_contig_index']], pos=row['variant_position'])\n",
+    "\n",
+    "df['description'] = df.apply(lambda row: return_contig(row), axis=1)\n",
+    "\n",
+    "df"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Genotype Calls\n",
+    "\n",
+    "If we've done our work right we our genotypes should have the shape: `[DIM_VARIANT, DIM_SAMPLE, DIM_PLOIDY]`, meaning the first axis is the number of variants, the second the number of samples, and the third the ploidy. In our case we are working with diploid alleles.\n",
+    "\n",
+    "Our genotype array has this structure:\n",
+    "\n",
+    "```python\n",
+    "genotypes = [\n",
+    "\n",
+    "    # Outermost array should have a length = the number of variants\n",
+    "    \n",
+    "    # variant chr 1 position 1\n",
+    "    [\n",
+    "        # Per variant we should have an array length = number of samples\n",
+    "        \n",
+    "        # sample 1 \n",
+    "        # Per sample we should have an array length = number of alleles\n",
+    "        [call, call],\n",
+    "        \n",
+    "        # sample 2\n",
+    "        [call, call]\n",
+    "    ],\n",
+    "    \n",
+    "    # variant chr 1 position 2\n",
+    "    [\n",
+    "        # sample 1 \n",
+    "        [call, call],\n",
+    "        # sample 2\n",
+    "        [call, call]\n",
+    "    ],\n",
+    "    \n",
+    "]\n",
+    "```"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "(3, 629, 2)"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "call_genotype = np.array(call_genotype, dtype='i')\n",
+    "call_genotype.shape"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "This is correct! We have 3 variants, 629 samples, and diploid alleles."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Convert to Genotype Call Dataset\n",
+    "\n",
+    "Finally! Let's convert this to the Genotype Call Dataset!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array([[b'T', b'A'],\n",
+       "       [b'G', b'C'],\n",
+       "       [b'C', b'T']], dtype='|S1')"
+      ]
+     },
+     "execution_count": 12,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "variant_alleles"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import sgkit\n",
+    "\n",
+    "genotype_xarray_dataset = sgkit.api.create_genotype_call_dataset(\n",
+    "    variant_contig_names = variant_contig_names,\n",
+    "    # Since we know these are all from the same chromosome we could just calculate this on the fly as a np array of zeros\n",
+    "    #variant_contig = np.zeros(len(variant_position)),\n",
+    "    variant_contig = variant_contig,\n",
+    "    variant_position = variant_position,\n",
+    "    variant_alleles = variant_alleles,\n",
+    "    sample_id = sample_id,\n",
+    "    call_genotype = call_genotype,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div><svg style=\"position: absolute; width: 0; height: 0; overflow: hidden\">\n",
+       "<defs>\n",
+       "<symbol id=\"icon-database\" viewBox=\"0 0 32 32\">\n",
+       "<title>Show/Hide data repr</title>\n",
+       "<path d=\"M16 0c-8.837 0-16 2.239-16 5v4c0 2.761 7.163 5 16 5s16-2.239 16-5v-4c0-2.761-7.163-5-16-5z\"></path>\n",
+       "<path d=\"M16 17c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z\"></path>\n",
+       "<path d=\"M16 26c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z\"></path>\n",
+       "</symbol>\n",
+       "<symbol id=\"icon-file-text2\" viewBox=\"0 0 32 32\">\n",
+       "<title>Show/Hide attributes</title>\n",
+       "<path d=\"M28.681 7.159c-0.694-0.947-1.662-2.053-2.724-3.116s-2.169-2.030-3.116-2.724c-1.612-1.182-2.393-1.319-2.841-1.319h-15.5c-1.378 0-2.5 1.121-2.5 2.5v27c0 1.378 1.122 2.5 2.5 2.5h23c1.378 0 2.5-1.122 2.5-2.5v-19.5c0-0.448-0.137-1.23-1.319-2.841zM24.543 5.457c0.959 0.959 1.712 1.825 2.268 2.543h-4.811v-4.811c0.718 0.556 1.584 1.309 2.543 2.268zM28 29.5c0 0.271-0.229 0.5-0.5 0.5h-23c-0.271 0-0.5-0.229-0.5-0.5v-27c0-0.271 0.229-0.5 0.5-0.5 0 0 15.499-0 15.5 0v7c0 0.552 0.448 1 1 1h7v19.5z\"></path>\n",
+       "<path d=\"M23 26h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "<path d=\"M23 22h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "<path d=\"M23 18h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "</symbol>\n",
+       "</defs>\n",
+       "</svg>\n",
+       "<style>/* CSS stylesheet for displaying xarray objects in jupyterlab.\n",
+       " *\n",
+       " */\n",
+       "\n",
+       ":root {\n",
+       "  --xr-font-color0: var(--jp-content-font-color0, rgba(0, 0, 0, 1));\n",
+       "  --xr-font-color2: var(--jp-content-font-color2, rgba(0, 0, 0, 0.54));\n",
+       "  --xr-font-color3: var(--jp-content-font-color3, rgba(0, 0, 0, 0.38));\n",
+       "  --xr-border-color: var(--jp-border-color2, #e0e0e0);\n",
+       "  --xr-disabled-color: var(--jp-layout-color3, #bdbdbd);\n",
+       "  --xr-background-color: var(--jp-layout-color0, white);\n",
+       "  --xr-background-color-row-even: var(--jp-layout-color1, white);\n",
+       "  --xr-background-color-row-odd: var(--jp-layout-color2, #eeeeee);\n",
+       "}\n",
+       "\n",
+       ".xr-wrap {\n",
+       "  min-width: 300px;\n",
+       "  max-width: 700px;\n",
+       "}\n",
+       "\n",
+       ".xr-header {\n",
+       "  padding-top: 6px;\n",
+       "  padding-bottom: 6px;\n",
+       "  margin-bottom: 4px;\n",
+       "  border-bottom: solid 1px var(--xr-border-color);\n",
+       "}\n",
+       "\n",
+       ".xr-header > div,\n",
+       ".xr-header > ul {\n",
+       "  display: inline;\n",
+       "  margin-top: 0;\n",
+       "  margin-bottom: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-obj-type,\n",
+       ".xr-array-name {\n",
+       "  margin-left: 2px;\n",
+       "  margin-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-obj-type {\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-sections {\n",
+       "  padding-left: 0 !important;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 150px auto auto 1fr 20px 20px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input + label {\n",
+       "  color: var(--xr-disabled-color);\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input:enabled + label {\n",
+       "  cursor: pointer;\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input:enabled + label:hover {\n",
+       "  color: var(--xr-font-color0);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary {\n",
+       "  grid-column: 1;\n",
+       "  color: var(--xr-font-color2);\n",
+       "  font-weight: 500;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary > span {\n",
+       "  display: inline-block;\n",
+       "  padding-left: 0.5em;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:disabled + label {\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in + label:before {\n",
+       "  display: inline-block;\n",
+       "  content: '►';\n",
+       "  font-size: 11px;\n",
+       "  width: 15px;\n",
+       "  text-align: center;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:disabled + label:before {\n",
+       "  color: var(--xr-disabled-color);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked + label:before {\n",
+       "  content: '▼';\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked + label > span {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary,\n",
+       ".xr-section-inline-details {\n",
+       "  padding-top: 4px;\n",
+       "  padding-bottom: 4px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-inline-details {\n",
+       "  grid-column: 2 / -1;\n",
+       "}\n",
+       "\n",
+       ".xr-section-details {\n",
+       "  display: none;\n",
+       "  grid-column: 1 / -1;\n",
+       "  margin-bottom: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked ~ .xr-section-details {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-array-wrap {\n",
+       "  grid-column: 1 / -1;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 20px auto;\n",
+       "}\n",
+       "\n",
+       ".xr-array-wrap > label {\n",
+       "  grid-column: 1;\n",
+       "  vertical-align: top;\n",
+       "}\n",
+       "\n",
+       ".xr-preview {\n",
+       "  color: var(--xr-font-color3);\n",
+       "}\n",
+       "\n",
+       ".xr-array-preview,\n",
+       ".xr-array-data {\n",
+       "  padding: 0 5px !important;\n",
+       "  grid-column: 2;\n",
+       "}\n",
+       "\n",
+       ".xr-array-data,\n",
+       ".xr-array-in:checked ~ .xr-array-preview {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-array-in:checked ~ .xr-array-data,\n",
+       ".xr-array-preview {\n",
+       "  display: inline-block;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list {\n",
+       "  display: inline-block !important;\n",
+       "  list-style: none;\n",
+       "  padding: 0 !important;\n",
+       "  margin: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list li {\n",
+       "  display: inline-block;\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list:before {\n",
+       "  content: '(';\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list:after {\n",
+       "  content: ')';\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list li:not(:last-child):after {\n",
+       "  content: ',';\n",
+       "  padding-right: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-has-index {\n",
+       "  font-weight: bold;\n",
+       "}\n",
+       "\n",
+       ".xr-var-list,\n",
+       ".xr-var-item {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-var-item > div,\n",
+       ".xr-var-item label,\n",
+       ".xr-var-item > .xr-var-name span {\n",
+       "  background-color: var(--xr-background-color-row-even);\n",
+       "  margin-bottom: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-var-item > .xr-var-name:hover span {\n",
+       "  padding-right: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-var-list > li:nth-child(odd) > div,\n",
+       ".xr-var-list > li:nth-child(odd) > label,\n",
+       ".xr-var-list > li:nth-child(odd) > .xr-var-name span {\n",
+       "  background-color: var(--xr-background-color-row-odd);\n",
+       "}\n",
+       "\n",
+       ".xr-var-name {\n",
+       "  grid-column: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-var-dims {\n",
+       "  grid-column: 2;\n",
+       "}\n",
+       "\n",
+       ".xr-var-dtype {\n",
+       "  grid-column: 3;\n",
+       "  text-align: right;\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-var-preview {\n",
+       "  grid-column: 4;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name,\n",
+       ".xr-var-dims,\n",
+       ".xr-var-dtype,\n",
+       ".xr-preview,\n",
+       ".xr-attrs dt {\n",
+       "  white-space: nowrap;\n",
+       "  overflow: hidden;\n",
+       "  text-overflow: ellipsis;\n",
+       "  padding-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name:hover,\n",
+       ".xr-var-dims:hover,\n",
+       ".xr-var-dtype:hover,\n",
+       ".xr-attrs dt:hover {\n",
+       "  overflow: visible;\n",
+       "  width: auto;\n",
+       "  z-index: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-var-attrs,\n",
+       ".xr-var-data {\n",
+       "  display: none;\n",
+       "  background-color: var(--xr-background-color) !important;\n",
+       "  padding-bottom: 5px !important;\n",
+       "}\n",
+       "\n",
+       ".xr-var-attrs-in:checked ~ .xr-var-attrs,\n",
+       ".xr-var-data-in:checked ~ .xr-var-data {\n",
+       "  display: block;\n",
+       "}\n",
+       "\n",
+       ".xr-var-data > table {\n",
+       "  float: right;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name span,\n",
+       ".xr-var-data,\n",
+       ".xr-attrs {\n",
+       "  padding-left: 25px !important;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs,\n",
+       ".xr-var-attrs,\n",
+       ".xr-var-data {\n",
+       "  grid-column: 1 / -1;\n",
+       "}\n",
+       "\n",
+       "dl.xr-attrs {\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 125px auto;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt, dd {\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "  float: left;\n",
+       "  padding-right: 10px;\n",
+       "  width: auto;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt {\n",
+       "  font-weight: normal;\n",
+       "  grid-column: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt:hover span {\n",
+       "  display: inline-block;\n",
+       "  background: var(--xr-background-color);\n",
+       "  padding-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dd {\n",
+       "  grid-column: 2;\n",
+       "  white-space: pre-wrap;\n",
+       "  word-break: break-all;\n",
+       "}\n",
+       "\n",
+       ".xr-icon-database,\n",
+       ".xr-icon-file-text2 {\n",
+       "  display: inline-block;\n",
+       "  vertical-align: middle;\n",
+       "  width: 1em;\n",
+       "  height: 1.5em !important;\n",
+       "  stroke-width: 0;\n",
+       "  stroke: currentColor;\n",
+       "  fill: currentColor;\n",
+       "}\n",
+       "</style><div class='xr-wrap'><div class='xr-header'><div class='xr-obj-type'>xarray.Dataset</div></div><ul class='xr-sections'><li class='xr-section-item'><input id='section-2bbbe44c-6042-4d24-99ce-4b04915ab37b' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-2bbbe44c-6042-4d24-99ce-4b04915ab37b' class='xr-section-summary'  title='Expand/collapse section'>Dimensions:</label><div class='xr-section-inline-details'><ul class='xr-dim-list'><li><span>alleles</span>: 2</li><li><span>ploidy</span>: 2</li><li><span>samples</span>: 629</li><li><span>variants</span>: 3</li></ul></div><div class='xr-section-details'></div></li><li class='xr-section-item'><input id='section-dee09919-0251-4a21-8d0e-b973e56a0913' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-dee09919-0251-4a21-8d0e-b973e56a0913' class='xr-section-summary'  title='Expand/collapse section'>Coordinates: <span>(0)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'></ul></div></li><li class='xr-section-item'><input id='section-0e965023-e326-49ff-93a7-f4d8ca5bd61a' class='xr-section-summary-in' type='checkbox'  checked><label for='section-0e965023-e326-49ff-93a7-f4d8ca5bd61a' class='xr-section-summary' >Data variables: <span>(6)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'><li class='xr-var-item'><div class='xr-var-name'><span>variant/contig</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0 0 0</div><input id='attrs-12f058df-66b9-439c-bf6c-01861f0cdc65' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-12f058df-66b9-439c-bf6c-01861f0cdc65' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-419b2e09-7c7a-40c6-9cef-21c7f5f23527' class='xr-var-data-in' type='checkbox'><label for='data-419b2e09-7c7a-40c6-9cef-21c7f5f23527' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([0, 0, 0], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/position</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>39967768 39967778 39967793</div><input id='attrs-5ed2c700-e8c8-47d0-a7a0-6c0fb18a093b' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-5ed2c700-e8c8-47d0-a7a0-6c0fb18a093b' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-424e24df-c940-4792-a686-50ce65f222ba' class='xr-var-data-in' type='checkbox'><label for='data-424e24df-c940-4792-a686-50ce65f222ba' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([39967768, 39967778, 39967793], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/alleles</span></div><div class='xr-var-dims'>(variants, alleles)</div><div class='xr-var-dtype'>|S1</div><div class='xr-var-preview xr-preview'>b&#x27;T&#x27; b&#x27;A&#x27; b&#x27;G&#x27; b&#x27;C&#x27; b&#x27;C&#x27; b&#x27;T&#x27;</div><input id='attrs-9b7f3d8a-c02b-4d17-8b18-0aecd9477eb0' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-9b7f3d8a-c02b-4d17-8b18-0aecd9477eb0' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-c9e8b4ca-a4a9-437a-809b-abb331a6e3ce' class='xr-var-data-in' type='checkbox'><label for='data-c9e8b4ca-a4a9-437a-809b-abb331a6e3ce' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[b&#x27;T&#x27;, b&#x27;A&#x27;],\n",
+       "       [b&#x27;G&#x27;, b&#x27;C&#x27;],\n",
+       "       [b&#x27;C&#x27;, b&#x27;T&#x27;]], dtype=&#x27;|S1&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>sample/id</span></div><div class='xr-var-dims'>(samples)</div><div class='xr-var-dtype'>&lt;U7</div><div class='xr-var-preview xr-preview'>&#x27;HG00098&#x27; &#x27;HG00100&#x27; ... &#x27;NA20828&#x27;</div><input id='attrs-0aecc63f-7276-4c87-a95a-492095873f76' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-0aecc63f-7276-4c87-a95a-492095873f76' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-5b980cd7-2fe4-45cc-8026-11b12d23152b' class='xr-var-data-in' type='checkbox'><label for='data-5b980cd7-2fe4-45cc-8026-11b12d23152b' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([&#x27;HG00098&#x27;, &#x27;HG00100&#x27;, &#x27;HG00106&#x27;, &#x27;HG00112&#x27;, &#x27;HG00114&#x27;, &#x27;HG00116&#x27;,\n",
+       "       &#x27;HG00117&#x27;, &#x27;HG00118&#x27;, &#x27;HG00119&#x27;, &#x27;HG00120&#x27;, &#x27;HG00122&#x27;, &#x27;HG00123&#x27;,\n",
+       "       &#x27;HG00124&#x27;, &#x27;HG00126&#x27;, &#x27;HG00131&#x27;, &#x27;HG00141&#x27;, &#x27;HG00142&#x27;, &#x27;HG00143&#x27;,\n",
+       "       &#x27;HG00144&#x27;, &#x27;HG00145&#x27;, &#x27;HG00146&#x27;, &#x27;HG00147&#x27;, &#x27;HG00148&#x27;, &#x27;HG00149&#x27;,\n",
+       "       &#x27;HG00150&#x27;, &#x27;HG00151&#x27;, &#x27;HG00152&#x27;, &#x27;HG00153&#x27;, &#x27;HG00156&#x27;, &#x27;HG00158&#x27;,\n",
+       "       &#x27;HG00159&#x27;, &#x27;HG00160&#x27;, &#x27;HG00171&#x27;, &#x27;HG00173&#x27;, &#x27;HG00174&#x27;, &#x27;HG00176&#x27;,\n",
+       "       &#x27;HG00177&#x27;, &#x27;HG00178&#x27;, &#x27;HG00179&#x27;, &#x27;HG00180&#x27;, &#x27;HG00181&#x27;, &#x27;HG00182&#x27;,\n",
+       "       &#x27;HG00183&#x27;, &#x27;HG00185&#x27;, &#x27;HG00186&#x27;, &#x27;HG00187&#x27;, &#x27;HG00188&#x27;, &#x27;HG00189&#x27;,\n",
+       "       &#x27;HG00190&#x27;, &#x27;HG00231&#x27;, &#x27;HG00239&#x27;, &#x27;HG00242&#x27;, &#x27;HG00243&#x27;, &#x27;HG00244&#x27;,\n",
+       "       &#x27;HG00245&#x27;, &#x27;HG00247&#x27;, &#x27;HG00258&#x27;, &#x27;HG00262&#x27;, &#x27;HG00264&#x27;, &#x27;HG00265&#x27;,\n",
+       "       &#x27;HG00266&#x27;, &#x27;HG00267&#x27;, &#x27;HG00269&#x27;, &#x27;HG00270&#x27;, &#x27;HG00272&#x27;, &#x27;HG00306&#x27;,\n",
+       "       &#x27;HG00308&#x27;, &#x27;HG00311&#x27;, &#x27;HG00312&#x27;, &#x27;HG00357&#x27;, &#x27;HG00361&#x27;, &#x27;HG00366&#x27;,\n",
+       "       &#x27;HG00367&#x27;, &#x27;HG00368&#x27;, &#x27;HG00369&#x27;, &#x27;HG00372&#x27;, &#x27;HG00373&#x27;, &#x27;HG00377&#x27;,\n",
+       "       &#x27;HG00380&#x27;, &#x27;HG00403&#x27;, &#x27;HG00404&#x27;, &#x27;HG00406&#x27;, &#x27;HG00407&#x27;, &#x27;HG00445&#x27;,\n",
+       "       &#x27;HG00446&#x27;, &#x27;HG00452&#x27;, &#x27;HG00457&#x27;, &#x27;HG00553&#x27;, &#x27;HG00554&#x27;, &#x27;HG00559&#x27;,\n",
+       "       &#x27;HG00560&#x27;, &#x27;HG00565&#x27;, &#x27;HG00566&#x27;, &#x27;HG00577&#x27;, &#x27;HG00578&#x27;, &#x27;HG00592&#x27;,\n",
+       "       &#x27;HG00593&#x27;, &#x27;HG00596&#x27;, &#x27;HG00610&#x27;, &#x27;HG00611&#x27;, &#x27;HG00625&#x27;, &#x27;HG00626&#x27;,\n",
+       "       &#x27;HG00628&#x27;, &#x27;HG00629&#x27;, &#x27;HG00634&#x27;, &#x27;HG00635&#x27;, &#x27;HG00637&#x27;, &#x27;HG00638&#x27;,\n",
+       "       &#x27;HG00640&#x27;, &#x27;NA06984&#x27;, &#x27;NA06985&#x27;, &#x27;NA06986&#x27;, &#x27;NA06989&#x27;, &#x27;NA06994&#x27;,\n",
+       "       &#x27;NA07000&#x27;, &#x27;NA07037&#x27;, &#x27;NA07048&#x27;, &#x27;NA07051&#x27;, &#x27;NA07056&#x27;, &#x27;NA07346&#x27;,\n",
+       "       &#x27;NA07347&#x27;, &#x27;NA07357&#x27;, &#x27;NA10847&#x27;, &#x27;NA10851&#x27;, &#x27;NA11829&#x27;, &#x27;NA11830&#x27;,\n",
+       "       &#x27;NA11831&#x27;, &#x27;NA11832&#x27;, &#x27;NA11840&#x27;, &#x27;NA11843&#x27;, &#x27;NA11881&#x27;, &#x27;NA11892&#x27;,\n",
+       "       &#x27;NA11893&#x27;, &#x27;NA11894&#x27;, &#x27;NA11918&#x27;, &#x27;NA11919&#x27;, &#x27;NA11920&#x27;, &#x27;NA11930&#x27;,\n",
+       "       &#x27;NA11931&#x27;, &#x27;NA11932&#x27;, &#x27;NA11933&#x27;, &#x27;NA11992&#x27;, &#x27;NA11993&#x27;, &#x27;NA11994&#x27;,\n",
+       "       &#x27;NA11995&#x27;, &#x27;NA12003&#x27;, &#x27;NA12004&#x27;, &#x27;NA12005&#x27;, &#x27;NA12006&#x27;, &#x27;NA12043&#x27;,\n",
+       "       &#x27;NA12044&#x27;, &#x27;NA12045&#x27;, &#x27;NA12046&#x27;, &#x27;NA12058&#x27;, &#x27;NA12144&#x27;, &#x27;NA12154&#x27;,\n",
+       "       &#x27;NA12155&#x27;, &#x27;NA12156&#x27;, &#x27;NA12249&#x27;, &#x27;NA12272&#x27;, &#x27;NA12273&#x27;, &#x27;NA12275&#x27;,\n",
+       "       &#x27;NA12287&#x27;, &#x27;NA12340&#x27;, &#x27;NA12341&#x27;, &#x27;NA12342&#x27;, &#x27;NA12347&#x27;, &#x27;NA12348&#x27;,\n",
+       "       &#x27;NA12383&#x27;, &#x27;NA12399&#x27;, &#x27;NA12400&#x27;, &#x27;NA12413&#x27;, &#x27;NA12414&#x27;, &#x27;NA12489&#x27;,\n",
+       "       &#x27;NA12546&#x27;, &#x27;NA12716&#x27;, &#x27;NA12717&#x27;, &#x27;NA12718&#x27;, &#x27;NA12749&#x27;, &#x27;NA12750&#x27;,\n",
+       "       &#x27;NA12751&#x27;, &#x27;NA12761&#x27;, &#x27;NA12762&#x27;, &#x27;NA12763&#x27;, &#x27;NA12775&#x27;, &#x27;NA12776&#x27;,\n",
+       "       &#x27;NA12777&#x27;, &#x27;NA12778&#x27;, &#x27;NA12812&#x27;, &#x27;NA12813&#x27;, &#x27;NA12814&#x27;, &#x27;NA12815&#x27;,\n",
+       "       &#x27;NA12828&#x27;, &#x27;NA12830&#x27;, &#x27;NA12872&#x27;, &#x27;NA12873&#x27;, &#x27;NA12874&#x27;, &#x27;NA12889&#x27;,\n",
+       "       &#x27;NA12890&#x27;, &#x27;NA18486&#x27;, &#x27;NA18487&#x27;, &#x27;NA18489&#x27;, &#x27;NA18498&#x27;, &#x27;NA18499&#x27;,\n",
+       "       &#x27;NA18501&#x27;, &#x27;NA18502&#x27;, &#x27;NA18504&#x27;, &#x27;NA18505&#x27;, &#x27;NA18507&#x27;, &#x27;NA18508&#x27;,\n",
+       "       &#x27;NA18510&#x27;, &#x27;NA18511&#x27;, &#x27;NA18516&#x27;, &#x27;NA18517&#x27;, &#x27;NA18519&#x27;, &#x27;NA18520&#x27;,\n",
+       "       &#x27;NA18522&#x27;, &#x27;NA18523&#x27;, &#x27;NA18525&#x27;, &#x27;NA18526&#x27;, &#x27;NA18527&#x27;, &#x27;NA18532&#x27;,\n",
+       "       &#x27;NA18535&#x27;, &#x27;NA18537&#x27;, &#x27;NA18538&#x27;, &#x27;NA18539&#x27;, &#x27;NA18541&#x27;, &#x27;NA18542&#x27;,\n",
+       "       &#x27;NA18545&#x27;, &#x27;NA18547&#x27;, &#x27;NA18550&#x27;, &#x27;NA18552&#x27;, &#x27;NA18553&#x27;, &#x27;NA18555&#x27;,\n",
+       "       &#x27;NA18558&#x27;, &#x27;NA18560&#x27;, &#x27;NA18561&#x27;, &#x27;NA18562&#x27;, &#x27;NA18563&#x27;, &#x27;NA18564&#x27;,\n",
+       "       &#x27;NA18565&#x27;, &#x27;NA18566&#x27;, &#x27;NA18567&#x27;, &#x27;NA18570&#x27;, &#x27;NA18571&#x27;, &#x27;NA18572&#x27;,\n",
+       "       &#x27;NA18573&#x27;, &#x27;NA18574&#x27;, &#x27;NA18576&#x27;, &#x27;NA18577&#x27;, &#x27;NA18579&#x27;, &#x27;NA18582&#x27;,\n",
+       "       &#x27;NA18592&#x27;, &#x27;NA18593&#x27;, &#x27;NA18603&#x27;, &#x27;NA18605&#x27;, &#x27;NA18608&#x27;, &#x27;NA18609&#x27;,\n",
+       "       &#x27;NA18611&#x27;, &#x27;NA18612&#x27;, &#x27;NA18614&#x27;, &#x27;NA18615&#x27;, &#x27;NA18616&#x27;, &#x27;NA18617&#x27;,\n",
+       "       &#x27;NA18618&#x27;, &#x27;NA18619&#x27;, &#x27;NA18620&#x27;, &#x27;NA18621&#x27;, &#x27;NA18622&#x27;, &#x27;NA18623&#x27;,\n",
+       "       &#x27;NA18624&#x27;, &#x27;NA18625&#x27;, &#x27;NA18626&#x27;, &#x27;NA18627&#x27;, &#x27;NA18628&#x27;, &#x27;NA18630&#x27;,\n",
+       "       &#x27;NA18631&#x27;, &#x27;NA18632&#x27;, &#x27;NA18633&#x27;, &#x27;NA18634&#x27;, &#x27;NA18636&#x27;, &#x27;NA18638&#x27;,\n",
+       "       &#x27;NA18640&#x27;, &#x27;NA18642&#x27;, &#x27;NA18643&#x27;, &#x27;NA18745&#x27;, &#x27;NA18853&#x27;, &#x27;NA18856&#x27;,\n",
+       "       &#x27;NA18858&#x27;, &#x27;NA18861&#x27;, &#x27;NA18867&#x27;, &#x27;NA18868&#x27;, &#x27;NA18870&#x27;, &#x27;NA18871&#x27;,\n",
+       "       &#x27;NA18873&#x27;, &#x27;NA18874&#x27;, &#x27;NA18907&#x27;, &#x27;NA18908&#x27;, &#x27;NA18909&#x27;, &#x27;NA18910&#x27;,\n",
+       "       &#x27;NA18912&#x27;, &#x27;NA18916&#x27;, &#x27;NA18940&#x27;, &#x27;NA18941&#x27;, &#x27;NA18942&#x27;, &#x27;NA18943&#x27;,\n",
+       "       &#x27;NA18944&#x27;, &#x27;NA18945&#x27;, &#x27;NA18947&#x27;, &#x27;NA18948&#x27;, &#x27;NA18949&#x27;, &#x27;NA18950&#x27;,\n",
+       "       &#x27;NA18951&#x27;, &#x27;NA18952&#x27;, &#x27;NA18953&#x27;, &#x27;NA18955&#x27;, &#x27;NA18956&#x27;, &#x27;NA18959&#x27;,\n",
+       "       &#x27;NA18960&#x27;, &#x27;NA18961&#x27;, &#x27;NA18963&#x27;, &#x27;NA18964&#x27;, &#x27;NA18965&#x27;, &#x27;NA18967&#x27;,\n",
+       "       &#x27;NA18968&#x27;, &#x27;NA18970&#x27;, &#x27;NA18971&#x27;, &#x27;NA18972&#x27;, &#x27;NA18973&#x27;, &#x27;NA18974&#x27;,\n",
+       "       &#x27;NA18975&#x27;, &#x27;NA18976&#x27;, &#x27;NA18977&#x27;, &#x27;NA18979&#x27;, &#x27;NA18980&#x27;, &#x27;NA18981&#x27;,\n",
+       "       &#x27;NA18982&#x27;, &#x27;NA18983&#x27;, &#x27;NA18984&#x27;, &#x27;NA18985&#x27;, &#x27;NA18986&#x27;, &#x27;NA18987&#x27;,\n",
+       "       &#x27;NA18988&#x27;, &#x27;NA18989&#x27;, &#x27;NA18990&#x27;, &#x27;NA18997&#x27;, &#x27;NA18999&#x27;, &#x27;NA19000&#x27;,\n",
+       "       &#x27;NA19001&#x27;, &#x27;NA19002&#x27;, &#x27;NA19003&#x27;, &#x27;NA19004&#x27;, &#x27;NA19005&#x27;, &#x27;NA19007&#x27;,\n",
+       "       &#x27;NA19009&#x27;, &#x27;NA19010&#x27;, &#x27;NA19012&#x27;, &#x27;NA19027&#x27;, &#x27;NA19044&#x27;, &#x27;NA19054&#x27;,\n",
+       "       &#x27;NA19055&#x27;, &#x27;NA19056&#x27;, &#x27;NA19057&#x27;, &#x27;NA19058&#x27;, &#x27;NA19059&#x27;, &#x27;NA19060&#x27;,\n",
+       "       &#x27;NA19062&#x27;, &#x27;NA19063&#x27;, &#x27;NA19064&#x27;, &#x27;NA19065&#x27;, &#x27;NA19066&#x27;, &#x27;NA19067&#x27;,\n",
+       "       &#x27;NA19068&#x27;, &#x27;NA19070&#x27;, &#x27;NA19072&#x27;, &#x27;NA19074&#x27;, &#x27;NA19075&#x27;, &#x27;NA19076&#x27;,\n",
+       "       &#x27;NA19077&#x27;, &#x27;NA19078&#x27;, &#x27;NA19079&#x27;, &#x27;NA19082&#x27;, &#x27;NA19083&#x27;, &#x27;NA19084&#x27;,\n",
+       "       &#x27;NA19085&#x27;, &#x27;NA19086&#x27;, &#x27;NA19087&#x27;, &#x27;NA19088&#x27;, &#x27;NA19093&#x27;, &#x27;NA19098&#x27;,\n",
+       "       &#x27;NA19099&#x27;, &#x27;NA19102&#x27;, &#x27;NA19107&#x27;, &#x27;NA19108&#x27;, &#x27;NA19113&#x27;, &#x27;NA19114&#x27;,\n",
+       "       &#x27;NA19116&#x27;, &#x27;NA19119&#x27;, &#x27;NA19129&#x27;, &#x27;NA19130&#x27;, &#x27;NA19131&#x27;, &#x27;NA19137&#x27;,\n",
+       "       &#x27;NA19138&#x27;, &#x27;NA19141&#x27;, &#x27;NA19143&#x27;, &#x27;NA19144&#x27;, &#x27;NA19147&#x27;, &#x27;NA19152&#x27;,\n",
+       "       &#x27;NA19153&#x27;, &#x27;NA19159&#x27;, &#x27;NA19160&#x27;, &#x27;NA19171&#x27;, &#x27;NA19172&#x27;, &#x27;NA19184&#x27;,\n",
+       "       &#x27;NA19189&#x27;, &#x27;NA19190&#x27;, &#x27;NA19200&#x27;, &#x27;NA19201&#x27;, &#x27;NA19204&#x27;, &#x27;NA19206&#x27;,\n",
+       "       &#x27;NA19207&#x27;, &#x27;NA19209&#x27;, &#x27;NA19210&#x27;, &#x27;NA19213&#x27;, &#x27;NA19225&#x27;, &#x27;NA19235&#x27;,\n",
+       "       &#x27;NA19236&#x27;, &#x27;NA19247&#x27;, &#x27;NA19248&#x27;, &#x27;NA19256&#x27;, &#x27;NA19257&#x27;, &#x27;NA19311&#x27;,\n",
+       "       &#x27;NA19312&#x27;, &#x27;NA19313&#x27;, &#x27;NA19314&#x27;, &#x27;NA19332&#x27;, &#x27;NA19334&#x27;, &#x27;NA19338&#x27;,\n",
+       "       &#x27;NA19346&#x27;, &#x27;NA19347&#x27;, &#x27;NA19350&#x27;, &#x27;NA19355&#x27;, &#x27;NA19359&#x27;, &#x27;NA19360&#x27;,\n",
+       "       &#x27;NA19371&#x27;, &#x27;NA19372&#x27;, &#x27;NA19375&#x27;, &#x27;NA19376&#x27;, &#x27;NA19377&#x27;, &#x27;NA19379&#x27;,\n",
+       "       &#x27;NA19381&#x27;, &#x27;NA19382&#x27;, &#x27;NA19383&#x27;, &#x27;NA19384&#x27;, &#x27;NA19385&#x27;, &#x27;NA19390&#x27;,\n",
+       "       &#x27;NA19391&#x27;, &#x27;NA19393&#x27;, &#x27;NA19394&#x27;, &#x27;NA19395&#x27;, &#x27;NA19397&#x27;, &#x27;NA19398&#x27;,\n",
+       "       &#x27;NA19399&#x27;, &#x27;NA19401&#x27;, &#x27;NA19404&#x27;, &#x27;NA19428&#x27;, &#x27;NA19429&#x27;, &#x27;NA19434&#x27;,\n",
+       "       &#x27;NA19435&#x27;, &#x27;NA19436&#x27;, &#x27;NA19437&#x27;, &#x27;NA19438&#x27;, &#x27;NA19439&#x27;, &#x27;NA19440&#x27;,\n",
+       "       &#x27;NA19443&#x27;, &#x27;NA19444&#x27;, &#x27;NA19445&#x27;, &#x27;NA19446&#x27;, &#x27;NA19448&#x27;, &#x27;NA19449&#x27;,\n",
+       "       &#x27;NA19451&#x27;, &#x27;NA19452&#x27;, &#x27;NA19453&#x27;, &#x27;NA19455&#x27;, &#x27;NA19456&#x27;, &#x27;NA19457&#x27;,\n",
+       "       &#x27;NA19461&#x27;, &#x27;NA19462&#x27;, &#x27;NA19463&#x27;, &#x27;NA19466&#x27;, &#x27;NA19467&#x27;, &#x27;NA19469&#x27;,\n",
+       "       &#x27;NA19471&#x27;, &#x27;NA19472&#x27;, &#x27;NA19473&#x27;, &#x27;NA19474&#x27;, &#x27;NA19625&#x27;, &#x27;NA19648&#x27;,\n",
+       "       &#x27;NA19649&#x27;, &#x27;NA19651&#x27;, &#x27;NA19652&#x27;, &#x27;NA19654&#x27;, &#x27;NA19655&#x27;, &#x27;NA19658&#x27;,\n",
+       "       &#x27;NA19660&#x27;, &#x27;NA19661&#x27;, &#x27;NA19678&#x27;, &#x27;NA19684&#x27;, &#x27;NA19685&#x27;, &#x27;NA19700&#x27;,\n",
+       "       &#x27;NA19701&#x27;, &#x27;NA19703&#x27;, &#x27;NA19704&#x27;, &#x27;NA19707&#x27;, &#x27;NA19712&#x27;, &#x27;NA19713&#x27;,\n",
+       "       &#x27;NA19720&#x27;, &#x27;NA19722&#x27;, &#x27;NA19723&#x27;, &#x27;NA19725&#x27;, &#x27;NA19726&#x27;, &#x27;NA19818&#x27;,\n",
+       "       &#x27;NA19819&#x27;, &#x27;NA19834&#x27;, &#x27;NA19835&#x27;, &#x27;NA19900&#x27;, &#x27;NA19901&#x27;, &#x27;NA19904&#x27;,\n",
+       "       &#x27;NA19908&#x27;, &#x27;NA19909&#x27;, &#x27;NA19914&#x27;, &#x27;NA19916&#x27;, &#x27;NA19917&#x27;, &#x27;NA19920&#x27;,\n",
+       "       &#x27;NA19921&#x27;, &#x27;NA19982&#x27;, &#x27;NA20414&#x27;, &#x27;NA20502&#x27;, &#x27;NA20505&#x27;, &#x27;NA20508&#x27;,\n",
+       "       &#x27;NA20509&#x27;, &#x27;NA20510&#x27;, &#x27;NA20512&#x27;, &#x27;NA20515&#x27;, &#x27;NA20516&#x27;, &#x27;NA20517&#x27;,\n",
+       "       &#x27;NA20518&#x27;, &#x27;NA20519&#x27;, &#x27;NA20520&#x27;, &#x27;NA20521&#x27;, &#x27;NA20522&#x27;, &#x27;NA20524&#x27;,\n",
+       "       &#x27;NA20525&#x27;, &#x27;NA20526&#x27;, &#x27;NA20527&#x27;, &#x27;NA20528&#x27;, &#x27;NA20529&#x27;, &#x27;NA20530&#x27;,\n",
+       "       &#x27;NA20531&#x27;, &#x27;NA20532&#x27;, &#x27;NA20533&#x27;, &#x27;NA20534&#x27;, &#x27;NA20535&#x27;, &#x27;NA20536&#x27;,\n",
+       "       &#x27;NA20537&#x27;, &#x27;NA20538&#x27;, &#x27;NA20539&#x27;, &#x27;NA20540&#x27;, &#x27;NA20541&#x27;, &#x27;NA20542&#x27;,\n",
+       "       &#x27;NA20543&#x27;, &#x27;NA20544&#x27;, &#x27;NA20581&#x27;, &#x27;NA20582&#x27;, &#x27;NA20585&#x27;, &#x27;NA20586&#x27;,\n",
+       "       &#x27;NA20588&#x27;, &#x27;NA20589&#x27;, &#x27;NA20752&#x27;, &#x27;NA20753&#x27;, &#x27;NA20754&#x27;, &#x27;NA20755&#x27;,\n",
+       "       &#x27;NA20756&#x27;, &#x27;NA20757&#x27;, &#x27;NA20758&#x27;, &#x27;NA20759&#x27;, &#x27;NA20760&#x27;, &#x27;NA20761&#x27;,\n",
+       "       &#x27;NA20765&#x27;, &#x27;NA20769&#x27;, &#x27;NA20770&#x27;, &#x27;NA20771&#x27;, &#x27;NA20772&#x27;, &#x27;NA20773&#x27;,\n",
+       "       &#x27;NA20774&#x27;, &#x27;NA20775&#x27;, &#x27;NA20778&#x27;, &#x27;NA20783&#x27;, &#x27;NA20785&#x27;, &#x27;NA20786&#x27;,\n",
+       "       &#x27;NA20787&#x27;, &#x27;NA20790&#x27;, &#x27;NA20792&#x27;, &#x27;NA20795&#x27;, &#x27;NA20796&#x27;, &#x27;NA20797&#x27;,\n",
+       "       &#x27;NA20798&#x27;, &#x27;NA20799&#x27;, &#x27;NA20800&#x27;, &#x27;NA20801&#x27;, &#x27;NA20802&#x27;, &#x27;NA20803&#x27;,\n",
+       "       &#x27;NA20804&#x27;, &#x27;NA20805&#x27;, &#x27;NA20806&#x27;, &#x27;NA20807&#x27;, &#x27;NA20808&#x27;, &#x27;NA20809&#x27;,\n",
+       "       &#x27;NA20810&#x27;, &#x27;NA20811&#x27;, &#x27;NA20812&#x27;, &#x27;NA20813&#x27;, &#x27;NA20814&#x27;, &#x27;NA20815&#x27;,\n",
+       "       &#x27;NA20816&#x27;, &#x27;NA20818&#x27;, &#x27;NA20819&#x27;, &#x27;NA20826&#x27;, &#x27;NA20828&#x27;], dtype=&#x27;&lt;U7&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0 0 0 0 1 1 0 1 ... 0 0 0 0 0 0 0 0</div><input id='attrs-c81f544a-e564-420e-aa45-03c0c3fcb884' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-c81f544a-e564-420e-aa45-03c0c3fcb884' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-035d86c6-70eb-4916-baeb-6ec8b2084b8f' class='xr-var-data-in' type='checkbox'><label for='data-035d86c6-70eb-4916-baeb-6ec8b2084b8f' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[ 0,  0],\n",
+       "        [ 0,  0],\n",
+       "        [ 1,  1],\n",
+       "        ...,\n",
+       "        [ 0,  1],\n",
+       "        [ 1,  1],\n",
+       "        [ 1,  0]],\n",
+       "\n",
+       "       [[-1, -1],\n",
+       "        [-1, -1],\n",
+       "        [-1, -1],\n",
+       "        ...,\n",
+       "        [-1, -1],\n",
+       "        [-1, -1],\n",
+       "        [-1, -1]],\n",
+       "\n",
+       "       [[ 0,  0],\n",
+       "        [ 0,  0],\n",
+       "        [ 0,  0],\n",
+       "        ...,\n",
+       "        [ 0,  0],\n",
+       "        [ 0,  0],\n",
+       "        [ 0,  0]]], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype_mask</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>bool</div><div class='xr-var-preview xr-preview'>False False False ... False False</div><input id='attrs-4f538aab-26a1-4465-ad21-f5b6fbaf7997' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-4f538aab-26a1-4465-ad21-f5b6fbaf7997' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-7c65ab14-edd8-4a1a-8d79-cfeb8990238b' class='xr-var-data-in' type='checkbox'><label for='data-7c65ab14-edd8-4a1a-8d79-cfeb8990238b' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]],\n",
+       "\n",
+       "       [[ True,  True],\n",
+       "        [ True,  True],\n",
+       "        [ True,  True],\n",
+       "        ...,\n",
+       "        [ True,  True],\n",
+       "        [ True,  True],\n",
+       "        [ True,  True]],\n",
+       "\n",
+       "       [[False, False],\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        ...,\n",
+       "        [False, False],\n",
+       "        [False, False],\n",
+       "        [False, False]]])</pre></li></ul></div></li><li class='xr-section-item'><input id='section-c201c05a-80ed-426e-b1ac-36c930a981f6' class='xr-section-summary-in' type='checkbox'  checked><label for='section-c201c05a-80ed-426e-b1ac-36c930a981f6' class='xr-section-summary' >Attributes: <span>(1)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><dl class='xr-attrs'><dt><span>contigs :</span></dt><dd>[b&#x27;2&#x27;]</dd></dl></div></li></ul></div></div>"
+      ],
+      "text/plain": [
+       "<xarray.Dataset>\n",
+       "Dimensions:             (alleles: 2, ploidy: 2, samples: 629, variants: 3)\n",
+       "Dimensions without coordinates: alleles, ploidy, samples, variants\n",
+       "Data variables:\n",
+       "    variant/contig      (variants) int32 0 0 0\n",
+       "    variant/position    (variants) int32 39967768 39967778 39967793\n",
+       "    variant/alleles     (variants, alleles) |S1 b'T' b'A' b'G' b'C' b'C' b'T'\n",
+       "    sample/id           (samples) <U7 'HG00098' 'HG00100' ... 'NA20828'\n",
+       "    call/genotype       (variants, samples, ploidy) int32 0 0 0 0 1 ... 0 0 0 0\n",
+       "    call/genotype_mask  (variants, samples, ploidy) bool False False ... False\n",
+       "Attributes:\n",
+       "    contigs:  [b'2']"
+      ]
+     },
+     "execution_count": 14,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "genotype_xarray_dataset"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Done!\n",
+    "\n",
+    "Now we have our Xarray dataset that we can use with the rest of Sgkit!"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python [conda env:root] *",
+   "language": "python",
+   "name": "conda-root-py"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.7.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
diff --git a/docs/examples/notebooks/Genotype-Call-Dataset-Minimal-Numpy-Example.ipynb b/docs/examples/notebooks/Genotype-Call-Dataset-Minimal-Numpy-Example.ipynb
new file mode 100644
index 000000000..8e88a8e93
--- /dev/null
+++ b/docs/examples/notebooks/Genotype-Call-Dataset-Minimal-Numpy-Example.ipynb
@@ -0,0 +1,547 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Minimal Numpy Example\n",
+    "\n",
+    "A central point to the SGkit API is the Genotype Call Dataset. This is the data structure that most of the other functions use. It uses [Xarray](http://xarray.pydata.org/en/stable/) underneath the hood to give a programmatic interface that allows for the backend to be several different data files.\n",
+    "\n",
+    "The Xarray itself is *sort of* a transposed VCF file.\n",
+    "\n",
+    "For this particular example we are going to use a minimal set of numpy arrays in order to create a small Genotype Call Dataset. \n",
+    "\n",
+    "This is only meant to demonstrate the datatypes that we feed into the Xarray dataset. For a more conceptual understanding please check out the `Genotype-Call-Dataset-From-VCF.ipynb`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import numpy as np\n",
+    "import zarr\n",
+    "import pandas as pd\n",
+    "import dask.array as da\n",
+    "import allel\n",
+    "from pprint import pprint\n",
+    "import matplotlib.pyplot as plt\n",
+    "%matplotlib inline"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Prep Work - Install Packages\n",
+    "\n",
+    "SGKit is still under rapid development, so I'm installing based on a commit. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "#! pip install git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Numpy Representations of the Variant Data\n",
+    "\n",
+    "We need to prepare for our XArray dataset by converting these to Numpy arrays.\n",
+    "\n",
+    "If you're wondering how I know what these are you can check out the `sgkit.api.create_genotype_call_dataset`. The exact functions are `check_array_like` and make sure that these are numpy arrays of a particular type.\n",
+    "\n",
+    "```\n",
+    "check_array_like(variant_contig, kind=\"i\", ndim=1)\n",
+    "check_array_like(variant_position, kind=\"i\", ndim=1)\n",
+    "check_array_like(variant_alleles, kind=\"S\", ndim=2)\n",
+    "check_array_like(sample_id, kind=\"U\", ndim=1)\n",
+    "check_array_like(call_genotype, kind=\"i\", ndim=3)\n",
+    "```"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "variant_contig_names = ['3R']\n",
+    "# the variant contig is the index of the chr in the variant_contig_names\n",
+    "# because we always prefer numbers over strings!\n",
+    "variant_contig = np.array([0], dtype='i')\n",
+    "variant_position = np.array([1], dtype='i')\n",
+    "variant_alleles = np.array([['A', 'T']], dtype='S')\n",
+    "\n",
+    "sample_id = np.array(['sample-1'], dtype='U')\n",
+    "call_genotype_phased = None\n",
+    "variant_id = None"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "(1, 1, 2)"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# The genotype is \n",
+    "#         \"call/genotype\": ([DIM_VARIANT, DIM_SAMPLE, DIM_PLOIDY], call_genotype),\n",
+    "# and needs to be type 'i'\n",
+    "# You can also look at the GenotypeChunkedArray\n",
+    "call_genotype = np.array([[[0, 0]]], dtype='i')\n",
+    "call_genotype.shape"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "This is correct! We have 1 variant, 1 sample, 1 biallelic call."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Convert to Genotype Call Dataset\n",
+    "\n",
+    "Finally! Let's convert this to the Genotype Call Dataset!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import sgkit\n",
+    "\n",
+    "genotype_xarray_dataset = sgkit.api.create_genotype_call_dataset(\n",
+    "    variant_contig_names = variant_contig_names,\n",
+    "    variant_contig = variant_contig,\n",
+    "    variant_position = variant_position,\n",
+    "    variant_alleles = variant_alleles,\n",
+    "    sample_id = sample_id,\n",
+    "    call_genotype = call_genotype,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div><svg style=\"position: absolute; width: 0; height: 0; overflow: hidden\">\n",
+       "<defs>\n",
+       "<symbol id=\"icon-database\" viewBox=\"0 0 32 32\">\n",
+       "<title>Show/Hide data repr</title>\n",
+       "<path d=\"M16 0c-8.837 0-16 2.239-16 5v4c0 2.761 7.163 5 16 5s16-2.239 16-5v-4c0-2.761-7.163-5-16-5z\"></path>\n",
+       "<path d=\"M16 17c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z\"></path>\n",
+       "<path d=\"M16 26c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z\"></path>\n",
+       "</symbol>\n",
+       "<symbol id=\"icon-file-text2\" viewBox=\"0 0 32 32\">\n",
+       "<title>Show/Hide attributes</title>\n",
+       "<path d=\"M28.681 7.159c-0.694-0.947-1.662-2.053-2.724-3.116s-2.169-2.030-3.116-2.724c-1.612-1.182-2.393-1.319-2.841-1.319h-15.5c-1.378 0-2.5 1.121-2.5 2.5v27c0 1.378 1.122 2.5 2.5 2.5h23c1.378 0 2.5-1.122 2.5-2.5v-19.5c0-0.448-0.137-1.23-1.319-2.841zM24.543 5.457c0.959 0.959 1.712 1.825 2.268 2.543h-4.811v-4.811c0.718 0.556 1.584 1.309 2.543 2.268zM28 29.5c0 0.271-0.229 0.5-0.5 0.5h-23c-0.271 0-0.5-0.229-0.5-0.5v-27c0-0.271 0.229-0.5 0.5-0.5 0 0 15.499-0 15.5 0v7c0 0.552 0.448 1 1 1h7v19.5z\"></path>\n",
+       "<path d=\"M23 26h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "<path d=\"M23 22h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "<path d=\"M23 18h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z\"></path>\n",
+       "</symbol>\n",
+       "</defs>\n",
+       "</svg>\n",
+       "<style>/* CSS stylesheet for displaying xarray objects in jupyterlab.\n",
+       " *\n",
+       " */\n",
+       "\n",
+       ":root {\n",
+       "  --xr-font-color0: var(--jp-content-font-color0, rgba(0, 0, 0, 1));\n",
+       "  --xr-font-color2: var(--jp-content-font-color2, rgba(0, 0, 0, 0.54));\n",
+       "  --xr-font-color3: var(--jp-content-font-color3, rgba(0, 0, 0, 0.38));\n",
+       "  --xr-border-color: var(--jp-border-color2, #e0e0e0);\n",
+       "  --xr-disabled-color: var(--jp-layout-color3, #bdbdbd);\n",
+       "  --xr-background-color: var(--jp-layout-color0, white);\n",
+       "  --xr-background-color-row-even: var(--jp-layout-color1, white);\n",
+       "  --xr-background-color-row-odd: var(--jp-layout-color2, #eeeeee);\n",
+       "}\n",
+       "\n",
+       ".xr-wrap {\n",
+       "  min-width: 300px;\n",
+       "  max-width: 700px;\n",
+       "}\n",
+       "\n",
+       ".xr-header {\n",
+       "  padding-top: 6px;\n",
+       "  padding-bottom: 6px;\n",
+       "  margin-bottom: 4px;\n",
+       "  border-bottom: solid 1px var(--xr-border-color);\n",
+       "}\n",
+       "\n",
+       ".xr-header > div,\n",
+       ".xr-header > ul {\n",
+       "  display: inline;\n",
+       "  margin-top: 0;\n",
+       "  margin-bottom: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-obj-type,\n",
+       ".xr-array-name {\n",
+       "  margin-left: 2px;\n",
+       "  margin-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-obj-type {\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-sections {\n",
+       "  padding-left: 0 !important;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 150px auto auto 1fr 20px 20px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input + label {\n",
+       "  color: var(--xr-disabled-color);\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input:enabled + label {\n",
+       "  cursor: pointer;\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-section-item input:enabled + label:hover {\n",
+       "  color: var(--xr-font-color0);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary {\n",
+       "  grid-column: 1;\n",
+       "  color: var(--xr-font-color2);\n",
+       "  font-weight: 500;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary > span {\n",
+       "  display: inline-block;\n",
+       "  padding-left: 0.5em;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:disabled + label {\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in + label:before {\n",
+       "  display: inline-block;\n",
+       "  content: '►';\n",
+       "  font-size: 11px;\n",
+       "  width: 15px;\n",
+       "  text-align: center;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:disabled + label:before {\n",
+       "  color: var(--xr-disabled-color);\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked + label:before {\n",
+       "  content: '▼';\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked + label > span {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary,\n",
+       ".xr-section-inline-details {\n",
+       "  padding-top: 4px;\n",
+       "  padding-bottom: 4px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-inline-details {\n",
+       "  grid-column: 2 / -1;\n",
+       "}\n",
+       "\n",
+       ".xr-section-details {\n",
+       "  display: none;\n",
+       "  grid-column: 1 / -1;\n",
+       "  margin-bottom: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-section-summary-in:checked ~ .xr-section-details {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-array-wrap {\n",
+       "  grid-column: 1 / -1;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 20px auto;\n",
+       "}\n",
+       "\n",
+       ".xr-array-wrap > label {\n",
+       "  grid-column: 1;\n",
+       "  vertical-align: top;\n",
+       "}\n",
+       "\n",
+       ".xr-preview {\n",
+       "  color: var(--xr-font-color3);\n",
+       "}\n",
+       "\n",
+       ".xr-array-preview,\n",
+       ".xr-array-data {\n",
+       "  padding: 0 5px !important;\n",
+       "  grid-column: 2;\n",
+       "}\n",
+       "\n",
+       ".xr-array-data,\n",
+       ".xr-array-in:checked ~ .xr-array-preview {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       ".xr-array-in:checked ~ .xr-array-data,\n",
+       ".xr-array-preview {\n",
+       "  display: inline-block;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list {\n",
+       "  display: inline-block !important;\n",
+       "  list-style: none;\n",
+       "  padding: 0 !important;\n",
+       "  margin: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list li {\n",
+       "  display: inline-block;\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list:before {\n",
+       "  content: '(';\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list:after {\n",
+       "  content: ')';\n",
+       "}\n",
+       "\n",
+       ".xr-dim-list li:not(:last-child):after {\n",
+       "  content: ',';\n",
+       "  padding-right: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-has-index {\n",
+       "  font-weight: bold;\n",
+       "}\n",
+       "\n",
+       ".xr-var-list,\n",
+       ".xr-var-item {\n",
+       "  display: contents;\n",
+       "}\n",
+       "\n",
+       ".xr-var-item > div,\n",
+       ".xr-var-item label,\n",
+       ".xr-var-item > .xr-var-name span {\n",
+       "  background-color: var(--xr-background-color-row-even);\n",
+       "  margin-bottom: 0;\n",
+       "}\n",
+       "\n",
+       ".xr-var-item > .xr-var-name:hover span {\n",
+       "  padding-right: 5px;\n",
+       "}\n",
+       "\n",
+       ".xr-var-list > li:nth-child(odd) > div,\n",
+       ".xr-var-list > li:nth-child(odd) > label,\n",
+       ".xr-var-list > li:nth-child(odd) > .xr-var-name span {\n",
+       "  background-color: var(--xr-background-color-row-odd);\n",
+       "}\n",
+       "\n",
+       ".xr-var-name {\n",
+       "  grid-column: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-var-dims {\n",
+       "  grid-column: 2;\n",
+       "}\n",
+       "\n",
+       ".xr-var-dtype {\n",
+       "  grid-column: 3;\n",
+       "  text-align: right;\n",
+       "  color: var(--xr-font-color2);\n",
+       "}\n",
+       "\n",
+       ".xr-var-preview {\n",
+       "  grid-column: 4;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name,\n",
+       ".xr-var-dims,\n",
+       ".xr-var-dtype,\n",
+       ".xr-preview,\n",
+       ".xr-attrs dt {\n",
+       "  white-space: nowrap;\n",
+       "  overflow: hidden;\n",
+       "  text-overflow: ellipsis;\n",
+       "  padding-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name:hover,\n",
+       ".xr-var-dims:hover,\n",
+       ".xr-var-dtype:hover,\n",
+       ".xr-attrs dt:hover {\n",
+       "  overflow: visible;\n",
+       "  width: auto;\n",
+       "  z-index: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-var-attrs,\n",
+       ".xr-var-data {\n",
+       "  display: none;\n",
+       "  background-color: var(--xr-background-color) !important;\n",
+       "  padding-bottom: 5px !important;\n",
+       "}\n",
+       "\n",
+       ".xr-var-attrs-in:checked ~ .xr-var-attrs,\n",
+       ".xr-var-data-in:checked ~ .xr-var-data {\n",
+       "  display: block;\n",
+       "}\n",
+       "\n",
+       ".xr-var-data > table {\n",
+       "  float: right;\n",
+       "}\n",
+       "\n",
+       ".xr-var-name span,\n",
+       ".xr-var-data,\n",
+       ".xr-attrs {\n",
+       "  padding-left: 25px !important;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs,\n",
+       ".xr-var-attrs,\n",
+       ".xr-var-data {\n",
+       "  grid-column: 1 / -1;\n",
+       "}\n",
+       "\n",
+       "dl.xr-attrs {\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "  display: grid;\n",
+       "  grid-template-columns: 125px auto;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt, dd {\n",
+       "  padding: 0;\n",
+       "  margin: 0;\n",
+       "  float: left;\n",
+       "  padding-right: 10px;\n",
+       "  width: auto;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt {\n",
+       "  font-weight: normal;\n",
+       "  grid-column: 1;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dt:hover span {\n",
+       "  display: inline-block;\n",
+       "  background: var(--xr-background-color);\n",
+       "  padding-right: 10px;\n",
+       "}\n",
+       "\n",
+       ".xr-attrs dd {\n",
+       "  grid-column: 2;\n",
+       "  white-space: pre-wrap;\n",
+       "  word-break: break-all;\n",
+       "}\n",
+       "\n",
+       ".xr-icon-database,\n",
+       ".xr-icon-file-text2 {\n",
+       "  display: inline-block;\n",
+       "  vertical-align: middle;\n",
+       "  width: 1em;\n",
+       "  height: 1.5em !important;\n",
+       "  stroke-width: 0;\n",
+       "  stroke: currentColor;\n",
+       "  fill: currentColor;\n",
+       "}\n",
+       "</style><div class='xr-wrap'><div class='xr-header'><div class='xr-obj-type'>xarray.Dataset</div></div><ul class='xr-sections'><li class='xr-section-item'><input id='section-b8323804-c4f7-4b65-a6ac-1289a3840a2a' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-b8323804-c4f7-4b65-a6ac-1289a3840a2a' class='xr-section-summary'  title='Expand/collapse section'>Dimensions:</label><div class='xr-section-inline-details'><ul class='xr-dim-list'><li><span>alleles</span>: 2</li><li><span>ploidy</span>: 2</li><li><span>samples</span>: 1</li><li><span>variants</span>: 1</li></ul></div><div class='xr-section-details'></div></li><li class='xr-section-item'><input id='section-b7290721-2b6d-4afe-b858-d99f72aa2e67' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-b7290721-2b6d-4afe-b858-d99f72aa2e67' class='xr-section-summary'  title='Expand/collapse section'>Coordinates: <span>(0)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'></ul></div></li><li class='xr-section-item'><input id='section-0243c879-ecc0-4d9f-a3bc-8e1a6128e6ef' class='xr-section-summary-in' type='checkbox'  checked><label for='section-0243c879-ecc0-4d9f-a3bc-8e1a6128e6ef' class='xr-section-summary' >Data variables: <span>(6)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'><li class='xr-var-item'><div class='xr-var-name'><span>variant/contig</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0</div><input id='attrs-83b83547-5616-4a87-8272-77dde5cd1cca' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-83b83547-5616-4a87-8272-77dde5cd1cca' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-217ca109-651a-47c0-ba1e-e7352bcfc259' class='xr-var-data-in' type='checkbox'><label for='data-217ca109-651a-47c0-ba1e-e7352bcfc259' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([0], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/position</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>1</div><input id='attrs-c5cf4ac8-8a10-4a1e-a510-3666350d0845' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-c5cf4ac8-8a10-4a1e-a510-3666350d0845' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-ba64cea5-7610-4bed-af9e-ccbeb399cae3' class='xr-var-data-in' type='checkbox'><label for='data-ba64cea5-7610-4bed-af9e-ccbeb399cae3' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([1], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/alleles</span></div><div class='xr-var-dims'>(variants, alleles)</div><div class='xr-var-dtype'>|S1</div><div class='xr-var-preview xr-preview'>b&#x27;A&#x27; b&#x27;T&#x27;</div><input id='attrs-21aa5a67-377d-4d6c-b351-c9e0aec82140' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-21aa5a67-377d-4d6c-b351-c9e0aec82140' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-6f4e6459-601c-41f6-aa9a-b6fdc633b2f9' class='xr-var-data-in' type='checkbox'><label for='data-6f4e6459-601c-41f6-aa9a-b6fdc633b2f9' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[b&#x27;A&#x27;, b&#x27;T&#x27;]], dtype=&#x27;|S1&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>sample/id</span></div><div class='xr-var-dims'>(samples)</div><div class='xr-var-dtype'>&lt;U8</div><div class='xr-var-preview xr-preview'>&#x27;sample-1&#x27;</div><input id='attrs-0e0b1e0b-db93-4435-bcf8-62a5cba7e309' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-0e0b1e0b-db93-4435-bcf8-62a5cba7e309' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-d0a71236-17f1-4b2d-8088-637dfeea1e79' class='xr-var-data-in' type='checkbox'><label for='data-d0a71236-17f1-4b2d-8088-637dfeea1e79' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([&#x27;sample-1&#x27;], dtype=&#x27;&lt;U8&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0 0</div><input id='attrs-e0043ffc-9fe3-4f1e-9c43-79d066ffb555' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-e0043ffc-9fe3-4f1e-9c43-79d066ffb555' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-fbebeafa-8d8c-485f-b95b-1cf9242db2c5' class='xr-var-data-in' type='checkbox'><label for='data-fbebeafa-8d8c-485f-b95b-1cf9242db2c5' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[0, 0]]], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype_mask</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>bool</div><div class='xr-var-preview xr-preview'>False False</div><input id='attrs-9f562165-a4b3-435c-bbb2-e70f18c3a65f' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-9f562165-a4b3-435c-bbb2-e70f18c3a65f' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-59b01160-9eb5-4d9c-a30f-6ba7e1e3d5d6' class='xr-var-data-in' type='checkbox'><label for='data-59b01160-9eb5-4d9c-a30f-6ba7e1e3d5d6' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[False, False]]])</pre></li></ul></div></li><li class='xr-section-item'><input id='section-05acbe9a-d603-47fc-9ddf-f2eb952c5f30' class='xr-section-summary-in' type='checkbox'  checked><label for='section-05acbe9a-d603-47fc-9ddf-f2eb952c5f30' class='xr-section-summary' >Attributes: <span>(1)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><dl class='xr-attrs'><dt><span>contigs :</span></dt><dd>[&#x27;3R&#x27;]</dd></dl></div></li></ul></div></div>"
+      ],
+      "text/plain": [
+       "<xarray.Dataset>\n",
+       "Dimensions:             (alleles: 2, ploidy: 2, samples: 1, variants: 1)\n",
+       "Dimensions without coordinates: alleles, ploidy, samples, variants\n",
+       "Data variables:\n",
+       "    variant/contig      (variants) int32 0\n",
+       "    variant/position    (variants) int32 1\n",
+       "    variant/alleles     (variants, alleles) |S1 b'A' b'T'\n",
+       "    sample/id           (samples) <U8 'sample-1'\n",
+       "    call/genotype       (variants, samples, ploidy) int32 0 0\n",
+       "    call/genotype_mask  (variants, samples, ploidy) bool False False\n",
+       "Attributes:\n",
+       "    contigs:  ['3R']"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "genotype_xarray_dataset"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Done!\n",
+    "\n",
+    "Now we have our Xarray dataset that we can use with the rest of Sgkit!"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python [conda env:root] *",
+   "language": "python",
+   "name": "conda-root-py"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.7.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
diff --git a/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-SGKit-Zarr.rst b/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-SGKit-Zarr.rst
new file mode 100644
index 000000000..bd9291e68
--- /dev/null
+++ b/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-SGKit-Zarr.rst
@@ -0,0 +1,901 @@
+Load From Malaria Gen Zarr
+==========================
+
+A central point to the SGkit API is the Genotype Call Dataset. This is
+the data structure that most of the other functions use. It uses
+`Xarray <http://xarray.pydata.org/en/stable/>`__ underneath the hood to
+give a programmatic interface that allows for the backend to be several
+different data files.
+
+The Xarray itself is *sort of* a transposed VCF file.
+
+For this example we are going to from the preprocessed zarr to the sgkit
+Genotype Call XArray Dataset.
+
+This is only meant to demonstrate the datatypes that we feed into the
+Xarray dataset. For a more conceptual understanding please check out the
+``Genotype-Call-Dataset-From-VCF.ipynb``.
+
+.. code:: ipython3
+
+    import numpy as np
+    import zarr
+    import pandas as pd
+    import dask.array as da
+    import allel
+    from pprint import pprint
+    import matplotlib.pyplot as plt
+    %matplotlib inline
+
+Create a Dask Cluster
+---------------------
+
+This isn’t that important for this example, but SGkit can use Dask under
+the hood for many of it’s calculations. Divide and conquer your
+statistical genomics data!
+
+.. code:: ipython3
+
+    from dask_kubernetes import KubeCluster
+    cluster = KubeCluster(n_workers=30, silence_logs='error')
+    cluster
+
+
+
+.. parsed-literal::
+
+    VBox(children=(HTML(value='<h2>KubeCluster</h2>'), HBox(children=(HTML(value='\n<div>\n  <style scoped>\n    .…
+
+
+Import sgkit
+------------
+
+.. code:: ipython3
+
+    ! pip install git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc
+
+
+.. parsed-literal::
+
+    Collecting git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc
+      Cloning https://github.com/pystatgen/sgkit (to revision 96203d471531e7e2416d4dd9b48ca11d660a1bcc) to /tmp/pip-req-build-spafo9uc
+      Running command git clone -q https://github.com/pystatgen/sgkit /tmp/pip-req-build-spafo9uc
+      Running command git checkout -q 96203d471531e7e2416d4dd9b48ca11d660a1bcc
+    Requirement already satisfied (use --upgrade to upgrade): sgkit==0.1.dev67+g96203d4 from git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc in /opt/conda/lib/python3.7/site-packages
+    Requirement already satisfied: numpy in /opt/conda/lib/python3.7/site-packages (from sgkit==0.1.dev67+g96203d4) (1.18.4)
+    Requirement already satisfied: xarray in /opt/conda/lib/python3.7/site-packages (from sgkit==0.1.dev67+g96203d4) (0.15.1)
+    Requirement already satisfied: setuptools>=41.2 in /opt/conda/lib/python3.7/site-packages (from sgkit==0.1.dev67+g96203d4) (47.1.1.post20200529)
+    Requirement already satisfied: pandas>=0.25 in /opt/conda/lib/python3.7/site-packages (from xarray->sgkit==0.1.dev67+g96203d4) (1.0.4)
+    Requirement already satisfied: pytz>=2017.2 in /opt/conda/lib/python3.7/site-packages (from pandas>=0.25->xarray->sgkit==0.1.dev67+g96203d4) (2020.1)
+    Requirement already satisfied: python-dateutil>=2.6.1 in /opt/conda/lib/python3.7/site-packages (from pandas>=0.25->xarray->sgkit==0.1.dev67+g96203d4) (2.8.1)
+    Requirement already satisfied: six>=1.5 in /opt/conda/lib/python3.7/site-packages (from python-dateutil>=2.6.1->pandas>=0.25->xarray->sgkit==0.1.dev67+g96203d4) (1.15.0)
+    Building wheels for collected packages: sgkit
+      Building wheel for sgkit (setup.py) ... [?25ldone
+    [?25h  Created wheel for sgkit: filename=sgkit-0.1.dev67+g96203d4-py3-none-any.whl size=19421 sha256=c682d510de78a11f035a936d6497f20de3c505d14b166dc23297208e2d98bda1
+      Stored in directory: /home/jovyan/.cache/pip/wheels/6f/2b/6e/48d20c382bb6a66ea96c6dee6e6e575ea88180fef1e96a9024
+    Successfully built sgkit
+
+
+.. code:: ipython3
+
+    import sgkit
+    help(sgkit.api.create_genotype_call_dataset)
+
+
+.. parsed-literal::
+
+    Help on function create_genotype_call_dataset in module sgkit.api:
+    
+    create_genotype_call_dataset(*, variant_contig_names: List[str], variant_contig: Any, variant_position: Any, variant_alleles: Any, sample_id: Any, call_genotype: Any, call_genotype_phased: Any = None, variant_id: Any = None) -> xarray.core.dataset.Dataset
+        Create a dataset of genotype calls.
+        
+        Parameters
+        ----------
+        variant_contig_names : list of str
+            The contig names.
+        variant_contig : array_like, int
+            The (index of the) contig for each variant.
+        variant_position : array_like, int
+            The reference position of the variant.
+        variant_alleles : array_like, S1
+            The possible alleles for the variant.
+        sample_id : array_like, str
+            The unique identifier of the sample.
+        call_genotype : array_like, int
+            Genotype, encoded as allele values (0 for the reference, 1 for
+            the first allele, 2 for the second allele), or -1 to indicate a
+            missing value.
+        call_genotype_phased : array_like, bool, optional
+            A flag for each call indicating if it is phased or not. If
+            omitted all calls are unphased.
+        variant_id: array_like, str, optional
+            The unique identifier of the variant.
+        
+        Returns
+        -------
+        xr.Dataset
+            The dataset of genotype calls.
+    
+
+
+Get the Malaria Gen Zarr Data
+-----------------------------
+
+The `zarr <https://zarr.readthedocs.io/en/stable>`__ data is hosted in a
+google cloud bucket, or available for download from the public FTP site.
+
+.. code:: ipython3
+
+    import gcsfs
+    
+    gcs_bucket_fs = gcsfs.GCSFileSystem(project='malariagen-jupyterhub', token='anon', access='read_only')
+    
+    storage_path = 'ag1000g-release/phase2.AR1/variation/main/zarr/pass/ag1000g.phase2.ar1.pass'
+    store = gcsfs.mapping.GCSMap(storage_path, gcs=gcs_bucket_fs, check=False, create=False)
+    callset = zarr.Group(store)
+
+If you explore the zarr data you will see that it is mostly the VCF
+data, with a few fields pre calculated for convenience.
+
+.. code:: ipython3
+
+    print(callset['samples'])
+
+
+.. parsed-literal::
+
+    <zarr.core.Array '/samples' (1142,) object>
+
+
+.. code:: ipython3
+
+    chrom = '3R'
+    print(callset[chrom].tree())
+
+
+.. parsed-literal::
+
+    3R
+     ├── calldata
+     │   └── GT (14481509, 1142, 2) int8
+     ├── samples (1142,) object
+     └── variants
+         ├── ABHet (14481509,) float32
+         ├── ABHom (14481509,) float32
+         ├── AC (14481509, 3) int32
+         ├── AF (14481509, 3) float32
+         ├── ALT (14481509, 3) |S1
+         ├── AN (14481509,) int32
+         ├── Accessible (14481509,) bool
+         ├── BaseCounts (14481509, 4) int32
+         ├── BaseQRankSum (14481509,) float32
+         ├── Coverage (14481509,) int32
+         ├── CoverageMQ0 (14481509,) int32
+         ├── DP (14481509,) int32
+         ├── DS (14481509,) bool
+         ├── Dels (14481509,) float32
+         ├── FILTER_BaseQRankSum (14481509,) bool
+         ├── FILTER_FS (14481509,) bool
+         ├── FILTER_HRun (14481509,) bool
+         ├── FILTER_HighCoverage (14481509,) bool
+         ├── FILTER_HighMQ0 (14481509,) bool
+         ├── FILTER_LowCoverage (14481509,) bool
+         ├── FILTER_LowMQ (14481509,) bool
+         ├── FILTER_LowQual (14481509,) bool
+         ├── FILTER_NoCoverage (14481509,) bool
+         ├── FILTER_PASS (14481509,) bool
+         ├── FILTER_QD (14481509,) bool
+         ├── FILTER_ReadPosRankSum (14481509,) bool
+         ├── FILTER_RefN (14481509,) bool
+         ├── FILTER_RepeatDUST (14481509,) bool
+         ├── FS (14481509,) float32
+         ├── HRun (14481509,) int32
+         ├── HW (14481509,) float32
+         ├── HaplotypeScore (14481509,) float32
+         ├── HighCoverage (14481509,) int32
+         ├── HighMQ0 (14481509,) int32
+         ├── InbreedingCoeff (14481509,) float32
+         ├── LowCoverage (14481509,) int32
+         ├── LowMQ (14481509,) int32
+         ├── LowPairing (14481509,) int32
+         ├── MLEAC (14481509, 3) int32
+         ├── MLEAF (14481509, 3) float32
+         ├── MQ (14481509,) float32
+         ├── MQ0 (14481509,) int32
+         ├── MQRankSum (14481509,) float32
+         ├── NDA (14481509,) int32
+         ├── NoCoverage (14481509,) int32
+         ├── OND (14481509,) float32
+         ├── POS (14481509,) int32
+         ├── QD (14481509,) float32
+         ├── QUAL (14481509,) float32
+         ├── REF (14481509,) |S1
+         ├── RPA (14481509,) int32
+         ├── RU (14481509,) object
+         ├── ReadPosRankSum (14481509,) float32
+         ├── RefMasked (14481509,) bool
+         ├── RefN (14481509,) bool
+         ├── RepeatDUST (14481509,) bool
+         ├── RepeatMasker (14481509,) bool
+         ├── RepeatTRF (14481509,) bool
+         ├── STR (14481509,) bool
+         ├── VariantType (14481509,) object
+         ├── altlen (14481509, 3) int32
+         ├── is_snp (14481509,) bool
+         └── numalt (14481509,) int32
+
+
+Get the Call Data
+-----------------
+
+.. code:: ipython3
+
+    chrom = '3R'
+    calldata = callset[chrom]['calldata']
+    
+    # TODO Will this be changed for SGKit?
+    genotypes = allel.GenotypeChunkedArray(calldata['GT'])
+    genotypes
+
+
+
+
+.. raw:: html
+
+    <div class="allel allel-DisplayAs2D"><span>&lt;GenotypeChunkedArray shape=(14481509, 1142, 2) dtype=int8 chunks=(524288, 61, 2)
+       nbytes=30.8G cbytes=-1 cratio=-33075766556.0
+       compression=blosc compression_opts={'cname': 'zstd', 'clevel': 1, 'shuffle': -1, 'blocksize': 0}
+       values=zarr.core.Array&gt;</span><table><thead><tr><th></th><th style="text-align: center">0</th><th style="text-align: center">1</th><th style="text-align: center">2</th><th style="text-align: center">3</th><th style="text-align: center">4</th><th style="text-align: center">...</th><th style="text-align: center">1137</th><th style="text-align: center">1138</th><th style="text-align: center">1139</th><th style="text-align: center">1140</th><th style="text-align: center">1141</th></tr></thead><tbody><tr><th style="text-align: center; background-color: white; border-right: 1px solid black; ">0</th><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">...</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td></tr><tr><th style="text-align: center; background-color: white; border-right: 1px solid black; ">1</th><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">...</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td></tr><tr><th style="text-align: center; background-color: white; border-right: 1px solid black; ">2</th><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">...</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td></tr><tr><th style="text-align: center; background-color: white; border-right: 1px solid black; ">...</th><td style="text-align: center" colspan="12">...</td></tr><tr><th style="text-align: center; background-color: white; border-right: 1px solid black; ">14481506</th><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">...</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td></tr><tr><th style="text-align: center; background-color: white; border-right: 1px solid black; ">14481507</th><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">...</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td></tr><tr><th style="text-align: center; background-color: white; border-right: 1px solid black; ">14481508</th><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">...</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td></tr></tbody></table></div>
+
+
+
+Genotype Chunked Array Data Structure
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+When looking at the ``allel.GenotypeChunkedArray`` we see that we have:
+GenotypeChunkedArray shape=(14481509, 1142, 2)
+
+The shape corresponds to ``variants``, ``samples``, ``alleles``.
+
+For every index of a variant we have the alleles of each of the samples.
+
+So let’s get all the sample data for the first variant.
+
+.. code:: ipython3
+
+    genotypes[0]
+
+
+
+
+.. raw:: html
+
+    <div class="allel allel-DisplayAs1D"><span>&lt;GenotypeVector shape=(1142, 2) dtype=int8&gt;</span><table><thead><tr><th style="text-align: center">0</th><th style="text-align: center">1</th><th style="text-align: center">2</th><th style="text-align: center">3</th><th style="text-align: center">4</th><th style="text-align: center">...</th><th style="text-align: center">1137</th><th style="text-align: center">1138</th><th style="text-align: center">1139</th><th style="text-align: center">1140</th><th style="text-align: center">1141</th></tr></thead><tbody><tr><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">...</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td><td style="text-align: center">0/0</td></tr></tbody></table></div>
+
+
+
+And now let’s look at the first variant call for the first sample.
+
+.. code:: ipython3
+
+    genotypes[0][0]
+
+
+
+
+.. parsed-literal::
+
+    array([0, 0], dtype=int8)
+
+
+
+You can see above that for sample[0] the allele is 0/0, meaning it is
+homozygous for the reference.
+
+Get the Samples
+---------------
+
+.. code:: ipython3
+
+    samples = callset['samples']
+    sample_id = np.array(samples, dtype='U')
+
+.. code:: ipython3
+
+    sample_id[0:5]
+
+
+
+
+.. parsed-literal::
+
+    array(['AA0040-C', 'AA0041-C', 'AA0042-C', 'AA0043-C', 'AA0044-C'],
+          dtype='<U8')
+
+
+
+Grab the Variant Positions
+--------------------------
+
+Get the positions of each variant
+
+.. code:: ipython3
+
+    variant_position = callset[chrom]['variants/POS']
+
+Let’s investigate some of the attributes of our numpy array.
+
+.. code:: ipython3
+
+    print(variant_position.shape)
+    print(variant_position.dtype.kind)
+
+
+.. parsed-literal::
+
+    (14481509,)
+    i
+
+
+Grab the Reference Alleles
+--------------------------
+
+For each variant we need the reference and the alternate.
+
+.. code:: ipython3
+
+    variant_ref = callset[chrom]['variants/REF']
+    variant_ref
+
+
+
+
+.. parsed-literal::
+
+    <zarr.core.Array '/3R/variants/REF' (14481509,) |S1>
+
+
+
+.. code:: ipython3
+
+    variant_alt = callset[chrom]['variants/ALT']
+    variant_alt
+
+
+
+
+.. parsed-literal::
+
+    <zarr.core.Array '/3R/variants/ALT' (14481509, 3) |S1>
+
+
+
+Now, instead of having 2 separate variant arrays, we want an np array of
+:
+
+.. code:: python
+
+
+   [ 
+       # variant position index
+       [ ref, alt ],
+   ]    
+
+.. code:: ipython3
+
+    # the alternate lists all possible variants. we'll just grab the first, but really we should filter out any variants that aren't biallelic
+    variant_alleles = np.column_stack((variant_ref, variant_alt[:,0]))
+    variant_contig = np.zeros(len(variant_alleles))
+
+.. code:: ipython3
+
+    variant_contig[0:10]
+
+
+
+
+.. parsed-literal::
+
+    array([0., 0., 0., 0., 0., 0., 0., 0., 0., 0.])
+
+
+
+.. code:: ipython3
+
+    variant_alleles[0:10]
+
+
+
+
+.. parsed-literal::
+
+    array([[b'A', b'G'],
+           [b'A', b'T'],
+           [b'T', b'C'],
+           [b'G', b'A'],
+           [b'T', b'A'],
+           [b'A', b'G'],
+           [b'G', b'C'],
+           [b'C', b'T'],
+           [b'C', b'T'],
+           [b'G', b'A']], dtype='|S1')
+
+
+
+Create the Xarray Genotype Callset
+----------------------------------
+
+.. code:: ipython3
+
+    # You can use the dataset_size to create a smaller dataset if you're just exploring
+    
+    #dataset_size = len(variant_alleles)
+    variant_contig_names = [chrom]
+    call_genotype = genotypes
+    dataset_size = 10000
+    variant_contig = np.zeros(dataset_size)
+    variant_position = variant_position[0:dataset_size]
+    variant_alleles = variant_alleles[0:dataset_size]
+    call_genotype = call_genotype[0:dataset_size]
+
+.. code:: ipython3
+
+    genotype_xarray_dataset = sgkit.api.create_genotype_call_dataset(
+        variant_contig_names = variant_contig_names,
+        # these are all on the 0th contig, because we only have one contig
+        variant_contig = np.zeros(len(variant_position), dtype='int'),
+        variant_position = variant_position,
+        variant_alleles = variant_alleles,
+        sample_id = sample_id,
+        call_genotype = call_genotype,
+    )
+
+.. code:: ipython3
+
+    genotype_xarray_dataset
+
+
+
+
+.. raw:: html
+
+    <div><svg style="position: absolute; width: 0; height: 0; overflow: hidden">
+    <defs>
+    <symbol id="icon-database" viewBox="0 0 32 32">
+    <title>Show/Hide data repr</title>
+    <path d="M16 0c-8.837 0-16 2.239-16 5v4c0 2.761 7.163 5 16 5s16-2.239 16-5v-4c0-2.761-7.163-5-16-5z"></path>
+    <path d="M16 17c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z"></path>
+    <path d="M16 26c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z"></path>
+    </symbol>
+    <symbol id="icon-file-text2" viewBox="0 0 32 32">
+    <title>Show/Hide attributes</title>
+    <path d="M28.681 7.159c-0.694-0.947-1.662-2.053-2.724-3.116s-2.169-2.030-3.116-2.724c-1.612-1.182-2.393-1.319-2.841-1.319h-15.5c-1.378 0-2.5 1.121-2.5 2.5v27c0 1.378 1.122 2.5 2.5 2.5h23c1.378 0 2.5-1.122 2.5-2.5v-19.5c0-0.448-0.137-1.23-1.319-2.841zM24.543 5.457c0.959 0.959 1.712 1.825 2.268 2.543h-4.811v-4.811c0.718 0.556 1.584 1.309 2.543 2.268zM28 29.5c0 0.271-0.229 0.5-0.5 0.5h-23c-0.271 0-0.5-0.229-0.5-0.5v-27c0-0.271 0.229-0.5 0.5-0.5 0 0 15.499-0 15.5 0v7c0 0.552 0.448 1 1 1h7v19.5z"></path>
+    <path d="M23 26h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    <path d="M23 22h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    <path d="M23 18h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    </symbol>
+    </defs>
+    </svg>
+    <style>/* CSS stylesheet for displaying xarray objects in jupyterlab.
+     *
+     */
+    
+    :root {
+      --xr-font-color0: var(--jp-content-font-color0, rgba(0, 0, 0, 1));
+      --xr-font-color2: var(--jp-content-font-color2, rgba(0, 0, 0, 0.54));
+      --xr-font-color3: var(--jp-content-font-color3, rgba(0, 0, 0, 0.38));
+      --xr-border-color: var(--jp-border-color2, #e0e0e0);
+      --xr-disabled-color: var(--jp-layout-color3, #bdbdbd);
+      --xr-background-color: var(--jp-layout-color0, white);
+      --xr-background-color-row-even: var(--jp-layout-color1, white);
+      --xr-background-color-row-odd: var(--jp-layout-color2, #eeeeee);
+    }
+    
+    .xr-wrap {
+      min-width: 300px;
+      max-width: 700px;
+    }
+    
+    .xr-header {
+      padding-top: 6px;
+      padding-bottom: 6px;
+      margin-bottom: 4px;
+      border-bottom: solid 1px var(--xr-border-color);
+    }
+    
+    .xr-header > div,
+    .xr-header > ul {
+      display: inline;
+      margin-top: 0;
+      margin-bottom: 0;
+    }
+    
+    .xr-obj-type,
+    .xr-array-name {
+      margin-left: 2px;
+      margin-right: 10px;
+    }
+    
+    .xr-obj-type {
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-sections {
+      padding-left: 0 !important;
+      display: grid;
+      grid-template-columns: 150px auto auto 1fr 20px 20px;
+    }
+    
+    .xr-section-item {
+      display: contents;
+    }
+    
+    .xr-section-item input {
+      display: none;
+    }
+    
+    .xr-section-item input + label {
+      color: var(--xr-disabled-color);
+    }
+    
+    .xr-section-item input:enabled + label {
+      cursor: pointer;
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-section-item input:enabled + label:hover {
+      color: var(--xr-font-color0);
+    }
+    
+    .xr-section-summary {
+      grid-column: 1;
+      color: var(--xr-font-color2);
+      font-weight: 500;
+    }
+    
+    .xr-section-summary > span {
+      display: inline-block;
+      padding-left: 0.5em;
+    }
+    
+    .xr-section-summary-in:disabled + label {
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-section-summary-in + label:before {
+      display: inline-block;
+      content: '►';
+      font-size: 11px;
+      width: 15px;
+      text-align: center;
+    }
+    
+    .xr-section-summary-in:disabled + label:before {
+      color: var(--xr-disabled-color);
+    }
+    
+    .xr-section-summary-in:checked + label:before {
+      content: '▼';
+    }
+    
+    .xr-section-summary-in:checked + label > span {
+      display: none;
+    }
+    
+    .xr-section-summary,
+    .xr-section-inline-details {
+      padding-top: 4px;
+      padding-bottom: 4px;
+    }
+    
+    .xr-section-inline-details {
+      grid-column: 2 / -1;
+    }
+    
+    .xr-section-details {
+      display: none;
+      grid-column: 1 / -1;
+      margin-bottom: 5px;
+    }
+    
+    .xr-section-summary-in:checked ~ .xr-section-details {
+      display: contents;
+    }
+    
+    .xr-array-wrap {
+      grid-column: 1 / -1;
+      display: grid;
+      grid-template-columns: 20px auto;
+    }
+    
+    .xr-array-wrap > label {
+      grid-column: 1;
+      vertical-align: top;
+    }
+    
+    .xr-preview {
+      color: var(--xr-font-color3);
+    }
+    
+    .xr-array-preview,
+    .xr-array-data {
+      padding: 0 5px !important;
+      grid-column: 2;
+    }
+    
+    .xr-array-data,
+    .xr-array-in:checked ~ .xr-array-preview {
+      display: none;
+    }
+    
+    .xr-array-in:checked ~ .xr-array-data,
+    .xr-array-preview {
+      display: inline-block;
+    }
+    
+    .xr-dim-list {
+      display: inline-block !important;
+      list-style: none;
+      padding: 0 !important;
+      margin: 0;
+    }
+    
+    .xr-dim-list li {
+      display: inline-block;
+      padding: 0;
+      margin: 0;
+    }
+    
+    .xr-dim-list:before {
+      content: '(';
+    }
+    
+    .xr-dim-list:after {
+      content: ')';
+    }
+    
+    .xr-dim-list li:not(:last-child):after {
+      content: ',';
+      padding-right: 5px;
+    }
+    
+    .xr-has-index {
+      font-weight: bold;
+    }
+    
+    .xr-var-list,
+    .xr-var-item {
+      display: contents;
+    }
+    
+    .xr-var-item > div,
+    .xr-var-item label,
+    .xr-var-item > .xr-var-name span {
+      background-color: var(--xr-background-color-row-even);
+      margin-bottom: 0;
+    }
+    
+    .xr-var-item > .xr-var-name:hover span {
+      padding-right: 5px;
+    }
+    
+    .xr-var-list > li:nth-child(odd) > div,
+    .xr-var-list > li:nth-child(odd) > label,
+    .xr-var-list > li:nth-child(odd) > .xr-var-name span {
+      background-color: var(--xr-background-color-row-odd);
+    }
+    
+    .xr-var-name {
+      grid-column: 1;
+    }
+    
+    .xr-var-dims {
+      grid-column: 2;
+    }
+    
+    .xr-var-dtype {
+      grid-column: 3;
+      text-align: right;
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-var-preview {
+      grid-column: 4;
+    }
+    
+    .xr-var-name,
+    .xr-var-dims,
+    .xr-var-dtype,
+    .xr-preview,
+    .xr-attrs dt {
+      white-space: nowrap;
+      overflow: hidden;
+      text-overflow: ellipsis;
+      padding-right: 10px;
+    }
+    
+    .xr-var-name:hover,
+    .xr-var-dims:hover,
+    .xr-var-dtype:hover,
+    .xr-attrs dt:hover {
+      overflow: visible;
+      width: auto;
+      z-index: 1;
+    }
+    
+    .xr-var-attrs,
+    .xr-var-data {
+      display: none;
+      background-color: var(--xr-background-color) !important;
+      padding-bottom: 5px !important;
+    }
+    
+    .xr-var-attrs-in:checked ~ .xr-var-attrs,
+    .xr-var-data-in:checked ~ .xr-var-data {
+      display: block;
+    }
+    
+    .xr-var-data > table {
+      float: right;
+    }
+    
+    .xr-var-name span,
+    .xr-var-data,
+    .xr-attrs {
+      padding-left: 25px !important;
+    }
+    
+    .xr-attrs,
+    .xr-var-attrs,
+    .xr-var-data {
+      grid-column: 1 / -1;
+    }
+    
+    dl.xr-attrs {
+      padding: 0;
+      margin: 0;
+      display: grid;
+      grid-template-columns: 125px auto;
+    }
+    
+    .xr-attrs dt, dd {
+      padding: 0;
+      margin: 0;
+      float: left;
+      padding-right: 10px;
+      width: auto;
+    }
+    
+    .xr-attrs dt {
+      font-weight: normal;
+      grid-column: 1;
+    }
+    
+    .xr-attrs dt:hover span {
+      display: inline-block;
+      background: var(--xr-background-color);
+      padding-right: 10px;
+    }
+    
+    .xr-attrs dd {
+      grid-column: 2;
+      white-space: pre-wrap;
+      word-break: break-all;
+    }
+    
+    .xr-icon-database,
+    .xr-icon-file-text2 {
+      display: inline-block;
+      vertical-align: middle;
+      width: 1em;
+      height: 1.5em !important;
+      stroke-width: 0;
+      stroke: currentColor;
+      fill: currentColor;
+    }
+    </style><div class='xr-wrap'><div class='xr-header'><div class='xr-obj-type'>xarray.Dataset</div></div><ul class='xr-sections'><li class='xr-section-item'><input id='section-85bd6079-b21e-4423-9363-f7c316d71d81' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-85bd6079-b21e-4423-9363-f7c316d71d81' class='xr-section-summary'  title='Expand/collapse section'>Dimensions:</label><div class='xr-section-inline-details'><ul class='xr-dim-list'><li><span>alleles</span>: 2</li><li><span>ploidy</span>: 2</li><li><span>samples</span>: 1142</li><li><span>variants</span>: 10000</li></ul></div><div class='xr-section-details'></div></li><li class='xr-section-item'><input id='section-73b6c7b4-350e-4024-b920-7f01a632a3b2' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-73b6c7b4-350e-4024-b920-7f01a632a3b2' class='xr-section-summary'  title='Expand/collapse section'>Coordinates: <span>(0)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'></ul></div></li><li class='xr-section-item'><input id='section-3ad84f58-790d-438f-b0f9-a39d1e46e8a0' class='xr-section-summary-in' type='checkbox'  checked><label for='section-3ad84f58-790d-438f-b0f9-a39d1e46e8a0' class='xr-section-summary' >Data variables: <span>(6)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'><li class='xr-var-item'><div class='xr-var-name'><span>variant/contig</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int64</div><div class='xr-var-preview xr-preview'>0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0</div><input id='attrs-4b27bac2-0d2b-4bad-8649-5c179b58e3d9' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-4b27bac2-0d2b-4bad-8649-5c179b58e3d9' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-e20535dd-ea7c-43cc-b65b-20b6186f8650' class='xr-var-data-in' type='checkbox'><label for='data-e20535dd-ea7c-43cc-b65b-20b6186f8650' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([0, 0, 0, ..., 0, 0, 0])</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/position</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>9526 9531 9536 ... 64416 64418</div><input id='attrs-e0163554-f274-4624-848d-3dc34ec6b8ac' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-e0163554-f274-4624-848d-3dc34ec6b8ac' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-0f951624-5df0-4fdd-9c35-70e2e6107200' class='xr-var-data-in' type='checkbox'><label for='data-0f951624-5df0-4fdd-9c35-70e2e6107200' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([ 9526,  9531,  9536, ..., 64411, 64416, 64418], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/alleles</span></div><div class='xr-var-dims'>(variants, alleles)</div><div class='xr-var-dtype'>|S1</div><div class='xr-var-preview xr-preview'>b&#x27;A&#x27; b&#x27;G&#x27; b&#x27;A&#x27; ... b&#x27;T&#x27; b&#x27;T&#x27; b&#x27;C&#x27;</div><input id='attrs-719d93aa-cb5e-48a6-96a2-34a59f173076' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-719d93aa-cb5e-48a6-96a2-34a59f173076' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-53e8c636-c715-4ac2-862f-8add9122ebf6' class='xr-var-data-in' type='checkbox'><label for='data-53e8c636-c715-4ac2-862f-8add9122ebf6' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[b&#x27;A&#x27;, b&#x27;G&#x27;],
+           [b&#x27;A&#x27;, b&#x27;T&#x27;],
+           [b&#x27;T&#x27;, b&#x27;C&#x27;],
+           ...,
+           [b&#x27;A&#x27;, b&#x27;T&#x27;],
+           [b&#x27;G&#x27;, b&#x27;T&#x27;],
+           [b&#x27;T&#x27;, b&#x27;C&#x27;]], dtype=&#x27;|S1&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>sample/id</span></div><div class='xr-var-dims'>(samples)</div><div class='xr-var-dtype'>&lt;U8</div><div class='xr-var-preview xr-preview'>&#x27;AA0040-C&#x27; ... &#x27;AY0091-C&#x27;</div><input id='attrs-dd2848f1-998d-4243-bb96-4e260abde3ee' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-dd2848f1-998d-4243-bb96-4e260abde3ee' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-906c912d-488e-4742-b6ce-e173786e9eb5' class='xr-var-data-in' type='checkbox'><label for='data-906c912d-488e-4742-b6ce-e173786e9eb5' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([&#x27;AA0040-C&#x27;, &#x27;AA0041-C&#x27;, &#x27;AA0042-C&#x27;, ..., &#x27;AY0089-C&#x27;, &#x27;AY0090-C&#x27;,
+           &#x27;AY0091-C&#x27;], dtype=&#x27;&lt;U8&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>int8</div><div class='xr-var-preview xr-preview'>0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0</div><input id='attrs-2e40a5da-eff2-4b99-9365-3e96e671f4f8' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-2e40a5da-eff2-4b99-9365-3e96e671f4f8' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-30918c2a-e2bd-4ac0-9eae-3dcf06c59fd9' class='xr-var-data-in' type='checkbox'><label for='data-30918c2a-e2bd-4ac0-9eae-3dcf06c59fd9' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[0, 0],
+            [0, 0],
+            [0, 0],
+            ...,
+            [0, 0],
+            [0, 0],
+            [0, 0]],
+    
+           [[0, 0],
+            [0, 0],
+            [0, 0],
+            ...,
+            [0, 0],
+            [0, 0],
+            [0, 0]],
+    
+           [[0, 0],
+            [0, 0],
+            [0, 0],
+            ...,
+            [0, 0],
+            [0, 0],
+            [0, 0]],
+    
+           ...,
+    
+           [[0, 0],
+            [0, 0],
+            [0, 0],
+            ...,
+            [0, 0],
+            [0, 0],
+            [0, 0]],
+    
+           [[0, 0],
+            [0, 0],
+            [0, 0],
+            ...,
+            [0, 0],
+            [0, 0],
+            [0, 0]],
+    
+           [[0, 0],
+            [0, 0],
+            [0, 0],
+            ...,
+            [0, 0],
+            [0, 0],
+            [0, 0]]], dtype=int8)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype_mask</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>bool</div><div class='xr-var-preview xr-preview'>False False False ... False False</div><input id='attrs-43e0655e-1427-4555-9fda-f83a93d51b36' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-43e0655e-1427-4555-9fda-f83a93d51b36' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-c19bd47a-ed1d-494e-a759-79f767654ce5' class='xr-var-data-in' type='checkbox'><label for='data-c19bd47a-ed1d-494e-a759-79f767654ce5' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]],
+    
+           [[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]],
+    
+           [[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]],
+    
+           ...,
+    
+           [[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]],
+    
+           [[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]],
+    
+           [[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]]])</pre></li></ul></div></li><li class='xr-section-item'><input id='section-fb447238-dd8a-4813-9881-9ad56bee98e4' class='xr-section-summary-in' type='checkbox'  checked><label for='section-fb447238-dd8a-4813-9881-9ad56bee98e4' class='xr-section-summary' >Attributes: <span>(1)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><dl class='xr-attrs'><dt><span>contigs :</span></dt><dd>[&#x27;3R&#x27;]</dd></dl></div></li></ul></div></div>
+
+
diff --git a/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-VCF.rst b/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-VCF.rst
new file mode 100644
index 000000000..c6ff0cf45
--- /dev/null
+++ b/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-From-VCF.rst
@@ -0,0 +1,923 @@
+Load From a VCF File Example
+============================
+
+A central point to the SGkit API is the Genotype Call Dataset. This is
+the data structure that most of the other functions use. It uses
+`Xarray <http://xarray.pydata.org/en/stable/>`__ underneath the hood to
+give a programmatic interface that allows for the backend to be several
+different data files.
+
+The Xarray itself is *sort of* a transposed VCF file.
+
+For this particular example we are going to go from a VCF file to the
+Genotype Call DataSet.
+
+**Please note that in the real world you should not read in your VCF
+files like this, but instead use the functionality in sgkit to go from a
+VCF to a Zarr file.**
+
+We are starting from the VCF file in order to give a conceptual
+understanding of the data structure itself.
+
+.. code:: ipython3
+
+    import numpy as np
+    import zarr
+    import pandas as pd
+    import dask.array as da
+    import allel
+    from pprint import pprint
+    import matplotlib.pyplot as plt
+    %matplotlib inline
+
+Prep Work - Install Packages
+----------------------------
+
+SGKit is still under rapid development, so I’m installing based on a
+commit.
+
+.. code:: ipython3
+
+    #! pip install git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc
+
+Install PyVCF
+~~~~~~~~~~~~~
+
+You’ll need to install PyVCF, samtools and tabix in order to run this
+example as is.
+
+PyVCF needs to be in the same kernel in order to use it, but tabix can
+be installed anywhere.
+
+.. code:: ipython3
+
+    # Or install to your existing environment
+    # ! conda install -c bioconda -c conda-forge -y pyvcf samtools tabix
+    
+    
+    # Uncomment these to create a new conda environment and install these packages
+    # If you create a new environment you will have to switch your jupyterhub kernel
+    # ! conda create -n samtools -c bioconda -c conda-forge -y samtools pyvcf samtools tabix
+    # ! conda activate samtools 
+    
+    # ! tabix -h ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20100804/ALL.2of4intersection.20100804.genotypes.vcf.gz 2:39967768-39967768 > chr2.vcf
+    # ls -lah chr2.vcf
+
+Grab Some Data
+--------------
+
+We’re going to grab a small subset of a VCF file from the `1000 Genomes
+Project. <https://www.internationalgenome.org/faq/how-do-i-get-sub-section-vcf-file/>`__.
+We’re only going to grab 3 calls, which is fine for our purposes.
+
+These calls are also already biallelic. I cheat. ;-)
+
+.. code:: ipython3
+
+    # I couldn't run this from jupyterhub but needed an actual terminal
+    #! conda activate samtools 
+    #! tabix -h ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20100804/ALL.2of4intersection.20100804.genotypes.vcf.gz 2:39967768-39967800 > chr2.vcf
+    #! conda deactivate
+    # ls -lah chr2.vcf
+
+.. code:: ipython3
+
+    import vcf
+    import os
+
+Let’s write up a quick, *not to be used in the real world*, parser to
+grab data about the variants.
+
+-  Variant Contig Names - A unique list of all the chromosomes and
+   contigs
+-  Variant Contig - an index of the variant_contig_names list.
+-  Variant Position - Position on the chromosome
+-  Variant Reference and Alternate
+-  Samples
+-  Genotype calls per sample - with missing encoded as -1
+
+.. code:: ipython3
+
+    vcf_reader = vcf.Reader(open('/home/jovyan/chr2.vcf', 'r'))
+    
+    # I already know these come from chr2
+    # but let's grab them anyways
+    variant_contig_names = []
+    
+    variant_chrom = []
+    variant_position = []
+    variant_alleles = []
+    variant_contig = []
+    
+    sample_id = []
+    call_genotype = []
+    
+    count = 0
+    
+    for record in vcf_reader:
+        
+        chrom = str(record.CHROM)
+        if chrom not in variant_contig_names:
+            variant_contig_names.append(chrom)
+            
+        # Grab the index of the contig
+        variant_contig.append(variant_contig_names.index(chrom))
+        
+        # Get the variant data
+        # I'm cheating and only getting the first alternate. In the real world you would filter for biallelic variants.
+        variant_alleles.append([str(record.REF), str(record.ALT[0])])
+        variant_position.append(record.POS)
+        
+        # the sample records is an object that has call data       
+        samples = record.samples
+        
+        # Grab the sample names
+        if count == 0:
+            for sample in samples:
+                sample_id.append(sample.sample)
+        
+        # Grab the call data for each sample for the variant
+        variant_genotypes = []
+        for sample in samples:
+            # If its missing encode as -1, -1
+            if sample['GT'] == './.':
+                variant_genotypes.append([-1, -1])
+            else:
+                GT = sample['GT'].split('|')
+                variant_genotypes.append([int(GT[0]), int(GT[1])])
+        
+        call_genotype.append(variant_genotypes)
+        count = count + 1
+
+Convert to Numpy
+----------------
+
+Now that we have our data, we need to prepare for our XArray dataset by
+converting these to Numpy arrays.
+
+If you’re wondering how I know what these are you can check out the
+``sgkit.api.create_genotype_call_dataset``. The exact functions are
+``check_array_like`` and make sure that these are numpy arrays of a
+particular type.
+
+::
+
+   check_array_like(variant_contig, kind="i", ndim=1)
+   check_array_like(variant_position, kind="i", ndim=1)
+   check_array_like(variant_alleles, kind="S", ndim=2)
+   check_array_like(sample_id, kind="U", ndim=1)
+   check_array_like(call_genotype, kind="i", ndim=3)
+
+.. code:: ipython3
+
+    sample_id = np.array(sample_id, dtype='U')
+    variant_position = np.array(variant_position, dtype='i')
+    variant_alleles = np.array(variant_alleles, dtype='S')
+    variant_contig_names = np.array(variant_contig_names, dtype='S')
+    variant_contig = np.array(variant_contig, dtype='i')
+
+Understanding Variant Contig and Variant Position
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+The Genotype Call Xarray dataset is meant to be able to incorporate
+multiple chromosomes.
+
+Let’s say we have variant calls from chrs 1 and 2, which we read into an
+array ``['chr1','chr2']``.
+
+.. code:: ipython3
+
+    import pandas as pd
+
+.. code:: ipython3
+
+    contigs = ['chr1', 'chr2']
+        
+    df = pd.DataFrame({
+                        'variant_contig_index': [0, 0, 1, 1],
+                        'variant_position': [1, 2, 1, 2],
+                        })
+    df
+
+
+
+
+.. raw:: html
+
+    <div>
+    <style scoped>
+        .dataframe tbody tr th:only-of-type {
+            vertical-align: middle;
+        }
+    
+        .dataframe tbody tr th {
+            vertical-align: top;
+        }
+    
+        .dataframe thead th {
+            text-align: right;
+        }
+    </style>
+    <table border="1" class="dataframe">
+      <thead>
+        <tr style="text-align: right;">
+          <th></th>
+          <th>variant_contig_index</th>
+          <th>variant_position</th>
+        </tr>
+      </thead>
+      <tbody>
+        <tr>
+          <th>0</th>
+          <td>0</td>
+          <td>1</td>
+        </tr>
+        <tr>
+          <th>1</th>
+          <td>0</td>
+          <td>2</td>
+        </tr>
+        <tr>
+          <th>2</th>
+          <td>1</td>
+          <td>1</td>
+        </tr>
+        <tr>
+          <th>3</th>
+          <td>1</td>
+          <td>2</td>
+        </tr>
+      </tbody>
+    </table>
+    </div>
+
+
+
+The Xarray dataset looks like the dataframe above.
+
+When we initialize the Xarray dataset we will give it a list of contigs
+(or chromosomes). We don’t need to explicitly list the contig per
+position because we can calculate this based on the contig index.
+
+**Contig**: ``contigs[row['variant_contig_index']]``
+
+**Position**: ``row['variant_position']``
+
+.. code:: ipython3
+
+    def return_contig(row):
+        return 'Chr: {chr} Pos: {pos}'.format(chr=contigs[row['variant_contig_index']], pos=row['variant_position'])
+    
+    df['description'] = df.apply(lambda row: return_contig(row), axis=1)
+    
+    df
+
+
+
+
+.. raw:: html
+
+    <div>
+    <style scoped>
+        .dataframe tbody tr th:only-of-type {
+            vertical-align: middle;
+        }
+    
+        .dataframe tbody tr th {
+            vertical-align: top;
+        }
+    
+        .dataframe thead th {
+            text-align: right;
+        }
+    </style>
+    <table border="1" class="dataframe">
+      <thead>
+        <tr style="text-align: right;">
+          <th></th>
+          <th>variant_contig_index</th>
+          <th>variant_position</th>
+          <th>description</th>
+        </tr>
+      </thead>
+      <tbody>
+        <tr>
+          <th>0</th>
+          <td>0</td>
+          <td>1</td>
+          <td>Chr: chr1 Pos: 1</td>
+        </tr>
+        <tr>
+          <th>1</th>
+          <td>0</td>
+          <td>2</td>
+          <td>Chr: chr1 Pos: 2</td>
+        </tr>
+        <tr>
+          <th>2</th>
+          <td>1</td>
+          <td>1</td>
+          <td>Chr: chr2 Pos: 1</td>
+        </tr>
+        <tr>
+          <th>3</th>
+          <td>1</td>
+          <td>2</td>
+          <td>Chr: chr2 Pos: 2</td>
+        </tr>
+      </tbody>
+    </table>
+    </div>
+
+
+
+Genotype Calls
+~~~~~~~~~~~~~~
+
+If we’ve done our work right we our genotypes should have the shape:
+``[DIM_VARIANT, DIM_SAMPLE, DIM_PLOIDY]``, meaning the first axis is the
+number of variants, the second the number of samples, and the third the
+ploidy. In our case we are working with diploid alleles.
+
+Our genotype array has this structure:
+
+.. code:: python
+
+   genotypes = [
+
+       # Outermost array should have a length = the number of variants
+       
+       # variant chr 1 position 1
+       [
+           # Per variant we should have an array length = number of samples
+           
+           # sample 1 
+           # Per sample we should have an array length = number of alleles
+           [call, call],
+           
+           # sample 2
+           [call, call]
+       ],
+       
+       # variant chr 1 position 2
+       [
+           # sample 1 
+           [call, call],
+           # sample 2
+           [call, call]
+       ],
+       
+   ]
+
+.. code:: ipython3
+
+    call_genotype = np.array(call_genotype, dtype='i')
+    call_genotype.shape
+
+
+
+
+.. parsed-literal::
+
+    (3, 629, 2)
+
+
+
+This is correct! We have 3 variants, 629 samples, and diploid alleles.
+
+Convert to Genotype Call Dataset
+--------------------------------
+
+Finally! Let’s convert this to the Genotype Call Dataset!
+
+.. code:: ipython3
+
+    variant_alleles
+
+
+
+
+.. parsed-literal::
+
+    array([[b'T', b'A'],
+           [b'G', b'C'],
+           [b'C', b'T']], dtype='|S1')
+
+
+
+.. code:: ipython3
+
+    import sgkit
+    
+    genotype_xarray_dataset = sgkit.api.create_genotype_call_dataset(
+        variant_contig_names = variant_contig_names,
+        # Since we know these are all from the same chromosome we could just calculate this on the fly as a np array of zeros
+        #variant_contig = np.zeros(len(variant_position)),
+        variant_contig = variant_contig,
+        variant_position = variant_position,
+        variant_alleles = variant_alleles,
+        sample_id = sample_id,
+        call_genotype = call_genotype,
+    )
+
+.. code:: ipython3
+
+    genotype_xarray_dataset
+
+
+
+
+.. raw:: html
+
+    <div><svg style="position: absolute; width: 0; height: 0; overflow: hidden">
+    <defs>
+    <symbol id="icon-database" viewBox="0 0 32 32">
+    <title>Show/Hide data repr</title>
+    <path d="M16 0c-8.837 0-16 2.239-16 5v4c0 2.761 7.163 5 16 5s16-2.239 16-5v-4c0-2.761-7.163-5-16-5z"></path>
+    <path d="M16 17c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z"></path>
+    <path d="M16 26c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z"></path>
+    </symbol>
+    <symbol id="icon-file-text2" viewBox="0 0 32 32">
+    <title>Show/Hide attributes</title>
+    <path d="M28.681 7.159c-0.694-0.947-1.662-2.053-2.724-3.116s-2.169-2.030-3.116-2.724c-1.612-1.182-2.393-1.319-2.841-1.319h-15.5c-1.378 0-2.5 1.121-2.5 2.5v27c0 1.378 1.122 2.5 2.5 2.5h23c1.378 0 2.5-1.122 2.5-2.5v-19.5c0-0.448-0.137-1.23-1.319-2.841zM24.543 5.457c0.959 0.959 1.712 1.825 2.268 2.543h-4.811v-4.811c0.718 0.556 1.584 1.309 2.543 2.268zM28 29.5c0 0.271-0.229 0.5-0.5 0.5h-23c-0.271 0-0.5-0.229-0.5-0.5v-27c0-0.271 0.229-0.5 0.5-0.5 0 0 15.499-0 15.5 0v7c0 0.552 0.448 1 1 1h7v19.5z"></path>
+    <path d="M23 26h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    <path d="M23 22h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    <path d="M23 18h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    </symbol>
+    </defs>
+    </svg>
+    <style>/* CSS stylesheet for displaying xarray objects in jupyterlab.
+     *
+     */
+    
+    :root {
+      --xr-font-color0: var(--jp-content-font-color0, rgba(0, 0, 0, 1));
+      --xr-font-color2: var(--jp-content-font-color2, rgba(0, 0, 0, 0.54));
+      --xr-font-color3: var(--jp-content-font-color3, rgba(0, 0, 0, 0.38));
+      --xr-border-color: var(--jp-border-color2, #e0e0e0);
+      --xr-disabled-color: var(--jp-layout-color3, #bdbdbd);
+      --xr-background-color: var(--jp-layout-color0, white);
+      --xr-background-color-row-even: var(--jp-layout-color1, white);
+      --xr-background-color-row-odd: var(--jp-layout-color2, #eeeeee);
+    }
+    
+    .xr-wrap {
+      min-width: 300px;
+      max-width: 700px;
+    }
+    
+    .xr-header {
+      padding-top: 6px;
+      padding-bottom: 6px;
+      margin-bottom: 4px;
+      border-bottom: solid 1px var(--xr-border-color);
+    }
+    
+    .xr-header > div,
+    .xr-header > ul {
+      display: inline;
+      margin-top: 0;
+      margin-bottom: 0;
+    }
+    
+    .xr-obj-type,
+    .xr-array-name {
+      margin-left: 2px;
+      margin-right: 10px;
+    }
+    
+    .xr-obj-type {
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-sections {
+      padding-left: 0 !important;
+      display: grid;
+      grid-template-columns: 150px auto auto 1fr 20px 20px;
+    }
+    
+    .xr-section-item {
+      display: contents;
+    }
+    
+    .xr-section-item input {
+      display: none;
+    }
+    
+    .xr-section-item input + label {
+      color: var(--xr-disabled-color);
+    }
+    
+    .xr-section-item input:enabled + label {
+      cursor: pointer;
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-section-item input:enabled + label:hover {
+      color: var(--xr-font-color0);
+    }
+    
+    .xr-section-summary {
+      grid-column: 1;
+      color: var(--xr-font-color2);
+      font-weight: 500;
+    }
+    
+    .xr-section-summary > span {
+      display: inline-block;
+      padding-left: 0.5em;
+    }
+    
+    .xr-section-summary-in:disabled + label {
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-section-summary-in + label:before {
+      display: inline-block;
+      content: '►';
+      font-size: 11px;
+      width: 15px;
+      text-align: center;
+    }
+    
+    .xr-section-summary-in:disabled + label:before {
+      color: var(--xr-disabled-color);
+    }
+    
+    .xr-section-summary-in:checked + label:before {
+      content: '▼';
+    }
+    
+    .xr-section-summary-in:checked + label > span {
+      display: none;
+    }
+    
+    .xr-section-summary,
+    .xr-section-inline-details {
+      padding-top: 4px;
+      padding-bottom: 4px;
+    }
+    
+    .xr-section-inline-details {
+      grid-column: 2 / -1;
+    }
+    
+    .xr-section-details {
+      display: none;
+      grid-column: 1 / -1;
+      margin-bottom: 5px;
+    }
+    
+    .xr-section-summary-in:checked ~ .xr-section-details {
+      display: contents;
+    }
+    
+    .xr-array-wrap {
+      grid-column: 1 / -1;
+      display: grid;
+      grid-template-columns: 20px auto;
+    }
+    
+    .xr-array-wrap > label {
+      grid-column: 1;
+      vertical-align: top;
+    }
+    
+    .xr-preview {
+      color: var(--xr-font-color3);
+    }
+    
+    .xr-array-preview,
+    .xr-array-data {
+      padding: 0 5px !important;
+      grid-column: 2;
+    }
+    
+    .xr-array-data,
+    .xr-array-in:checked ~ .xr-array-preview {
+      display: none;
+    }
+    
+    .xr-array-in:checked ~ .xr-array-data,
+    .xr-array-preview {
+      display: inline-block;
+    }
+    
+    .xr-dim-list {
+      display: inline-block !important;
+      list-style: none;
+      padding: 0 !important;
+      margin: 0;
+    }
+    
+    .xr-dim-list li {
+      display: inline-block;
+      padding: 0;
+      margin: 0;
+    }
+    
+    .xr-dim-list:before {
+      content: '(';
+    }
+    
+    .xr-dim-list:after {
+      content: ')';
+    }
+    
+    .xr-dim-list li:not(:last-child):after {
+      content: ',';
+      padding-right: 5px;
+    }
+    
+    .xr-has-index {
+      font-weight: bold;
+    }
+    
+    .xr-var-list,
+    .xr-var-item {
+      display: contents;
+    }
+    
+    .xr-var-item > div,
+    .xr-var-item label,
+    .xr-var-item > .xr-var-name span {
+      background-color: var(--xr-background-color-row-even);
+      margin-bottom: 0;
+    }
+    
+    .xr-var-item > .xr-var-name:hover span {
+      padding-right: 5px;
+    }
+    
+    .xr-var-list > li:nth-child(odd) > div,
+    .xr-var-list > li:nth-child(odd) > label,
+    .xr-var-list > li:nth-child(odd) > .xr-var-name span {
+      background-color: var(--xr-background-color-row-odd);
+    }
+    
+    .xr-var-name {
+      grid-column: 1;
+    }
+    
+    .xr-var-dims {
+      grid-column: 2;
+    }
+    
+    .xr-var-dtype {
+      grid-column: 3;
+      text-align: right;
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-var-preview {
+      grid-column: 4;
+    }
+    
+    .xr-var-name,
+    .xr-var-dims,
+    .xr-var-dtype,
+    .xr-preview,
+    .xr-attrs dt {
+      white-space: nowrap;
+      overflow: hidden;
+      text-overflow: ellipsis;
+      padding-right: 10px;
+    }
+    
+    .xr-var-name:hover,
+    .xr-var-dims:hover,
+    .xr-var-dtype:hover,
+    .xr-attrs dt:hover {
+      overflow: visible;
+      width: auto;
+      z-index: 1;
+    }
+    
+    .xr-var-attrs,
+    .xr-var-data {
+      display: none;
+      background-color: var(--xr-background-color) !important;
+      padding-bottom: 5px !important;
+    }
+    
+    .xr-var-attrs-in:checked ~ .xr-var-attrs,
+    .xr-var-data-in:checked ~ .xr-var-data {
+      display: block;
+    }
+    
+    .xr-var-data > table {
+      float: right;
+    }
+    
+    .xr-var-name span,
+    .xr-var-data,
+    .xr-attrs {
+      padding-left: 25px !important;
+    }
+    
+    .xr-attrs,
+    .xr-var-attrs,
+    .xr-var-data {
+      grid-column: 1 / -1;
+    }
+    
+    dl.xr-attrs {
+      padding: 0;
+      margin: 0;
+      display: grid;
+      grid-template-columns: 125px auto;
+    }
+    
+    .xr-attrs dt, dd {
+      padding: 0;
+      margin: 0;
+      float: left;
+      padding-right: 10px;
+      width: auto;
+    }
+    
+    .xr-attrs dt {
+      font-weight: normal;
+      grid-column: 1;
+    }
+    
+    .xr-attrs dt:hover span {
+      display: inline-block;
+      background: var(--xr-background-color);
+      padding-right: 10px;
+    }
+    
+    .xr-attrs dd {
+      grid-column: 2;
+      white-space: pre-wrap;
+      word-break: break-all;
+    }
+    
+    .xr-icon-database,
+    .xr-icon-file-text2 {
+      display: inline-block;
+      vertical-align: middle;
+      width: 1em;
+      height: 1.5em !important;
+      stroke-width: 0;
+      stroke: currentColor;
+      fill: currentColor;
+    }
+    </style><div class='xr-wrap'><div class='xr-header'><div class='xr-obj-type'>xarray.Dataset</div></div><ul class='xr-sections'><li class='xr-section-item'><input id='section-2bbbe44c-6042-4d24-99ce-4b04915ab37b' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-2bbbe44c-6042-4d24-99ce-4b04915ab37b' class='xr-section-summary'  title='Expand/collapse section'>Dimensions:</label><div class='xr-section-inline-details'><ul class='xr-dim-list'><li><span>alleles</span>: 2</li><li><span>ploidy</span>: 2</li><li><span>samples</span>: 629</li><li><span>variants</span>: 3</li></ul></div><div class='xr-section-details'></div></li><li class='xr-section-item'><input id='section-dee09919-0251-4a21-8d0e-b973e56a0913' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-dee09919-0251-4a21-8d0e-b973e56a0913' class='xr-section-summary'  title='Expand/collapse section'>Coordinates: <span>(0)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'></ul></div></li><li class='xr-section-item'><input id='section-0e965023-e326-49ff-93a7-f4d8ca5bd61a' class='xr-section-summary-in' type='checkbox'  checked><label for='section-0e965023-e326-49ff-93a7-f4d8ca5bd61a' class='xr-section-summary' >Data variables: <span>(6)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'><li class='xr-var-item'><div class='xr-var-name'><span>variant/contig</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0 0 0</div><input id='attrs-12f058df-66b9-439c-bf6c-01861f0cdc65' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-12f058df-66b9-439c-bf6c-01861f0cdc65' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-419b2e09-7c7a-40c6-9cef-21c7f5f23527' class='xr-var-data-in' type='checkbox'><label for='data-419b2e09-7c7a-40c6-9cef-21c7f5f23527' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([0, 0, 0], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/position</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>39967768 39967778 39967793</div><input id='attrs-5ed2c700-e8c8-47d0-a7a0-6c0fb18a093b' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-5ed2c700-e8c8-47d0-a7a0-6c0fb18a093b' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-424e24df-c940-4792-a686-50ce65f222ba' class='xr-var-data-in' type='checkbox'><label for='data-424e24df-c940-4792-a686-50ce65f222ba' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([39967768, 39967778, 39967793], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/alleles</span></div><div class='xr-var-dims'>(variants, alleles)</div><div class='xr-var-dtype'>|S1</div><div class='xr-var-preview xr-preview'>b&#x27;T&#x27; b&#x27;A&#x27; b&#x27;G&#x27; b&#x27;C&#x27; b&#x27;C&#x27; b&#x27;T&#x27;</div><input id='attrs-9b7f3d8a-c02b-4d17-8b18-0aecd9477eb0' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-9b7f3d8a-c02b-4d17-8b18-0aecd9477eb0' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-c9e8b4ca-a4a9-437a-809b-abb331a6e3ce' class='xr-var-data-in' type='checkbox'><label for='data-c9e8b4ca-a4a9-437a-809b-abb331a6e3ce' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[b&#x27;T&#x27;, b&#x27;A&#x27;],
+           [b&#x27;G&#x27;, b&#x27;C&#x27;],
+           [b&#x27;C&#x27;, b&#x27;T&#x27;]], dtype=&#x27;|S1&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>sample/id</span></div><div class='xr-var-dims'>(samples)</div><div class='xr-var-dtype'>&lt;U7</div><div class='xr-var-preview xr-preview'>&#x27;HG00098&#x27; &#x27;HG00100&#x27; ... &#x27;NA20828&#x27;</div><input id='attrs-0aecc63f-7276-4c87-a95a-492095873f76' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-0aecc63f-7276-4c87-a95a-492095873f76' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-5b980cd7-2fe4-45cc-8026-11b12d23152b' class='xr-var-data-in' type='checkbox'><label for='data-5b980cd7-2fe4-45cc-8026-11b12d23152b' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([&#x27;HG00098&#x27;, &#x27;HG00100&#x27;, &#x27;HG00106&#x27;, &#x27;HG00112&#x27;, &#x27;HG00114&#x27;, &#x27;HG00116&#x27;,
+           &#x27;HG00117&#x27;, &#x27;HG00118&#x27;, &#x27;HG00119&#x27;, &#x27;HG00120&#x27;, &#x27;HG00122&#x27;, &#x27;HG00123&#x27;,
+           &#x27;HG00124&#x27;, &#x27;HG00126&#x27;, &#x27;HG00131&#x27;, &#x27;HG00141&#x27;, &#x27;HG00142&#x27;, &#x27;HG00143&#x27;,
+           &#x27;HG00144&#x27;, &#x27;HG00145&#x27;, &#x27;HG00146&#x27;, &#x27;HG00147&#x27;, &#x27;HG00148&#x27;, &#x27;HG00149&#x27;,
+           &#x27;HG00150&#x27;, &#x27;HG00151&#x27;, &#x27;HG00152&#x27;, &#x27;HG00153&#x27;, &#x27;HG00156&#x27;, &#x27;HG00158&#x27;,
+           &#x27;HG00159&#x27;, &#x27;HG00160&#x27;, &#x27;HG00171&#x27;, &#x27;HG00173&#x27;, &#x27;HG00174&#x27;, &#x27;HG00176&#x27;,
+           &#x27;HG00177&#x27;, &#x27;HG00178&#x27;, &#x27;HG00179&#x27;, &#x27;HG00180&#x27;, &#x27;HG00181&#x27;, &#x27;HG00182&#x27;,
+           &#x27;HG00183&#x27;, &#x27;HG00185&#x27;, &#x27;HG00186&#x27;, &#x27;HG00187&#x27;, &#x27;HG00188&#x27;, &#x27;HG00189&#x27;,
+           &#x27;HG00190&#x27;, &#x27;HG00231&#x27;, &#x27;HG00239&#x27;, &#x27;HG00242&#x27;, &#x27;HG00243&#x27;, &#x27;HG00244&#x27;,
+           &#x27;HG00245&#x27;, &#x27;HG00247&#x27;, &#x27;HG00258&#x27;, &#x27;HG00262&#x27;, &#x27;HG00264&#x27;, &#x27;HG00265&#x27;,
+           &#x27;HG00266&#x27;, &#x27;HG00267&#x27;, &#x27;HG00269&#x27;, &#x27;HG00270&#x27;, &#x27;HG00272&#x27;, &#x27;HG00306&#x27;,
+           &#x27;HG00308&#x27;, &#x27;HG00311&#x27;, &#x27;HG00312&#x27;, &#x27;HG00357&#x27;, &#x27;HG00361&#x27;, &#x27;HG00366&#x27;,
+           &#x27;HG00367&#x27;, &#x27;HG00368&#x27;, &#x27;HG00369&#x27;, &#x27;HG00372&#x27;, &#x27;HG00373&#x27;, &#x27;HG00377&#x27;,
+           &#x27;HG00380&#x27;, &#x27;HG00403&#x27;, &#x27;HG00404&#x27;, &#x27;HG00406&#x27;, &#x27;HG00407&#x27;, &#x27;HG00445&#x27;,
+           &#x27;HG00446&#x27;, &#x27;HG00452&#x27;, &#x27;HG00457&#x27;, &#x27;HG00553&#x27;, &#x27;HG00554&#x27;, &#x27;HG00559&#x27;,
+           &#x27;HG00560&#x27;, &#x27;HG00565&#x27;, &#x27;HG00566&#x27;, &#x27;HG00577&#x27;, &#x27;HG00578&#x27;, &#x27;HG00592&#x27;,
+           &#x27;HG00593&#x27;, &#x27;HG00596&#x27;, &#x27;HG00610&#x27;, &#x27;HG00611&#x27;, &#x27;HG00625&#x27;, &#x27;HG00626&#x27;,
+           &#x27;HG00628&#x27;, &#x27;HG00629&#x27;, &#x27;HG00634&#x27;, &#x27;HG00635&#x27;, &#x27;HG00637&#x27;, &#x27;HG00638&#x27;,
+           &#x27;HG00640&#x27;, &#x27;NA06984&#x27;, &#x27;NA06985&#x27;, &#x27;NA06986&#x27;, &#x27;NA06989&#x27;, &#x27;NA06994&#x27;,
+           &#x27;NA07000&#x27;, &#x27;NA07037&#x27;, &#x27;NA07048&#x27;, &#x27;NA07051&#x27;, &#x27;NA07056&#x27;, &#x27;NA07346&#x27;,
+           &#x27;NA07347&#x27;, &#x27;NA07357&#x27;, &#x27;NA10847&#x27;, &#x27;NA10851&#x27;, &#x27;NA11829&#x27;, &#x27;NA11830&#x27;,
+           &#x27;NA11831&#x27;, &#x27;NA11832&#x27;, &#x27;NA11840&#x27;, &#x27;NA11843&#x27;, &#x27;NA11881&#x27;, &#x27;NA11892&#x27;,
+           &#x27;NA11893&#x27;, &#x27;NA11894&#x27;, &#x27;NA11918&#x27;, &#x27;NA11919&#x27;, &#x27;NA11920&#x27;, &#x27;NA11930&#x27;,
+           &#x27;NA11931&#x27;, &#x27;NA11932&#x27;, &#x27;NA11933&#x27;, &#x27;NA11992&#x27;, &#x27;NA11993&#x27;, &#x27;NA11994&#x27;,
+           &#x27;NA11995&#x27;, &#x27;NA12003&#x27;, &#x27;NA12004&#x27;, &#x27;NA12005&#x27;, &#x27;NA12006&#x27;, &#x27;NA12043&#x27;,
+           &#x27;NA12044&#x27;, &#x27;NA12045&#x27;, &#x27;NA12046&#x27;, &#x27;NA12058&#x27;, &#x27;NA12144&#x27;, &#x27;NA12154&#x27;,
+           &#x27;NA12155&#x27;, &#x27;NA12156&#x27;, &#x27;NA12249&#x27;, &#x27;NA12272&#x27;, &#x27;NA12273&#x27;, &#x27;NA12275&#x27;,
+           &#x27;NA12287&#x27;, &#x27;NA12340&#x27;, &#x27;NA12341&#x27;, &#x27;NA12342&#x27;, &#x27;NA12347&#x27;, &#x27;NA12348&#x27;,
+           &#x27;NA12383&#x27;, &#x27;NA12399&#x27;, &#x27;NA12400&#x27;, &#x27;NA12413&#x27;, &#x27;NA12414&#x27;, &#x27;NA12489&#x27;,
+           &#x27;NA12546&#x27;, &#x27;NA12716&#x27;, &#x27;NA12717&#x27;, &#x27;NA12718&#x27;, &#x27;NA12749&#x27;, &#x27;NA12750&#x27;,
+           &#x27;NA12751&#x27;, &#x27;NA12761&#x27;, &#x27;NA12762&#x27;, &#x27;NA12763&#x27;, &#x27;NA12775&#x27;, &#x27;NA12776&#x27;,
+           &#x27;NA12777&#x27;, &#x27;NA12778&#x27;, &#x27;NA12812&#x27;, &#x27;NA12813&#x27;, &#x27;NA12814&#x27;, &#x27;NA12815&#x27;,
+           &#x27;NA12828&#x27;, &#x27;NA12830&#x27;, &#x27;NA12872&#x27;, &#x27;NA12873&#x27;, &#x27;NA12874&#x27;, &#x27;NA12889&#x27;,
+           &#x27;NA12890&#x27;, &#x27;NA18486&#x27;, &#x27;NA18487&#x27;, &#x27;NA18489&#x27;, &#x27;NA18498&#x27;, &#x27;NA18499&#x27;,
+           &#x27;NA18501&#x27;, &#x27;NA18502&#x27;, &#x27;NA18504&#x27;, &#x27;NA18505&#x27;, &#x27;NA18507&#x27;, &#x27;NA18508&#x27;,
+           &#x27;NA18510&#x27;, &#x27;NA18511&#x27;, &#x27;NA18516&#x27;, &#x27;NA18517&#x27;, &#x27;NA18519&#x27;, &#x27;NA18520&#x27;,
+           &#x27;NA18522&#x27;, &#x27;NA18523&#x27;, &#x27;NA18525&#x27;, &#x27;NA18526&#x27;, &#x27;NA18527&#x27;, &#x27;NA18532&#x27;,
+           &#x27;NA18535&#x27;, &#x27;NA18537&#x27;, &#x27;NA18538&#x27;, &#x27;NA18539&#x27;, &#x27;NA18541&#x27;, &#x27;NA18542&#x27;,
+           &#x27;NA18545&#x27;, &#x27;NA18547&#x27;, &#x27;NA18550&#x27;, &#x27;NA18552&#x27;, &#x27;NA18553&#x27;, &#x27;NA18555&#x27;,
+           &#x27;NA18558&#x27;, &#x27;NA18560&#x27;, &#x27;NA18561&#x27;, &#x27;NA18562&#x27;, &#x27;NA18563&#x27;, &#x27;NA18564&#x27;,
+           &#x27;NA18565&#x27;, &#x27;NA18566&#x27;, &#x27;NA18567&#x27;, &#x27;NA18570&#x27;, &#x27;NA18571&#x27;, &#x27;NA18572&#x27;,
+           &#x27;NA18573&#x27;, &#x27;NA18574&#x27;, &#x27;NA18576&#x27;, &#x27;NA18577&#x27;, &#x27;NA18579&#x27;, &#x27;NA18582&#x27;,
+           &#x27;NA18592&#x27;, &#x27;NA18593&#x27;, &#x27;NA18603&#x27;, &#x27;NA18605&#x27;, &#x27;NA18608&#x27;, &#x27;NA18609&#x27;,
+           &#x27;NA18611&#x27;, &#x27;NA18612&#x27;, &#x27;NA18614&#x27;, &#x27;NA18615&#x27;, &#x27;NA18616&#x27;, &#x27;NA18617&#x27;,
+           &#x27;NA18618&#x27;, &#x27;NA18619&#x27;, &#x27;NA18620&#x27;, &#x27;NA18621&#x27;, &#x27;NA18622&#x27;, &#x27;NA18623&#x27;,
+           &#x27;NA18624&#x27;, &#x27;NA18625&#x27;, &#x27;NA18626&#x27;, &#x27;NA18627&#x27;, &#x27;NA18628&#x27;, &#x27;NA18630&#x27;,
+           &#x27;NA18631&#x27;, &#x27;NA18632&#x27;, &#x27;NA18633&#x27;, &#x27;NA18634&#x27;, &#x27;NA18636&#x27;, &#x27;NA18638&#x27;,
+           &#x27;NA18640&#x27;, &#x27;NA18642&#x27;, &#x27;NA18643&#x27;, &#x27;NA18745&#x27;, &#x27;NA18853&#x27;, &#x27;NA18856&#x27;,
+           &#x27;NA18858&#x27;, &#x27;NA18861&#x27;, &#x27;NA18867&#x27;, &#x27;NA18868&#x27;, &#x27;NA18870&#x27;, &#x27;NA18871&#x27;,
+           &#x27;NA18873&#x27;, &#x27;NA18874&#x27;, &#x27;NA18907&#x27;, &#x27;NA18908&#x27;, &#x27;NA18909&#x27;, &#x27;NA18910&#x27;,
+           &#x27;NA18912&#x27;, &#x27;NA18916&#x27;, &#x27;NA18940&#x27;, &#x27;NA18941&#x27;, &#x27;NA18942&#x27;, &#x27;NA18943&#x27;,
+           &#x27;NA18944&#x27;, &#x27;NA18945&#x27;, &#x27;NA18947&#x27;, &#x27;NA18948&#x27;, &#x27;NA18949&#x27;, &#x27;NA18950&#x27;,
+           &#x27;NA18951&#x27;, &#x27;NA18952&#x27;, &#x27;NA18953&#x27;, &#x27;NA18955&#x27;, &#x27;NA18956&#x27;, &#x27;NA18959&#x27;,
+           &#x27;NA18960&#x27;, &#x27;NA18961&#x27;, &#x27;NA18963&#x27;, &#x27;NA18964&#x27;, &#x27;NA18965&#x27;, &#x27;NA18967&#x27;,
+           &#x27;NA18968&#x27;, &#x27;NA18970&#x27;, &#x27;NA18971&#x27;, &#x27;NA18972&#x27;, &#x27;NA18973&#x27;, &#x27;NA18974&#x27;,
+           &#x27;NA18975&#x27;, &#x27;NA18976&#x27;, &#x27;NA18977&#x27;, &#x27;NA18979&#x27;, &#x27;NA18980&#x27;, &#x27;NA18981&#x27;,
+           &#x27;NA18982&#x27;, &#x27;NA18983&#x27;, &#x27;NA18984&#x27;, &#x27;NA18985&#x27;, &#x27;NA18986&#x27;, &#x27;NA18987&#x27;,
+           &#x27;NA18988&#x27;, &#x27;NA18989&#x27;, &#x27;NA18990&#x27;, &#x27;NA18997&#x27;, &#x27;NA18999&#x27;, &#x27;NA19000&#x27;,
+           &#x27;NA19001&#x27;, &#x27;NA19002&#x27;, &#x27;NA19003&#x27;, &#x27;NA19004&#x27;, &#x27;NA19005&#x27;, &#x27;NA19007&#x27;,
+           &#x27;NA19009&#x27;, &#x27;NA19010&#x27;, &#x27;NA19012&#x27;, &#x27;NA19027&#x27;, &#x27;NA19044&#x27;, &#x27;NA19054&#x27;,
+           &#x27;NA19055&#x27;, &#x27;NA19056&#x27;, &#x27;NA19057&#x27;, &#x27;NA19058&#x27;, &#x27;NA19059&#x27;, &#x27;NA19060&#x27;,
+           &#x27;NA19062&#x27;, &#x27;NA19063&#x27;, &#x27;NA19064&#x27;, &#x27;NA19065&#x27;, &#x27;NA19066&#x27;, &#x27;NA19067&#x27;,
+           &#x27;NA19068&#x27;, &#x27;NA19070&#x27;, &#x27;NA19072&#x27;, &#x27;NA19074&#x27;, &#x27;NA19075&#x27;, &#x27;NA19076&#x27;,
+           &#x27;NA19077&#x27;, &#x27;NA19078&#x27;, &#x27;NA19079&#x27;, &#x27;NA19082&#x27;, &#x27;NA19083&#x27;, &#x27;NA19084&#x27;,
+           &#x27;NA19085&#x27;, &#x27;NA19086&#x27;, &#x27;NA19087&#x27;, &#x27;NA19088&#x27;, &#x27;NA19093&#x27;, &#x27;NA19098&#x27;,
+           &#x27;NA19099&#x27;, &#x27;NA19102&#x27;, &#x27;NA19107&#x27;, &#x27;NA19108&#x27;, &#x27;NA19113&#x27;, &#x27;NA19114&#x27;,
+           &#x27;NA19116&#x27;, &#x27;NA19119&#x27;, &#x27;NA19129&#x27;, &#x27;NA19130&#x27;, &#x27;NA19131&#x27;, &#x27;NA19137&#x27;,
+           &#x27;NA19138&#x27;, &#x27;NA19141&#x27;, &#x27;NA19143&#x27;, &#x27;NA19144&#x27;, &#x27;NA19147&#x27;, &#x27;NA19152&#x27;,
+           &#x27;NA19153&#x27;, &#x27;NA19159&#x27;, &#x27;NA19160&#x27;, &#x27;NA19171&#x27;, &#x27;NA19172&#x27;, &#x27;NA19184&#x27;,
+           &#x27;NA19189&#x27;, &#x27;NA19190&#x27;, &#x27;NA19200&#x27;, &#x27;NA19201&#x27;, &#x27;NA19204&#x27;, &#x27;NA19206&#x27;,
+           &#x27;NA19207&#x27;, &#x27;NA19209&#x27;, &#x27;NA19210&#x27;, &#x27;NA19213&#x27;, &#x27;NA19225&#x27;, &#x27;NA19235&#x27;,
+           &#x27;NA19236&#x27;, &#x27;NA19247&#x27;, &#x27;NA19248&#x27;, &#x27;NA19256&#x27;, &#x27;NA19257&#x27;, &#x27;NA19311&#x27;,
+           &#x27;NA19312&#x27;, &#x27;NA19313&#x27;, &#x27;NA19314&#x27;, &#x27;NA19332&#x27;, &#x27;NA19334&#x27;, &#x27;NA19338&#x27;,
+           &#x27;NA19346&#x27;, &#x27;NA19347&#x27;, &#x27;NA19350&#x27;, &#x27;NA19355&#x27;, &#x27;NA19359&#x27;, &#x27;NA19360&#x27;,
+           &#x27;NA19371&#x27;, &#x27;NA19372&#x27;, &#x27;NA19375&#x27;, &#x27;NA19376&#x27;, &#x27;NA19377&#x27;, &#x27;NA19379&#x27;,
+           &#x27;NA19381&#x27;, &#x27;NA19382&#x27;, &#x27;NA19383&#x27;, &#x27;NA19384&#x27;, &#x27;NA19385&#x27;, &#x27;NA19390&#x27;,
+           &#x27;NA19391&#x27;, &#x27;NA19393&#x27;, &#x27;NA19394&#x27;, &#x27;NA19395&#x27;, &#x27;NA19397&#x27;, &#x27;NA19398&#x27;,
+           &#x27;NA19399&#x27;, &#x27;NA19401&#x27;, &#x27;NA19404&#x27;, &#x27;NA19428&#x27;, &#x27;NA19429&#x27;, &#x27;NA19434&#x27;,
+           &#x27;NA19435&#x27;, &#x27;NA19436&#x27;, &#x27;NA19437&#x27;, &#x27;NA19438&#x27;, &#x27;NA19439&#x27;, &#x27;NA19440&#x27;,
+           &#x27;NA19443&#x27;, &#x27;NA19444&#x27;, &#x27;NA19445&#x27;, &#x27;NA19446&#x27;, &#x27;NA19448&#x27;, &#x27;NA19449&#x27;,
+           &#x27;NA19451&#x27;, &#x27;NA19452&#x27;, &#x27;NA19453&#x27;, &#x27;NA19455&#x27;, &#x27;NA19456&#x27;, &#x27;NA19457&#x27;,
+           &#x27;NA19461&#x27;, &#x27;NA19462&#x27;, &#x27;NA19463&#x27;, &#x27;NA19466&#x27;, &#x27;NA19467&#x27;, &#x27;NA19469&#x27;,
+           &#x27;NA19471&#x27;, &#x27;NA19472&#x27;, &#x27;NA19473&#x27;, &#x27;NA19474&#x27;, &#x27;NA19625&#x27;, &#x27;NA19648&#x27;,
+           &#x27;NA19649&#x27;, &#x27;NA19651&#x27;, &#x27;NA19652&#x27;, &#x27;NA19654&#x27;, &#x27;NA19655&#x27;, &#x27;NA19658&#x27;,
+           &#x27;NA19660&#x27;, &#x27;NA19661&#x27;, &#x27;NA19678&#x27;, &#x27;NA19684&#x27;, &#x27;NA19685&#x27;, &#x27;NA19700&#x27;,
+           &#x27;NA19701&#x27;, &#x27;NA19703&#x27;, &#x27;NA19704&#x27;, &#x27;NA19707&#x27;, &#x27;NA19712&#x27;, &#x27;NA19713&#x27;,
+           &#x27;NA19720&#x27;, &#x27;NA19722&#x27;, &#x27;NA19723&#x27;, &#x27;NA19725&#x27;, &#x27;NA19726&#x27;, &#x27;NA19818&#x27;,
+           &#x27;NA19819&#x27;, &#x27;NA19834&#x27;, &#x27;NA19835&#x27;, &#x27;NA19900&#x27;, &#x27;NA19901&#x27;, &#x27;NA19904&#x27;,
+           &#x27;NA19908&#x27;, &#x27;NA19909&#x27;, &#x27;NA19914&#x27;, &#x27;NA19916&#x27;, &#x27;NA19917&#x27;, &#x27;NA19920&#x27;,
+           &#x27;NA19921&#x27;, &#x27;NA19982&#x27;, &#x27;NA20414&#x27;, &#x27;NA20502&#x27;, &#x27;NA20505&#x27;, &#x27;NA20508&#x27;,
+           &#x27;NA20509&#x27;, &#x27;NA20510&#x27;, &#x27;NA20512&#x27;, &#x27;NA20515&#x27;, &#x27;NA20516&#x27;, &#x27;NA20517&#x27;,
+           &#x27;NA20518&#x27;, &#x27;NA20519&#x27;, &#x27;NA20520&#x27;, &#x27;NA20521&#x27;, &#x27;NA20522&#x27;, &#x27;NA20524&#x27;,
+           &#x27;NA20525&#x27;, &#x27;NA20526&#x27;, &#x27;NA20527&#x27;, &#x27;NA20528&#x27;, &#x27;NA20529&#x27;, &#x27;NA20530&#x27;,
+           &#x27;NA20531&#x27;, &#x27;NA20532&#x27;, &#x27;NA20533&#x27;, &#x27;NA20534&#x27;, &#x27;NA20535&#x27;, &#x27;NA20536&#x27;,
+           &#x27;NA20537&#x27;, &#x27;NA20538&#x27;, &#x27;NA20539&#x27;, &#x27;NA20540&#x27;, &#x27;NA20541&#x27;, &#x27;NA20542&#x27;,
+           &#x27;NA20543&#x27;, &#x27;NA20544&#x27;, &#x27;NA20581&#x27;, &#x27;NA20582&#x27;, &#x27;NA20585&#x27;, &#x27;NA20586&#x27;,
+           &#x27;NA20588&#x27;, &#x27;NA20589&#x27;, &#x27;NA20752&#x27;, &#x27;NA20753&#x27;, &#x27;NA20754&#x27;, &#x27;NA20755&#x27;,
+           &#x27;NA20756&#x27;, &#x27;NA20757&#x27;, &#x27;NA20758&#x27;, &#x27;NA20759&#x27;, &#x27;NA20760&#x27;, &#x27;NA20761&#x27;,
+           &#x27;NA20765&#x27;, &#x27;NA20769&#x27;, &#x27;NA20770&#x27;, &#x27;NA20771&#x27;, &#x27;NA20772&#x27;, &#x27;NA20773&#x27;,
+           &#x27;NA20774&#x27;, &#x27;NA20775&#x27;, &#x27;NA20778&#x27;, &#x27;NA20783&#x27;, &#x27;NA20785&#x27;, &#x27;NA20786&#x27;,
+           &#x27;NA20787&#x27;, &#x27;NA20790&#x27;, &#x27;NA20792&#x27;, &#x27;NA20795&#x27;, &#x27;NA20796&#x27;, &#x27;NA20797&#x27;,
+           &#x27;NA20798&#x27;, &#x27;NA20799&#x27;, &#x27;NA20800&#x27;, &#x27;NA20801&#x27;, &#x27;NA20802&#x27;, &#x27;NA20803&#x27;,
+           &#x27;NA20804&#x27;, &#x27;NA20805&#x27;, &#x27;NA20806&#x27;, &#x27;NA20807&#x27;, &#x27;NA20808&#x27;, &#x27;NA20809&#x27;,
+           &#x27;NA20810&#x27;, &#x27;NA20811&#x27;, &#x27;NA20812&#x27;, &#x27;NA20813&#x27;, &#x27;NA20814&#x27;, &#x27;NA20815&#x27;,
+           &#x27;NA20816&#x27;, &#x27;NA20818&#x27;, &#x27;NA20819&#x27;, &#x27;NA20826&#x27;, &#x27;NA20828&#x27;], dtype=&#x27;&lt;U7&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0 0 0 0 1 1 0 1 ... 0 0 0 0 0 0 0 0</div><input id='attrs-c81f544a-e564-420e-aa45-03c0c3fcb884' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-c81f544a-e564-420e-aa45-03c0c3fcb884' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-035d86c6-70eb-4916-baeb-6ec8b2084b8f' class='xr-var-data-in' type='checkbox'><label for='data-035d86c6-70eb-4916-baeb-6ec8b2084b8f' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[ 0,  0],
+            [ 0,  0],
+            [ 1,  1],
+            ...,
+            [ 0,  1],
+            [ 1,  1],
+            [ 1,  0]],
+    
+           [[-1, -1],
+            [-1, -1],
+            [-1, -1],
+            ...,
+            [-1, -1],
+            [-1, -1],
+            [-1, -1]],
+    
+           [[ 0,  0],
+            [ 0,  0],
+            [ 0,  0],
+            ...,
+            [ 0,  0],
+            [ 0,  0],
+            [ 0,  0]]], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype_mask</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>bool</div><div class='xr-var-preview xr-preview'>False False False ... False False</div><input id='attrs-4f538aab-26a1-4465-ad21-f5b6fbaf7997' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-4f538aab-26a1-4465-ad21-f5b6fbaf7997' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-7c65ab14-edd8-4a1a-8d79-cfeb8990238b' class='xr-var-data-in' type='checkbox'><label for='data-7c65ab14-edd8-4a1a-8d79-cfeb8990238b' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]],
+    
+           [[ True,  True],
+            [ True,  True],
+            [ True,  True],
+            ...,
+            [ True,  True],
+            [ True,  True],
+            [ True,  True]],
+    
+           [[False, False],
+            [False, False],
+            [False, False],
+            ...,
+            [False, False],
+            [False, False],
+            [False, False]]])</pre></li></ul></div></li><li class='xr-section-item'><input id='section-c201c05a-80ed-426e-b1ac-36c930a981f6' class='xr-section-summary-in' type='checkbox'  checked><label for='section-c201c05a-80ed-426e-b1ac-36c930a981f6' class='xr-section-summary' >Attributes: <span>(1)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><dl class='xr-attrs'><dt><span>contigs :</span></dt><dd>[b&#x27;2&#x27;]</dd></dl></div></li></ul></div></div>
+
+
+
+Done!
+-----
+
+Now we have our Xarray dataset that we can use with the rest of Sgkit!
diff --git a/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-Minimal-Numpy-Example.rst b/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-Minimal-Numpy-Example.rst
new file mode 100644
index 000000000..fbccc3ae5
--- /dev/null
+++ b/docs/examples/understanding-genotype-call-xarray-dataset/Genotype-Call-Dataset-Minimal-Numpy-Example.rst
@@ -0,0 +1,460 @@
+Minimal Numpy Example
+=====================
+
+A central point to the SGkit API is the Genotype Call Dataset. This is
+the data structure that most of the other functions use. It uses
+`Xarray <http://xarray.pydata.org/en/stable/>`__ underneath the hood to
+give a programmatic interface that allows for the backend to be several
+different data files.
+
+The Xarray itself is *sort of* a transposed VCF file.
+
+For this particular example we are going to use a minimal set of numpy
+arrays in order to create a small Genotype Call Dataset.
+
+This is only meant to demonstrate the datatypes that we feed into the
+Xarray dataset. For a more conceptual understanding please check out the
+``Genotype-Call-Dataset-From-VCF.ipynb``.
+
+.. code:: ipython3
+
+    import numpy as np
+    import zarr
+    import pandas as pd
+    import dask.array as da
+    import allel
+    from pprint import pprint
+    import matplotlib.pyplot as plt
+    %matplotlib inline
+
+Prep Work - Install Packages
+----------------------------
+
+SGKit is still under rapid development, so I’m installing based on a
+commit.
+
+.. code:: ipython3
+
+    #! pip install git+https://github.com/pystatgen/sgkit@96203d471531e7e2416d4dd9b48ca11d660a1bcc
+
+Numpy Representations of the Variant Data
+-----------------------------------------
+
+We need to prepare for our XArray dataset by converting these to Numpy
+arrays.
+
+If you’re wondering how I know what these are you can check out the
+``sgkit.api.create_genotype_call_dataset``. The exact functions are
+``check_array_like`` and make sure that these are numpy arrays of a
+particular type.
+
+::
+
+   check_array_like(variant_contig, kind="i", ndim=1)
+   check_array_like(variant_position, kind="i", ndim=1)
+   check_array_like(variant_alleles, kind="S", ndim=2)
+   check_array_like(sample_id, kind="U", ndim=1)
+   check_array_like(call_genotype, kind="i", ndim=3)
+
+.. code:: ipython3
+
+    variant_contig_names = ['3R']
+    # the variant contig is the index of the chr in the variant_contig_names
+    # because we always prefer numbers over strings!
+    variant_contig = np.array([0], dtype='i')
+    variant_position = np.array([1], dtype='i')
+    variant_alleles = np.array([['A', 'T']], dtype='S')
+    
+    sample_id = np.array(['sample-1'], dtype='U')
+    call_genotype_phased = None
+    variant_id = None
+
+.. code:: ipython3
+
+    # The genotype is 
+    #         "call/genotype": ([DIM_VARIANT, DIM_SAMPLE, DIM_PLOIDY], call_genotype),
+    # and needs to be type 'i'
+    # You can also look at the GenotypeChunkedArray
+    call_genotype = np.array([[[0, 0]]], dtype='i')
+    call_genotype.shape
+
+
+
+
+.. parsed-literal::
+
+    (1, 1, 2)
+
+
+
+This is correct! We have 1 variant, 1 sample, 1 biallelic call.
+
+Convert to Genotype Call Dataset
+--------------------------------
+
+Finally! Let’s convert this to the Genotype Call Dataset!
+
+.. code:: ipython3
+
+    import sgkit
+    
+    genotype_xarray_dataset = sgkit.api.create_genotype_call_dataset(
+        variant_contig_names = variant_contig_names,
+        variant_contig = variant_contig,
+        variant_position = variant_position,
+        variant_alleles = variant_alleles,
+        sample_id = sample_id,
+        call_genotype = call_genotype,
+    )
+
+.. code:: ipython3
+
+    genotype_xarray_dataset
+
+
+
+
+.. raw:: html
+
+    <div><svg style="position: absolute; width: 0; height: 0; overflow: hidden">
+    <defs>
+    <symbol id="icon-database" viewBox="0 0 32 32">
+    <title>Show/Hide data repr</title>
+    <path d="M16 0c-8.837 0-16 2.239-16 5v4c0 2.761 7.163 5 16 5s16-2.239 16-5v-4c0-2.761-7.163-5-16-5z"></path>
+    <path d="M16 17c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z"></path>
+    <path d="M16 26c-8.837 0-16-2.239-16-5v6c0 2.761 7.163 5 16 5s16-2.239 16-5v-6c0 2.761-7.163 5-16 5z"></path>
+    </symbol>
+    <symbol id="icon-file-text2" viewBox="0 0 32 32">
+    <title>Show/Hide attributes</title>
+    <path d="M28.681 7.159c-0.694-0.947-1.662-2.053-2.724-3.116s-2.169-2.030-3.116-2.724c-1.612-1.182-2.393-1.319-2.841-1.319h-15.5c-1.378 0-2.5 1.121-2.5 2.5v27c0 1.378 1.122 2.5 2.5 2.5h23c1.378 0 2.5-1.122 2.5-2.5v-19.5c0-0.448-0.137-1.23-1.319-2.841zM24.543 5.457c0.959 0.959 1.712 1.825 2.268 2.543h-4.811v-4.811c0.718 0.556 1.584 1.309 2.543 2.268zM28 29.5c0 0.271-0.229 0.5-0.5 0.5h-23c-0.271 0-0.5-0.229-0.5-0.5v-27c0-0.271 0.229-0.5 0.5-0.5 0 0 15.499-0 15.5 0v7c0 0.552 0.448 1 1 1h7v19.5z"></path>
+    <path d="M23 26h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    <path d="M23 22h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    <path d="M23 18h-14c-0.552 0-1-0.448-1-1s0.448-1 1-1h14c0.552 0 1 0.448 1 1s-0.448 1-1 1z"></path>
+    </symbol>
+    </defs>
+    </svg>
+    <style>/* CSS stylesheet for displaying xarray objects in jupyterlab.
+     *
+     */
+    
+    :root {
+      --xr-font-color0: var(--jp-content-font-color0, rgba(0, 0, 0, 1));
+      --xr-font-color2: var(--jp-content-font-color2, rgba(0, 0, 0, 0.54));
+      --xr-font-color3: var(--jp-content-font-color3, rgba(0, 0, 0, 0.38));
+      --xr-border-color: var(--jp-border-color2, #e0e0e0);
+      --xr-disabled-color: var(--jp-layout-color3, #bdbdbd);
+      --xr-background-color: var(--jp-layout-color0, white);
+      --xr-background-color-row-even: var(--jp-layout-color1, white);
+      --xr-background-color-row-odd: var(--jp-layout-color2, #eeeeee);
+    }
+    
+    .xr-wrap {
+      min-width: 300px;
+      max-width: 700px;
+    }
+    
+    .xr-header {
+      padding-top: 6px;
+      padding-bottom: 6px;
+      margin-bottom: 4px;
+      border-bottom: solid 1px var(--xr-border-color);
+    }
+    
+    .xr-header > div,
+    .xr-header > ul {
+      display: inline;
+      margin-top: 0;
+      margin-bottom: 0;
+    }
+    
+    .xr-obj-type,
+    .xr-array-name {
+      margin-left: 2px;
+      margin-right: 10px;
+    }
+    
+    .xr-obj-type {
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-sections {
+      padding-left: 0 !important;
+      display: grid;
+      grid-template-columns: 150px auto auto 1fr 20px 20px;
+    }
+    
+    .xr-section-item {
+      display: contents;
+    }
+    
+    .xr-section-item input {
+      display: none;
+    }
+    
+    .xr-section-item input + label {
+      color: var(--xr-disabled-color);
+    }
+    
+    .xr-section-item input:enabled + label {
+      cursor: pointer;
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-section-item input:enabled + label:hover {
+      color: var(--xr-font-color0);
+    }
+    
+    .xr-section-summary {
+      grid-column: 1;
+      color: var(--xr-font-color2);
+      font-weight: 500;
+    }
+    
+    .xr-section-summary > span {
+      display: inline-block;
+      padding-left: 0.5em;
+    }
+    
+    .xr-section-summary-in:disabled + label {
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-section-summary-in + label:before {
+      display: inline-block;
+      content: '►';
+      font-size: 11px;
+      width: 15px;
+      text-align: center;
+    }
+    
+    .xr-section-summary-in:disabled + label:before {
+      color: var(--xr-disabled-color);
+    }
+    
+    .xr-section-summary-in:checked + label:before {
+      content: '▼';
+    }
+    
+    .xr-section-summary-in:checked + label > span {
+      display: none;
+    }
+    
+    .xr-section-summary,
+    .xr-section-inline-details {
+      padding-top: 4px;
+      padding-bottom: 4px;
+    }
+    
+    .xr-section-inline-details {
+      grid-column: 2 / -1;
+    }
+    
+    .xr-section-details {
+      display: none;
+      grid-column: 1 / -1;
+      margin-bottom: 5px;
+    }
+    
+    .xr-section-summary-in:checked ~ .xr-section-details {
+      display: contents;
+    }
+    
+    .xr-array-wrap {
+      grid-column: 1 / -1;
+      display: grid;
+      grid-template-columns: 20px auto;
+    }
+    
+    .xr-array-wrap > label {
+      grid-column: 1;
+      vertical-align: top;
+    }
+    
+    .xr-preview {
+      color: var(--xr-font-color3);
+    }
+    
+    .xr-array-preview,
+    .xr-array-data {
+      padding: 0 5px !important;
+      grid-column: 2;
+    }
+    
+    .xr-array-data,
+    .xr-array-in:checked ~ .xr-array-preview {
+      display: none;
+    }
+    
+    .xr-array-in:checked ~ .xr-array-data,
+    .xr-array-preview {
+      display: inline-block;
+    }
+    
+    .xr-dim-list {
+      display: inline-block !important;
+      list-style: none;
+      padding: 0 !important;
+      margin: 0;
+    }
+    
+    .xr-dim-list li {
+      display: inline-block;
+      padding: 0;
+      margin: 0;
+    }
+    
+    .xr-dim-list:before {
+      content: '(';
+    }
+    
+    .xr-dim-list:after {
+      content: ')';
+    }
+    
+    .xr-dim-list li:not(:last-child):after {
+      content: ',';
+      padding-right: 5px;
+    }
+    
+    .xr-has-index {
+      font-weight: bold;
+    }
+    
+    .xr-var-list,
+    .xr-var-item {
+      display: contents;
+    }
+    
+    .xr-var-item > div,
+    .xr-var-item label,
+    .xr-var-item > .xr-var-name span {
+      background-color: var(--xr-background-color-row-even);
+      margin-bottom: 0;
+    }
+    
+    .xr-var-item > .xr-var-name:hover span {
+      padding-right: 5px;
+    }
+    
+    .xr-var-list > li:nth-child(odd) > div,
+    .xr-var-list > li:nth-child(odd) > label,
+    .xr-var-list > li:nth-child(odd) > .xr-var-name span {
+      background-color: var(--xr-background-color-row-odd);
+    }
+    
+    .xr-var-name {
+      grid-column: 1;
+    }
+    
+    .xr-var-dims {
+      grid-column: 2;
+    }
+    
+    .xr-var-dtype {
+      grid-column: 3;
+      text-align: right;
+      color: var(--xr-font-color2);
+    }
+    
+    .xr-var-preview {
+      grid-column: 4;
+    }
+    
+    .xr-var-name,
+    .xr-var-dims,
+    .xr-var-dtype,
+    .xr-preview,
+    .xr-attrs dt {
+      white-space: nowrap;
+      overflow: hidden;
+      text-overflow: ellipsis;
+      padding-right: 10px;
+    }
+    
+    .xr-var-name:hover,
+    .xr-var-dims:hover,
+    .xr-var-dtype:hover,
+    .xr-attrs dt:hover {
+      overflow: visible;
+      width: auto;
+      z-index: 1;
+    }
+    
+    .xr-var-attrs,
+    .xr-var-data {
+      display: none;
+      background-color: var(--xr-background-color) !important;
+      padding-bottom: 5px !important;
+    }
+    
+    .xr-var-attrs-in:checked ~ .xr-var-attrs,
+    .xr-var-data-in:checked ~ .xr-var-data {
+      display: block;
+    }
+    
+    .xr-var-data > table {
+      float: right;
+    }
+    
+    .xr-var-name span,
+    .xr-var-data,
+    .xr-attrs {
+      padding-left: 25px !important;
+    }
+    
+    .xr-attrs,
+    .xr-var-attrs,
+    .xr-var-data {
+      grid-column: 1 / -1;
+    }
+    
+    dl.xr-attrs {
+      padding: 0;
+      margin: 0;
+      display: grid;
+      grid-template-columns: 125px auto;
+    }
+    
+    .xr-attrs dt, dd {
+      padding: 0;
+      margin: 0;
+      float: left;
+      padding-right: 10px;
+      width: auto;
+    }
+    
+    .xr-attrs dt {
+      font-weight: normal;
+      grid-column: 1;
+    }
+    
+    .xr-attrs dt:hover span {
+      display: inline-block;
+      background: var(--xr-background-color);
+      padding-right: 10px;
+    }
+    
+    .xr-attrs dd {
+      grid-column: 2;
+      white-space: pre-wrap;
+      word-break: break-all;
+    }
+    
+    .xr-icon-database,
+    .xr-icon-file-text2 {
+      display: inline-block;
+      vertical-align: middle;
+      width: 1em;
+      height: 1.5em !important;
+      stroke-width: 0;
+      stroke: currentColor;
+      fill: currentColor;
+    }
+    </style><div class='xr-wrap'><div class='xr-header'><div class='xr-obj-type'>xarray.Dataset</div></div><ul class='xr-sections'><li class='xr-section-item'><input id='section-b8323804-c4f7-4b65-a6ac-1289a3840a2a' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-b8323804-c4f7-4b65-a6ac-1289a3840a2a' class='xr-section-summary'  title='Expand/collapse section'>Dimensions:</label><div class='xr-section-inline-details'><ul class='xr-dim-list'><li><span>alleles</span>: 2</li><li><span>ploidy</span>: 2</li><li><span>samples</span>: 1</li><li><span>variants</span>: 1</li></ul></div><div class='xr-section-details'></div></li><li class='xr-section-item'><input id='section-b7290721-2b6d-4afe-b858-d99f72aa2e67' class='xr-section-summary-in' type='checkbox' disabled ><label for='section-b7290721-2b6d-4afe-b858-d99f72aa2e67' class='xr-section-summary'  title='Expand/collapse section'>Coordinates: <span>(0)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'></ul></div></li><li class='xr-section-item'><input id='section-0243c879-ecc0-4d9f-a3bc-8e1a6128e6ef' class='xr-section-summary-in' type='checkbox'  checked><label for='section-0243c879-ecc0-4d9f-a3bc-8e1a6128e6ef' class='xr-section-summary' >Data variables: <span>(6)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><ul class='xr-var-list'><li class='xr-var-item'><div class='xr-var-name'><span>variant/contig</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0</div><input id='attrs-83b83547-5616-4a87-8272-77dde5cd1cca' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-83b83547-5616-4a87-8272-77dde5cd1cca' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-217ca109-651a-47c0-ba1e-e7352bcfc259' class='xr-var-data-in' type='checkbox'><label for='data-217ca109-651a-47c0-ba1e-e7352bcfc259' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([0], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/position</span></div><div class='xr-var-dims'>(variants)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>1</div><input id='attrs-c5cf4ac8-8a10-4a1e-a510-3666350d0845' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-c5cf4ac8-8a10-4a1e-a510-3666350d0845' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-ba64cea5-7610-4bed-af9e-ccbeb399cae3' class='xr-var-data-in' type='checkbox'><label for='data-ba64cea5-7610-4bed-af9e-ccbeb399cae3' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([1], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>variant/alleles</span></div><div class='xr-var-dims'>(variants, alleles)</div><div class='xr-var-dtype'>|S1</div><div class='xr-var-preview xr-preview'>b&#x27;A&#x27; b&#x27;T&#x27;</div><input id='attrs-21aa5a67-377d-4d6c-b351-c9e0aec82140' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-21aa5a67-377d-4d6c-b351-c9e0aec82140' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-6f4e6459-601c-41f6-aa9a-b6fdc633b2f9' class='xr-var-data-in' type='checkbox'><label for='data-6f4e6459-601c-41f6-aa9a-b6fdc633b2f9' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[b&#x27;A&#x27;, b&#x27;T&#x27;]], dtype=&#x27;|S1&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>sample/id</span></div><div class='xr-var-dims'>(samples)</div><div class='xr-var-dtype'>&lt;U8</div><div class='xr-var-preview xr-preview'>&#x27;sample-1&#x27;</div><input id='attrs-0e0b1e0b-db93-4435-bcf8-62a5cba7e309' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-0e0b1e0b-db93-4435-bcf8-62a5cba7e309' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-d0a71236-17f1-4b2d-8088-637dfeea1e79' class='xr-var-data-in' type='checkbox'><label for='data-d0a71236-17f1-4b2d-8088-637dfeea1e79' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([&#x27;sample-1&#x27;], dtype=&#x27;&lt;U8&#x27;)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>int32</div><div class='xr-var-preview xr-preview'>0 0</div><input id='attrs-e0043ffc-9fe3-4f1e-9c43-79d066ffb555' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-e0043ffc-9fe3-4f1e-9c43-79d066ffb555' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-fbebeafa-8d8c-485f-b95b-1cf9242db2c5' class='xr-var-data-in' type='checkbox'><label for='data-fbebeafa-8d8c-485f-b95b-1cf9242db2c5' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[0, 0]]], dtype=int32)</pre></li><li class='xr-var-item'><div class='xr-var-name'><span>call/genotype_mask</span></div><div class='xr-var-dims'>(variants, samples, ploidy)</div><div class='xr-var-dtype'>bool</div><div class='xr-var-preview xr-preview'>False False</div><input id='attrs-9f562165-a4b3-435c-bbb2-e70f18c3a65f' class='xr-var-attrs-in' type='checkbox' disabled><label for='attrs-9f562165-a4b3-435c-bbb2-e70f18c3a65f' title='Show/Hide attributes'><svg class='icon xr-icon-file-text2'><use xlink:href='#icon-file-text2'></use></svg></label><input id='data-59b01160-9eb5-4d9c-a30f-6ba7e1e3d5d6' class='xr-var-data-in' type='checkbox'><label for='data-59b01160-9eb5-4d9c-a30f-6ba7e1e3d5d6' title='Show/Hide data repr'><svg class='icon xr-icon-database'><use xlink:href='#icon-database'></use></svg></label><div class='xr-var-attrs'><dl class='xr-attrs'></dl></div><pre class='xr-var-data'>array([[[False, False]]])</pre></li></ul></div></li><li class='xr-section-item'><input id='section-05acbe9a-d603-47fc-9ddf-f2eb952c5f30' class='xr-section-summary-in' type='checkbox'  checked><label for='section-05acbe9a-d603-47fc-9ddf-f2eb952c5f30' class='xr-section-summary' >Attributes: <span>(1)</span></label><div class='xr-section-inline-details'></div><div class='xr-section-details'><dl class='xr-attrs'><dt><span>contigs :</span></dt><dd>[&#x27;3R&#x27;]</dd></dl></div></li></ul></div></div>
+
+
+
+Done!
+-----
+
+Now we have our Xarray dataset that we can use with the rest of Sgkit!
diff --git a/docs/index.rst b/docs/index.rst
index 865ce268a..95e0cf26e 100644
--- a/docs/index.rst
+++ b/docs/index.rst
@@ -3,9 +3,11 @@ sgkit: Statistical genetics toolkit in Python
 
 .. toctree::
    :maxdepth: 2
-   :caption: Contents:
+   :caption: Contents
 
    api
+   examples
+
 
 
 Indices and tables
diff --git a/requirements-dev.txt b/requirements-dev.txt
index 4a4131125..b5c807dca 100644
--- a/requirements-dev.txt
+++ b/requirements-dev.txt
@@ -5,4 +5,7 @@ pytest-cov
 hypothesis
 sphinx
 sphinx_rtd_theme
+ipython
+nbsphinx
 statsmodels
+