From 5781cd15bb4f87c79954df405df2852dcaabeb46 Mon Sep 17 00:00:00 2001
From: Dianeod <40861871+Dianeod@users.noreply.github.com>
Date: Wed, 10 Mar 2021 13:51:55 +0000
Subject: [PATCH] Refactor of ML Toolkit (#87)

* update to xval fitscore to support XGBoost models

* xval refactor

* xval refactor

* update to timeseries; Added save/load functionality

* review of xval

* review of graph

* updated param to camelcase

* review of graphing structure

* added utils function

* made naming verbose. Cleaned up code formatting for if statements

* update README

* add @private for util funcs

* new fresh format

* new fresh format

* fixes to hyperparam json file for single values

* utils refactor

* feature function refactor

* final refactoring

* formatting

* formatting review

* timeseries windows tests

* updates to fresh tests

* python utilities

* refactor of util folder

* refactor review

* refactor or optimize section

* Addition of test script for line length exceeding 80 characters, updates in line with this and minor changes to aspects of the optimization code

* Added deprecation warning. Updated namespace for xval

* review of clust code

* updated cutDict comment

* upd cutK

* Minor changes to deprecation functionality

* updated functionMapping for clustering

* fixed sigfeat tests

* updates resulting from review of clustering refactor/update

* fix to scoring tests

* Fix bugs

* review of clust update

* Fixed hierarchial comments

* removed redundant min function

* added WLS,OLS functionality, updated describe function. Updated failing tests for fresh

* updated tests and describe function

* addition of stats folder

* update order of inputs

* updated format of fit/predict inputs

* added WLS fit function. Fixed inputs to OLS fit order

* fixed travis issue for mac

* resolved comments

* changed all fit/predict functions to the same format. Updated timeseries tests

* fixed indentation

* added stats tests to bash script

* added time series tests for windows

* resolve latest comments

* update function mapping and fixed comments

* addition of README

* updated describe function, fixed errors in timeseries and graphimg

* fixed filelength

* ml utilities style review

* fixed line lengths

* fixed crossEntropy

* fixed .ml.i.findKey

* added changes from comments

* review of stats

* reviewed fresh and fixed stats test

* updating clustering and replying to comments on stats and fresh

* try to fix 'branch outside repository error'

* new commit on new branch

* changes to clust/utils.q after comments

* commiting with kx email address

* commiting with kx email address

* review optimize library

* fixed desc from fileoverview

* changes for comments

* fixed @type comments

* review remainder of ml libraries

* changes following comments

* fileoverview changed in pipeline file

* changed init file

* fixed .ml.i.ap added .ml.infReplace for all types

* added test for keyed table infReplace

* change predict -> transform

* fixed init file. Clash with AutoML if not in ml namespace

* resolved comments

Co-authored-by: Deanna Morgan <dmorgan1@kx.com>
Co-authored-by: dmorgankx <44678213+dmorgankx@users.noreply.github.com>
Co-authored-by: Conor McCarthy <conormccarthy@192.168.1.119>
Co-authored-by: Conor McCarthy <conormccarthy@192.168.1.132>
Co-authored-by: unknown <Andrew Morrison>
Co-authored-by: Andrew Morrison <amorrison1@firstderivatives.com>
Co-authored-by: Andrew Morrison <amorrison1@kx.com>
---
 .travis.yml                               |    3 +-
 build/test.bat                            |    2 +-
 clust/README.md                           |    2 +-
 clust/aprop.q                             |  230 +---
 clust/dbscan.q                            |  205 ++--
 clust/hierarchical.q                      |  653 +++-------
 clust/init.q                              |   10 +-
 clust/kdtree.q                            |  178 ++-
 clust/kmeans.q                            |  258 ++--
 clust/score.q                             |  146 +--
 clust/tests/clt.t                         |  143 ++-
 clust/tests/score.t                       |   30 +-
 clust/tests/util.t                        |   52 +-
 clust/util.q                              |  106 --
 clust/utils.q                             | 1327 +++++++++++++++++++++
 fresh/README.md                           |   11 +-
 fresh/extract.q                           |  173 +--
 fresh/feat.q                              |  643 ++++++++++
 fresh/hyperparam.txt                      |   21 -
 fresh/hyperparameters.json                |  140 +++
 fresh/init.q                              |   13 +-
 fresh/select.q                            |   87 +-
 fresh/tests/features.t                    |  966 +++++++--------
 fresh/tests/sigtests.t                    |    8 +-
 fresh/tests/test.p                        |    1 -
 fresh/utils.q                             |  202 ++++
 graph/README.md                           |    2 +-
 graph/graph.q                             |  187 ++-
 graph/init.q                              |   10 +
 graph/modules/loading.q                   |  128 +-
 graph/modules/saving.q                    |  121 +-
 graph/pipeline.q                          |   89 +-
 graph/tests/graph.t                       |   13 +-
 graph/utils.q                             |  153 +++
 init.q                                    |   23 +-
 ml.q                                      |   20 +
 optimize/README.md                        |   35 +
 optimize/init.q                           |   16 +-
 optimize/optim.q                          |  659 ----------
 optimize/optimize.q                       |   77 ++
 optimize/tests/test.t                     |    6 +-
 optimize/utils.q                          |  679 +++++++++++
 stats/README.md                           |   34 +
 stats/describe.json                       |   74 ++
 stats/init.q                              |    5 +
 stats/stats.q                             |  159 +++
 stats/tests/stats.t                       |   56 +
 stats/utils.q                             |  183 +++
 tests/filelength.t                        |   36 +
 tests/testFiles.bat                       |    1 +
 timeseries/README.md                      |    2 +-
 timeseries/fit.q                          |  231 ++--
 timeseries/init.q                         |   22 +-
 timeseries/misc.q                         |  125 +-
 timeseries/predict.q                      |  154 +--
 timeseries/tests/data/linux/fit/AR1       |  Bin 109 -> 595 bytes
 timeseries/tests/data/linux/fit/AR2       |  Bin 973 -> 2323 bytes
 timeseries/tests/data/linux/fit/AR3       |  Bin 949 -> 2275 bytes
 timeseries/tests/data/linux/fit/AR4       |  Bin 981 -> 2339 bytes
 timeseries/tests/data/linux/fit/ARCH1     |  Bin 152 -> 559 bytes
 timeseries/tests/data/linux/fit/ARCH2     |  Bin 104 -> 463 bytes
 timeseries/tests/data/linux/fit/ARIMA1    |  Bin 327 -> 1227 bytes
 timeseries/tests/data/linux/fit/ARIMA2    |  Bin 1479 -> 3531 bytes
 timeseries/tests/data/linux/fit/ARIMA3    |  Bin 1535 -> 3643 bytes
 timeseries/tests/data/linux/fit/ARIMA4    |  Bin 1519 -> 3611 bytes
 timeseries/tests/data/linux/fit/ARMA1     |  Bin 299 -> 912 bytes
 timeseries/tests/data/linux/fit/ARMA2     |  Bin 1483 -> 3280 bytes
 timeseries/tests/data/linux/fit/ARMA3     |  Bin 1451 -> 3216 bytes
 timeseries/tests/data/linux/fit/ARMA4     |  Bin 1547 -> 3408 bytes
 timeseries/tests/data/linux/fit/SARIMA1   |  Bin 591 -> 2115 bytes
 timeseries/tests/data/linux/fit/SARIMA2   |  Bin 1879 -> 4691 bytes
 timeseries/tests/data/linux/fit/SARIMA3   |  Bin 2607 -> 6147 bytes
 timeseries/tests/data/linux/fit/SARIMA4   |  Bin 1967 -> 4867 bytes
 timeseries/tests/data/misc/aicScore1      |  Bin 68 -> 71 bytes
 timeseries/tests/data/misc/aicScore2      |  Bin 68 -> 71 bytes
 timeseries/tests/data/misc/aicScore3      |  Bin 68 -> 71 bytes
 timeseries/tests/data/misc/aicScore4      |  Bin 68 -> 71 bytes
 timeseries/tests/data/windows/fit/AR1     |  Bin 109 -> 595 bytes
 timeseries/tests/data/windows/fit/AR2     |  Bin 973 -> 2323 bytes
 timeseries/tests/data/windows/fit/AR3     |  Bin 949 -> 2275 bytes
 timeseries/tests/data/windows/fit/AR4     |  Bin 981 -> 2339 bytes
 timeseries/tests/data/windows/fit/ARCH1   |  Bin 152 -> 559 bytes
 timeseries/tests/data/windows/fit/ARCH2   |  Bin 104 -> 463 bytes
 timeseries/tests/data/windows/fit/ARIMA1  |  Bin 327 -> 1227 bytes
 timeseries/tests/data/windows/fit/ARIMA2  |  Bin 1479 -> 3531 bytes
 timeseries/tests/data/windows/fit/ARIMA3  |  Bin 1535 -> 3643 bytes
 timeseries/tests/data/windows/fit/ARIMA4  |  Bin 1519 -> 3611 bytes
 timeseries/tests/data/windows/fit/ARMA1   |  Bin 299 -> 912 bytes
 timeseries/tests/data/windows/fit/ARMA2   |  Bin 1483 -> 3280 bytes
 timeseries/tests/data/windows/fit/ARMA3   |  Bin 1451 -> 3216 bytes
 timeseries/tests/data/windows/fit/ARMA4   |  Bin 1547 -> 3408 bytes
 timeseries/tests/data/windows/fit/SARIMA1 |  Bin 591 -> 2115 bytes
 timeseries/tests/data/windows/fit/SARIMA2 |  Bin 1879 -> 4691 bytes
 timeseries/tests/data/windows/fit/SARIMA3 |  Bin 2607 -> 6147 bytes
 timeseries/tests/data/windows/fit/SARIMA4 |  Bin 1967 -> 4867 bytes
 timeseries/tests/fit.t                    |   45 +-
 timeseries/tests/misc.t                   |    4 +-
 timeseries/tests/pred.t                   |   57 +-
 timeseries/utils.q                        | 1016 +++++++++-------
 util/README.md                            |    2 +-
 util/functionMapping.json                 |  310 +++++
 util/init.q                               |   10 +-
 util/metrics.q                            |  402 ++++++-
 util/mproc.q                              |   53 +-
 util/mprocw.q                             |   12 +-
 util/pickle.q                             |   29 +-
 util/preproc.q                            |  449 +++++--
 util/tests/metric.t                       |  153 ++-
 util/tests/preproctst.t                   |  221 ++--
 util/tests/utiltst.t                      |   30 +-
 util/util.q                               |   72 --
 util/utilities.q                          |  159 +++
 util/utils.q                              |  605 ++++++++++
 xval/README.md                            |   16 +-
 xval/init.q                               |   11 +
 xval/tests/xval.t                         |  239 ++--
 xval/utils.q                              |  342 ++++++
 xval/xval.q                               |  475 +++++++-
 118 files changed, 9473 insertions(+), 4149 deletions(-)
 delete mode 100644 clust/util.q
 create mode 100644 clust/utils.q
 create mode 100644 fresh/feat.q
 delete mode 100644 fresh/hyperparam.txt
 create mode 100644 fresh/hyperparameters.json
 create mode 100644 fresh/utils.q
 create mode 100644 graph/utils.q
 create mode 100644 optimize/README.md
 delete mode 100644 optimize/optim.q
 create mode 100644 optimize/optimize.q
 create mode 100644 optimize/utils.q
 create mode 100644 stats/README.md
 create mode 100644 stats/describe.json
 create mode 100644 stats/init.q
 create mode 100644 stats/stats.q
 create mode 100644 stats/tests/stats.t
 create mode 100644 stats/utils.q
 create mode 100644 tests/filelength.t
 create mode 100644 tests/testFiles.bat
 create mode 100644 util/functionMapping.json
 delete mode 100644 util/util.q
 create mode 100644 util/utilities.q
 create mode 100644 util/utils.q
 create mode 100644 xval/utils.q
diff --git a/.travis.yml b/.travis.yml
index f252f85f..a52606e5 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -26,6 +26,7 @@ install:
   # grab latest embedpy
   - if [[ "x$QLIC_KC" != "x" ]]; then
       echo -n $QLIC_KC |base64 --decode > q/kc.lic;
+      pip install --upgrade pip;
       pip -q install -r requirements.txt;
     fi
 beforescript:
@@ -40,7 +41,7 @@ script:
 - echo "Packaged as ml_$TRAVIS_OS_NAME-$TRAVIS_BRANCH.zip"
 -  if [[ "x$QLIC_KC" != "x" ]]; then
     curl -fsSL -o test.q https://github.com/KxSystems/embedpy/raw/master/test.q;
-    q test.q fresh/tests/ util/tests/ xval/tests clust/tests/ graph/tests/ timeseries/tests/ optimize/tests/ -q;
+    bash tests/testFiles.bat;
 
   else
     echo No kdb+, no tests;
diff --git a/build/test.bat b/build/test.bat
index 661d593d..8a9d1615 100644
--- a/build/test.bat
+++ b/build/test.bat
@@ -2,5 +2,5 @@ if defined QLIC_KC (
         pip -q install -r requirements.txt
 	echo getting test.q from embedpy
         curl -fsSL -o test.q https://github.com/KxSystems/embedpy/raw/master/test.q
-        q test.q fresh/tests/ util/tests/ xval/tests/ clust/tests/ graph/tests/ timeseries/tests/ optimize/tests/ -q
+	call "tests\testFiles.bat"
 )
diff --git a/clust/README.md b/clust/README.md
index db4e1a39..b1c2fd9e 100644
--- a/clust/README.md
+++ b/clust/README.md
@@ -43,6 +43,6 @@ Documentation is available on the [clustering](https://code.kx.com/v2/ml/toolkit
 
 ## Status
   
-The clustering library is still in development and is available here as a beta release. Further functionality and improvements will be made to the library in the coming months.
+The clustering library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
 
 If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/clust/aprop.q b/clust/aprop.q
index 1fc46dce..4b4c1f78 100644
--- a/clust/aprop.q
+++ b/clust/aprop.q
@@ -1,197 +1,61 @@
-\d .ml
+// clust/init.q - Affinity propagation 
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Clustering using affinity propagation. 
+// Affinity Propagation groups data based on the similarity 
+// between points and subsequently finds exemplars, which best 
+// represent the points in each cluster. The algorithm does 
+// not require the number of clusters be provided at run time, 
+// but determines the optimum solution by exchanging real-valued 
+// messages between points until a high-valued set of exemplars 
+// is produced.
 
-// Affinity Propagation
+\d .ml
 
 // @kind function
 // @category clust
-// @fileoverview Fit affinity propagation algorithm
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint 
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param dmp  {float}     Damping coefficient
-// @param diag {func}      Function applied to the similarity matrix diagonal
-// @param iter {dict}      Max number of overall iterations and iterations 
-//   without a change in clusters. (::) can be passed in which case the defaults
-//   of (`total`nochange!200 15) will be used
-// @return     {dict}      Data, input variables, clusters and exemplars 
-//   (`data`inputs`clt`exemplars) required for the predict method
-clust.ap.fit:{[data;df;dmp;diag;iter]
+// @desc Fit affinity propagation algorithm
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.df'
+// @param damp {float} Damping coefficient
+// @param diag {fn} Function applied to the similarity matrix diagonal
+// @param iter {dictionary} Max number of overall iterations and iterations 
+//   without a change in clusters. (::) can be passed in which case the 
+//   defaults of (`total`noChange!200 15) will be used
+// @return {dictionary} Data, input variables, clusters and exemplars 
+//   (`data`inputs`clust`exemplars) required, along with a projection of the
+//   predict function
+clust.ap.fit:{[data;df;damp;diag;iter]
   data:clust.i.floatConversion[data];
-  defaultDict:`run`total`nochange!0 200 15;
+  defaultDict:`run`total`noChange!0 200 15;
   if[iter~(::);iter:()!()];
   if[99h<>type iter;'"iter must be (::) or a dictionary"];
-  // update iteration dictionary with user changes
+  // Update iteration dictionary with user changes
   updDict:defaultDict,iter;
-  // cluster data using AP algo
-  clust.i.runap[data;df;dmp;diag;til count data 0;updDict]
+  // Cluster data using AP algo
+  modelInfo:clust.i.runAp[data;df;damp;diag;til count data 0;updDict];
+  returnInfo:enlist[`modelInfo]!enlist modelInfo;
+  predictFunc:clust.ap.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
 
 // @kind function
 // @category clust
-// @fileoverview Predict clusters using AP config
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      `data`inputs`clt`exemplars returned by clust.ap.fit
-// @return     {long[]}    List of predicted clusters
-clust.ap.predict:{[data;cfg]
+// @desc Predict clusters using AP config
+// @param config {dictionary} `data`inputs`clust`exemplars returned by the 
+//   modelInfo key from the return of clust.ap.fit
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @return {long[]} Predicted clusters
+clust.ap.predict:{[config;data]
+  config:config`modelInfo;
   data:clust.i.floatConversion[data];
-  if[-1~first cfg`clt;
-    '"'.ml.clust.ap.fit' did not converge, all clusters returned -1. Cannot predict new data."];
-  // retrieve cluster centres from training data
-  ex:cfg[`data][;distinct cfg`exemplars];
-  // predict testing data clusters
-  clust.i.appreddist[ex;cfg[`inputs]`df]each$[0h=type data;flip;enlist]data
-  }
-
-
-// Utilities
-
-// @kind function
-// @category private
-// @fileoverview Run affinity propagation algorithm
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param dmp  {float}     Damping coefficient
-// @param diag {func}      Function applied to the similarity matrix diagonal
-// @param idxs {long[]}    List of indicies to find distances for
-// @param iter {dict}      Max number of overall iterations and iterations 
-//   without a change in clusters. (::) can be passed in where the defaults
-//   of (`total`nochange!200 15) will be used
-// @return     {long[]}    List of clusters
-clust.i.runap:{[data;df;dmp;diag;idxs;iter]
-  // check negative euclidean distance has been given
-  if[df<>`nege2dist;clust.i.err.ap[]];
-  // calculate distances, availability and responsibility
-  info0:clust.i.apinit[data;df;diag;idxs];
-  // initialize exemplar matrix and convergence boolean
-  info0,:`emat`conv`iter!((count data 0;iter`nochange)#0b;0b;iter);
-  // run ap algo until maximum number of iterations completed or convergence
-  info1:clust.i.apstop clust.i.apalgo[dmp]/info0;
-  // return data, inputs, clusters and exemplars
-  inputs:`df`dmp`diag`iter!(df;dmp;diag;iter);
-  exemplars:info1`exemplars;
-  clt:$[info1`conv;clust.i.reindex exemplars;count[data 0]#-1];
-  `data`inputs`clt`exemplars!(data;inputs;clt;exemplars)
-  }
-
-// @kind function
-// @category private
-// @fileoverview Initialize matrices
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param diag {func}      Function applied to the similarity matrix diagonal
-// @param idxs {long[]}    List of point indices
-// @return     {dict}      Similarity, availability and responsibility matrices
-//   and keys for matches and exemplars to be filled during further iterations
-clust.i.apinit:{[data;df;diag;idxs]
-  // calculate similarity matrix values
-  s:clust.i.dists[data;df;data]each idxs;
-  // update diagonal
-  s:@[;;:;diag raze s]'[s;k:til n:count data 0];
-  // create lists/matrices of zeros for other variables
-  `matches`exemplars`s`a`r!(0;0#0;s),(2;n;n)#0f
-  }
-
-// @kind function
-// @category private
-// @fileoverview Run affinity propagation algorithm
-// @param dmp  {float} Damping coefficient
-// @param info {dict}  Similarity, availability, responsibility, exemplars,
-//   matches, iter dictionary, no_conv boolean and iter dict
-// @return     {dict}  Updated inputs
-clust.i.apalgo:{[dmp;info]
-  // update responsibility matrix
-  info[`r]:clust.i.updr[dmp;info];
-  // update availability matrix
-  info[`a]:clust.i.upda[dmp;info];
-  // find new exemplars
-  ex:imax each sum info`a`r;
-  // update `info` with new exemplars/matches
-  info:update exemplars:ex,matches:?[exemplars~ex;matches+1;0]from info;
-  // update iter dictionary
-  .[clust.i.apconv info;(`iter;`run);+[1]]
-  }
-
-// @kind function
-// @category private
-// @fileoverview Check affinity propagation algorithm for convergence
-// @param info {dict} Similarity, availability, responsibility, exemplars,
-//   matches, iter dictionary, no_conv boolean and iter dict
-// @return     {dict} Updated info dictionary
-clust.i.apconv:{[info]
-  // iteration dictionary
-  iter:info`iter;
-  // exemplar matrix
-  emat:info`emat;
-  // existing exemplars
-  ediag:0<sum clust.i.diag each info`a`r;
-  emat[;iter[`run]mod iter`nochange]:ediag;
-  // check for convergence
-  if[iter[`nochange]<=iter`run;
-    unconv:count[info`s]<>sum(se=iter`nochange)+0=se:sum each emat;
-    conv:$[(iter[`total]=iter`run)|not[unconv]&sum[ediag]>0;1b;0b]];
-  // return updated info
-  info,`emat`conv!(emat;conv)
-  }
-
-// @kind function
-// @category private
-// @fileoverview Retrieve diagonal from a square matrix
-// @param m {any[][]} Square matrix
-// @return  {any[]}   Matrix diagonal
-clust.i.diag:{[m]
-  {x y}'[m;til count m]
-  }
-
-// @kind function
-// @category private
-// @fileoverview Update responsibility matrix
-// @param dmp  {float}     Damping coefficient
-// @param info {dict}      Similarity, availability, responsibility, exemplars,
-//   matches, iter dictionary, no_conv boolean and iter dict
-// @return     {float[][]} Updated responsibility matrix
-clust.i.updr:{[dmp;info]
-  // create matrix with every points max responsibility
-  // diagonal becomes -inf, current max is becomes second max
-  mxresp:{[x;i]@[count[x]#mx;j;:;]max@[x;i,j:x?mx:max x;:;-0w]};
-  mx:mxresp'[sum info`s`a;til count info`r];
-  // calculate new responsibility
-  (dmp*info`r)+(1-dmp)*info[`s]-mx
-  }
-
-// @kind function
-// @category private
-// @fileoverview Update availability matrix
-// @param dmp  {float}     Damping coefficient
-// @param info {dict}      Similarity, availability, responsibility, exemplars,
-//   matches, iter dictionary, no_conv boolean and iter dict
-// @return     {float[][]} Returns updated availability matrix
-clust.i.upda:{[dmp;info]
-  // sum values in positive availability matrix
-  s:sum@[;;:;0f]'[pv:0|info`r;k:til n:count info`a];
-  // create a matrix using the negative values produced by the availability sum
-  //   + responsibility diagonal - positive availability values
-  a:@[;;:;]'[0&(s+info[`r]@'k)-/:pv;k;s];
-  // calculate new availability
-  (dmp*info`a)+a*1-dmp
-  }
-
-// @kind function
-// @category private
-// @fileoverview Stopping condition for affinity propagation algorithm
-// @param info {dict} Similarity, availability, responsibility, exemplars,
-//   matches, iter dictionary, no_conv boolean and iter dict
-// @return     {bool} Indicates whether to continue or stop running AP (1/0b)
-clust.i.apstop:{[info]
-  (info[`iter;`total]>info[`iter]`run)&not 1b~info`conv
-  }
-
-// @kind function
-// @category private
-// @fileoverview Predict clusters using AP training exemplars
-// @param ex {float[][]} Training cluster centres in matrix format, 
-//   each column is an individual datapoint
-// @param df {symbol}    Distance function name within '.ml.clust.df'
-// @param pt {float[]}   Current data point
-// @return   {long[]}    Predicted clusters
-clust.i.appreddist:{[ex;df;pt]
-  d?max d:clust.i.dists[ex;df;pt]each til count ex 0
+  if[-1~first config`clust;
+    '"'.ml.clust.ap.fit' did not converge, all clusters returned -1.",
+     " Cannot predict new data."
+    ];
+  // Retrieve cluster centres from training data
+  exemp:config[`data][;distinct config`exemplars];
+  // Predict testing data clusters
+  data:$[0h=type data;flip;enlist]data;
+  clust.i.apPredDist[exemp;config[`inputs]`df]each data
   }
diff --git a/clust/dbscan.q b/clust/dbscan.q
index 83fce63c..8a401e00 100644
--- a/clust/dbscan.q
+++ b/clust/dbscan.q
@@ -1,138 +1,107 @@
+// clust/dbscan.q - DBSCAN clustering
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// DBSCAN clustering.
+// The Density-Based Spatial Clustering of Applications with Noise
+// (DBSCAN) algorithm groups points that are closely packed in areas 
+// of high density. Any points in low-density regions are seen as outliers
+
 \d .ml
 
 // Density-Based Spatial Clustering of Applications with Noise (DBSCAN)
 
 // @kind function
 // @category clust
-// @fileoverview Fit DBSCAN algorithm to data
-// @param data   {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df     {symbol}    Distance function name within '.ml.clust.df'
-// @param minpts {long}      Minimum number of points with the epsilon radius
-// @param eps    {float}     Epsilon radius to search
-// @return       {dict}      Data, inputs, clusters and cluster table 
-//   (`data`inputs`clt`t) required for predict and update methods
-clust.dbscan.fit:{[data;df;minpts;eps]
+// @desc Fit DBSCAN algorithm to data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.df'
+// @param minPts {long} Minimum number of points with the epsilon radius
+// @param eps {float} Epsilon radius to search
+// @return {dictionary} A dictionary containing:
+//   modelInfo - Encapsulates all relevant infromation needed to fit
+//     the model `data`inputs`clust`tab, where data is the original data,
+//     inputs are the user defined minPts and eps, clust are the cluster
+//     assignments and tab is the neighbourhood table defining items in the
+//     clusters.
+//   predict - A projection allowing for prediction on new input data
+//   update - A projection allowing new data to be used to update
+//     cluster centers such that the model can react to new data
+clust.dbscan.fit:{[data;df;minPts;eps]
   data:clust.i.floatConversion[data];
-  // check distance function
+  // Check distance function
   if[not df in key clust.i.df;clust.i.err.df[]];
-  // create neighbourhood table
-  t:clust.i.nbhoodtab[data;df;minpts;eps;til count data 0];
-  // apply the density based clustering algorithm over the neighbourhood table
-  t:{[t]any t`corepoint}clust.i.dbalgo/t;
-  // find cluster for remaining points and return list of clusters
-  clt:-1^exec cluster from t;
-  // return config dict
-  `data`inputs`clt`t!(data;`df`minpts`eps!(df;minpts;eps);clt;t)
+  // Create neighbourhood table
+  tab:clust.i.nbhoodTab[data;df;minPts;eps;til count data 0];
+  // Apply the density based clustering algorithm over the neighbourhood table
+  tab:{[t]any t`corePoint}clust.i.dbAlgo/tab;
+  // Find cluster for remaining points and return list of clusters
+  clust:-1^exec cluster from tab;
+  // Return config dict
+  inputDict:`df`minPts`eps!(df;minPts;eps);
+  modelInfo:`data`inputs`clust`tab!(data;inputDict;clust;tab);
+  returnInfo:enlist[`modelInfo]!enlist modelInfo;
+  predictFunc:clust.dbscan.predict returnInfo;
+  updFunc:clust.dbscan.update returnInfo;
+  returnInfo,`predict`update!(predictFunc;updFunc)
   }
 
 // @kind function
 // @category clust
-// @fileoverview Predict clusters using DBSCAN config
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      `data`df`minpts`eps`clt returned from DBSCAN 
-//   clustered training data
-// @return     {long[]}    List of predicted clusters
-clust.dbscan.predict:{[data;cfg]
+// @desc Predict clusters using DBSCAN config
+// @param config {dictionary} A dictionary returned from '.ml.clust.dbscan.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant infromation needed to fit
+//     the model `data`inputs`clust`tab, where data is the original data,
+//     inputs are the user defined minPts and eps, clust are the cluster
+//     assignments and tab is the neighbourhood table defining items in the 
+//     clusters.
+//   predict - A projection allowing for prediction on new input data
+//   update - A projection allowing new data to be used to update
+//     cluster centers such that the model can react to new data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @return {long[]} Predicted clusters
+clust.dbscan.predict:{[config;data]
+  config:config[`modelInfo];
   data:clust.i.floatConversion[data];
-  // predict new clusters
-  -1^exec cluster from clust.i.dbscanpredict[data;cfg]
+  // Predict new clusters
+  -1^exec cluster from clust.i.dbscanPredict[data;config]
   }
 
 // @kind function
 // @category clust
-// @fileoverview Update DBSCAN config including new data points
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      `data`inputs`clt`nbh returned from DBSCAN clustered training data
-// @return     {dict}      Updated model config
-clust.dbscan.update:{[data;cfg]
+// @desc Update DBSCAN config including new data points
+// @param config {dictionary} A dictionary returned from '.ml.clust.dbscan.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant infromation needed to fit
+//     the model `data`inputs`clust`tab, where data is the original data,
+//     inputs are the user defined minPts and eps, clust are the cluster
+//     assignments and tab is the neighbourhood table defining items in the 
+//     clusters.
+//   predict - A projection allowing for prediction on new input data
+//   update - A projection allowing new data to be used to update
+//     cluster centers such that the model can react to new data
+// @param data {float[][]} Each column of the data is an individual datapoint
+//   and update functions
+// @return {dictionary} Updated model configuration (config), including predict
+clust.dbscan.update:{[config;data]
+  modelConfig:config[`modelInfo];
   data:clust.i.floatConversion[data];
-  // original data prior to addition of new points, with core points set
-  orig:update corepoint:1b from cfg[`t]where cluster<>0N;
-  // predict new clusters
-  new:clust.i.dbscanpredict[data;cfg];
-  // include new data points in training neighbourhood
-  orig:clust.i.updnbhood/[orig;new;count[orig]+til count new];
-  // fit model with new data included to update model
-  t:{[t]any t`corepoint}.ml.clust.i.dbalgo/orig,new;
-  // reindex the clusters
-  t:update{(d!til count d:distinct x)x}cluster from t where cluster<>0N;
+  // Original data prior to addition of new points, with core points set
+  orig:update corePoint:1b from modelConfig[`tab]where cluster<>0N;
+  // Predict new clusters
+  new:clust.i.dbscanPredict[data;modelConfig];
+  // Include new data points in training neighbourhood
+  orig:clust.i.updNbhood/[orig;new;count[orig]+til count new];
+  // Fit model with new data included to update model
+  tab:{[t]any t`corePoint}.ml.clust.i.dbAlgo/orig,new;
+  // Reindex the clusters
+  tab:update{(d!til count d:distinct x)x}cluster from tab where cluster<>0N;
   // return updated config
-  cfg,`data`t`clt!(cfg[`data],'data;t;-1^exec cluster from t)
-  }
-
-
-// Utilities
-
-// @kind function
-// @category private
-// @fileoverview Update the neighbourhood of a previously fit original dbscan model based on new data
-// @param orig {tab}    Original table of data with all points set as core points
-// @param new  {tab}    Table generated from new data with the previously generated model
-// @param idx  {long[]} Indices used to update the neighbourhood of the original table
-// @return     {tab}    Table with neighbourhood updated appropriately for the newly introduced data
-clust.i.updnbhood:{[orig;new;idx]
-  update nbhood:{x,'y}[nbhood;idx]from orig where i in new`nbhood
-  }
-
-// @kind function
-// @category private
-// @fileoverview Predict clusters using DBSCAN config
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      `data`inputs`clt returned from DBSCAN clustered training data
-// @return     {tab}       Cluster table
-clust.i.dbscanpredict:{[data;cfg]
-  idx:count[cfg[`data]0]+til count data 0;
-  // create neighbourhood table
-  t:clust.i.nbhoodtab[cfg[`data],'data;;;;idx]. cfg[`inputs;`df`minpts`eps];
-  // find which existing clusters new data belongs to
-  update cluster:{x[`clt]first y}[cfg]each nbhood from t where corepoint
-  }
-
-// @kind function
-// @category private
-// @fileoverview Create neighbourhood table for points at indices provided
-// @param data   {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df     {symbol}    Distance function name within '.ml.clust.df'
-// @param minpts {long}      Minimum number of points with the epsilon radius
-// @param eps    {float}     Epsilon radius to search
-// @param idx    {long[]}    Data indices to find neighbourhood for
-// @return       {table}     Neighbourhood table with columns `nbhood`cluster`corepoint
-clust.i.nbhoodtab:{[data;df;minpts;eps;idx]
-  // calculate distances and find all points which are not outliers
-  nbhood:clust.i.nbhood[data;df;eps]each idx;
-  // update outlier cluster to null
-  update cluster:0N,corepoint:minpts<=1+count each nbhood from([]nbhood)
-  }
-
-// @kind function
-// @category private
-// @fileoverview Find all points which are not outliers
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param eps  {float}     Epsilon radius to search
-// @param idx  {long}      Index of current point
-// @return     {long[]}    Indices of points within the epsilon radius
-clust.i.nbhood:{[data;df;eps;idx]
-  where eps>@[;idx;:;0w]clust.i.df[df]data-data[;idx]
-  }
-
-// @kind function
-// @category private
-// @fileoverview Run DBSCAN algorithm and update cluster of each point
-// @param t {table} Cluster info table
-// @return  {table} Updated cluster table with old clusters merged
-clust.i.dbalgo:{[t]
-  nbh:.ml.clust.i.nbhoodidxs[t]/[first where t`corepoint];
-  update cluster:0|1+max t`cluster,corepoint:0b from t where i in nbh
-  }
-
-// @kind function
-// @category private
-// @fileoverview Find indices in each points neighborhood
-// @param t    {table}  Cluster info table
-// @param idxs {long[]} Indices to search the neighborhood of
-// @return     {long[]} Indices in neighborhood
-clust.i.nbhoodidxs:{[t;idxs]
-  nbh:exec nbhood from t[distinct idxs,raze t[idxs]`nbhood]where corepoint;
-  asc distinct idxs,raze nbh
+  clusts:-1^exec cluster from tab;
+  modelConfig,:`data`tab`clust!(modelConfig[`data],'data;tab;clusts);
+  returnInfo:enlist[`modelInfo]!enlist modelConfig;
+  returnKeys:`predict`update;
+  returnVals:(clust.dbscan.predict returnInfo;
+    clust.dbscan.update returnInfo);
+  returnInfo,returnKeys!returnVals
   }
diff --git a/clust/hierarchical.q b/clust/hierarchical.q
index 1df3d122..303613f9 100644
--- a/clust/hierarchical.q
+++ b/clust/hierarchical.q
@@ -1,545 +1,210 @@
+// clust/hierarchical.q - Hierarchical and CURE clustering
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Hierarchical clustering.
+// Agglomerative hierarchical clustering iteratively groups data, 
+// using a bottom-up approach that initially treats all data 
+// points as individual clusters.
+//
+// CURE clustering.
+// Clustering Using REpresentatives (CURE) is a technique used to deal 
+// with datasets containing outliers and clusters of varying sizes and 
+// shapes. Each cluster is represented by a specified number of 
+// representative points. These points are chosen by taking the most 
+// scattered points in each cluster and shrinking them towards the 
+// cluster center using a compression ratio.
+
 \d .ml
 
 // Clustering Using REpresentatives (CURE) and Hierarchical Clustering
 
 // @kind function
 // @category clust
-// @fileoverview Fit CURE algorithm to data
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df' 
-// @param n    {long}      Number of representative points per cluster
-// @param c    {float}     Compression factor for representative points
-// @return     {dict}      Data, input variables and dendrogram 
-//   (`data`inputs`dgram) required for predict method
+// @desc Fit CURE algorithm to data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param n {long} Number of representative points per cluster
+// @param c {float} Compression factor for representative points
+// @return {dictionary} A dictionary containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
 clust.cure.fit:{[data;df;n;c]
   data:clust.i.floatConversion[data];
   if[not df in key clust.i.df;clust.i.err.df[]];
-  dgram:clust.i.hcscc[data;df;`cure;1;n;c;1b];
-  `data`inputs`dgram!(data;`df`n`c!(df;n;c);dgram)
+  dgram:clust.i.hcSCC[data;df;`cure;1;n;c;1b];
+  modelInfo:`data`inputs`dgram!(data;`df`n`c!(df;n;c);dgram);
+  returnInfo:enlist[`modelInfo]!enlist modelInfo;
+  predictFunc:clust.cure.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
 
 // @kind function
 // @category clust
-// @fileoverview Fit Hierarchical algorithm to data
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df' 
-// @param lf   {symbol}    Linkage function name within '.ml.clust.lf' 
-// @return     {dict}      Data, input variables and dendrogram 
-//   (`data`inputs`dgram) required for predict method
+// @desc Fit Hierarchical algorithm to data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf' 
+// @return {dictionary} A dictionary containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
 clust.hc.fit:{[data;df;lf]
-  // check distance and linkage functions
+  // Check distance and linkage functions
   data:clust.i.floatConversion[data];
   if[not df in key clust.i.df;clust.i.err.df[]];
   dgram:$[lf in`complete`average`ward;
-    clust.i.hccaw[data;df;lf;2;1b];
+    clust.i.hcCAW[data;df;lf;2;1b];
     lf in`single`centroid;
-    clust.i.hcscc[data;df;lf;1;::;::;1b];
+    clust.i.hcSCC[data;df;lf;1;::;::;1b];
     clust.i.err.lf[]
     ];
-  `data`inputs`dgram!(data;`df`lf!(df;lf);dgram)
+  modelInfo:`data`inputs`dgram!(data;`df`lf!(df;lf);dgram);
+  returnInfo:enlist[`modelInfo]!enlist modelInfo;
+  predictFunc:clust.hc.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
 
 // @kind function
 // @category clust
-// @fileoverview Convert CURE cfg to k clusters
-// @param cfg {dict} Output of .ml.clust.cure.fit
-// @param k   {long} Number of clusters
-// @return    {dict} Updated config with clusters labels added
-clust.cure.cutk:{[cfg;k]
-  cfg,enlist[`clt]!enlist clust.i.cutdgram[cfg`dgram;k-1]
+// @desc Convert CURE config to k clusters
+// @param config {dictionary} A dictionary returned from '.ml.clust.cure.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
+// @param k {long} Number of clusters
+// @return {dictionary} Updated config with clusters labels added
+clust.cure.cutK:{[config;k]
+  clust.i.checkK[k];
+  clustVal:clust.i.cutDgram[config[`modelInfo;`dgram];k-1];
+  clusts:enlist[`clust]!enlist clustVal;
+  config,clusts
   }
 
 // @kind function
 // @category clust
-// @fileoverview Convert hierarchical cfg to k clusters
-// @param cfg {dict} Output of .ml.clust.hc.fit
-// @param k   {long} Number of clusters
-// @return    {dict} Updated config with clusters added
-clust.hc.cutk:clust.cure.cutk
+// @desc Convert hierarchical config to k clusters
+// @param config {dictionary} A dictionary returned from '.ml.clust.hc.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
+// @param k {long} Number of clusters
+// @return {dictionary} Updated config with clusters added
+clust.hc.cutK:clust.cure.cutK
 
 // @kind function
 // @category clust
-// @fileoverview Convert CURE dendrogram to clusters based on distance 
+// @desc Convert CURE dendrogram to clusters based on distance 
 //   threshold
-// @param cfg     {dict}   Output of .ml.clust.cure.fit
-// @param dthresh {float}  Cutting distance threshold
-// @return        {dict}   Updated config with clusters added
-clust.cure.cutdist:{[cfg;dthresh]
-  dgram:cfg`dgram;
-  k:0|count[dgram]-exec first i from dgram where dist>dthresh;
-  cfg,enlist[`clt]!enlist clust.i.cutdgram[dgram;k]
+// @param config {dictionary} A dictionary returned from '.ml.clust.cure.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
+// @param distThresh {float} Cutting distance threshold
+// @return {dictionary} Updated config with clusters added
+clust.cure.cutDist:{[config;distThresh]
+  clust.i.checkDist[distThresh];
+  dgram:config[`modelInfo;`dgram];
+  k:0|count[dgram]-exec first i from dgram where dist>distThresh;
+  config,enlist[`clust]!enlist clust.i.cutDgram[dgram;k]
   }
 
 // @kind function
 // @category clust
-// @fileoverview Convert hierarchical dendrogram to clusters based on distance
+// @desc Convert hierarchical dendrogram to clusters based on distance
 //   threshold
-// @param cfg     {dict}   Output of .ml.clust.hc.fit
-// @param dthresh {float}  Cutting distance threshold
-// @return        {dict}   Updated config with clusters added
-clust.hc.cutdist:clust.cure.cutdist
+// @param config {dictionary} A dictionary returned from '.ml.clust.cure.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
+// @param distThresh {float} Cutting distance threshold
+// @return {dictionary} Updated config with clusters added
+clust.hc.cutDist:clust.cure.cutDist
 
 // @kind function
 // @category clust
-// @fileoverview Predict clusters using CURE config
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      `data`df`n`c`clt returned from .ml.clust.(cutk/cutdist)
-// @return     {long[]}    List of predicted clusters
-clust.cure.predict:{[data;cfg]
-  clust.i.hccpred[`cure;data;cfg]
+// @desc Predict clusters using CURE config
+// @param config {dictionary} A dictionary returned from '.ml.clust.cure.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param cutDict {dictionary} The key defines what cutting algo to use when 
+//   splitting the data into clusters (`k/`dist) and the value defines the
+//   cutting threshold
+// @return {long[]} Predicted clusters
+clust.cure.predict:{[config;data;cutDict]
+  updConfig:clust.i.prepPred[config;cutDict];
+  clust.i.hCCpred[`cure;data;updConfig]
   }
 
 // @kind function
 // @category clust
-// @fileoverview Predict clusters using hierarchical config
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      `data`df`lf`clt returned from .ml.clust.(cutk/cutdist)
-// @return     {long[]}    List of predicted clusters
-clust.hc.predict:{[data;cfg]
-  clust.i.hccpred[`hc;data;cfg]
-  }
-
-
-// Utilities
-
-// @kind function
-// @category private
-// @fileoverview Complete, Average, Ward (CAW) Linkage
-// @param data  {float[][]}    Data in matrix format, each column is an individual datapoint
-// @param df    {symbol}       Distance function name within '.ml.clust.df'
-// @param lf    {symbol}       Linkage function name within '.ml.clust.lf'
-// @param k     {long}         Number of clusters
-// @param dgram {bool}         Generate dendrogram or not (1b/0b)
-// @return      {table/long[]} Dendrogram or list of clusters
-clust.i.hccaw:{[data;df;lf;k;dgram]
-  // check distance function for ward
-  if[(not df~`e2dist)&lf=`ward;clust.i.err.ward[]];
-  // create initial cluster table
-  t0:clust.i.initcaw[data;df];
-  // create linkage matrix
-  m:([]i1:`int$();i2:`int$();dist:`float$();n:`int$());
-  // merge clusters based on chosen algorithm
-  r:{[k;r]k<count distinct r[0]`clt}[k]clust.i.algocaw[data;df;lf]/(t0;m);
-  // return dendrogram or list of clusters
-  $[dgram;clust.i.upddgram[r 0;r 1];clust.i.reindex r[0]`clt]
-  }
-
-// @kind function
-// @category private
-// @fileoverview Single, Centroid, Cure (SCC) Linkage
-// @param data  {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df    {symbol}    Distance function name within '.ml.clust.df'
-// @param lf    {fn}        Linkage function
-// @param k     {long}      Number of clusters
-// @param n     {long}      Number of representative points per cluster
-// @param c     {float}     Compression factor for representative points
-// @param dgram {bool}      Generate dendrogram or not (1b/0b)
-// @return      {long[]}    List of clusters
-clust.i.hcscc:{[data;df;lf;k;n;c;dgram]
-  if[(not df in`edist`e2dist)&lf=`centroid;clust.i.err.centroid[]];
-  clustinit:clust.i.initscc[data;df;k;n;c;dgram];
-  r:(count[data 0]-k).[clust.i.algoscc[data;df;lf]]/clustinit;
-  vres:select from r[1]where valid;
-  $[dgram;
-    clust.i.dgramidx last[r]0;
-    enlist @[;;:;]/[count[data 0]#0N;vres`points;til count vres]
-  ]
+// @desc Predict clusters using hierarchical config
+// @param config {dictionary} A dictionary returned from '.ml.clust.cure.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`inputs`dgram, where data is the original data, inputs
+//     are the user defined linkage and distance functions while dgram
+//     is the generated dendrogram
+//   predict - A projection allowing for prediction on new input data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param cutDict {dictionary} The key defines what cutting algo to use when 
+//   splitting the data into clusters (`k/`dist) and the value defines the
+//   cutting threshold
+// @return {long[]} Predicted clusters
+clust.hc.predict:{[config;data;cutDict]
+  updConfig:clust.i.prepPred[config;cutDict];
+  clust.i.hCCpred[`hc;data;updConfig]
   }
 
 // @kind function
-// @category private
-// @fileoverview Update dendrogram for CAW with final cluster of all the points
-// @param t  {table}     Cluster table
-// @param m  {float[][]} Linkage matrix
-// @return   {float[][]} Updated linkage matrix
-clust.i.upddgram:{[t;m]
-  m,:value exec first clt,first nni,first nnd,count reppt from t where nnd=min nnd;
-  m
-  }
-
-// @kind function
-// @category private
-// @fileoverview Predict clusters using hierarchical or CURE config
-// @param ns   {symbol}    Namespace to use - `hc or `cure
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      dict output of .ml.clust.(cutk/cutdist)
-// @return     {long[]}    List of predicted clusters
-clust.i.hccpred:{[ns;data;cfg]
-  data:clust.i.floatConversion[data];
-  // check correct namespace and clusters given
-  if[not ns in`hc`cure;'"Incorrect namespace - please use `hc or `cure"];
-  if[not`clt in key cfg;
-    '"Clusters must be contained within cfg - please run .ml.clust.",
-    $[ns~`hc;"hc";"cure"],".(cutk/cutdist)"];
-  // add namespace and linkage to config dictionary for cure
-  if[ns~`cure;cfg[`inputs],:`ns`lf!(ns;`single)];
-  // recalc reppts for training clusters in asc order to ensure correct labels
-  reppt:clust.i.getrep[cfg]each gc kc:asc key gc:group cfg`clt;
-  // training indicies
-  idxs:til each c:count each reppt[;0];
-  // return closest clusters to testing points
-  clust.i.predclosest[data;cfg;reppt;c;idxs]each til count data 0
-  }
-
-// @kind function
-// @category private
-// @fileoverview Recalculate representative points from training clusters
-// @param cfg  {dict}      Dict output of .ml.clust.(cutk/cutdist)
-// @param idxs {long[][]}  Training data indices
-// @return     {float[][]} Training data points
-clust.i.getrep:{[cfg;idxs]
-  $[cfg[`inputs;`ns]~`cure;
-      flip(clust.i.curerep . cfg[`inputs;`df`n`c])::;
-    cfg[`inputs;`lf]in`ward`centroid;
-      enlist each avg each;]cfg[`data][;idxs]
-  }
-
-// @kind function
-// @category private
-// @fileoverview Predict new cluster for given data point
-// @param data   {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg    {dict}      dict output of .ml.clust.(cutk/cutdist)
-// @param reppt  {float[][]} Representative points in matrix format
-// @param c      {long}      Number of points in training clusters
-// @param cltidx {long[][]}  Training data indices
-// @param ptidx  {long[][]}  Index of current data point
-// @return       {long[]}    List of predicted clusters
-clust.i.predclosest:{[data;cfg;reppt;c;cltidx;ptidx]
-  // intra cluster distances
-  dist:.ml.clust.i.dists[;cfg[`inputs]`df;data[;ptidx];]'[reppt;cltidx];
-  // apply linkage
-  dist:$[`ward~lf:cfg[`inputs]`lf;
-    2*clust.i.lf[lf][1]'[c;dist];
-    clust.i.lf[lf]each dist];
-  // find closest cluster
-  dist?ndst:min dist
-  }
-
-// @kind function
-// @category private
-// @fileoverview Initialize cluster table
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df' 
-// @return     {table}     Distances, neighbors, clusters and representatives
-clust.i.initcaw:{[data;df]
-  // create table with distances and nearest neighhbors noted
-  t:clust.i.nncaw[data;df;data]each til count data 0;
-  // update each points cluster and representatives
-  update clt:i,reppt:flip data from t
-  }
-
-// @kind function
-// @category private
-// @fileoverview Find nearest neighbour index and distance
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df' 
-// @param pt   {float[][]} Points in `value flip` format
-// @param idxs {long}      Index of point in 'pt' to find nearest neighbour for
-// @return     {dict}      Index of and distance to nearest neighbour
-clust.i.nncaw:{[data;df;pt;idxs]
-  `nni`nnd!(d?m;m:min d:@[;idxs;:;0w]clust.i.dists[data;df;pt;idxs])
-  }
-
-// @kind function
-// @category private
-// @fileoverview CAW algo
-// @param data {float[][]}         Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}            Distance function name within '.ml.clust.df'
-// @param lf   {symbol}            Linkage function name within '.ml.clust.lf' 
-// @param l    {(table;float[][])} List with cluster table and linkage matrix
-// @return     {(table;float[][])} Updated l
-clust.i.algocaw:{[data;df;lf;l]
-  t:l 0;m:l 1;
-  // update linkage matrix
-  m,:value exec first clt,first nni,first nnd,count reppt from t where nnd=min nnd;
-  // merge closest clusters
-  merge:distinct value first select clt,nni from t where nnd=min nnd;
-  // add new cluster and reppt into table
-  t:update clt:1+max t`clt,reppt:count[i]#enlist sum[reppt]%count[i] from t where clt in merge;
-  // exec pts by cluster
-  cpts:exec pts:data[;i],n:count i,last reppt by clt from t;
-  // find points initially closest to new cluster points
-  chks:exec distinct clt from t where nni in merge;
-  // run specific algo and return updated table
-  t:clust.i.hcupd[lf][cpts;df;lf]/[t;chks];
-  // return updated table and matrix
-  (t;m)
-  }
-
-// @kind function
-// @category private
-// @fileoverview Complete linkage
-// @param cpts {float[][]} Points in each cluster
-// @param df   {symbol}    Distance function name within '.ml.clust.df' 
-// @param lf   {symbol}    Linkage function name within '.ml.clust.lf' 
-// @param t    {table}     Cluster table
-// @param chk  {long[]}    Points to check
-// @return     {table}     Updated cluster table
-clust.i.hcupd.complete:{[cpts;df;lf;t;chk]
-  // calculate cluster distances using complete method
-  dsts:clust.i.completedist[df;lf;cpts;chk];
-  // find nearest neighbors
-  nidx:dsts?ndst:min dsts;
-  // update cluster table
-  update nni:nidx,nnd:ndst from t where clt=chk
-  }
-
-// @kind function
-// @category private
-// @fileoverview Average linkage
-// @param cpts {float[][]} Points in each cluster
-// @param df   {symbol}    Distance function name within '.ml.clust.df' 
-// @param lf   {symbol}    Linkage function name within '.ml.clust.lf' 
-// @param t    {table}     Cluster table
-// @param chk  {long[]}    Points to check
-// @return     {table}     Updated cluster table
-clust.i.hcupd.average:clust.i.hcupd.complete
-
-// @kind function
-// @category private
-// @fileoverview Ward linkage
-// @param cpts {float[][]} Points in each cluster
-// @param df   {symbol}    Distance function name within '.ml.clust.df' 
-// @param lf   {symbol}    Linkage function name within '.ml.clust.lf'
-// @param t    {table}     Cluster table
-// @param chk  {long[]}    Points to check
-// @return     {table}     Updated cluster table
-clust.i.hcupd.ward:{[cpts;df;lf;t;chk]
- // calculate distances using ward method
- dsts:clust.i.warddist[df;lf;cpts;chk];
- // find nearest neighbors
- nidx:dsts?ndst:min dsts;
- // update cluster table and rep pts
- update nni:nidx,nnd:ndst from t where clt=chk}
-
-// @kind function
-// @category private
-// @fileoverview Calculate distances between points based on specified
-//   linkage and distance functions
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param lf   {symbol}    Linkage function name within '.ml.clust.lf'
-// @param data {float[][]} Points in each cluster
-// @param idxs {long[]}    Indices for which to produce distances
-// @return     {float[]}   list of distances between all data points and those in idxs
-clust.i.completedist:{[df;lf;data;idxs]
-  {[df;lf;xdata;ydata]
-    dists:raze clust.i.df[df]xdata[`pts]-\:'ydata`pts;
-    clust.i.lf[lf]dists}[df;lf;data idxs]each data _ idxs
-  }
-
-// @kind function
-// @category private
-// @fileoverview Calculate distances between points based on ward linkage and
-//   specified distance function
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param lf   {symbol}    Linkage function name within '.ml.clust.lf'
-// @param data {float[][]} Points in each cluster
-// @param idxs {long[]}    Indices for which to produce distances
-// @return     {float[]}   list of distances between all data points and those in idxs
-clust.i.warddist:{[df;lf;data;idxs]
-  {[df;lf;xdata;ydata]
-   dists:clust.i.df[df]xdata[`reppt]-ydata`reppt;
-   2*clust.i.lf[lf][xdata`n;ydata`n;dists]}[df;lf;data idxs]each data _ idxs
-  }
-
-// @kind function
-// @category private
-// @fileoverview Initialize SCC clusters
-// @param data {float[][]}  Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}     Distance function name within '.ml.clust.df' 
-// @param k    {long}       Number of clusters
-// @param n    {long}       Number of representative points per cluster
-// @param c    {float}      Compression factor for representative points
-// @return     {(dict;long[];table;table)} Parameters, clusters, representative
-//   points and the kdtree
-clust.i.initscc:{[data;df;k;n;c;dgram]
-  // build kdtree
-  kdtree:clust.kd.newtree[data]1000&ceiling .01*nd:count data 0;
-  // generate distance table with closest clusters identified
-  dists:clust.i.gendisttab[kdtree;data;df;nd];
-  lidx:select raze idxs,self:self where count each idxs from kdtree where leaf;
-  r2l:exec self idxs?til count i from lidx;
-  // create cluster table 
-  clusts:select clusti:i,clust:i,valid:1b,reppts:enlist each i,
-                points:enlist each i,closestDist,closestClust from dists;
-  // create table of representative points for each cluster
-  reppts:select reppt:i,clust:i,leaf:r2l,closestDist,closestClust from dists;
-  reppts:reppts,'flip(rpcols:`$"x",'string til count data)!data;
-  // create list of important parameters to carry forward
-  params:`k`n`c`rpcols!(k;n;c;rpcols);
-  lnkmat:([]i1:`int$();i2:`int$();dist:`float$();n:`int$());
-  // return as a list to be passed to algos
-  (params;clusts;reppts;kdtree;(lnkmat;dgram))
-  }
-
-
-// @kind function
-// @category private
-// @fileoverview Generate distance table indicating closest cluster
-// @param kdtree {tab}       initial representation of the k-d tree
-// @param data   {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df     {symbol}    Distance function name within '.ml.clust.df'
-// @param npts   {long}      Number of points in the dataset 
-// @return       {tab}       Distance table containing an indication of the closest cluster
-clust.i.gendisttab:{[kdtree;data;df;npts]
-  // generate the distance table
-  gentab:{[kdtree;data;df;idx]
-    clust.kd.nn[kdtree;data;df;idx;data[;idx]]
-    }[kdtree;data;df]each til npts;
-  // update naming convention
-  update closestClust:closestPoint from gentab
-  }
-
-// @kind function
-// @category private
-// @fileoverview Representative points for Centroid linkage
-// @param p {float[][]} Data points
-// @return  {float[]}   Representative point
-clust.i.centrep:{[p]
-  enlist avg each p
-  }
-
-// @kind function
-// @category private
-// @fileoverview Representative points for CURE
-// @param df {symbol}    Distance function name within '.ml.clust.df' 
-// @param n  {long}      Number of representative points per cluster
-// @param c  {float}     Compression factor for representative points
-// @param p  {float[][]} List of data points
-// @return   {float[][]} List of representative points
-clust.i.curerep:{[df;n;c;p]
-  rpts:1_first(n&count p 0).[{[df;rpts;p]
-    i:imax min clust.i.df[df]each p-/:neg[1|-1+count rpts]#rpts;
-    rpts,:enlist p[;i];
-    (rpts;.[p;(::;i);:;0n])
-    }[df]]/(enlist avgpt:avg each p;p);
-  (rpts*1-c)+\:c*avgpt
-  }
-
-// @kind function
-// @category private
-// @fileoverview Update initial dendrogram structure to show path of merges so
-//   that the dendrogram can be plotted with scipy
-// @param dgram {table} Dendrogram stucture produced using 
-//   .ml.clust.hc[...;...;...;...;1b]
-// @return      {table} Updated dendrogram
-clust.i.dgramidx:{[dgram]
-  // initial cluster indices, number of merges and loop counter
-  cl:raze dgram`i1`i2;n:count dgram;i:0;
-  // increment a cluster for every occurrence in the tree
-  while[n>i+1;cl[where[cl=cl i]except i]:1+max cl;i+:1];
-  // update dendrogram with new indices
-  ![dgram;();0b;`i1`i2!n cut cl]
-  }
-
-// @kind function
-// @category private
-// @fileoverview Convert dendrogram table to clusters
-// @param t {table}  Dendrogram table
-// @param k {long}   Define splitting value in dendrogram table
-// @return  {long[]} List of clusters
-clust.i.cutdgram:{[t;k]
-  // get index of cluster made at cutting point k
-  idx:(2*cntt:count t)-k-1;
-  // exclude any clusters made after point k
-  exclt:i where idx>i:raze neg[k]#'allclt:t`i1`i2;
-  // extract indices within clusters made until k, excluding any outliers
-  nout:exclt except outliers:exclt where exclt<=cntt;
-  clt:{last{count x 0}clust.i.extractclt[x;y]/(z;())}[allclt;cntt+1]each nout;
-  // update points to the cluster they belong to
-  @[;;:;]/[(1+cntt)#0N;clt,enlist each outliers;til k+1]
-  }
-
-// @kind function
-// @category private
-// @fileoverview Extract points within merged cluster
-// @param clts {long[]} List of cluster indices
-// @param cntt {long}   Count of dend table 
-// @param inds {long[]} Index in list to search and indices points found within
-//   that cluster
-// @return     {long[]} Next index to search, and additional points found 
-//   within cluster
-clust.i.extractclt:{[clts;cntt;inds]
-  // extract the points that were merged at this point
-  mrgclt:raze clts[;inds[0]-cntt];
-  // Store any single clts, break down clts more than single point
-  (mrgclt where inext;inds[1],mrgclt where not inext:mrgclt>=cntt)
+// @category clust
+// @desc Fit CURE algorithm to data and convert dendrogram to clusters
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param n {long} Number of representative points per cluster
+// @param c {float} Compression factor for representative points
+// @param cutDict {dictionary} The key defines what cutting algo to use when 
+//   splitting the data into clusters (`k/`dist) and the value defines the
+//   cutting threshold
+// @return {dictionary} Updated config with clusters added
+clust.cure.fitPredict:{[data;df;n;c;cutDict]
+  fitModel:clust.cure.fit[data;df;n;c];
+  clust.i.prepPred[fitModel;cutDict]
   }
 
 // @kind function
-// @category private
-// @fileoverview SCC algo
-// @param data {float[][]} Data in matrix format, each column is 
-//   an individual datapoint
-// @param df   {symbol}      Distance function name within '.ml.clust.df'
-// @param lf   {symbol}      Linkage function name within '.ml.clust.lf' 
-// @param params {dict}      Parameters - k (no. clusts), n (no. reppts per clust), reppts, kdtree
-// @param clusts {table}     Cluster table
-// @param reppts {float[][]} Representative points and associated info
-// @param kdtree {table}     k-dimensional tree storing points and distances
-// @return       {(dict;long[];float[][];table)} Parameters dict, clusters, 
-//   representative points and kdtree tables
-clust.i.algoscc:{[data;df;lf;params;clusts;reppts;kdtree;lnkmat]
-  // merge closest clusters
-  clust0:exec clust{x?min x}closestDist from clusts where valid;
-  newmrg:clusts clust0,clust1:clusts[clust0]`closestClust;
-  newmrg:update valid:10b,reppts:(raze reppts;0#0),points:(raze points;0#0)from newmrg;
-  // make dendrogram if required
-  if[lnkmat 1;
-    m:lnkmat 0;
-    m,:newmrg[`clusti],fnew[`closestDist],count(fnew:first newmrg)`points;
-    lnkmat[0]:m
-  ];
-  // keep track of old reppts
-  oldrep:reppts newmrg[0]`reppts;
-  // find reps in new cluster
-  $[sgl:lf~`single;
-    // for single new reps=old reps -> no new points calculated 
-    newrep:select reppt,clust:clust0 from oldrep;
-    [
-    // generate new representative points table (centroid -> reps=avg; cure -> calc reps)
-    newrepfunc:$[lf~`centroid;clust.i.centrep;clust.i.curerep[df;params`n;params`c]];
-    newrepkeys:params[`rpcols];
-    newrepvals:flip newrepfunc[data[;newmrg[0]`points]];
-    newrep:flip newrepkeys!newrepvals;
-    newrep:update clust:clust0,reppt:count[i]#newmrg[0]`reppts from newrep;
-    // new rep leaves
-    newrep[`leaf]:(clust.kd.findleaf[kdtree;;kdtree 0]each flip newrep params`rpcols)`self;
-    newmrg[0;`reppts]:newrep`reppt;
-    // delete old points from leaf and update new point to new rep leaf
-    kdtree:.[kdtree;(oldrep`leaf;`idxs);except;oldrep`reppt];
-    kdtree:.[kdtree;(newrep`leaf;`idxs);union ;newrep`reppt]
-    ]
-  ];
-  // update clusters and reppts
-  clusts:@[clusts;newmrg`clust;,;delete clust from newmrg];
-  reppts:@[reppts;newrep`reppt;,;delete reppt from newrep];
-  updrep:reppts newrep`reppt;
-  // nneighbour to clust
-  if[sgl;updrep:select from updrep where closestClust in newmrg`clust];
-  // calculate and append to representative point table the nearest neighbours
-  // of columns containing representative points
-  updrepdata:flip updrep params`rpcols;
-  updrepdatann:clust.kd.nn[kdtree;reppts params`rpcols;df;newmrg[0]`points] each updrepdata;
-  updrep:updrep,'updrepdatann;
-  updrep:update closestClust:reppts[closestPoint;`clust]from updrep;
-  if[sgl;
-    reppts:@[reppts;updrep`reppt;,;select closestDist,closestClust from updrep];
-    updrep:reppts newrep`reppt];
-  // update nneighbour of new clust  
-  updrep@:raze imin updrep`closestDist;
-  clusts:@[clusts;updrep`clust;,;`closestDist`closestClust#updrep];
-  $[sgl;
-    // single - nneighbour=new clust
-    [clusts:update closestClust:clust0 from clusts where valid,closestClust=clust1;
-     reppts:update closestClust:clust0 from reppts where       closestClust=clust1];
-    // else do nneighbour search
-    if[count updcls:select from clusts where valid,closestClust in(clust0;clust1);
-      updcls:updcls,'{x imin x`closestDist}each clust.kd.nn[kdtree;reppts params`rpcols;df]/:'
-        [updcls`reppts;flip each reppts[updcls`reppts]@\:params`rpcols];
-      updcls[`closestClust]:reppts[updcls`closestPoint]`clust;
-      clusts:@[clusts;updcls`clust;,;select closestDist,closestClust from updcls]
-    ]
-  ];
-  (params;clusts;reppts;kdtree;lnkmat)
+// @category clust
+// @desc Fit hierarchial algorithm to data and convert dendrogram 
+//   to clusters
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf' 
+// @param cutDict {dictionary} The key defines what cutting algo to use when 
+//   splitting the data into clusters (`k/`dist) and the value defines the
+//   cutting threshold
+// @return {dictionary} Updated config with clusters added
+clust.hc.fitPredict:{[data;df;lf;cutDict]
+  fitModel:clust.hc.fit[data;df;lf];
+  clust.i.prepPred[fitModel;cutDict]
   }
diff --git a/clust/init.q b/clust/init.q
index e8e69e36..15804bfa 100644
--- a/clust/init.q
+++ b/clust/init.q
@@ -1,13 +1,21 @@
+// clust/init.q - Load clustering library
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Clustering algorithms including affinity propagation, 
+// cure, dbscan, hierarchical, and k-means clustering
+
 \d .ml
 
 // required for use of .ml.confmat in score.q
 loadfile`:util/init.q
 
 // load clustering files
-loadfile`:clust/util.q
+loadfile`:clust/utils.q
 loadfile`:clust/kdtree.q
 loadfile`:clust/kmeans.q
 loadfile`:clust/aprop.q
 loadfile`:clust/dbscan.q
 loadfile`:clust/hierarchical.q
 loadfile`:clust/score.q
+
+.ml.i.deprecWarning`clust
diff --git a/clust/kdtree.q b/clust/kdtree.q
index 97a50918..bead5ac7 100644
--- a/clust/kdtree.q
+++ b/clust/kdtree.q
@@ -1,130 +1,104 @@
+// clust/kdtree.q - K dimensional tree
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// A k-dimensional tree (k-d tree) is a special case of the 
+// binary search tree data structure, commonly used in computer 
+// science to organize data points in k-dimensional space. 
+// Each leaf node in the tree contains a set of k-dimensional points, 
+// while each non-leaf node generates a splitting hyperplane
+// which divides the surrounding space.
+
 \d .ml
 
 // K-Dimensional (k-d) Tree
 
 // @kind function
 // @category clust
-// @fileoverview Create new k-d tree
-// @param data   {float[][]} Data in matrix format, each column is an individual datapoint
-// @param leafsz {long}      Number of points per leaf (<2*number of reppts)
-// @return       {table}     k-d tree
-clust.kd.newtree:{[data;leafsz]
+// @desc Create new k-d tree
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param leafSize {long} Number of points per leaf (<2*number of reppts)
+// @return {table} k-d tree
+clust.kd.newTree:{[data;leafSize]
   args:`leaf`left`parent`self`idxs!(0b;0b;0N;0;til count data 0);
-  clust.kd.i.tree[data;leafsz]args
+  clust.kd.i.tree[data;leafSize]args
   }
 
 // @kind function
 // @category clust
-// @fileoverview Find nearest neighhbors in k-d tree
-// @param tree  {table}     k-d tree
-// @param data  {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df    {symbol}    Distance function name within '.ml.clust.df'
-// @param xidxs {long[][]}  Points to exclude in search
-// @param pt    {long[]}    Point to find nearest neighbor for
-// @return      {dict}      Nearest neighbor dictionary with closest point,
+// @desc Find nearest neighhbors in k-d tree
+// @param tree {table} k-d tree
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.df'
+// @param xIdxs {long[][]} Points to exclude in search
+// @param pt {long[]} Point to find nearest neighbor for
+// @return {dictionary} Nearest neighbor dictionary with closest point,
 //   distance, points searched and points to search
-clust.kd.q.nn:clust.kd.nn:{[tree;data;df;xidxs;pt]
-  nninit:(0N;0w;0#0;clust.kd.findleaf[tree;pt;tree 0]);
-  start:`closestPoint`closestDist`xnodes`node!nninit;
-  stop:{[nninfo]not null nninfo[`node;`self]};
-  2#stop clust.kd.i.nncheck[tree;data;df;xidxs;pt]/start
+clust.kd.q.nn:clust.kd.nn:{[tree;data;df;xIdxs;pt]
+  nnInit:(0N;0w;0#0;clust.kd.findLeaf[tree;pt;tree 0]);
+  start:`closestPoint`closestDist`xNodes`node!nnInit;
+  stop:{[nnInfo]not null nnInfo[`node;`self]};
+  2#stop clust.kd.i.nnCheck[tree;data;df;xIdxs;pt]/start
   }
 
 // @kind function
-// @category private
-// @fileoverview Create tree table where each row represents a node
-// @param data   {float[][]} Points in `value flip` format
-// @param leafsz {long}      Points per leaf (<2*number of representatives)
-// @param node   {dict}      Info for a given node in the tree
-// @return       {table}     k-d tree table
-clust.kd.i.tree:{[data;leafsz;node]
-  if[leafsz<=.5*count node`idxs;
-    chk:xdata<med xdata@:ax:imax dvar:var each xdata:data[;node`idxs];
-    if[all leafsz<=count each(lIdxs:where chk;rIdxs:where not chk);
-      lnode:update left:1b,parent:self,self+1,idxs:idxs lIdxs from node;
-      n:count lTree:.z.s[data;leafsz]lnode;
-      rnode:update left:0b,parent:self,self+1+n,idxs:idxs rIdxs from node;
-      rTree:.z.s[data;leafsz]rnode;
-      node:select leaf,left,self,parent,children:self+1+(0;n),axis:ax,
-        midval:"f"$min xdata rIdxs,idxs:0#0 from node;
-      :enlist[node],lTree,rTree
-    ]
-  ];
-  enlist select leaf:1b,left,self,parent,children:0#0,axis:0N,midval:0n,idxs from node
+// @category kdtree
+// @desc Find the leaf node point belongs to
+// @param tree {table} k-d tree table
+// @param pt {float[]} Point to search
+// @param node {dictionary} Node in the k-d tree to start the search 
+// @return {dictionary} The index (row) of the kd-tree that the datapoint 
+//   belongs to
+clust.kd.q.findLeaf:clust.kd.findleaf:{[tree;pt;node]
+  {[node]not node`leaf}clust.kd.i.findNext[tree;pt]/node
   }
 
 // @kind function
-// @category private
-// @fileoverview Search each node and check nearest neighbors
-// @param tree   {table}     k-d tree table
-// @param data   {float[][]} Points in `value flip` format
-// @param df     {fn}        Distance function
-// @param xidxs  {long[][]}  Points to exclude in search
-// @param pt     {long[]}    Point to find nearest neighbor for
-// @param nninfo {dict}      Nearest neighbor info of a point
-// @return       {dict}      Updated nearest neighbor info
-clust.kd.i.nncheck:{[tree;data;df;xidxs;pt;nninfo]
-  if[nninfo[`node]`leaf;
-    closest:clust.i.closest[data;df;pt]nninfo[`node;`idxs]except xidxs;
-    if[closest[`distance]<nninfo`closestDist;
-      nninfo[`closestPoint`closestDist]:closest`point`distance;
-    ]
-  ];
-  if[not null childidx:first nninfo[`node;`children]except nninfo`xnodes;
-    nndist:clust.i.df[df]pt[nninfo[`node]`axis]-nninfo[`node]`midval;
-    childidx:$[(nninfo`closestDist)<nndist;
-      0N;
-	  clust.kd.findleaf[tree;pt;tree childidx]`self
-    ]
-  ];
-  if[null childidx;nninfo[`xnodes],:nninfo[`node]`self];
-  nninfo[`node]:tree nninfo[`node;`parent]^childidx;
-  nninfo
+// @category kdtree
+// @desc Sets k-d tree q or C functions
+// @param typ {boolean} Type of code to use q or C (1/0b)
+// @return  {::} No return. Updates nn and findLeaf functions.
+clust.kd.qC:{[typ]
+  funcTyp:not(112=type clust.kd.c.nn)&112=type clust.kd.c.findLeaf;
+  func:$[typ|funcTyp;`q;`c];
+  clust.kd[`nn`findLeaf]:(clust.kd[func]`nn;clust.kd[func]`findLeaf)
   }
 
 // @kind function
-// @category private
-// @fileoverview Find the next direction to take in the tree
-// @param tree {table}   k-d tree table
-// @param pt   {float[]} Current point to put in tree
-// @param node {dict}    Current node to check
-// @return     {long}    Next direction to take
-clust.kd.i.findnext:{[tree;pt;node]
-  tree node[`children]node[`midval]<=pt node`axis
-  }
+// @category kdtree
+// @desc Get nearest neighbor in C
+// @param tree {table} k-d tree table
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {fn} Distance function
+// @param xIdxs {long[][]} Points to exclude in search
+// @param pt {long[]} Point to find nearest neighbor for
+// @return {dictionary} Nearest neighbor information
+clust.kd.c.nn:{[tree;data;df;xIdxs;pt]
+  data:clust.i.floatConversion[data];
+  pt:clust.i.floatConversion[pt];
+  args:(tree;data;(1_key clust.i.df)?df;@[count[data 0]#0b;xIdxs;:;1b];pt);
+  `closestPoint`closestDist!clust.kd.c.nnFunc . args
+  };
 
 // @kind function
-// @category private
-// @fileoverview Find the leaf node point belongs to
-// @param tree {table}   k-d tree table
-// @param pt   {float[]} Current point to put in tree
-// @param node {dict}    Current node to check
-// @return     {dict}    Leaf node pt belongs to
-clust.kd.q.findleaf:clust.kd.findleaf:{[tree;pt;node]
-  {[node]not node`leaf}clust.kd.i.findnext[tree;pt]/node
-  }
+// @category kdtree
+// @desc Find the leaf node point belongs to using C
+// @param tree {table} k-d tree table
+// @param point {float[]} Point to search
+// @param node {dictionary} Node in the k-d tree to start the search 
+// @return {dictionary} The index (row) of the kd-tree that the 
+//   datapoint belongs to
+clust.kd.c.findLeaf:{[tree;point;node]
+    point:clust.i.floatConversion[point];
+    tree clust.kd.c.findLeafFunc[tree;point;node`self]
+    }
+
 
 // @kind function
-// @category private
-// @fileoverview Sets k-d tree q or C functions
-// @param b {bool} Type of code to use q or C (1/0b)
-// @return  {null} No return. Updates nn and findleaf functions.
-clust.kd.qC:{[b]
-  clust.kd.c.nn:.[2:;(`:kdnn;(`kd_nn;5));::];
-  clust.kd.c.findleaf:.[2:;(`:kdnn;(`kd_findleaf;3));::];
-  cnn:{[tree;data;df;xidxs;pt]
-    data:clust.i.floatConversion[data];
-    pt  :clust.i.floatConversion[pt];
-    args:(tree;data;(1_key clust.i.df)?df;@[count[data 0]#0b;xidxs;:;1b];pt);
-    `closestPoint`closestDist!clust.kd.c.nn . args
-    };
-  cfl:{[tree;point;node]
-    point:clust.i.floatConversion[point];
-    tree clust.kd.c.findleaf[tree;point;node`self]
-    };
-  fntyp:not(112=type clust.kd.c.nn)&112=type clust.kd.c.findleaf;
-  clust.kd[`nn`findleaf]:$[b|fntyp;(clust.kd.q.nn;clust.kd.q.findleaf);(cnn;cfl)]
-  }
+// @category kdtree
+// @desc Load in C functionality
+clust.kd.c.nnLoadFunc:.[2:;(`:kdnn;(`kd_nn;5));::];
+clust.kd.c.findLeafFunc:.[2:;(`:kdnn;(`kd_findleaf;3));::];
+
 
 // Default to C implementations
 
diff --git a/clust/kmeans.q b/clust/kmeans.q
index 21755ae3..a2a4a5c9 100644
--- a/clust/kmeans.q
+++ b/clust/kmeans.q
@@ -1,188 +1,108 @@
+// clust/kmeans.q - K means clustering
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// K means clustering.
+// K-means clustering begins by selecting k data points 
+// as cluster centers and assigning data to the cluster
+// with the nearest center.
+// The algorithm follows an iterative refinement process
+// which runs a specified number of times, updating the 
+// cluster centers and assigned points to a cluster at 
+// each iteration based on the nearest cluster center.
+
 \d .ml
 
 // K-Means
 
 // @kind function
 // @category clust
-// @fileoverview Fit k-Means algorithm to data
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param k    {long}      Number of clusters
-// @param cfg  {dict}      Configuration information which can be updated, (::) allows a user
-//   to use default values, allows update for to maximum iterations `iter, initialisation type
-//   `init and threshold for smallest distance to move between the previous and new run `thresh
-// @return     {dict}      Model config `data`df`reppts`clt where data 
-//   and df are the inputs, reppts are the calculated k means and clt 
-//   are the associated clusters
-clust.kmeans.fit:{[data;df;k;cfg]
+// @desc Fit k-Means algorithm to data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param k {long} Number of clusters
+// @param config {dictionary} Configuration information which can be updated,
+//   (::) allows a user to use default values, allows update for to maximum 
+//   iterations `iter, initialisation type `init i.e. use k++ or random and
+//   the threshold for smallest distance to move between the previous and
+//   new run `thresh, a distance less than thresh will result in
+//   early stopping
+// @return {dictionary} A dictionary containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`df`repPts`clust, where data and df are the inputs,
+//     repPts are the calculated k centers and clust are clusters associated
+//     with each of the datapoints
+//   predict - A projection allowing for prediction on new input data
+//   update - A projection allowing new data to be used to update
+//     cluster centers such that the model can react to new data
+clust.kmeans.fit:{[data;df;k;config]
   data:clust.i.floatConversion[data];
   defaultDict:`iter`init`thresh!(100;1b;1e-5);
-  if[cfg~(::);cfg:()!()];
-  if[99h<>type cfg;'"cfg must be (::) or a dictionary"];
-  // update iteration dictionary with user changes
-  updDict:defaultDict,cfg;
-  // fit algo to data
-  r:clust.i.kmeans[data;df;k;updDict];
-  // return config with new clusters
-  r,`data`inputs!(data;`df`k`iter`kpp!(df;k;updDict`iter;updDict`init))
+  if[config~(::);config:()!()];
+  if[99h<>type config;'"config must be (::) or a dictionary"];
+  // Update iteration dictionary with user changes
+  updDict:defaultDict,config;
+  // Fit algo to data
+  r:clust.i.kMeans[data;df;k;updDict];
+  // Return config with new clusters
+  inputDict:`df`k`iter`kpp!(df;k;updDict`iter;updDict`init);
+  modelInfo:r,`data`inputs!(data;inputDict);
+  returnInfo:enlist[`modelInfo]!enlist modelInfo;
+  predictFunc:clust.kmeans.predict returnInfo;
+  updFunc:clust.kmeans.update returnInfo;
+  returnInfo,`predict`update!(predictFunc;updFunc)
   }
 
 // @kind function
 // @category clust
-// @fileoverview Predict clusters using k-means config
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param cfg  {dict}      `data`df`reppts`clt returned from kmeans clustered training data
-// @return     {long[]}    List of predicted clusters
-clust.kmeans.predict:{[data;cfg]
+// @desc Predict clusters using k-means config
+// @param config {dictionary} A dictionary returned from '.ml.clust.kmeans.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`df`repPts`clust, where data and df are the inputs,
+//     repPts are the calculated k centers and clust are clusters associated
+//     with each of the datapoints
+//   predict - A projection allowing for prediction on new input data
+//   update - A projection allowing new data to be used to update
+//     cluster centers such that the model can react to new data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @return {long[]} Predicted clusters
+clust.kmeans.predict:{[config;data]
+  config:config[`modelInfo];
   data:clust.i.floatConversion[data];
-  // get new clusters based on latest config
-  clust.i.getclust[data;cfg[`inputs]`df;cfg`reppts]
+  // Get new clusters based on latest config
+  clust.i.getClust[data;config[`inputs]`df;config`repPts]
   }
 
 // @kind function
 // @category clust
-// @fileoverview Update kmeans config including new data points
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param cfg  {dict}      `data`df`reppts`clt returned from kmeans clustered on training data
-// @return     {dict}      Updated model config
-clust.kmeans.update:{[data;cfg]
+// @desc Update kmeans config including new data points
+// @param config {dictionary} A dictionary returned from '.ml.clust.kmeans.fit'
+//   containing:
+//   modelInfo - Encapsulates all relevant information needed to fit
+//     the model `data`df`repPts`clust, where data and df are the inputs,
+//     repPts are the calculated k centers and clust are clusters associated
+//     with each of the datapoints
+//   predict - A projection allowing for prediction on new input data
+//   update - A projection allowing new data to be used to update
+//     cluster centers such that the model can react to new data
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @return {dictionary} Updated model configuration (config), including predict 
+//   and update functions
+clust.kmeans.update:{[config;data]
+  modelConfig:config[`modelInfo];
   data:clust.i.floatConversion[data];
-  // update data to include new points
-  cfg[`data]:cfg[`data],'data;
-  // update k means
-  cfg[`reppts]:clust.i.updcenters[cfg`data;cfg[`inputs]`df;()!();cfg`reppts];
-  // get updated clusters based on new means
-  cfg[`clt]:clust.i.getclust[cfg`data;cfg[`inputs]`df;cfg`reppts];
-  // return updated config
-  cfg
-  }
-
-
-// Utilities
-
-// @kind function
-// @category private
-// @fileoverview K-Means algorithm
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param k    {long}      Number of clusters
-// @param cfg  {dict} Configuration information containing the maximum iterations `iter, 
-//   initialisation type `init and threshold for smallest distance 
-//   to move between the previous and new run `thresh
-// @return     {dict}      Clusters or reppts depending on rep
-clust.i.kmeans:{[data;df;k;cfg]
-  // check distance function
-  if[not df in`e2dist`edist;clust.i.err.kmeans[]];
-  // initialize representative points
-  initreppts:$[cfg`init;clust.i.initkpp df;clust.i.initrdm][data;k];
-  // run algo until maximum number of iterations reached or convergence
-  reppts0:`idx`reppts`notconv!(0;initreppts;1b);
-  reppts1:clust.i.kmeansConverge[cfg] clust.i.updcenters[data;df;cfg]/reppts0;
-  // return representative points and clusters
-  `reppts`clt!(reppts1`reppts;clust.i.getclust[data;df;reppts1`reppts])
-  }
-
-// @kind function
-// @category private
-// @fileoverview Check to see if cluster centers are stable or 
-//   if the maximum number of iterations allowable have been reached
-// @param cfg     {dict} Configuration information containing the maximum iterations `iter, 
-//   initialisation type `init and threshold for smallest distance 
-//   to move between the previous and new run `thresh
-// @param algorun {dict} Information about the current run of the algorithm which can have an
-//   impact on early or on time stopping i.e. have the maximum number of iterations been exceeded
-//   or have the cluster centers not moved more than the threshold i.e. 'stationary'
-// @return    {bool} 0b indicates number of iterations has exceeded maximum and
-clust.i.kmeansConverge:{[cfg;algorun]
-  check1:cfg[`iter]>algorun`idx;
-  check2:algorun`notconv;
-  check1 & check2
-  }
-
-// @kind function
-// @category private
-// @fileoverview Update cluster centers
-// @param data   {float[][]}      Data in matrix format, each column is an individual datapoint
-// @param df     {symbol}         Distance function name within '.ml.clust.df'
-// @param cfg    {dict}           Configuration information containing the maximum iterations `iter, 
-//   initialisation type `init and threshold for smallest distance 
-//   to move between the previous and new run `thresh
-// @param reppts {float[][]/dict} Information relating to the representative points, in the case of
-//   fitting the model this is a dictionary containing the current iteration index and if the data
-//   has converged in addition to the representative points. In an individual update this is just
-//   the representative points for the k means centers.
-// @return       {float[][]}      Updated representative points  
-clust.i.updcenters:{[data;df;cfg;reppts]
-  // projection used for calculation of representative points
-  repptFunc:clust.i.newreppts[data;df;];
-  if[99h=type reppts;
-    reppts[`idx]+:1;
-    prevpoint:reppts`reppts;
-    reppts[`reppts]:repptFunc reppts`reppts;
-    reppts[`notconv]:cfg[`thresh]<max abs (raze/)prevpoint-reppts`reppts;
-    :reppts
-    ];
-  repptFunc reppts
-  }
-
-// @kind function
-// @category private
-// @fileoverview Calculate new representative points based on new 
-//   data and previous representatives
-// @param data   {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df     {symbol}    Distance function name within '.ml.clust.df'
-// @param reppts {float[][]} Representative points in matrix format each row 
-//   is an individual datapoint
-// @return       {float[][]} New representative points in matrix format each row 
-//   is an individual datapoint
-clust.i.newreppts:{[data;df;reppts]
-    {[data;j]avg each data[;j]}[data]each value group clust.i.getclust[data;df;reppts]
-    }      
-
-// @kind function
-// @category private
-// @fileoverview Calculate final representative points
-// @param data   {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df     {symbol}    Distance function name within '.ml.clust.df'
-// @param reppts {float[]}   Representative points of each cluster
-// @return       {long}      List of clusters
-clust.i.getclust:{[data;df;reppts]
-  dist:{[data;df;reppt]clust.i.df[df]reppt-data}[data;df]each reppts;
-  max til[count dist]*dist=\:min dist
-  }
-
-// @kind function
-// @category private
-// @fileoverview Random initialization of representative points
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param k    {long}      Number of clusters
-// @return     {float[][]} k representative points
-clust.i.initrdm:{[data;k]
-  flip data[;neg[k]?count data 0]
-  }
-
-// @kind function
-// @category private
-// @fileoverview K-Means++ initialization of representative points
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param k    {long}      Number of clusters
-// @return     {float[][]} k representative points
-clust.i.initkpp:{[df;data;k]
-  info0:`point`dists!(data[;rand count data 0];0w);
-  infos:(k-1)clust.i.kpp[data;df]\info0;
-  infos`point
-  }
-
-// @kind function
-// @category private
-// @fileoverview K-Means++ algorithm
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param info {dict}      Points and distance info
-// @return     {dict}      Updated info dictionary
-clust.i.kpp:{[data;df;info]
-  s:sums info[`dists]&:clust.i.dists[data;df;info`point;::];
-  @[info;`point;:;data[;s binr rand last s]]
+  // Update data to include new points
+  modelConfig[`data]:modelConfig[`data],'data;
+  // Update k means
+  modelConfig[`repPts]:clust.i.updCenters
+    [modelConfig`data;modelConfig[`inputs]`df;()!();modelConfig`repPts];
+  // Get updated clusters based on new means
+  modelConfig[`clust]:clust.i.getClust
+    [modelConfig`data;modelConfig[`inputs]`df;modelConfig`repPts];
+  // Return updated config, prediction and update functions
+  returnInfo:enlist[`modelInfo]!enlist modelConfig;
+  returnKeys:`predict`update;
+  returnVals:(clust.kmeans.predict returnInfo;
+    clust.kmeans.update returnInfo);
+  returnInfo,returnKeys!returnVals
   }
diff --git a/clust/score.q b/clust/score.q
index df058a16..0126db71 100644
--- a/clust/score.q
+++ b/clust/score.q
@@ -1,3 +1,9 @@
+// clust/score.q - Scoring metrics for clustering
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Scoring metrics allow you to validate the performance 
+// of your clustering algorithms
+
 \d .ml
 
 // Cluster Scoring Algorithms
@@ -6,124 +12,76 @@
 
 // @kind function
 // @category clust
-// @fileoverview Davies-Bouldin index - Euclidean distance only (edist)
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param clt  {long[]}    List of clusters produced by .ml.clust algos
-// @return     {float}     Davies Bouldin index of clt
-clust.daviesbouldin:{[data;clt]
-  a:avg@''p:{x[;y]}[data]each group clt;
-  s:avg each clust.i.dists[;`edist;;::]'[p;a];
-  db:{[s;a;x;y]max(s[y]+s e)%'clust.i.dists[flip a e:x _y;`edist;a y;::]};
-  (sum db[s;a;t]each t:til n)%n:count a
+// @desc Davies-Bouldin index - Euclidean distance only (edist)
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param clusts {long[]} Clusters produced by .ml.clust algos
+// @return {float} Davies Bouldin index of clusts
+clust.daviesBouldin:{[data;clusts]
+  dataClust:{x[;y]}[data]each group clusts;
+  avgClust:avg@''dataClust;
+  avgDist:avg each clust.i.dists[;`edist;;::]'[dataClust;avgClust];
+  n:count avgClust;
+  dbScore:clust.i.daviesBouldin[avgDist;avgClust;t]each t:til n;
+  sum[dbScore]%n
   }
 
 // @kind function
 // @category clust
-// @fileoverview Dunn index
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param clt  {long[]}    List of clusters produced by .ml.clust algos
-// @return     {float}     Dunn index of clt
-clust.dunn:{[data;df;clt]
-  mx:clust.i.maxintra[df]each p:{x[;y]}[data]each group clt;
-  mn:min raze clust.i.mininter[df;p]each -2_({1_x}\)til count p;
+// @desc Dunn index
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param clusts {long[]} Clusters produced by .ml.clust algos
+// @return {float} Dunn index of clusts
+clust.dunn:{[data;df;clusts]
+  dataClust:{x[;y]}[data]each group clusts;
+  mx:clust.i.maxIntra[df]each dataClust;
+  upperTri:-2_({1_x}\)til count dataClust;
+  mn:min raze clust.i.minInter[df;dataClust]each upperTri;
   mn%max raze mx
   }
 
 // @kind function
 // @category clust
-// @fileoverview Silhouette score
-// @param data  {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df    {symbol}    Distance function name within '.ml.clust.df'
-// @param clt   {long[]}    List of clusters produced by .ml.clust algos
-// @param isavg {bool}      List of scores or the average score (1/0b)
-// @return      {float}     Silhouette score of clt
-clust.silhouette:{[data;df;clt;isavg]
-  k:1%(count each group clt)-1;
-  $[isavg;avg;]clust.i.sil[data;df;group clt;k]'[clt;flip data]
+// @desc Silhouette score
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param clusts {long[]} Clusters produced by .ml.clust algos
+// @param isAvg {boolean} Are all scores (0b) or the average score (1b)
+//   to be returned
+// @return {float} Silhouette score of clusts
+clust.silhouette:{[data;df;clusts;isAvg]
+  k:1%(count each group clusts)-1;
+  $[isAvg;avg;]clust.i.sil[data;df;group clusts;k]'[clusts;flip data]
   }
 
 // Supervised Learning
 
 // @kind function
 // @category clust
-// @fileoverview Homogeneity Score
+// @desc Homogeneity Score
 // @param pred {long[]} Predicted cluster labels
 // @param true {long[]} True cluster labels
-// @return     {float}  Homogeneity score for true
+// @return {float} Homogeneity score for true
 clust.homogeneity:{[pred;true]
   if[count[pred]<>n:count true;
-    '`$"pred and true must have equal lengths"];
-  if[not e:clust.i.entropy true;:1.];
-  cm:value confmat[pred;true];
-  nm:(*\:/:).((count each group@)each(pred;true))@\:til count cm;
-  mi:(sum/)0^cm*.[-;log(n*cm;nm)]%n;
-  mi%e
+    '"pred and true must have equal lengths"
+    ];
+  if[not ent:clust.i.entropy true;:1.];
+  confMat:value confMatrix[pred;true];
+  nm:(*\:/:).((count each group@)each(pred;true))@\:til count confMat;
+  mi:(sum/)0^confMat*.[-;log(n*confMat;nm)]%n;
+  mi%ent
   }
 
 // Optimum number of clusters
 
 // @kind function
 // @category clust
-// @fileoverview Elbow method
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param k    {long}      Max number of clusters
-// @return     {float[]}   Score for each k value - plot to find elbow
+// @desc Elbow method
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param k {long} Max number of clusters
+// @return {float[]} Score for each k value - plot to find elbow
 clust.elbow:{[data;df;k]
-  {[data;df;k]
-    clt:clust.kmeans.fit[data;df;k;::]`clt;
-    sum raze clust.i.dists[;df;;::]'[p;a:avg@''p:{x[;y]}[data]each group clt]
-    }[data;df]each 2+til k-1
-  }
-
-// Utilities
-
-// @kind function
-// @category private
-// @fileoverview Entropy
-// @param d {long[]} distribution
-// @return  {float} Entropy for d
-clust.i.entropy:{[d]
-  neg sum(p%n)*(-). log(p;n:sum p:count each group d)
-  }
-
-// @kind function
-// @category private
-// @fileoverview Maximum intra-cluster distance
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @return     {float}     Max intra-cluster distance
-clust.i.maxintra:{[df;data]
-  max raze{[df;data;x;y]
-    clust.i.dists[data;df;data[;y];x except til 1+y]
-    }[df;data;n]each n:til count first data
-  }
-
-// @kind function
-// @category private
-// @fileoverview Minimum inter-cluster distance
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param idxs {long[]}    Cluster indices
-// @return     {float}     Min inter-cluster distance
-clust.i.mininter:{[df;data;idxs]
-  {[df;data;i;j]
-    (min/)clust.i.dists[data[i];df;data[j]]each til count data[i]0
-    }[df;data;first idxs]each 1_idxs
-  }
-
-// @kind function
-// @category private
-// @fileoverview Silhouette coefficient
-// @param data {float[][]} Data in matrix format, each column is an individual datapoint
-// @param df   {symbol}    Distance function name within '.ml.clust.df'
-// @param idxs {dict}      Point indices grouped by cluster
-// @param k    {float}     Coefficient to multiply by
-// @param clt  {long}      Cluster of current point
-// @param pt   {float}     Current point
-// @return     {float}     Silhouette coefficent for pt
-clust.i.sil:{[data;df;idxs;k;clt;pt]
-  d:clust.i.dists[data;df;pt]each idxs;
-  (%).((-).;max)@\:(min avg each;k[clt]*sum@)@'d@/:(key[idxs]except clt;clt)
+  clust.i.elbow[data;df]each 2+til k-1
   }
diff --git a/clust/tests/clt.t b/clust/tests/clt.t
index 37e9fdff..3741d14a 100644
--- a/clust/tests/clt.t
+++ b/clust/tests/clt.t
@@ -13,15 +13,18 @@ fclust:.p.import[`scipy.cluster.hierarchy]`:fcluster
 
 // q Utilities
 mat        :{"f"$flip value flip x}
-clusterIdxs:{value group(x . y)`clt}
-clusterKeys:{key   group(x . y)`clt}
-clusterAdd1:{1+(x . y)`clt}
-qDendrogram:{asc each x(y . z)`dgram}
+clusterIdxs:{value group(x . y)[`modelInfo;`clust]}
+clusterKeys:{key   group(x . y)[`modelInfo;`clust]}
+clusterIdxsDendro:{value group(x . y)`clust}
+clusterIdxsUpd:{value group(x . y)[`modelInfo;`clust]}
+clusterAdd1:{1+(x . y)`clust}
+qDendrogram:{asc each x(y . z)[`modelInfo;`dgram]}
 algoOutputs:{asc key x . y}
-countOutput:{count x . y}
-pythonRes  :{[fclust;mat;t;clt;param]value group fclust[mat t`dgram;clt;param]`}[fclust;mat]
+algoOutputsFit:{asc key first x . y}
+countOutput:{count x y}
+pythonRes  :{[fclust;mat;t;clust;param]value group fclust[mat t[`modelInfo;`dgram];clust;param]`}[fclust;mat]
 pythonDgram:{[lnk;d;lf;df]asc each lnk[flip d;lf;df]`}[lnk]
-qDgramDists:{(x . y)[`dgram]`dist}
+qDgramDists:{(x . y)[`modelInfo;`dgram]`dist}
 
 // Datasets
 d1:flip(60#"F";",")0:`:clust/tests/data/ss5.csv
@@ -44,17 +47,19 @@ passingTest[clusterIdxs[.ml.clust.ap.fit];(d2;`nege2dist;0.01;{[x] -10.};(::));1
 passingTest[clusterIdxs[.ml.clust.ap.fit];(d1tts 0;`nege2dist;0.3;min;`maxrun`maxmatch!100 10);1b;enlist til 45]
 passingTest[clusterKeys[.ml.clust.ap.fit];(d2;`nege2dist;0.95;{[x] -20000.};enlist[`maxsame]!enlist 150);1b;til 5]
 passingTest[clusterKeys[.ml.clust.ap.fit];(d2;`nege2dist;0.5;min;(::));1b;til 5]
-passingTest[algoOutputs[.ml.clust.ap.fit];(d2;`nege2dist;0.5;min;(::));1b;`clt`data`exemplars`inputs]
+passingTest[algoOutputsFit[.ml.clust.ap.fit];(d2;`nege2dist;0.5;min;(::));1b;`clust`data`exemplars`inputs]
 failingTest[.ml.clust.ap.fit;(d1;`e2dist;0.7;min;(::));0b;"AP must be used with nege2dist"]
 failingTest[.ml.clust.ap.fit;(d1;`nege2dist;0.7;min;100);0b;"iter must be (::) or a dictionary"]
 failingTest[.ml.clust.ap.fit;(d1;`nege2dist;0.7;min;([]total:10,();nochange:5,()));0b;"iter must be (::) or a dictionary"]
 failingTest[.ml.clust.ap.fit;(100?`8;`nege2dist;0.7;min;(::));0b;"Dataset not suitable for clustering. Must be convertible to floats."]
 
+
 // Predict
-passingTest[.ml.clust.ap.predict;(d1tts 1;.ml.clust.ap.fit[d1tts 0;`nege2dist;0.7;min;(::)]);0b;APclt]
-passingTest[.ml.clust.ap.predict;(d1tts 1;.ml.clust.ap.fit[d1tts 0;`nege2dist;0.7;med;`maxrun`maxmatch!100 10]);0b;APclt]
-failingTest[.ml.clust.ap.predict;(100?`7;enlist[`clt]!enlist -1);0b;"Dataset not suitable for clustering. Must be convertible to floats."]
-failingTest[.ml.clust.ap.predict;(d1tts 1;enlist[`clt]!enlist -1);0b;"'.ml.clust.ap.fit' did not converge, all clusters returned -1. Cannot predict new data."]
+passingTest[.ml.clust.ap.fit[d1tts 0;`nege2dist;0.7;min;(::)]`predict;d1tts 1;1b;APclt]
+passingTest[.ml.clust.ap.fit[d1tts 0;`nege2dist;0.7;med;`maxrun`maxmatch!100 10]`predict;d1tts 1;1b;APclt]
+failingTest[.ml.clust.ap.fit[d1tts 0;`nege2dist;0.7;min;(::)]`predict;100?`7;1b;"Dataset not suitable for clustering. Must be convertible to floats."]
+failingTest[.ml.clust.ap.predict;(enlist[`modelInfo]!enlist enlist[`clust]!enlist -1;d1tts 1);
+            0b;"'.ml.clust.ap.fit' did not converge, all clusters returned -1. Cannot predict new data."]
 
 // K-Means
 
@@ -66,21 +71,22 @@ passingTest[clusterIdxs[.ml.clust.kmeans.fit];(d1;`e2dist;4;kMeansCfg,enlist[`th
 passingTest[clusterIdxs[.ml.clust.kmeans.fit];(d1;`edist;4;kMeansCfg);1b;d1clt]
 passingTest[clusterKeys[.ml.clust.kmeans.fit];(d1;`edist;4;kMeansCfg);1b;til 4]
 passingTest[clusterKeys[.ml.clust.kmeans.fit];(d1;`e2dist;7;kMeansCfg);1b;til 7]
-passingTest[algoOutputs[.ml.clust.kmeans.fit];(d2;`edist;4;kMeansCfg);1b;`clt`data`inputs`reppts]
+passingTest[algoOutputsFit[.ml.clust.kmeans.fit];(d2;`edist;4;kMeansCfg);1b;`clust`data`inputs`repPts]
 failingTest[.ml.clust.kmeans.fit;(d1;`mdist;4;kMeansCfg);0b;"kmeans must be used with edist/e2dist"]
-failingTest[.ml.clust.kmeans.fit;(d1;`nege2dist;4;74);0b;"cfg must be (::) or a dictionary"]
-failingTest[.ml.clust.kmeans.fit;(d1;`nege2dist;4;([]total:28,();nochange:100,()));0b;"cfg must be (::) or a dictionary"]
+failingTest[.ml.clust.kmeans.fit;(d1;`nege2dist;4;74);0b;"config must be (::) or a dictionary"]
+failingTest[.ml.clust.kmeans.fit;(d1;`nege2dist;4;([]total:28,();nochange:100,()));0b;"config must be (::) or a dictionary"]
 failingTest[.ml.clust.kmeans.fit;(1000?`a`b`c;`edist;4;kMeansCfg);0b;"Dataset not suitable for clustering. Must be convertible to floats."]
 
 // Predict
-passingTest[countOutput[.ml.clust.kmeans.predict];(d1tts 1;.ml.clust.kmeans.fit[d1tts 0;`e2dist;4;kMeansCfg]);1b;15]
-passingTest[countOutput[.ml.clust.kmeans.predict];(d1tts 1;.ml.clust.kmeans.fit[d1tts 0;`edist;4;kMeansCfg]);1b;15]
-failingTest[.ml.clust.kmeans.predict;(100?`4;()!());0b;"Dataset not suitable for clustering. Must be convertible to floats."]
+passingTest[countOutput[.ml.clust.kmeans.fit[d1tts 0;`e2dist;4;kMeansCfg]`predict];d1tts 1;1b;15]
+passingTest[countOutput[.ml.clust.kmeans.fit[d1tts 0;`edist;4;kMeansCfg]`predict];d1tts 1;1b;15]
+failingTest[.ml.clust.kmeans.fit[d1tts 0;`e2dist;4;kMeansCfg]`predict;100?`4;1b;"Dataset not suitable for clustering. Must be convertible to floats."]
 
 // Update
-passingTest[clusterIdxs[.ml.clust.kmeans.update];(d1tts 1;.ml.clust.kmeans.fit[d1tts 0;`e2dist;4;kMeansCfg]);1b;d1clt]
-passingTest[algoOutputs[.ml.clust.kmeans.update];(d1tts 1;.ml.clust.kmeans.fit[d1tts 0;`edist;4;kMeansCfg]);1b;`clt`data`inputs`reppts]
-failingTest[.ml.clust.kmeans.update;(1000?`2;()!());0b;"Dataset not suitable for clustering. Must be convertible to floats."]
+passingTest[algoOutputs[.ml.clust.kmeans.fit[d1tts 0;`edist;4;kMeansCfg]`update];enlist d1tts 1;1b;`modelInfo`predict`update]
+passingTest[clusterIdxsUpd[.ml.clust.kmeans.fit[d1tts 0;`e2dist;4;kMeansCfg]`update];enlist d1tts 1;1b;d1clt]
+failingTest[.ml.clust.kmeans.update;(()!();1000?`2);0b;"Dataset not suitable for clustering. Must be convertible to floats."]
+
 
 // DBSCAN
 
@@ -95,17 +101,17 @@ failingTest[.ml.clust.dbscan.fit;(50?`x`y;`edist;4;300);0b;"Dataset not suitable
 failingTest[.ml.clust.dbscan.fit;(d1;`euclidean;5;5);0b;"invalid distance metric"]
 
 // Predict
-passingTest[.ml.clust.dbscan.predict;(d1tts 1;.ml.clust.dbscan.fit[d1tts 0;`e2dist;5;5]);0b;15#-1]
-passingTest[.ml.clust.dbscan.predict;(d1tts 1;.ml.clust.dbscan.fit[d1tts 0;`edist;5;5]);0b;15#-1]
-passingTest[.ml.clust.dbscan.predict;(d1tts 1;.ml.clust.dbscan.fit[d1tts 0;`mdist;5;5]);0b;15#-1]
-failingTest[.ml.clust.dbscan.predict;(50?`x`y;());0b;"Dataset not suitable for clustering. Must be convertible to floats."]
+passingTest[.ml.clust.dbscan.fit[d1tts 0;`e2dist;5;5]`predict;d1tts 1;1b;15#-1]
+passingTest[.ml.clust.dbscan.fit[d1tts 0;`edist;5;5]`predict;d1tts 1;1b;15#-1]
+passingTest[.ml.clust.dbscan.fit[d1tts 0;`mdist;5;5]`predict;d1tts 1;1b;15#-1]
+failingTest[.ml.clust.dbscan.fit[d1tts 0;`e2dist;5;5]`predict;(50?`x`y);1b;"Dataset not suitable for clustering. Must be convertible to floats."]
 
 // Update
-passingTest[clusterIdxs[.ml.clust.dbscan.update];(d1tts 1;.ml.clust.dbscan.fit[d1tts 0;`e2dist;5;5]);1b;d1clt]
-passingTest[clusterIdxs[.ml.clust.dbscan.update];(d1tts 1;.ml.clust.dbscan.fit[d1tts 0;`edist;5;5]);1b;d1clt]
-passingTest[clusterIdxs[.ml.clust.dbscan.update];(d1tts 1;.ml.clust.dbscan.fit[d1tts 0;`mdist;5;5]);1b;d1clt]
-passingTest[algoOutputs[.ml.clust.dbscan.update];(d1tts 1;.ml.clust.dbscan.fit[d1tts 0;`mdist;5;5]);1b;`clt`data`inputs`t]
-failingTest[.ml.clust.dbscan.update;(50?`x`y;());0b;"Dataset not suitable for clustering. Must be convertible to floats."]
+passingTest[clusterIdxsUpd[.ml.clust.dbscan.fit[d1tts 0;`e2dist;5;5]`update];enlist d1tts 1;1b;d1clt]
+passingTest[clusterIdxsUpd[.ml.clust.dbscan.fit[d1tts 0;`edist;5;5]`update];enlist d1tts 1;1b;d1clt]
+passingTest[clusterIdxsUpd[.ml.clust.dbscan.fit[d1tts 0;`mdist;5;5]`update];enlist d1tts 1;1b;d1clt]
+passingTest[algoOutputs[.ml.clust.dbscan.fit[d1tts 0;`mdist;5;5]`update];enlist d1tts 1;1b;`modelInfo`predict`update]
+failingTest[.ml.clust.dbscan.update;(()!();50?`x`y);0b;"Dataset not suitable for clustering. Must be convertible to floats."]
 
 // CURE
 
@@ -118,25 +124,29 @@ cured1pred2:0 3 0 0 3 3 0 0 0 0 0 3 0 3 3
 cured1pred3:1 3 1 3 3 3 1 1 1 1 1 3 1 3 3
 
 // Fit
-passingTest[clusterIdxs[.ml.clust.cure.cutk];(.ml.clust.cure.fit[d1;`e2dist;5;0];4);1b;d1clt]
-passingTest[clusterIdxs[.ml.clust.cure.cutk];(.ml.clust.cure.fit[d1;`edist;10;0.2];4);1b;d1clt]
-passingTest[clusterIdxs[.ml.clust.cure.cutk];(.ml.clust.cure.fit[d1;`mdist;3;0.15];4);1b;d1clt]
-passingTest[clusterIdxs[.ml.clust.cure.cutk];(.ml.clust.cure.fit[d2;`e2dist;20;0];4);1b;cured2clt1]
-passingTest[clusterIdxs[.ml.clust.cure.cutk];(.ml.clust.cure.fit[d2;`edist;20;0.2];4);1b;cured2clt2]
-passingTest[clusterIdxs[.ml.clust.cure.cutk];(.ml.clust.cure.fit[d2;`mdist;10;0.1];4);1b;cured2clt3]
-passingTest[clusterIdxs[.ml.clust.cure.cutdist];(.ml.clust.cure.fit[d1;`e2dist;5;0];2.);1b;d1clt]
-passingTest[clusterIdxs[.ml.clust.cure.cutdist];(.ml.clust.cure.fit[d1;`edist;10;0.2];2.);1b;d1clt]
-passingTest[clusterIdxs[.ml.clust.cure.cutdist];(.ml.clust.cure.fit[d1;`mdist;3;0.15];2.);1b;d1clt]
-passingTest[algoOutputs[.ml.clust.cure.fit];(d1;`e2dist;5;0);1b;`data`dgram`inputs]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutK];(.ml.clust.cure.fit[d1;`e2dist;5;0];4);1b;d1clt]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutK];(.ml.clust.cure.fit[d1;`edist;10;0.2];4);1b;d1clt]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutK];(.ml.clust.cure.fit[d1;`mdist;3;0.15];4);1b;d1clt]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutK];(.ml.clust.cure.fit[d2;`e2dist;20;0];4);1b;cured2clt1]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutK];(.ml.clust.cure.fit[d2;`edist;20;0.2];4);1b;cured2clt2]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutK];(.ml.clust.cure.fit[d2;`mdist;10;0.1];4);1b;cured2clt3]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutDist];(.ml.clust.cure.fit[d1;`e2dist;5;0];2.);1b;d1clt]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutDist];(.ml.clust.cure.fit[d1;`edist;10;0.2];2.);1b;d1clt]
+passingTest[clusterIdxsDendro[.ml.clust.cure.cutDist];(.ml.clust.cure.fit[d1;`mdist;3;0.15];2.);1b;d1clt]
+passingTest[algoOutputsFit[.ml.clust.cure.fit];(d1;`e2dist;5;0);1b;`data`dgram`inputs]
 failingTest[.ml.clust.cure.fit;(821?`2;`e2dist;5;0);0b;"Dataset not suitable for clustering. Must be convertible to floats."]
 failingTest[.ml.clust.cure.fit;(d1;`newmetric;5;0);0b;"invalid distance metric"]
 
+// FitPredict
+passingTest[clusterIdxsDendro[.ml.clust.cure.fitPredict];(d1;`e2dist;5;0;enlist[`k]!enlist 4);1b;d1clt]
+passingTest[clusterIdxsDendro[.ml.clust.cure.fitPredict];(d1;`edist;10;0.2;enlist[`k]!enlist 4);1b;d1clt]
+passingTest[clusterIdxsDendro[.ml.clust.cure.fitPredict];(d1;`mdist;3;0.15;enlist[`k]!enlist 4);1b;d1clt]
+
 // Predict
-passingTest[.ml.clust.cure.predict;(d1tts 1;.ml.clust.cure.cutk[.ml.clust.cure.fit[d1tts 0;`e2dist;5;0];4]);0b;cured1pred1]
-passingTest[.ml.clust.cure.predict;(d1tts 1;.ml.clust.cure.cutk[.ml.clust.cure.fit[d1tts 0;`edist;10;0.2];4]);0b;cured1pred2]
-passingTest[.ml.clust.cure.predict;(d1tts 1;.ml.clust.cure.cutk[.ml.clust.cure.fit[d1tts 0;`mdist;3;0.15];4]);0b;cured1pred3]
-failingTest[.ml.clust.cure.predict;(182?`5;());0b;"Dataset not suitable for clustering. Must be convertible to floats."]
-failingTest[.ml.clust.cure.predict;(2 10#20?5.;()!());0b;"Clusters must be contained within cfg - please run .ml.clust.cure.(cutk/cutdist)"]
+passingTest[.ml.clust.cure.fit[d1tts 0;`e2dist;5;0]`predict;(d1tts 1;enlist[`k]!enlist 4);0b;cured1pred1]
+passingTest[.ml.clust.cure.fit[d1tts 0;`edist;10;0.2]`predict;(d1tts 1;enlist[`k]!enlist 4);0b;cured1pred2]
+passingTest[.ml.clust.cure.fit[d1tts 0;`mdist;3;0.15]`predict;(d1tts 1;enlist[`k]!enlist 4);0b;cured1pred3]
+failingTest[.ml.clust.cure.fit[d1tts 0;`e2dist;5;0]`predict;(182?`5;enlist[`k]!enlist 3);0b;"Dataset not suitable for clustering. Must be convertible to floats."]
 
 // Hierarchical
 
@@ -150,37 +160,42 @@ tab1:.ml.clust.hc.fit[d1;`mdist ;`single]
 tab2:.ml.clust.hc.fit[d1;`e2dist;`average]
 tab3:.ml.clust.hc.fit[d2;`e2dist;`centroid]
 tab4:.ml.clust.hc.fit[d2;`edist ;`complete]
-hct1fit:"j"$fclust[mat tab1`dgram;4;`maxclust]`
+hct1fit:"j"$fclust[mat tab1[`modelInfo;`dgram];4;`maxclust]`
 hcd1pred1:1 2 1 1 2 2 1 1 1 1 1 2 1 2 2
 hcd1pred2:1 3 1 1 3 3 1 1 1 1 1 3 1 3 3
 hcd1pred3:1 3 1 1 3 3 1 1 1 1 1 3 1 3 3
 pyDgramDists:(lnk[flip d2;`single;`sqeuclidean]`)[;2]
 
 // Fit
-passingTest[clusterAdd1[.ml.clust.hc.cutk   ];(tab1;4);1b;hct1fit]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(.ml.clust.hc.fit[d2;`e2dist;`single];4);1b;hcResSingle]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(.ml.clust.hc.fit[d2;`e2dist;`ward];4);1b;hcResWard]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(.ml.clust.hc.fit[d2;`edist;`centroid];4);1b;hcResCentroid]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(.ml.clust.hc.fit[d2;`edist;`complete];4);1b;hcResComplete]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(.ml.clust.hc.fit[d2;`mdist;`average];4);1b;hcResAverage]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(tab2;4);1b;pythonRes[tab2;4;`maxclust]]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(tab3;4);1b;pythonRes[tab3;4;`maxclust]]
-passingTest[clusterIdxs[.ml.clust.hc.cutk   ];(tab4;4);1b;pythonRes[tab4;4;`maxclust]]
-passingTest[clusterIdxs[.ml.clust.hc.cutdist];(tab1;.45);1b;pythonRes[tab1;.45;`distance]]
-passingTest[clusterIdxs[.ml.clust.hc.cutdist];(tab2;4);1b;pythonRes[tab2;34;`distance]]
-passingTest[clusterIdxs[.ml.clust.hc.cutdist];(tab3;500);1b;pythonRes[tab3;500;`distance]]
-passingTest[clusterIdxs[.ml.clust.hc.cutdist];(tab4;30);1b;pythonRes[tab4;30;`distance]]
+passingTest[clusterAdd1[.ml.clust.hc.cutK   ];(tab1;4);1b;hct1fit]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(.ml.clust.hc.fit[d2;`e2dist;`single];4);1b;hcResSingle]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(.ml.clust.hc.fit[d2;`e2dist;`ward];4);1b;hcResWard]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(.ml.clust.hc.fit[d2;`edist;`centroid];4);1b;hcResCentroid]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(.ml.clust.hc.fit[d2;`edist;`complete];4);1b;hcResComplete]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(.ml.clust.hc.fit[d2;`mdist;`average];4);1b;hcResAverage]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(tab2;4);1b;pythonRes[tab2;4;`maxclust]]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(tab3;4);1b;pythonRes[tab3;4;`maxclust]]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutK];(tab4;4);1b;pythonRes[tab4;4;`maxclust]]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutDist];(tab1;.45);1b;pythonRes[tab1;.45;`distance]]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutDist];(tab2;4);1b;pythonRes[tab2;34;`distance]]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutDist];(tab3;500);1b;pythonRes[tab3;500;`distance]]
+passingTest[clusterIdxsDendro[.ml.clust.hc.cutDist];(tab4;30);1b;pythonRes[tab4;30;`distance]]
 passingTest[qDendrogram[mat;.ml.clust.hc.fit];(d1;`e2dist;`single);1b;pythonDgram[d1;`single;`sqeuclidean]]
 passingTest[qDendrogram[mat;.ml.clust.hc.fit];(d1;`mdist;`complete);1b;pythonDgram[d1;`complete;`cityblock]]
 passingTest[qDendrogram[mat;.ml.clust.hc.fit];(d1;`edist;`centroid);1b;pythonDgram[d1;`centroid;`euclidean]]
 passingTest[qDendrogram[mat;.ml.clust.hc.fit];(d1;`mdist;`average);1b;pythonDgram[d1;`average;`cityblock]]
-passingTest[qDgramDists[.ml.clust.hc.fit    ];(d2;`e2dist;`single);1b;pyDgramDists]
+passingTest[qDgramDists[.ml.clust.hc.fit];(d2;`e2dist;`single);1b;pyDgramDists]
 failingTest[.ml.clust.hc.fit;(821?`2;`e2dist;`ward);0b;"Dataset not suitable for clustering. Must be convertible to floats."]
 failingTest[.ml.clust.hc.fit;(d1;`mdist;`ward);0b;"ward must be used with e2dist"]
 failingTest[.ml.clust.hc.fit;(d1;`mdist;`linkage);0b;"invalid linkage"]
 
+// FitPredict
+passingTest[clusterIdxsDendro[.ml.clust.hc.fitPredict];(d2;`e2dist;`single;enlist[`k]!enlist 4);1b;hcResSingle]
+passingTest[clusterIdxsDendro[.ml.clust.hc.fitPredict];(d2;`e2dist;`ward;enlist[`k]!enlist 4);1b;hcResWard]
+passingTest[clusterIdxsDendro[.ml.clust.hc.fitPredict];(d2;`edist;`centroid;enlist[`k]!enlist 4);1b;hcResCentroid]
+
 // Predict
-passingTest[.ml.clust.hc.predict;(d1tts 1;.ml.clust.hc.cutk[.ml.clust.hc.fit[d1tts 0;`e2dist;`single];4]);0b;hcd1pred1]
-passingTest[.ml.clust.hc.predict;(d1tts 1;.ml.clust.hc.cutk[.ml.clust.hc.fit[d1tts 0;`e2dist;`ward];4]);0b;hcd1pred2]
-passingTest[.ml.clust.hc.predict;(d1tts 1;.ml.clust.hc.cutk[.ml.clust.hc.fit[d1tts 0;`edist;`centroid];4]);0b;hcd1pred3]
-failingTest[.ml.clust.hc.predict;(2 10#20?5.;()!());0b;"Clusters must be contained within cfg - please run .ml.clust.hc.(cutk/cutdist)"]
+passingTest[.ml.clust.hc.fit[d1tts 0;`e2dist;`single]`predict;(d1tts 1;enlist[`k]!enlist 4);0b;hcd1pred1]
+passingTest[.ml.clust.hc.fit[d1tts 0;`e2dist;`ward]`predict;(d1tts 1;enlist[`k]!enlist 4);0b;hcd1pred2]
+passingTest[.ml.clust.hc.fit[d1tts 0;`edist;`centroid]`predict;(d1tts 1;enlist[`k]!enlist 4);0b;hcd1pred3]
+
diff --git a/clust/tests/score.t b/clust/tests/score.t
index 500d5616..7de0bf10 100644
--- a/clust/tests/score.t
+++ b/clust/tests/score.t
@@ -20,26 +20,26 @@ d1:flip(60#"F";",")0:`:clust/tests/data/ss5.csv
 d2:@[;`AnnualIncome`SpendingScore]("SSIII";(),",")0:`:clust/tests/data/Mall_Customers.csv
 
 // Expected Results
-clt1:.ml.clust.hc.cutk[.ml.clust.hc.fit[d1;`edist;`single];4]
-clt2:.ml.clust.hc.cutk[.ml.clust.hc.fit[d2;`e2dist;`ward];4]
-clt3:.ml.clust.hc.cutk[.ml.clust.cure.fit[d2;`edist;20;0.2];4]
+clt1:.ml.clust.hc.cutK[.ml.clust.hc.fit[d1;`edist;`single];4]
+clt2:.ml.clust.hc.cutK[.ml.clust.hc.fit[d2;`e2dist;`ward];4]
+clt3:.ml.clust.hc.cutK[.ml.clust.cure.fit[d2;`edist;20;0.2];4]
 rnd1:count[flip d1]?4
 rnd2:count[flip d2]?4
 
 // Dave Bouldin Score
-passingTest[.ml.clust.daviesbouldin;(d1;clt1`clt);0b;pydb[flip d1;clt1`clt]`]
-passingTest[.ml.clust.daviesbouldin;(d2;clt2`clt);0b;pydb[flip d2;clt2`clt]`]
-passingTest[.ml.clust.daviesbouldin;(d2;clt3`clt);0b;pydb[flip d2;clt3`clt]`]
+passingTest[.ml.clust.daviesBouldin;(d1;clt1`clust);0b;pydb[flip d1;clt1`clust]`]
+passingTest[.ml.clust.daviesBouldin;(d2;clt2`clust);0b;pydb[flip d2;clt2`clust]`]
+passingTest[.ml.clust.daviesBouldin;(d2;clt3`clust);0b;pydb[flip d2;clt3`clust]`]
 
 // Silhouette Score
-passingTest[.ml.clust.silhouette;(d1;`edist;clt1`clt;1b);0b;pysil[flip d1;clt1`clt]`]
-passingTest[.ml.clust.silhouette;(d2;`edist;clt2`clt;1b);0b;pysil[flip d2;clt2`clt]`]
-passingTest[.ml.clust.silhouette;(d2;`edist;clt3`clt;1b);0b;pysil[flip d2;clt3`clt]`]
+passingTest[.ml.clust.silhouette;(d1;`edist;clt1`clust;1b);0b;pysil[flip d1;clt1`clust]`]
+passingTest[.ml.clust.silhouette;(d2;`edist;clt2`clust;1b);0b;pysil[flip d2;clt2`clust]`]
+passingTest[.ml.clust.silhouette;(d2;`edist;clt3`clust;1b);0b;pysil[flip d2;clt3`clust]`]
 
 // Dunn Score
-passingTest[applyScoring[.ml.clust.dunn;1  ];(d1;`e2dist;clt1`clt);1b;20]
-passingTest[applyScoring[.ml.clust.dunn;100];(d2;`edist;clt2`clt);1b;13]
-passingTest[applyScoring[.ml.clust.dunn;100];(d2;`mdist;clt3`clt);1b;10]
+passingTest[applyScoring[.ml.clust.dunn;1  ];(d1;`e2dist;clt1`clust);1b;20]
+passingTest[applyScoring[.ml.clust.dunn;100];(d2;`edist;clt2`clust);1b;13]
+passingTest[applyScoring[.ml.clust.dunn;100];(d2;`mdist;clt3`clust);1b;10]
 
 // Elbow Scoring
 passingTest[applyScoring[.ml.clust.elbow;1];(d1;`e2dist;2);1b;enlist 548]
@@ -48,7 +48,7 @@ passingTest[applyScoring[.ml.clust.elbow;1];(d2;`e2dist;2);1b;enlist 186363]
 failingTest[.ml.clust.elbow;(d2;`mdist;3);0b;"kmeans must be used with edist/e2dist"]
 
 // Homogeneity Score
-passingTest[.ml.clust.homogeneity;(clt1`clt;rnd1);0b;hscore[rnd1;clt1`clt]`]
-passingTest[.ml.clust.homogeneity;(clt2`clt;rnd2);0b;hscore[rnd2;clt2`clt]`]
-passingTest[.ml.clust.homogeneity;(clt3`clt;rnd2);0b;hscore[rnd2;clt3`clt]`]
+passingTest[.ml.clust.homogeneity;(clt1`clust;rnd1);0b;hscore[rnd1;clt1`clust]`]
+passingTest[.ml.clust.homogeneity;(clt2`clust;rnd2);0b;hscore[rnd2;clt2`clust]`]
+passingTest[.ml.clust.homogeneity;(clt3`clust;rnd2);0b;hscore[rnd2;clt3`clust]`]
 failingTest[.ml.clust.homogeneity;(100?0b;10?0b);0b;"pred and true must have equal lengths"]
diff --git a/clust/tests/util.t b/clust/tests/util.t
index 0b078fe7..96515d02 100644
--- a/clust/tests/util.t
+++ b/clust/tests/util.t
@@ -16,24 +16,24 @@ idxs1:til count d1 0
 idxs2:til count d2 0
 
 // K-D trees
-tree:.ml.clust.kd.newtree[d1;1]
-tree2:.ml.clust.kd.newtree[d2;2]
+tree:.ml.clust.kd.newTree[d1;1]
+tree2:.ml.clust.kd.newTree[d2;2]
 
 // Configurations
-iter:`run`total`nochange!0 200 15
-info:.ml.clust.i.apinit[d1;`e2dist;max;idxs1]
-info,:`emat`conv`iter!((count d1 0;iter`nochange)#0b;0b;iter)
+iter:`run`total`noChange!0 200 15
+info:.ml.clust.i.apInit[d1;`e2dist;max;idxs1]
+info,:`exemMat`conv`iter!((count d1 0;iter`noChange)#0b;0b;iter)
 
 // q Utilities
 specificRes :{(x . z)y}
 closestPoint:specificRes[.ml.clust.i.closest;`point]
-newTreeRes  :specificRes[.ml.clust.kd.newtree]
+newTreeRes  :specificRes[.ml.clust.kd.newTree]
 nnRes       :specificRes[.ml.clust.kd.nn]
 
 // K-D Tree using C 
 
 // Expected Results
-kdKey:`leaf`left`self`parent`children`axis`midval`idxs
+kdKey:`leaf`left`self`parent`children`axis`midVal`idxs
 kdRes1:kdKey!(1b;0b;3;1;0#0;0N;0n;enlist 1)
 kdRes2:kdKey!(1b;1b;2;1;0#0;0N;0n;enlist 0)
 kdRes3:kdKey!(1b;0b;3;1;0#0;0N;0n;1 3 4)
@@ -45,7 +45,7 @@ passingTest[.ml.clust.i.closest;(d1;`e2dist;1 2;til 5);0b;`point`distance!(1;0)]
 passingTest[closestPoint       ;(d2;`e2dist;3 6;reverse til 5);1b;2]
 passingTest[newTreeRes`left  ;(d1;2);1b;010b]
 passingTest[newTreeRes`leaf  ;(d1;2);1b;011b]
-passingTest[newTreeRes`midval;(d1;2);1b;2 0n 0n]
+passingTest[newTreeRes`midVal;(d1;2);1b;2 0n 0n]
 passingTest[newTreeRes`parent;(d1;2);1b;0N 0 0]
 passingTest[newTreeRes`idxs  ;(d1;2);1b;(0#0;0 1;2 3 4)]
 passingTest[newTreeRes`axis  ;(d1;2);1b;0 0N 0N]
@@ -59,10 +59,10 @@ passingTest[nnRes`closestPoint;(tree2;d2;`edist;1 2 3;d1[;1]);1b;0]
 passingTest[nnRes`closestPoint;(tree2;d2;`edist;1 5 2;d1[;3]);1b;3]
 passingTest[nnRes`closestPoint`closestDist;(tree;d1;`mdist;1;7 9f);1b;(4;8f)]
 passingTest[nnRes`closestPoint`closestDist;(tree2;d2;`edist;0;d2[;2]);1b;(2;0f)]
-passingTest[.ml.clust.kd.findleaf;(tree;d1[;1];tree 0);0b;kdRes1]
-passingTest[.ml.clust.kd.findleaf;(tree;d2[;4];tree 2);0b;kdRes2]
-passingTest[.ml.clust.kd.findleaf;(tree2;d2[;1];tree2 1);0b;kdRes3]
-passingTest[.ml.clust.kd.findleaf;(tree2;d1[;0];tree2 2);0b;kdRes4]
+passingTest[.ml.clust.kd.findLeaf;(tree;d1[;1];tree 0);0b;kdRes1]
+passingTest[.ml.clust.kd.findLeaf;(tree;d2[;4];tree 2);0b;kdRes2]
+passingTest[.ml.clust.kd.findLeaf;(tree2;d2[;1];tree2 1);0b;kdRes3]
+passingTest[.ml.clust.kd.findLeaf;(tree2;d1[;0];tree2 2);0b;kdRes4]
 
 // K-D Tree using q
 
@@ -80,17 +80,17 @@ passingTest[nnRes`closestPoint;(tree2;d2;`edist;1 2 3;d1[;1]);1b;0]
 passingTest[nnRes`closestPoint;(tree2;d2;`edist;1 5 2;d1[;3]);1b;3]
 passingTest[nnRes`closestPoint`closestDist;(tree;d1;`mdist;1;7 9f);1b;(4;8f)]
 passingTest[nnRes`closestPoint`closestDist;(tree2;d2;`edist;0;d2[;2]);1b;(2;0f)]
-passingTest[.ml.clust.kd.findleaf;(tree;d1[;1];tree 0);0b;kdRes5]
-passingTest[.ml.clust.kd.findleaf;(tree;d2[;4];tree 2);0b;kdRes6]
-passingTest[.ml.clust.kd.findleaf;(tree2;d2[;1];tree2 1);0b;kdRes7]
-passingTest[.ml.clust.kd.findleaf;(tree2;d1[;0];tree2 2);0b;kdRes8]
+passingTest[.ml.clust.kd.findLeaf;(tree;d1[;1];tree 0);0b;kdRes5]
+passingTest[.ml.clust.kd.findLeaf;(tree;d2[;4];tree 2);0b;kdRes6]
+passingTest[.ml.clust.kd.findLeaf;(tree2;d2[;1];tree2 1);0b;kdRes7]
+passingTest[.ml.clust.kd.findLeaf;(tree2;d1[;0];tree2 2);0b;kdRes8]
 
 // K-Means
 
-passingTest[.ml.clust.i.getclust;(d2;`e2dist;flip d2[;1 2]);0b;1 0 1 0 0 0 0 0 0 0]
-passingTest[.ml.clust.i.getclust;(d2;`e2dist;flip d2[;1 2 3]);0b;1 0 1 2 2 2 2 2 2 2]
-passingTest[.ml.clust.i.getclust;(d1;`e2dist;flip d1[;2 3]);0b;0 1 0 1 0]
-passingTest[.ml.clust.i.getclust;(d1;`edist;flip d1[;3 4]);0b;0 0 1 0 1]
+passingTest[.ml.clust.i.getClust;(d2;`e2dist;flip d2[;1 2]);0b;1 0 1 0 0 0 0 0 0 0]
+passingTest[.ml.clust.i.getClust;(d2;`e2dist;flip d2[;1 2 3]);0b;1 0 1 2 2 2 2 2 2 2]
+passingTest[.ml.clust.i.getClust;(d1;`e2dist;flip d1[;2 3]);0b;0 1 0 1 0]
+passingTest[.ml.clust.i.getClust;(d1;`edist;flip d1[;3 4]);0b;0 0 1 0 1]
 
 // DBSCAN
 
@@ -107,9 +107,9 @@ a01:"f"$(3.24 0 0 0 0;0 0 0 0 0;0 0 3.24 0 0;0 0 0 0 0;0 0 0 0 0)
 AP1:(0 -12 -7.2 -3.2 0;-12 3.2 -5.6 -9.6 -3.2;-7.2 -5.6 0 0 -9.6;-3.2 -9.6 3.2 0 -5.6;3.2  -3.2 -9.6 -5.6 0)
 AP2:(0 -13.5 -8.1 -3.6 0;-13.5 3.6 -6.3 -10.8 -3.6;-8.1 -6.3 0 0 -10.8;-3.6 -10.8 3.6 0 -6.3;3.6 -3.6 -10.8 -6.3 0)
 
-passingTest[specificRes[.ml.clust.i.apinit;`s`a`r`matches];(d1;`e2dist;min;idxs1);1b;(d1S;5 5#0f;5 5#0f;0)]
-passingTest[specificRes[.ml.clust.i.apalgo;`exemplars`s`a];(.1;info);1b;(0 1 2 2 0;s01;a01)]
-passingTest[.ml.clust.i.updr;(.2;info);0b;AP1]
-passingTest[.ml.clust.i.updr;(.1;info);0b;AP2]
-passingTest[.ml.clust.i.upda;(.5;info);0b;5 5#0f]
-passingTest[.ml.clust.i.upda;(.9;info);0b;5 5#0f]
+passingTest[specificRes[.ml.clust.i.apInit;`similar`avail`r`matches];(d1;`e2dist;min;idxs1);1b;(d1S;5 5#0f;5 5#0f;0)]
+passingTest[specificRes[.ml.clust.i.apAlgo;`exemplars`similar`avail];(.1;info);1b;(0 1 2 2 0;s01;a01)]
+passingTest[.ml.clust.i.updR;(.2;info);0b;AP1]
+passingTest[.ml.clust.i.updR;(.1;info);0b;AP2]
+passingTest[.ml.clust.i.updAvail;(.5;info);0b;5 5#0f]
+passingTest[.ml.clust.i.updAvail;(.9;info);0b;5 5#0f]
diff --git a/clust/util.q b/clust/util.q
deleted file mode 100644
index cc2cc552..00000000
--- a/clust/util.q
+++ /dev/null
@@ -1,106 +0,0 @@
-\d .ml
-
-// Clustering Utilities
-
-// Distance metric dictionary
-
-// @kind function
-// @category private
-// @fileoverview Euclidean distance calculation
-// @param data {float[][]} Points
-// @return     {float[]}   Euclidean distances for data 
-clust.i.df.edist:{[data]
-  sqrt data wsum data
-  }
-
-// @kind function
-// @category private
-// @fileoverview distance calculation
-// @param data {float[][]} Points
-// @return     {float[]}   Euclidean squared distances for data 
-clust.i.df.e2dist:{[data]
-  data wsum data
-  }
-
-// @kind function
-// @category private
-// @fileoverview Manhattan distance calculation
-// @param data {float[][]} Points
-// @return     {float[]}   Manhattan distances for data 
-clust.i.df.mdist:{[data]
-  sum abs data
-  }
-
-// @kind function
-// @category private
-// @fileoverview Chebyshev distance calculation
-// @param data {float[][]} Points
-// @return     {float[]}   Chebyshev distances for data 
-clust.i.df.cshev:{[data]
-  min abs data
-  }
-
-// @kind function
-// @category private
-// @fileoverview Negative euclidean squared distance calculation
-// @param data {float[][]} Points
-// @return     {float[]}   Negative euclidean squared distances for data 
-clust.i.df.nege2dist:{[data]
-  neg data wsum data
-  }
-
-// @kind dictionary
-// @category private
-// @fileoverview Linkage dictionary
-clust.i.lf.single:min
-clust.i.lf.complete:max
-clust.i.lf.average:avg
-clust.i.lf.centroid:raze
-clust.i.lf.ward:{z*x*y%x+y}
-
-// Distance calculations
-
-// @kind function
-// @category private
-// @param data {float[][]} Points in `value flip` format
-// @param df   {fn}        Distance function
-// @param pt   {float[]}   Current point
-// @param idxs {long[]}    Indices from data
-// @return     {float[]}   Distances for data and pt
-clust.i.dists:{[data;df;pt;idxs]
-  clust.i.df[df]pt-data[;idxs]
-  }
-
-// @kind function
-// @category private
-// @param data {float[][]} Points in `value flip` format
-// @param df   {fn}        Distance function
-// @param pt   {float[]}   Current point
-// @param idxs {long[]}    Indices from data
-// @return     {float[]}   Distances for data and pt
-clust.i.closest:{[data;df;pt;idxs]
-  `point`distance!(idxs dists?md;md:min dists:clust.i.dists[data;df;pt;idxs])
-  }
-
-// @kind function
-// @category private
-// @fileoverview Reindex exemplars
-// @param  data {#any[]} Data points
-// @return      {long[]} List of indices
-clust.i.reindex:{[data]
-  distinct[data]?data
-  }
-
-clust.i.floatConversion:{[data]
-  @[{"f"$x};data;{'"Dataset not suitable for clustering. Must be convertible to floats."}]
-  }
-
-// @kind dictionary
-// @category private
-// @fileoverview Error dictionary
-clust.i.err.df:{'`$"invalid distance metric"}
-clust.i.err.lf:{'`$"invalid linkage"}
-clust.i.err.ward:{'`$"ward must be used with e2dist"}
-clust.i.err.centroid:{'`$"centroid must be used with edist/e2dist"}
-clust.i.err.kmeans:{'`$"kmeans must be used with edist/e2dist"}
-clust.i.err.ap:{'`$"AP must be used with nege2dist"}
diff --git a/clust/utils.q b/clust/utils.q
new file mode 100644
index 00000000..aebc6e20
--- /dev/null
+++ b/clust/utils.q
@@ -0,0 +1,1327 @@
+// clust/utils.q - Clustering Utilities
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Collection of utility functions for 
+// implementation of clustering algos
+
+\d .ml
+
+
+// Distance metric dictionary
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Euclidean distance calculation
+// @param data {float[][]} Points
+// @return {float[]} Euclidean distances for data 
+clust.i.df.edist:{[data]
+  sqrt data wsum data
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Distance calculation
+// @param data {float[][]} Points
+// @return {float[]} Euclidean squared distances for data 
+clust.i.df.e2dist:{[data]
+  data wsum data
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Manhattan distance calculation
+// @param data {float[][]} Points
+// @return {float[]} Manhattan distances for data 
+clust.i.df.mdist:{[data]
+  sum abs data
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Chebyshev distance calculation
+// @param data {float[][]} Points
+// @return {float[]} Chebyshev distances for data 
+clust.i.df.cshev:{[data]
+  min abs data
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Negative euclidean squared distance calculation
+// @param data {float[][]} Points
+// @return {float[]} Negative euclidean squared distances for data 
+clust.i.df.nege2dist:{[data]
+  neg data wsum data
+  }
+
+// @private
+// @kind dictionary
+// @category clustUtility
+// @desc Linkage dictionary
+// @type dictionary
+clust.i.lf.single:min
+clust.i.lf.complete:max
+clust.i.lf.average:avg
+clust.i.lf.centroid:raze
+clust.i.lf.ward:{z*x*y%x+y}
+
+// Distance calculations
+
+// @private
+// @kind function
+// @category clustUtility
+// @param data {float[][]} Points in `value flip` format
+// @param df {fn} Distance function
+// @param pt {float[]} Current point
+// @param idxs {long[]} Indices from data
+// @return {float[]} Distances for data and pt
+clust.i.dists:{[data;df;pt;idxs]
+  clust.i.df[df]pt-data[;idxs]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Get the closest point and distance from the current point
+// @param data {float[][]} Points in `value flip` format
+// @param df {fn} Distance function
+// @param pt {float[]} Current point
+// @param idxs {long[]} Indices from data
+// @return {dictionary} Index of the closest point and the distance between
+//   that point and current point
+clust.i.closest:{[data;df;pt;idxs]
+  dists:clust.i.dists[data;df;pt;idxs];
+  minIdx:idxs dists?minDist:min dists;
+  `point`distance!(minIdx;minDist)
+  }
+
+// @private
+// @kind function
+// @desc Reindex the data
+// @category clustUtility
+// @desc Reindex exemplars
+// @param data {any[]} Data points
+// @return {long[]} List of indices
+clust.i.reIndex:{[data]
+  distinct[data]?data
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Convert data to floating value
+// @param data {any[]} Data points
+// @return {err|float[]} Data converted to floating point values or
+//   error if not possible
+clust.i.floatConversion:{[data]
+  @[{"f"$x};data;{'"Dataset not suitable for clustering. ",
+    "Must be convertible to floats."}]
+  }
+
+// @private
+// @kind dictionary
+// @category clustUtility
+// @desc Error dictionary
+// @type dictionary
+clust.i.err.df:{'`$"invalid distance metric"}
+clust.i.err.lf:{'`$"invalid linkage"}
+clust.i.err.ward:{'`$"ward must be used with e2dist"}
+clust.i.err.centroid:{'`$"centroid must be used with edist/e2dist"}
+clust.i.err.kMeans:{'`$"kmeans must be used with edist/e2dist"}
+clust.i.err.ap:{'`$"AP must be used with nege2dist"}
+
+// Hierarchial Utilities
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Check validity of inputs for cutting dendrograms
+//   at position K when using .ml.clust.cutK >1
+// @param cutK {int} The user provided number of clusters to be
+//   retrieved when cutting the dendrogram
+// @return {::|err} Returns nothing on successful invocation, will error
+//   if a user provides an unsupported value
+clust.i.checkK:{[cutK]
+  if[cutK<=1;'"Number of requested clusters must be > 1."];
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Check validity of inputs for cutting dendrograms
+//   at a distance. In order to be valid this must be > 0
+// @param cutDist {float} The user provided cutting distance for
+//   the dendrogram
+// @return {::|err} Returns nothing on successful invocation, will error
+//   if a user provides an unsupported value
+clust.i.checkDist:{[cutDist]
+  if[cutDist<=0;'"Cutting distance must be > 0."];
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Prepare the config for prediction functionality
+// @param config {dictionary} Clustering information returned from `fit`
+// @param cutDist {dictionary} The key defines what cutting algo to use when
+//   splitting the data into clusters (`k/`dist) and the value defines the
+//   cutting threshold
+// @return {dictionary} `data`df`n`c`clust returned from 
+//   .ml.clust.(cutK/cutDist)
+clust.i.prepPred:{[config;cutDict]
+  cutType:first key cutDict;
+  if[not cutType in`k`dist;'"Cutting distance has to be 'k' or 'dist'"];
+  $[cutType=`k;
+    clust.cure.cutK;
+    clust.cure.cutDist
+    ][config;first value cutDict]
+  }
+
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Complete, Average, Ward (CAW) Linkage
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf'
+// @param k {long} Number of clusters
+// @param dgram {boolean} Generate dendrogram or not (1b/0b)
+// @return {table|long[]} Dendrogram or list of clusters
+clust.i.hcCAW:{[data;df;lf;k;dgram]
+  // Check distance function for ward
+  if[(not df~`e2dist)&lf=`ward;clust.i.err.ward[]];
+  // Create initial cluster table
+  t0:clust.i.initCAW[data;df];
+  // Create linkage matrix
+  m:([]idx1:`int$();idx2:`int$();dist:`float$();n:`int$());
+  // Merge clusters based on chosen algorithm
+  r:{[k;r]k<count distinct r[0]`clust}[k]clust.i.algoCAW[data;df;lf]/(t0;m);
+  // Return dendrogram or list of clusters
+  $[dgram;clust.i.updDgram[r 0;r 1];clust.i.reindex r[0]`clust]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Single, Centroid, Cure (SCC) Linkage
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {fn} Linkage function
+// @param k {long} Number of clusters
+// @param n {long} Number of representative points per cluster
+// @param c {float} Compression factor for representative points
+// @param dgram {boolean} Generate dendrogram or not (1b/0b)
+// @return {long[]} Grouped clusters
+clust.i.hcSCC:{[data;df;lf;k;n;c;dgram]
+  if[(not df in`edist`e2dist)&lf=`centroid;clust.i.err.centroid[]];
+  clustInit:clust.i.initSCC[data;df;k;n;c;dgram];
+  clusts:(count[data 0]-k).[clust.i.algoSCC[data;df;lf]]/clustInit;
+  validClusts:select from clusts[1]where valid;
+  $[dgram;
+    clust.i.dgramIdx last[clusts]0;
+    enlist @[;;:;]/[count[data 0]#0N;validClusts`points;til count validClusts]
+    ]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Update dendrogram for CAW with final cluster of all the points
+// @param tab {table} Cluster table
+// @param linkMatrix {float[][]} Linkage matrix
+// @return {float[][]} Updated linkage matrix
+clust.i.updDgram:{[tab;linkMatrix]
+  linkMatrix,:value exec first clust,first nnIdx,first nnDist,count repPt 
+    from tab where nnDist=min nnDist;
+  linkMatrix
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Predict clusters using hierarchical or CURE config
+// @param name {symbol} Namespace to use - `hc or `cure
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param config {dictionary} Output of .ml.clust.(cutK/cutDist)
+// @return {long[]} Predicted clusters
+clust.i.hCCpred:{[name;data;config]
+  data:clust.i.floatConversion[data];
+  // Check correct namespace and clusters given
+  if[not name in`hc`cure;'"Incorrect namespace - please use `hc or `cure"];
+  if[not`clust in key config;
+    '"Clusters must be contained within config - please run .ml.clust.",
+    $[name~`hc;"hc";"cure"],".(cutK/cutDist)"];
+  // Add namespace and linkage to config dictionary for cure
+  if[name~`cure;config[`modelInfo;`inputs],:`name`lf!(name;`single)];
+  // Recalculate representative point for training clusters in asc
+  // order to ensure correct labels
+  clusts:group config`clust; 
+  clustKey:asc key clusts;
+  repPts:clust.i.getRep[config]each clusts clustKey;
+  // Training indices
+  idxs:til each numPt:count each repPts[;0];
+  // Return closest clusters to testing points
+  clust.i.predClosest[data;config;repPts;numPt;idxs]each til count data 0
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Recalculate representative points from training clusters
+// @param config {dictionary} Output of .ml.clust.(cutK/cutDist)
+// @param idxs {long[][]} Training data indices
+// @return {float[][]} Training data points
+clust.i.getRep:{[config;idxs]
+  config:config[`modelInfo];
+  $[config[`inputs;`name]~`cure;
+      flip(clust.i.cureRep . config[`inputs;`df`n`c])::;
+    config[`inputs;`lf]in`ward`centroid;
+      enlist each avg each;
+    ]config[`data][;idxs]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Predict new cluster for given data point
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param config {dictionary} Output of .ml.clust.(cutK/cutDist)
+// @param repPt {float[][]} Representative points in matrix format
+// @param numPts {long} Number of points in training clusters
+// @param clustIdx {long[][]} Training data indices
+// @param ptIdx {long[][]} Index of current data point
+// @return {long} Index of the nearest cluster to the current point 
+clust.i.predClosest:{[data;config;repPt;c;clustIdx;ptIdx]
+  config:config[`modelInfo];
+  // Intra cluster distances
+  dist:.ml.clust.i.dists[;config[`inputs]`df;data[;ptIdx];]'[repPt;clustIdx];
+  // Apply linkage
+  dist:$[`ward~lf:config[`inputs]`lf;
+    2*clust.i.lf[lf][1]'[c;dist];
+    clust.i.lf[lf]each dist
+    ];
+  // Find closest cluster
+  iMin dist
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Initialize cluster table
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @return {table} Distances, neighbors, clusters and representatives
+clust.i.initCAW:{[data;df]
+  // Create table with distances and nearest neighhbors noted
+  tab:clust.i.nnCAW[data;df;data]each til count data 0;
+  // Update each points cluster and representatives
+  update clust:i,repPt:flip data from tab
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Find nearest neighbour index and distance
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param pt {float[][]} Points in `value flip` format
+// @param idxs {long} Index of point in 'pt' to find nearest neighbour for
+// @return {dictionary} Index of and distance to nearest neighbour
+clust.i.nnCAW:{[data;df;pt;idxs]
+  dists:@[;idxs;:;0w]clust.i.dists[data;df;pt;idxs];
+  minDists:min dists;
+  `nnIdx`nnDist!(dists?minDists;minDists)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc CAW algo
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf' 
+// @param clustInfo {(table;float[][])} List with cluster table and 
+//   linkage matrix
+// @return {(table;float[][])} Updated cluster table and linkage matrix
+clust.i.algoCAW:{[data;df;lf;clustInfo]
+  tab:clustInfo 0;
+  matrix:clustInfo 1;
+  // Update linkage matrix
+  matrix,:value exec first clust,first nnIdx,first nnDist,count repPt from tab
+    where nnDist=min nnDist;
+  // Merge closest clusters
+  merge:distinct value first select clust,nnIdx from tab where nnDist=
+    min nnDist;
+  // Add new cluster and repPt into table
+  tab:update clust:1+max tab`clust,repPt:count[i]#enlist sum[repPt]%count[i] 
+    from tab where clust in merge;
+  // Exec pts by cluster
+  clustPts:exec pts:data[;i],n:count i,last repPt by clust from tab;
+  // Find points initially closest to new cluster points
+  chkPts:exec distinct clust from tab where nnIdx in merge;
+  // Run specific algo and return updated table
+  tab:clust.i.hcUpd[lf][clustPts;df;lf]/[tab;chkPts];
+  // Return updated table and matrix
+  (tab;matrix)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Complete linkage
+// @param clustPts {float[][]} Points in each cluster
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf' 
+// @param tab {table} Cluster table
+// @param chkPts {long[]} Points to check
+// @return {table} Updated cluster table
+clust.i.hcUpd.complete:{[clustPts;df;lf;tab;chkPts]
+  // Calculate cluster distances using complete method
+  dists:clust.i.completeDist[df;lf;clustPts;chkPts];
+  // Find nearest neighbors
+  nIdx:dists?nDist:min dists;
+  // Update cluster table
+  update nnIdx:nIdx,nnDist:nDist from tab where clust=chkPts
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Average linkage
+// @param clustPts {float[][]} Points in each cluster
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf' 
+// @param tab {table} Cluster table
+// @param chkPts {long[]} Points to check
+// @return {table} Updated cluster table
+clust.i.hcUpd.average:clust.i.hcUpd.complete
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Ward linkage
+// @param clustPts {float[][]} Points in each cluster
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf'
+// @param tab {table} Cluster table
+// @param chkPts {long[]} Points to check
+// @return {table} Updated cluster table
+clust.i.hcUpd.ward:{[clustPts;df;lf;t;chkPts]
+  // Calculate distances using ward method
+  dists:clust.i.wardDist[df;lf;clustPts;chkPts];
+  // Find nearest neighbors
+  nIdx:dists?nDist:min dists;
+  // Update cluster table and rep pts
+  update nnIdx:nIdx,nnDist:nDist from t where clust=chkPts
+  }
+
+// @private	
+// @kind function
+// @category clustUtility
+// @desc Calculate distances between points based on specified
+//   linkage and distance functions
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf'
+// @param data {float[][]} Points in each cluster
+// @param idxs {long[]} Indices for which to produce distances
+// @return {float[]} Distances between all data points and those in idxs
+clust.i.completeDist:{[df;lf;data;idxs]
+  clust.i.completeCalc[df;lf;data idxs]each data _ idxs
+  }
+
+// @private	
+// @kind function
+// @category clustUtility
+// @desc Calculate distances between points based on specified
+//   linkage and distance functions
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf'
+// @param xdata {float[][]} X data points
+// @param ydata {float[][]} Y data points
+// @return {float[]} Distances between data points
+clust.i.completeCalc:{[df;lf;xdata;ydata]
+    dists:raze clust.i.df[df]xdata[`pts]-\:'ydata`pts;
+    clust.i.lf[lf]dists
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate distances between points based on ward linkage and
+//   specified distance function
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf'
+// @param data {float[][]} Points in each cluster
+// @param idxs {long[]} Indices for which to produce distances
+// @return {float[]} Distances between all data points and those in idxs
+clust.i.wardDist:{[df;lf;data;idxs]
+  clust.i.wardCalc[df;lf;data idxs]each data _ idxs
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate distances between points based on ward linkage and
+//   specified distance function
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf'
+// @param xdata {float[][]} X data points
+// @param ydata {float[][]} Y data points
+// @return {float[]} Distances between data points
+clust.i.wardCalc:{[df;lf;xdata;ydata]
+    dists:clust.i.df[df]xdata[`repPt]-ydata`repPt;
+    2*clust.i.lf[lf][xdata`n;ydata`n;dists]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Initialize SCC clusters
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df' 
+// @param k {long} Number of clusters
+// @param n {long} Number of representative points per cluster
+// @param c {float} Compression factor for representative points
+// @return {(dictionary|long[]|table|table)} Parameters, clusters, 
+//   representative points and the kdTree
+clust.i.initSCC:{[data;df;k;n;c;dgram]
+  numPts:count data 0;
+  // Build kdTree
+  kdTree:clust.kd.newTree[data]1000&ceiling .01*numPts;
+  // Generate distance table with closest clusters identified
+  dists:clust.i.genDistTab[kdTree;data;df;numPts];
+  leafIdx:select raze idxs,self:self where count each idxs from kdTree 
+    where leaf;
+  rep2leaf:exec self idxs?til count i from leafIdx;
+  // Create cluster table 
+  clusts:select clustIdx:i,clust:i,valid:1b,repPts:enlist each i,
+    points:enlist each i,closestDist,closestClust from dists;
+  // Create table of representative points for each cluster
+  repPts:select repPt:i,clust:i,leaf:rep2leaf,closestDist,closestClust 
+    from dists;
+  repPts:repPts,'flip(repCols:`$"x",'string til count data)!data;
+  // Create list of important parameters to carry forward
+  params:`k`n`c`repCols!(k;n;c;repCols);
+  linkMat:([]idx1:`int$();idx2:`int$();dist:`float$();n:`int$());
+  // Return as a list to be passed to algos
+  (params;clusts;repPts;kdTree;(linkMat;dgram))
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Generate distance table indicating closest cluster
+// @param kdTree {table} Initial representation of the k-d tree
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param numPts {long} Number of points in the dataset 
+// @return {table} Distance table containing an indication of the closest 
+//   cluster
+clust.i.genDistTab:{[kdTree;data;df;numPts]
+  // Generate the distance table
+  genTab:{[kdTree;data;df;idx]
+    clust.kd.nn[kdTree;data;df;idx;data[;idx]]
+    }[kdTree;data;df]each til numPts;
+  // Update naming convention
+  update closestClust:closestPoint from genTab
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Representative points for Centroid linkage
+// @param pts {float[][]} Data points
+// @return {float[]} Representative point
+clust.i.centRep:{[pts]
+  enlist avg each pts
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Representative points for CURE
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param n {long} Number of representative points per cluster
+// @param c {float} Compression factor for representative points
+// @param pts {float[][]} Data points
+// @return {float[][]} Representative points
+clust.i.cureRep:{[df;n;c;pts]
+  avgPt:avg each pts;
+  repPts:1_first(n&count pts 0).[clust.i.repCalc[df]]/(enlist avgPt;pts);
+  (repPts*1-c)+\:c*avgPt
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate single representative point
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param repPts {float[]} Representative points of the cluster
+// @param pts {float[][]} Data points
+// @return {float[]} Representative point and the updated data points 
+clust.i.repCalc:{[df;repPts;pts]
+  i:iMax min clust.i.df[df]each pts-/:neg[1|-1+count repPts]#repPts;
+  repPts,:enlist pts[;i];
+  (repPts;.[pts;(::;i);:;0n])
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Update initial dendrogram structure to show path of merges so
+//   that the dendrogram can be plotted with scipy
+// @param dgram {table} Dendrogram stucture produced using 
+//   .ml.clust.hc[...;...;...;...;1b]
+// @return {table} Updated dendrogram
+clust.i.dgramIdx:{[dgram]
+  // Initial cluster indices, number of merges and loop counter
+  clusts:raze dgram`idx1`idx2;n:count dgram;i:0;
+  // Increment a cluster for every occurrence in the tree
+  while[n>i+1;
+    clustIdx:where[clusts=clusts i]except i;
+    clusts[clustIdx]:1+max clusts;i+:1
+    ];
+  // Update dendrogram with new indices
+  ![dgram;();0b;`idx1`idx2!n cut clusts]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Convert dendrogram table to clusters
+// @param tab {table} Dendrogram table
+// @param k {long} Define splitting value in dendrogram table
+// @return {long[]} List of clusters
+clust.i.cutDgram:{[tab;k]
+  if[k=0;
+    '"User provided input encapsultes all datapoints, please ",
+     "increase `k or reduce `cut to an appropriate value."
+    ];
+  // Get index of cluster made at cutting point k
+  idx:(2*cntTab:count tab)-k-1;
+  // Exclude any clusters made after point k
+  i:raze neg[k]#'allClusts:tab`idx1`idx2;
+  exClust:i where idx>i;
+  // Extract indices within clusters made until k, excluding any outliers
+  outliers:exClust where exClust<=cntTab;
+  cutOff:exClust except outliers; 
+  clust:{last{count x 0}clust.i.extractClust[x;y]/(z;())}
+    [allClusts;cntTab+1]each cutOff;
+  // Update points to the cluster they belong to
+  @[;;:;]/[(1+cntTab)#0N;clust,enlist each outliers;til k+1]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Extract points within merged cluster
+// @param clusts {long[]} Cluster indices
+// @param cntTab {long} Count of dendrogram table 
+// @param idxs {long[]} Index in list to search and indices points found within
+//   that cluster
+// @return {long[]} Next index to search, and additional points found 
+//   within cluster
+clust.i.extractClust:{[clusts;cntTab;idxs]
+  // Extract the points that were merged at this point
+  mrgClust:raze clusts[;idxs[0]-cntTab];
+  // Store any single clusts, break down clusts more than single point
+  nextIdx:mrgClust>=cntTab;
+  otherIdxs:idxs[1],mrgClust where not nextIdx;
+  (mrgClust where nextIdx;otherIdxs)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc SCC algo
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param lf {symbol} Linkage function name within '.ml.clust.i.lf'
+// @param params {dictionary} Parameters - k (no. clusts), 
+//   n (no. repPts per clust), repPts, kdTree
+// @param clustTab {table} Cluster table
+// @param repPts {float[][]} Representative points and associated info
+// @param kdTree {table} k-dimensional tree storing points and distances
+// @param linkMatrix {float[][]} Linkage matrix
+// @return {(dictionary|long[]|float[][]|table)} Parameters dict, clusters, 
+//   representative points and kdTree tables
+clust.i.algoSCC:{[data;df;lf;params;clustTab;repPts;kdTree;linkMatrix]
+  // Merge closest clusters
+  clust0:exec clust{x?min x}closestDist from clustTab where valid;
+  newMerge:clustTab clust0,clust1:clustTab[clust0]`closestClust;
+  newMerge:update valid:10b,repPts:(raze repPts;0#0),points:(raze points;0#0)
+    from newMerge;
+  // Make dendrogram if required
+  if[linkMatrix 1;
+    matrix:linkMatrix 0;
+    merge0:first newMerge;
+    matrix,:newMerge[`clustIdx],merge0[`closestDist],count merge0`points;
+    linkMatrix[0]:matrix
+    ];
+  // Keep track of old repPts
+  oldRep:repPts newMerge[0]`repPts;
+  // Find reps in new cluster
+  $[single:lf~`single;
+    // For single new reps=old reps -> no new points calculated 
+    newRep:select repPt,clust:clust0 from oldRep;
+    // Generate new representative points table 
+    //  (centroid -> reps=avg; cure -> calc reps)
+    [newRepFunc:$[lf~`centroid;
+      clust.i.centRep;
+      clust.i.cureRep[df;params`n;params`c]
+      ];
+    newRepKeys:params`repCols;
+    newRepVals:flip newRepFunc data[;newMerge[0]`points];
+    newRep:flip newRepKeys!newRepVals;
+    newRep:update clust:clust0,repPt:count[i]#newMerge[0]`repPts from newRep;
+    // New rep leaves
+    updLeaf:clust.kd.findleaf[kdTree;;kdTree 0]each flip newRep params`repCols;
+    newRep[`leaf]:updLeaf`self;
+    newMerge[0;`repPts]:newRep`repPt;
+    // Delete old points from leaf and update new point to new rep leaf
+    kdTree:.[kdTree;(oldRep`leaf;`idxs);except;oldRep`repPt];
+    kdTree:.[kdTree;(newRep`leaf;`idxs);union ;newRep`repPt]
+    ]
+    ];
+  // Update clusters and repPts
+  clustTab:@[clustTab;newMerge`clust;,;delete clust from newMerge];
+  repPts:@[repPts;newRep`repPt;,;delete repPt from newRep];
+  updRep:repPts newRep`repPt;
+  // Nearest neighbour to clust
+  if[single;updRep:select from updRep where closestClust in newMerge`clust];
+  // Calculate and append to representative point table the nearest neighbours
+  // of columns containing representative points
+  updRepData:flip updRep params`repCols;
+  updRepDataNN:clust.kd.nn
+    [kdTree;repPts params`repCols;df;newMerge[0]`points] each updRepData;
+  updRep:updRep,'updRepDataNN;
+  updRep:update closestClust:repPts[closestPoint;`clust]from updRep;
+  if[single;
+    repPt:@[repPts;updRep`repPt;,;select closestDist,closestClust from updRep];
+    updRep:repPt newRep`repPt
+    ];
+  // Update nearest neighbour of new clust  
+  updRep@:raze iMin updRep`closestDist;
+  clustTab:@[clustTab;updRep`clust;,;`closestDist`closestClust#updRep];
+  $[single;
+    // Single - nearest neighbour=new clust
+    [clustTab:update closestClust:clust0 from clustTab where valid,
+       closestClust=clust1;
+     repPts:update closestClust:clust0 from repPts where closestClust=clust1
+    ];
+    // Else do nearest neighbour search
+    if[count updClusts:select from clustTab where valid,closestClust in
+        (clust0;clust1);
+      nnClust:clust.kd.nn[kdTree;repPts params`repCols;df]/:'
+        [updClusts`repPts;flip each repPts[updClusts`repPts]@\:params`repCols];
+      updClusts:updClusts,'{x iMin x`closestDist}each nnClust;
+      updClusts[`closestClust]:repPts[updClusts`closestPoint]`clust;
+      clustTab:@[clustTab;updClusts`clust;,;select closestDist,closestClust 
+        from updClusts]
+      ]
+   ];
+  (params;clustTab;repPts;kdTree;linkMatrix)
+  }
+
+
+// Kmeans utilities
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc K-Means algorithm
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param k {long} Number of clusters
+// @param config {dictionary} Configuration information containing the maximum 
+//   iterations `iter, initialisation type `init and threshold for smallest
+//   distance to move between the previous and new run `thresh
+// @return {dictionary} Clusters or repPts depending on rep
+clust.i.kMeans:{[data;df;k;config]
+  // Check distance function
+  if[not df in`e2dist`edist;clust.i.err.kMeans[]];
+  // Initialize representative points
+  initRepPts:$[config`init;
+    clust.i.initKpp df;
+    clust.i.initRandom
+    ][data;k];
+  // Run algo until maximum number of iterations reached or convergence
+  repPts0:`idx`repPts`notConv!(0;initRepPts;1b);
+  repPts1:clust.i.kMeansConverge[config]
+    clust.i.updCenters[data;df;config]/repPts0;
+  // Return representative points and clusters
+  clust:clust.i.getClust[data;df;repPts1`repPts];
+  `repPts`clust!(repPts1`repPts;clust)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Check to see if cluster centers are stable or 
+//   if the maximum number of iterations allowable have been reached
+// @param config {dictionary} Configuration information containing the maximum 
+//   iterations `iter, initialisation type `init and threshold for smallest
+//   distance to move between the previous and new run `thresh
+// @param algoRun {dictionary} Information about the current run of the 
+//   algorithm  which can have an impact on early or on time stopping i.e. have 
+//   the maximum number of iterations been exceeded or have the cluster centers 
+//   not moved more than the threshold i.e. 'stationary'
+// @return {boolean} 0b indicates number of iterations has exceeded maximum and
+clust.i.kMeansConverge:{[config;algoRun]
+  check1:config[`iter]>algoRun`idx;
+  check2:algoRun`notConv;
+  check1&check2
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Update cluster centers
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param config {dictionary} Configuration information containing the maximum 
+//   iterations `iter, initialisation type `init and threshold for smallest
+//   distance to move between the previous and new run `thresh
+// @param repPts {float[][]|dictionary} Information relating to the 
+//   representative points, in the case of fitting the model this is a 
+//   dictionary containing the current iteration index and if the data has 
+//   converged in addition to the representative points. In an individual 
+//   update this is just the representative points for the k means centers.
+// @return {float[][]} Updated representative points  
+clust.i.updCenters:{[data;df;config;repPts]
+  // Projection used for calculation of representative points
+  repPtFunc:clust.i.newRepPts[data;df;];
+  if[99h=type repPts;
+    repPts[`idx]+:1;
+    prevPoint:repPts`repPts;
+    repPts[`repPts]:repPtFunc repPts`repPts;
+    repPts[`notConv]:config[`thresh]<max abs (raze/)prevPoint-repPts`repPts;
+    :repPts
+    ];
+  repPtFunc repPts
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate new representative points based on new 
+//   data and previous representatives
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param repPts {float[][]} Representative points in matrix format each row 
+//   is an individual datapoint
+// @return {float[][]} New representative points in matrix format each row 
+//   is an individual datapoint
+clust.i.newRepPts:{[data;df;repPts]
+  avgFunc:{[data;j]avg each data[;j]};
+  avgFunc[data]each value group clust.i.getClust[data;df;repPts]
+  }      
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate final representative points
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param repPts {float[]} Representative points of each cluster
+// @return {long} List of clusters
+clust.i.getClust:{[data;df;repPts]
+  distFunc:{[data;df;repPt]clust.i.df[df]repPt-data};
+  dist:distFunc[data;df]each repPts;
+  max til[count dist]*dist=\:min dist
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Random initialization of representative points
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param k {long} Number of clusters
+// @return {float[][]} k representative points
+clust.i.initRandom:{[data;k]
+  flip data[;neg[k]?count data 0]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc K-Means++ initialization of representative points
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param k {long} Number of clusters
+// @return {float[][]} k representative points
+clust.i.initKpp:{[df;data;k]
+  info0:`point`dists!(data[;rand count data 0];0w);
+  infos:(k-1)clust.i.kpp[data;df]\info0;
+  infos`point
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc K-Means++ algorithm
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param info {dictionary} Points and distance info
+// @return {dictionary} Updated info dictionary
+clust.i.kpp:{[data;df;info]
+  dists:clust.i.dists[data;df;info`point;::];
+  sumDist:sums info[`dists]&:dists;
+  idx:sumDist binr rand last sumDist;
+  @[info;`point;:;data[;idx]]
+  }
+
+// dbscan utilities
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Update the neighbourhood of a previously fit original dbscan 
+//   model based on new data
+// @param orig {table} Original table of data with all points set as core 
+//   points
+// @param new {table} Table generated from new data with the previously 
+//   generated model
+// @param idx {long[]} Indices used to update the neighbourhood of the original 
+//   table
+// @return {table} Table with neighbourhood updated appropriately for the newly 
+//   introduced data
+clust.i.updNbhood:{[orig;new;idx]
+  update nbhood:{x,'y}[nbhood;idx]from orig where i in new`nbhood
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Predict clusters using DBSCAN config
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param config {dictionary} `data`inputs`clust returned from DBSCAN clustered 
+//   training data
+// @return {table} Cluster table
+clust.i.dbscanPredict:{[data;config]
+  idx:count[config[`data]0]+til count data 0;
+  // Create neighbourhood table
+  tab:clust.i.nbhoodTab[config[`data],'data;;;;idx]. 
+    config[`inputs;`df`minPts`eps];
+  // Find which existing clusters new data belongs to
+  update cluster:{x[`clust]first y}[config]each nbhood from tab where corePoint
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Create neighbourhood table for points at indices provided
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param minPts {long} Minimum number of points within the epsilon radius
+// @param eps {float} Epsilon radius to search
+// @param idx {long[]} Data indices to find neighbourhood for
+// @return {table} Neighbourhood table with columns `nbhood`cluster`corepoint
+clust.i.nbhoodTab:{[data;df;minPts;eps;idx]
+  // Calculate distances and find all points which are not outliers
+  nbhood:clust.i.nbhood[data;df;eps]each idx;
+  // Update outlier cluster to null
+  update cluster:0N,corePoint:minPts<=1+count each nbhood from([]nbhood)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Find all points which are not outliers
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param eps {float} Epsilon radius to search
+// @param idx {long} Index of current point
+// @return {long[]} Indices of points within the epsilon radius
+clust.i.nbhood:{[data;df;eps;idx]
+  where eps>@[;idx;:;0w]clust.i.df[df]data-data[;idx]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Run DBSCAN algorithm and update cluster of each point
+// @param tab {table} Cluster info table
+// @return {table} Updated cluster table with old clusters merged
+clust.i.dbAlgo:{[tab]
+  nbIdxs:.ml.clust.i.nbhoodIdxs[tab]/[first where tab`corePoint];
+  update cluster:0|1+max tab`cluster,corePoint:0b from tab where i in nbIdxs
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Find indices in each points neighborhood
+// @param tab {table} Cluster info table
+// @param idxs {long[]} Indices to search the neighborhood of
+// @return {long[]} Indices in neighborhood
+clust.i.nbhoodIdxs:{[tab;idxs]
+  nbh:exec nbhood from tab[distinct idxs,raze tab[idxs]`nbhood]where corePoint;
+  asc distinct idxs,raze nbh
+  }
+
+// Aprop utilities
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Run affinity propagation algorithm
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param damp {float} Damping coefficient
+// @param diag {fn} Function applied to the similarity matrix diagonal
+// @param idxs {long[]} Indicies to find distances for
+// @param iter {dictionary} Max number of overall iterations and iterations 
+//   without a change in clusters. (::) can be passed in where the defaults
+//   of (`total`noChange!200 15) will be used
+// @return {dictionary} Data, input variables, clusters and exemplars
+clust.i.runAp:{[data;df;damp;diag;idxs;iter]
+  // Check negative euclidean distance has been given
+  if[df<>`nege2dist;clust.i.err.ap[]];
+  // Calculate distances, availability and responsibility
+  info0:clust.i.apInit[data;df;diag;idxs];
+  // Initialize exemplar matrix and convergence boolean
+  info0,:`exemMat`conv`iter!((count data 0;iter`noChange)#0b;0b;iter);
+  // Run ap algo until maximum number of iterations completed or convergence
+  info1:clust.i.apStop clust.i.apAlgo[damp]/info0;
+  // Return data, inputs, clusters and exemplars
+  inputs:`df`damp`diag`iter!(df;damp;diag;iter);
+  exemplars:info1`exemplars;
+  clust:$[info1`conv;clust.i.reIndex exemplars;count[data 0]#-1];
+  `data`inputs`clust`exemplars!(data;inputs;clust;exemplars)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Initialize matrices
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param diag {fn} Function applied to the similarity matrix diagonal
+// @param idxs {long[]} Point indices
+// @return {dictionary} Similarity, availability and responsibility matrices 
+//   and keys for matches and exemplars to be filled during further iterations
+clust.i.apInit:{[data;df;diag;idxs]
+  // Calculate similarity matrix values
+  dists:clust.i.dists[data;df;data]each idxs;
+  // Update diagonal
+  dists:@[;;:;diag raze dists]'[dists;k:til n:count data 0];
+  // Create lists/matrices of zeros for other variables
+  `matches`exemplars`similar`avail`r!(0;0#0;dists),(2;n;n)#0f
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Run affinity propagation algorithm
+// @param damp {float} Damping coefficient
+// @param info {dictionary} Similarity, availability, responsibility, 
+//   exemplars, matches, iter dictionary, no_conv boolean and iter dict
+// @return {dictionary} Updated inputs
+clust.i.apAlgo:{[damp;info]
+  // Update responsibility matrix
+  info[`r]:clust.i.updR[damp;info];
+  // Update availability matrix
+  info[`avail]:clust.i.updAvail[damp;info];
+  // Find new exemplars
+  ex:iMax each sum info`avail`r;
+  // Update `info` with new exemplars/matches
+  info:update exemplars:ex,matches:?[exemplars~ex;matches+1;0]from info;
+  // Update iter dictionary
+  .[clust.i.apConv info;(`iter;`run);+[1]]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Check affinity propagation algorithm for convergence
+// @param info {dictionary} Similarity, availability, responsibility, 
+//   exemplars, matches, iter dictionary, no_conv boolean and iter dict
+// @return {dictionary} Updated info dictionary
+clust.i.apConv:{[info]
+  // Iteration dictionary
+  iter:info`iter;
+  // Exemplar matrix
+  exemMat:info`exemMat;
+  // Existing exemplars
+  exemDiag:0<sum clust.i.diag each info`avail`r;
+  exemMat[;iter[`run]mod iter`noChange]:exemDiag;
+  // Check for convergence
+  if[iter[`noChange]<=iter`run;
+    unConv:count[info`similar]<>sum(se=iter`noChange)+0=se:sum each exemMat;
+    conv:$[(iter[`total]=iter`run)|not[unConv]&sum[exemDiag]>0;1b;0b]];
+  // Return updated info
+  info,`exemMat`conv!(exemMat;conv)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Retrieve diagonal from a square matrix
+// @param matrix {any[][]} Square matrix
+// @return {any[]} Matrix diagonal
+clust.i.diag:{[matrix]
+  {x y}'[matrix;til count matrix]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Update responsibility matrix
+// @param damp {float} Damping coefficient
+// @param info {dictionary} Similarity, availability, responsibility, 
+//   exemplars, matches, iter dictionary, no_conv boolean and iter dict
+// @return {float[][]} Updated responsibility matrix
+clust.i.updR:{[damp;info]
+  mx:clust.i.maxResp'[sum info`similar`avail;til count info`r];
+  // Calculate new responsibility
+  (damp*info`r)+(1-damp)*info[`similar]-mx
+  }	
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Create matrix with every points max responsibility
+//   diagonal becomes -inf, current max becomes second max
+// @param data {float[]} Sum of similarity and availability matrices
+// @param i {long} Index of responsibility matrix
+// @return {float[][]} Responsibility matrix
+clust.i.maxResp:{[data;i]
+  maxData:max data;
+  maxI:data?maxData;
+  @[count[data]#maxData;maxI;:;]max@[data;i,maxI;:;-0w]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Update availability matrix
+// @param damp {float} Damping coefficient
+// @param info {dictionary} Similarity, availability, responsibility, 
+//   exemplars, matches, iter dictionary, no_conv boolean and iter dict
+// @return {float[][]} Returns updated availability matrix
+clust.i.updAvail:{[damp;info]
+  // Sum values in positive availability matrix
+  resp:0|info`r;
+  k:til count info`avail;
+  sumR:sum@[;;:;0f]'[resp;k];
+  // Create a matrix using the negative values produced by the availability sum
+  //  + responsibility diagonal - positive availability values
+  avail:@[;;:;]'[0&(sumR+info[`r]@'k)-/:resp;k;sumR];
+  // Calculate new availability
+  (damp*info`avail)+avail*1-damp
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Stopping condition for affinity propagation algorithm
+// @param info {dictionary} Similarity, availability, responsibility, exemplars,
+//   matches, iter dictionary, no_conv boolean and iter dict
+// @return {boolean} Indicates whether to continue or stop running AP (1/0b)
+clust.i.apStop:{[info]
+  (info[`iter;`total]>info[`iter]`run)&not 1b~info`conv
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Predict clusters using AP training exemplars
+// @param centre {float[][]} Training cluster centres in matrix format, 
+//   each column is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param pt {float[]} Current data point
+// @return {long[]} Predicted clusters
+clust.i.apPredDist:{[centre;df;pt]
+  dists:clust.i.dists[centre;df;pt]each til count centre 0;
+  iMax dists
+  }
+
+// KD Tree utilities
+
+// @private
+// @kind function
+// @category kdtree
+// @desc Create tree table where each row represents a node
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param leafSize {long} Points per leaf (<2*number of representatives)
+// @param node {dictionary} Info for a given node in the tree
+// @return {table} k-d tree table
+clust.kd.i.tree:{[data;leafSize;node]
+  if[leafSize<=.5*count node`idxs;
+    xData:data[;node`idxs];
+    varData:var each xData;
+    split:xData<med xData@:ax:iMax varData;
+    leftIdxs:where split;
+    rightIdxs:where not split;
+    if[all leafSize<=count each (leftIdxs;rightIdxs);
+      leftNode:update left:1b,parent:self,self+1,idxs:idxs leftIdxs from node;
+      n:count leftTree:.z.s[data;leafSize]leftNode;
+      rightNode:update left:0b,parent:self,self+1+n,idxs:idxs rightIdxs 
+        from node;
+      rightTree:.z.s[data;leafSize]rightNode;
+      node:select leaf,left,self,parent,children:self+1+(0;n),axis:ax,
+        midVal:"f"$min xData rightIdxs,idxs:0#0 from node;
+      :enlist[node],leftTree,rightTree
+      ]
+    ];
+  enlist select leaf:1b,left,self,parent,children:0#0,axis:0N,midVal:0n,idxs 
+    from node
+  }
+
+// @private
+// @kind function
+// @category kdtree
+// @desc Search each node and check nearest neighbors
+// @param tree {table} k-d tree table
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {fn} Distance function
+// @param xIdxs {long[][]} Points to exclude in search
+// @param pt {long[]} Point to find nearest neighbor for
+// @param nnInfo {dictionary} Nearest neighbor info of a point
+// @return {dictionary} Updated nearest neighbor info
+clust.kd.i.nnCheck:{[tree;data;df;xIdxs;pt;nnInfo]
+  if[nnInfo[`node]`leaf;
+    closest:clust.i.closest[data;df;pt]nnInfo[`node;`idxs]except xIdxs;
+    if[closest[`distance]<nnInfo`closestDist;
+      nnInfo[`closestPoint`closestDist]:closest`point`distance;
+      ]
+    ];
+  childIdx:first nnInfo[`node;`children]except nnInfo`xNodes;
+  if[not null childIdx;
+    nnDist:clust.i.df[df]pt[nnInfo[`node]`axis]-nnInfo[`node]`midVal;
+    childIdx:$[nnInfo[`closestDist]<nnDist;
+      0N;
+      clust.kd.findLeaf[tree;pt;tree childIdx]`self
+      ]
+    ];
+  if[null childIdx;nnInfo[`xNodes],:nnInfo[`node]`self];
+  nnInfo[`node]:tree nnInfo[`node;`parent]^childIdx;
+  nnInfo
+  }
+
+// @private
+// @kind function
+// @category kdtree
+// @desc Find the next direction to take in the tree
+// @param tree {table} k-d tree table
+// @param pt {float[]} Current point to put in tree
+// @param node {dictionary} Current node to check
+// @return {long} Next direction to take
+clust.kd.i.findNext:{[tree;pt;node]
+  tree node[`children]node[`midVal]<=pt node`axis
+  }
+
+// Scoring utilities
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate entropy
+// @param d {long[]} distribution
+// @return {float} Entropy for d
+clust.i.entropy:{[d]
+  distrib:count each group d;
+  n:sum distrib;
+  neg sum(distrib%n)*(-). log(distrib;n)
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Maximum intra-cluster distance
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @return {float} Max intra-cluster distance
+clust.i.maxIntra:{[df;data]
+  max raze clust.i.intra[df;data;n]each n:til count first data
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate intra-cluster distance
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param idxs {int[]} All indices of the data
+// @param i {int} Single index within the data 
+// @return {float} Intra-cluster distance
+clust.i.intra:{[df;data;idxs;i]
+  updIdx:idxs except til 1+i;
+  clust.i.dists[data;df;data[;i];updIdx]
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Minimum inter-cluster distance
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param idxs {long[]} Cluster indices
+// @return {float} Min inter-cluster distance
+clust.i.minInter:{[df;data;idxs]
+   clust.i.inter[df;data;first idxs]each 1_idxs
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Calculate inter-cluster distance
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param init {int[]} First index in the data
+// @param i {int} Single index within the data 
+// @return {float} Inter-cluster distance
+clust.i.inter:{[df;data;init;i]
+  (min/)clust.i.dists[data[init];df;data[i]]each til count data[init]0
+  }
+
+// @private
+// @kind function
+// @category clustUtility
+// @desc Silhouette coefficient
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param idxs {dictionary} Point indices grouped by cluster
+// @param k {float} Coefficient to multiply by
+// @param clusts {long} Cluster of current point
+// @param pt {float} Current point
+// @return {float} Silhouette coefficent for pt
+clust.i.sil:{[data;df;idxs;k;clusts;pt]
+  dists:clust.i.dists[data;df;pt]each idxs;
+  split:dists@/:(key[idxs]except clusts;clusts);
+  (%).((-).;max)@\:(min avg each;k[clusts]*sum@)@'split
+  }
+
+// @kind function
+// @category clustUtility
+// @desc Davies-Bouldin of a single index
+// @param avgDist {float} Average distance between clusters and average value
+// @param avgClust {float} Average value of each cluster
+// @param idx {int[]} All indices of cluster
+// @param n {int} Single index of the cluster group
+// @return {float} Davies Bouldin index of single point
+clust.i.daviesBouldin:{[avgDist;avgClust;idx;n]
+  dists:clust.i.dists[flip avgClust updIdx:idx _n;`edist;avgClust n;::];
+  max(avgDist[n]+avgDist updIdx)%'dists
+  }
+
+// @private
+// @kind function
+// @category clust
+// @desc Elbow method
+// @param data {float[][]} Each column of the data is an individual datapoint
+// @param df {symbol} Distance function name within '.ml.clust.i.df'
+// @param k {long} Number of clusters to be fit for k-means
+// @return {float[]} Score for single cluster k value
+clust.i.elbow:{[data;df;k]
+  clusts:clust.kmeans.fit[data;df;k;::][`modelInfo;`clust];
+  dataClusts:{x[;y]}[data]each group clusts;
+  sum raze clust.i.dists[;df;;::]'[dataClusts;avg@''dataClusts]
+  }
diff --git a/fresh/README.md b/fresh/README.md
index 2e25678e..e1ff023a 100644
--- a/fresh/README.md
+++ b/fresh/README.md
@@ -1,13 +1,14 @@
 # FRESH
 
-Feature extraction and selection are important tasks in machine learning. They provide an opportunity to explore datasets in depth and can also improve prediction accuracy and allow the use of less complex models.
+Feature extraction and selection are important tasks in machine learning. They provide an opportunity to explore datasets in depth and can also improve prediction accuracy and allow the use of less complex models. 
 
 ## Features
-FreshQ is an implementation of the [FRESH](https://arxiv.org/pdf/1610.07717v3.pdf) (FeatuRe Extraction and Scalable Hypothesis testing) algorithm. FRESH allows users to derive new features from their input dataset, in order to characterize the underlying time series. Features vary in complexity from min and max to kurtosis and fourier coefficients. The majority of these functions are implemented in q, with a small number dependent on python modules via embedPy.
 
-FRESH also allows users to complete statistical tests, comparing input data with the target vector being predicted. Thus, the most statistically significant features can be selected from the expanded dataset.
+FRESH is an implementation of the [FRESH](https://arxiv.org/pdf/1610.07717v3.pdf) (FeatuRe Extraction and Scalable Hypothesis testing) algorithm. FRESH allows users to derive new features from their input dataset, in order to characterize the underlying time series. Features vary in complexity from min and max values, to kurtosis and fourier coefficients. The majority of these functions are implemented in q, with a small number dependent on python modules, accessed via embedPy.
 
-FRESH can be used in conjunction with the util library, which contains functions for;
+FRESH also allows users to complete statistical tests, comparing the input data with the target vector being predicted. Thus, the most statistically significant features can be selected from the expanded dataset.
+
+FRESH can be used in conjunction with the util library, which contains functions for:
 - Creation of polynomial features
 - Tailored filling and interpolation of data by column (fill/median/mean/linear/zero).
 
@@ -35,6 +36,6 @@ Documentation is available on the [FRESH](https://code.kx.com/v2/ml/toolkit/fres
 
 ## Status
   
-The FRESH library is still in development and is available here as a beta release. Further functionality and improvements will be made to the library in the coming months.
+The FRESH library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
 
 If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/fresh/extract.q b/fresh/extract.q
index 1a30bfec..f1f9f841 100644
--- a/fresh/extract.q
+++ b/fresh/extract.q
@@ -1,121 +1,68 @@
-\d .ml 
+// fresh/extract.q - Extract features
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Generate features based on params
 
-/ feature functions
-fresh.feat.absenergy:{x wsum x}
-fresh.feat.abssumchange:{sum abs 1_deltas x}
-fresh.feat.aggautocorr:{
- a:$[(abs[var x]<1e-10)|1=n:count x;0;1_fresh.i.acf[x;`unbiased pykw 1b;`fft pykw n>1250]`];
- `mean`variance`median`dev!(avg;var;med;dev)@\:a}
-fresh.feat.agglintrend:{
- t:fresh.feat.lintrend each(max;min;var;avg)@/:\:y cut x;
- (`$"_"sv'string cols[t]cross`max`min`var`avg)!raze value flip t}
-fresh.feat.augfuller:{`teststat`pvalue`usedlag!3#"f"$@[{fresh.i.adfuller[x]`};x;0n]}
-fresh.feat.autocorr:{$[y=0;1f;(avg(x-m)*xprev[y;x]-m:avg x)%var x]}
-fresh.feat.binnedentropy:{neg sum p*log p:(count each group(y-1)&floor y*x%max x-:min x)%count x}
-/ t-series non-linearity - Schreiber, T. and Schmitz, A. (1997). PHYSICAL REVIEW E, VOLUME 55, NUMBER 5
-fresh.feat.c3:{avg x*/xprev\:[-1 -2*y]x}
-fresh.feat.changequant:{[x;ql;qh;isabs]
- k:($[isabs;abs;]1_deltas x)where 1_&':[x within fresh.feat.quantile[x]ql,qh];
- `max`min`mean`variance`median`stdev!(max;min;avg;var;med;dev)@\:k}
-/ time series complexity - http://www.cs.ucr.edu/~eamonn/Complexity-Invariant%20Distance%20Measure.pdf
-fresh.feat.cidce:{sqrt k$k:"f"$1_deltas$[not y;x;0=s:dev x;:0.;(x-avg x)%s]}
-fresh.feat.count:{count x}
-fresh.feat.countabovemean:{sum x>avg x}
-fresh.feat.countbelowmean:{sum x<avg x}
-fresh.feat.eratiobychunk:{(`$"_"sv'string`chunk,'til[y],'y)!k$'k:((y;0N)#x)%x wsum x}
-fresh.feat.fftaggreg:{
- m:1.,(sum each a*/:3(l*)\l:"f"$til count a)%sum a:fresh.i.abso[fresh.i.rfft x]`;
- v:m[2]-m[1]*m[1];
- s:$[v<.5;0n;((m[3]-3*m[1]*v)-m[1]*m[1]*m 1)%v xexp 1.5];
- k:$[v<.5;0n;((m[4]-4*m[1]*m[3]-3*m 1)+6*m[2]*m[1]*m 1)%v*v];
- `centroid`variance`skew`kurtosis!(m 1;v;s;k)}
-fresh.feat.fftcoeff:{
- r:(fresh.i.angle[fx;`deg pykw 1b]`;fresh.i.real[fx]`;fresh.i.imag[fx]`;fresh.i.abso[fx:fresh.i.rfft x]`);
- (`$"_"sv'string raze(`coeff,/:til y),\:/:`angle`real`imag`abs)!raze y#'r,\:y#0n}
-fresh.feat.firstmax:{(x?max x)%count x}
-fresh.feat.firstmin:{(x?min x)%count x}
-fresh.feat.hasdup:{count[x]<>count distinct x}
-fresh.feat.hasdupmax:{1<sum x=max x}
-fresh.feat.hasdupmin:{1<sum x=min x}
-fresh.feat.indexmassquantile:{(1+(sums[x]%sum x:abs x)binr y)%count x}
-fresh.feat.kurtosis:{((n-1)%(n-2)*n-3)*(3*1-n)+n*(1+n:count x)*sum[k*k]%s*s:sum k*:k:x-avg x}
-fresh.feat.largestdev:{dev[x]>y*max[x]-min x}
-fresh.feat.lastmax:{(last where x=max x)%count x}
-fresh.feat.lastmin:{(last where x=min x)%count x}
-fresh.feat.lintrend:{`rval`intercept`slope!0^(xk%sqrt vk*var x;avg[x]-b*avg k;b:(xk:x cov k)%vk:var k:til count x)}
-fresh.feat.longstrikegtmean:{max 0,fresh.i.getlenseqwhere x>avg x}
-fresh.feat.longstrikeltmean:{max 0,fresh.i.getlenseqwhere x<avg x}
-fresh.feat.max:{max x}
-fresh.feat.mean:{avg x}
-fresh.feat.mean2dercentral:{avg(.5*x+prev p)-p:prev x}
-fresh.feat.meanabschange:{avg abs 1_deltas x}
-fresh.feat.meanchange:{(x[n]-x 0)%n:-1+count x}
-fresh.feat.med:{med x}
-fresh.feat.min:{min x}
-fresh.feat.numcrossingm:{sum 1_differ x>y}
-fresh.feat.numcwtpeaks:{count fresh.i.findpeak[x;1+til y]`}
-fresh.feat.numpeaks:{sum all fresh.i.peakfind[x;y;]each 1+til y}
-fresh.feat.partautocorrelation:{
- (`$"lag_",/:string 1+til y)!y#$[1>mx:y&count[x]-1;();1_fresh.i.pacf[x;`nlags pykw mx;`method pykw`ld]`],y#0n}
-fresh.feat.perrecurtoalldata:{sum[1<g]%count g:count each group x}
-fresh.feat.perrecurtoallval:{sum[g where 1<g:count each group x]%count x}
-fresh.feat.quantile:{r[0]+(p-i 0)*last r:0^deltas asc[x]i:0 1+\:floor p:y*-1+count x}
-fresh.feat.rangecount:{sum(x>=y)&x<z}
-fresh.feat.ratiobeyondrsigma:{avg abs[x-avg x]>y*dev x}
-fresh.feat.ratiovalnumtserieslength:{count[distinct x]%count x}
-fresh.feat.skewness:{n*sum[m*m*m:x-avg x]%(s*s*s:sdev x)*(n-1)*-2+n:count x}
-fresh.feat.spktwelch:{fresh.i.welch[x;`nperseg pykw 256&count x][@;1][`]y}
-fresh.feat.stddev:{dev x}
-fresh.feat.sumrecurringdatapoint:{sum k*g k:where 1<g:count each group x}
-fresh.feat.sumrecurringval:{sum where 1<count each group x}
-fresh.feat.sumval:{sum x}
-fresh.feat.symmetriclooking:{abs[avg[x]-med x]<y*max[x]-min x}
-fresh.feat.treverseasymstat:{0^avg x1*(x*x)-x2*x2:xprev[y]x1:xprev[y]x}
-fresh.feat.valcount:{sum x=y}
-fresh.feat.var:{var x}
-fresh.feat.vargtstddev:{1<var x}
+\d .ml 
 
-/ py utils
-fresh.i.rfft :.p.import[`numpy]`:fft.rfft
-fresh.i.real :.p.import[`numpy]`:real
-fresh.i.angle:.p.import[`numpy]`:angle
-fresh.i.imag :.p.import[`numpy]`:imag
-fresh.i.abso :.p.import[`numpy]`:abs
-fresh.i.acf     :.p.import[`statsmodels.tsa.stattools]`:acf
-fresh.i.pacf    :.p.import[`statsmodels.tsa.stattools]`:pacf
-fresh.i.adfuller:.p.import[`statsmodels.tsa.stattools]`:adfuller
-fresh.i.welch   :.p.import[`scipy.signal]`:welch
-fresh.i.findpeak:.p.import[`scipy.signal]`:find_peaks_cwt
+// @kind table
+// @category fresh
+// @desc Table containing .ml.fresh.feat functions
+fresh.params:update pnum:{count 1_get[fresh.feat x]1}each f,pnames:count[i]#(),
+  pvals:count[i]#()from([]f:1_key fresh.feat) 
+fresh.params:1!`pnum xasc update valid:pnum=count each pnames from fresh.params
 
-fresh.i.pyfeat:`aggautocorr`augfuller`fftaggreg`fftcoeff`numcwtpeaks`partautocorrelation`spktwelch
+// @kind function
+// @category fresh
+// @desc Load in hyperparameters for FRESH functions and add to 
+//   .ml.fresh.params table
+// @param filePath {string} File path within ML where hyperparameter JSON 
+//   file is
+// @return {::} Null on success with .ml.fresh.params updated
+fresh.loadparams:{[filePath]
+  hyperparamFile:.ml.path,filePath;
+  p:.j.k raze read0`$hyperparamFile;
+  p:inter[kp:key p;exec f from fresh.params]#p;
+  fresh.params[([]f:kp);`pnames]:key each vp:value p;
+  fresh.params[([]f:kp);`pvals]:{(`$x`type)$x`value}each value each vp;
+  fresh.params:update valid:pnum=count each pnames from fresh.params 
+    where f in kp;
+  }
 
-/ q utils
-fresh.i.getlenseqwhere:{(1_deltas i,count x)where x i:where differ x}
-fresh.i.peakfind:{neg[y]_y _min x>/:xprev\:[-1 1*z]x}
+// @kind function
+// @category fresh
+// @desc Add hyperparameter values to .ml.fresh.params
+fresh.loadparams"/fresh/hyperparameters.json";
 
-/ params
-fresh.params:update pnum:{count 1_get[fresh.feat x]1}each f,pnames:count[i]#(),pvals:count[i]#()from([]f:1_key fresh.feat) 
-fresh.params:1!`pnum xasc update valid:pnum=count each pnames from fresh.params
-fresh.loadparams:{
- pp:{(raze value@)each(!).("S=;")0:x}each(!).("S*";"|")0:x;
- fresh.params[([]f:key pp);`pvals]:value each value pp:inter[key pp;exec f from fresh.params]#pp;
- fresh.params[([]f:key pp);`pnames]:key each value pp;
- fresh.params:update valid:pnum=count each pnames from fresh.params where f in key pp;}
-fresh.loadparams hsym`$path,"/fresh/hyperparam.txt"; / default params
+// @kind function
+// @category fresh
+// @desc Extract features using FRESH
+// @param data {table} Input data
+// @param idCol {symbol[]} ID column(s) name
+// @param cols2Extract {symbol[]} Columns on which extracted features will
+//   be calculated (these columns must be numerical)
+// @param params {table} Functions/parameters to be applied to cols2Extract.
+//   This should be a modified version of .ml.fresh.params
+// @return {table} Table keyed by ID column and containing the features 
+//   extracted from the subset of the data identified by the ID column.
+fresh.createFeatures:{[data;idCol;cols2Extract;params]
+  param0:exec f from params where valid,pnum=0;
+  param1:exec f,pnames,pvals from params where valid,pnum>0;
+  allParams:(cross/)each param1`pvals;
+  calcs:param0,raze param1[`f]cross'param1[`pnames],'/:'allParams;
+  cols2Extract:$[n:"j"$abs system"s";
+    $[n<count cols2Extract;(n;0N);(n)]#;
+    enlist
+    ]cols2Extract;
+  calcs:cols2Extract cross\:calcs;
+  colMapping:fresh.i.colMap each calcs;
+  colMapping:(`$ssr[;".";"o"]@''"_"sv''string raze@''calcs)!'colMapping;
+  toApply:((cols2Extract,\:idCol:idCol,())#\:data;colMapping);
+  res:(uj/).[?[;();idCol!idCol;]]peach flip toApply;
+  idCol xkey fresh.i.expandResults/[0!res;exec c from meta[res]where null t]
+  }
 
-/ feature extraction
-fresh.createfeatures:{[data;aggs;cnames;conf]
- p0:exec f from conf where valid,pnum=0;
- p1:exec f,pnames,pvals from conf where valid,pnum>0;
- calcs:p0,raze p1[`f]cross'p1[`pnames],'/:'(cross/)each p1`pvals;
- calcs:(cnames:$[n:"j"$abs system"s";$[n<count cnames;(n;0N);(n)]#;enlist]cnames)cross\:calcs;
- q:{flip[(` sv'`.ml.fresh.feat,'x[;1];x[;0])],'last@''2_'x}each calcs;
- q:(`$ssr[;".";"o"]@''"_"sv''string raze@''calcs)!'q;
- r:(uj/).[?[;();aggs!aggs;]]peach flip((cnames,\:aggs:aggs,())#\:data;q);
- aggs xkey{[r;c]
-  ![r;();0b;enlist c],'(`$"_"sv'string c,'cols t)xcol t:r c
- }/[0!r;exec c from meta[r]where null t]}
+// Multi-processing functionality
 
-/ allow multiprocess
 loadfile`:util/mproc.q
-if[0>system"s";mproc.init[abs system"s"]enlist".ml.loadfile`:fresh/init.q"];
+if[0>system"s";multiProc.init[abs system"s"]enlist".ml.loadfile`:fresh/init.q"];
diff --git a/fresh/feat.q b/fresh/feat.q
new file mode 100644
index 00000000..367e31f6
--- /dev/null
+++ b/fresh/feat.q
@@ -0,0 +1,643 @@
+// fresh/feat.q - Features
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Features to be used in FRESH 
+
+\d .ml
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the absolute energy of data (sum of squares)
+// @param data {number[]} Numerical data points
+// @return {float} Sum of squares
+fresh.feat.absEnergy:{[data]
+  data wsum data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the absolute sum of the differences between 
+//   successive data points
+// @param data {number[]} Numerical data points
+// @return {float} Absolute sum of differences
+fresh.feat.absSumChange:{[data]
+  sum abs 1_deltas data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the aggregation of an auto-correlation over all
+//   possible lags (1 - count[x]) 
+// @param data {number[]} Numerical data points
+// @return {dictionary} Aggregation (mean, median, variance
+//   and standard deviation) of an auto-correlation
+fresh.feat.aggAutoCorr:{[data]
+  n:count data;
+  autoCorrFunc:$[(abs[var data]<1e-10)|1=n;
+    0;
+    1_fresh.i.acf[data;`unbiased pykw 1b;`fft pykw n>1250]`
+    ];
+  `mean`variance`median`dev!(avg;var;med;dev)@\:autoCorrFunc
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate a linear least-squares regression for aggregated 
+//   values
+// @param data {number[]} Numerical data points
+// @param chunkLen {long} Size of chunk to apply
+// @return {dictionary} Slope, intercept and rvalue for the series 
+//   over aggregated max, min, variance or average for chunks of size chunklen
+fresh.feat.aggLinTrend:{[data;chunkLen]
+  chunkData:chunkLen cut data;
+  stats:(max;min;var;avg)@/:\:chunkData;
+  trend:fresh.feat.linTrend each stats;
+  statCols:`$"_"sv'string cols[trend]cross`max`min`var`avg;
+  statCols!raze value flip trend
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Hypothesis test to check for a unit root in series
+//   (Augmented Dickey Fuller tests)
+// @param data {number[]} Numerical data points
+// @return {dictionary} Test statistic, p-value and used lag
+fresh.feat.augFuller:{[data]
+  `teststat`pvalue`usedlag!3#"f"$@[{fresh.i.adFuller[x]`};data;0n]
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Apply auto-correlation over a user-specified lag
+// @param data {number[]} Numerical data points
+// @param lag {long} Lag to apply to data
+// @return {float} Auto-correlation over specified lag
+fresh.feat.autoCorr:{[data;lag]
+  mean:avg data;
+  $[lag=0;1f;(avg(data-mean)*xprev[lag;data]-mean)%var data]
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate entropy for data binned into n equi-distant bins
+// @param data {number[]} Numerical data points
+// @params numBins {long} Number of bins to apply to data
+// @return {float} Entropy of the series binned into numBins equidistant bins
+fresh.feat.binnedEntropy:{[data;numBins]
+  n:count data;
+  data-:min data;
+  p:(count each group(numBins-1)&floor numBins*data%max data)%n;
+  neg sum p*log p
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate non-linearity of a time series with lag applied
+// @param data {number[]} Numerical data points
+// @param lag {long} Lag to apply to data
+// @return {float} Measure of the non-linearity of the series lagged by lag
+// Time series non-linearity: Schreiber, T. and Schmitz, A. (1997). PHYSICAL
+//   REVIEW E, VOLUME 55, NUMBER 5
+fresh.feat.c3:{[data;lag]
+  avg data*/xprev\:[-1 -2*lag]data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate aggregate value of successive changes within
+//   corridor
+// @param data {number[]} Numerical data points
+// @param lowerQuant {float} Lower quartile
+// @param upperQuant {float} Upper quartile
+// @param isAbs {boolean} Whether absolute values should be considered
+// @return {dictionary} Aggregated value of successive changes within corridor
+//   specified by lower/upperQuant
+fresh.feat.changeQuant:{[data;lowerQuant;upperQuant;isAbs]
+  quants:fresh.feat.quantile[data]lowerQuant,upperQuant;
+  k:($[isAbs;abs;]1_deltas data)where 1_&':[data within quants];
+  statCols:`max`min`mean`variance`median`stdev;
+  statCols!(max;min;avg;var;med;dev)@\:k
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculated complexity of time series based on peaks and
+//   troughs in the dataset
+// @param data {number[]} Numerical data points
+// @param isAbs {boolean} Whether absolute values should be considered
+// @return {float} Measure of series complexity
+// Time series complexity:
+//  http://www.cs.ucr.edu/~eamonn/Complexity-Invariant%20Distance%20Measure.pdf
+fresh.feat.cidCe:{[data;isAbs]
+  comp:$[not isAbs;
+      data;
+    0=s:dev data;
+      :0.;
+    (data-avg data)%s
+    ];
+  sqrt k$k:"f"$1_deltas comp
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Count of values in data
+// @param data {number[]} Numerical data points
+// @return {long} Number of values within the series
+fresh.feat.count:{[data]
+  count data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Values greater than the average value
+// @param data {number[]} Numerical data points
+// @return {int} Number of values in series with a value greater than the mean
+fresh.feat.countAboveMean:{[data]
+  sum data>avg data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Values less than the average value
+// @param data {number[]} Numerical data points
+// @return {int} Number of values in series with a value less than the mean
+fresh.feat.countBelowMean:{[data]
+  sum data<avg data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Ratio of absolute energy by chunk
+// @param data {number[]} Numerical data points
+// @param numSeg {long} Number of segments to split data into
+// @return {dictionary} Sum of squares of each region of the series 
+//  split into n segments, divided by the absolute energy
+fresh.feat.eRatioByChunk:{[data;numSeg]
+  k:((numSeg;0N)#data)%fresh.feat.absEnergy data;
+  (`$"_"sv'string`chunk,'til[numSeg],'numSeg)!k$'k
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Position of first max relative to the series length
+// @param data {number[]} Numerical data points
+// @return {float} Position of the first occurrence of the maximum value in the
+//   series relative to the series length
+fresh.feat.firstMax:{[data]
+  iMax[data]%count data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Position of first min relative to the series length
+// @param data {number[]} Numerical data points
+// @return {float} Position of the first occurrence of the minimum value in the
+//   series relative to the series length
+fresh.feat.firstMin:{[data]
+  iMin[data]%count data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the mean, variance, skew and kurtosis of the 
+//   absolute Fourier-transform spectrum of data
+// @param data {number[]} Numerical data points
+// @return {dictionary} Spectral centroid, variance, skew and kurtosis
+fresh.feat.fftAggreg:{[data]
+  a:fresh.i.abso[fresh.i.rfft data]`;
+  l:"f"$til count a;
+  mean:1.,(sum each a*/:3(l*)\l)%sum a;
+  m1:mean 1;m2:mean 2;m3:mean 3;m4:mean 4;
+  variance:m2-m1*m1;
+  cond:variance<.5;
+  skew:$[cond;0n;((m3-3*m1*variance)-m1*m1*m1)%variance xexp 1.5];
+  kurtosis:$[cond;0n;((m4-4*m1*m3-3*m1)+6*m2*m1*m1)%variance*variance];
+  `centroid`variance`skew`kurtosis!(m1;variance;skew;kurtosis)
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the fast-fourier transform coefficient of a series
+// @param data {number[]} Numerical data points
+// @param coeff {int} Coefficients to use
+// @return {dictionary} FFT coefficient given real inputs and extracting real, 
+//   imaginary, absolute and angular components
+fresh.feat.fftCoeff:{[data;coeff]
+  r:(fresh.i.angle[fx;`deg pykw 1b]`;
+    fresh.i.real[fx]`;
+    fresh.i.imag[fx]`;
+    fresh.i.abso[fx:fresh.i.rfft data]`
+    );
+  fftKeys:`$"_"sv'string raze(`coeff,/:til coeff),\:/:`angle`real`imag`abs;
+  fftVals:raze coeff#'r,\:coeff#0n;
+  fftKeys!fftVals
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Check if duplicates present
+// @param data {number[]} Numerical data points
+// @return {boolean} Series contains any duplicate values
+fresh.feat.hasDup:{[data]
+  count[data]<>count distinct data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Check for duplicate of maximum value within a series
+// @param data {number[]} Numerical data points
+// @return {boolean} Does data contain a duplicate of the maximum value
+fresh.feat.hasDupMax:{[data]
+  1<sum data=max data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Check for duplicate of minimum value within a series
+// @param data {number[]} Numerical data points
+// @return {boolean} Does data contain a duplicate of the minimum value
+fresh.feat.hasDupMin:{[data]
+  1<sum data=min data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the relative index of a dataset such that the chosen
+//   quantile of the series' mass lies to the left
+// @param data {number[]} Numerical data points
+// @param quantile {float} Quantile to check
+// @return {float} Calculate index
+fresh.feat.indexMassQuantile:{[data;quantile]
+  n:count data;
+  data:abs data;
+  (1+(sums[data]%sum data)binr quantile)%n
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the adjusted G2 Fisher-Pearson kurtosis of a series
+// @param data {number[]} Numerical data points
+// @return {float} Adjusted G2 Fisher-Pearson kurtosis
+fresh.feat.kurtosis:{[data]
+  k*:k:data-avg data;
+  s:sum k;
+  n:count data;
+  ((n-1)%(n-2)*n-3)*(3*1-n)+n*(1+n)*sum[k*k]%s*s
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Check if the standard deviation of a series is larger than 
+//   ratio*(max-min) values
+// @param data {number[]} Numerical data points
+// @param ratio {float} Ratio to check
+// @return {boolean} Is standard deviation larger than ratio times max-min
+fresh.feat.largestDev:{[data;ratio]
+  dev[data]>ratio*max[data]-min data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Find the position of the last occurrence of the maximum value
+//   in the series relative to the series length
+// @param data {number[]} Numerical data points
+// @return {float} Last max relative to number of data points
+fresh.feat.lastMax:{[data]
+  (last where data=max data)%count data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Find the position of the last occurrence of the minimum value
+//   in the series relative to the series length
+// @param data {number[]} Numerical data points
+// @return {float} Last min relative to number of data points
+fresh.feat.lastMin:{[data]
+  (last where data=min data)%count data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the slope/intercept/r-value associated of a series
+// @param data {number[]} Numerical data points
+// @return {dictionary} Slope, intercept and r-value
+fresh.feat.linTrend:{[data]
+  k:til count data;
+  slope:(xk:data cov k)%vk:var k;
+  intercept:avg[data]-slope*avg k;
+  rval:xk%sqrt vk*var data;
+  `rval`intercept`slope!0^(rval;intercept;slope)
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Longest sequence of consecutive data points within the series with 
+//   a value greater than the mean
+// @param data {number[]} Numerical data points
+// @return {boolean} Is longest subsequence greater than the mean
+fresh.feat.longStrikeAboveMean:{[data]
+  max 0,fresh.i.getLenSeqWhere data>avg data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Longest sequence of consecutive data points within the series with 
+//   a value lower than the mean
+// @param data {number[]} Numerical data points
+// @return {boolean} Is longest subsequence less than the mean
+fresh.feat.longStrikeBelowMean:{[data]
+  max 0,fresh.i.getLenSeqWhere data<avg data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Maximum value
+// @param data {number[]} Numerical data points
+// @return {number} Maximum value of the series
+fresh.feat.max:{[data]
+  max data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Average value
+// @param data {number[]} Numerical data points
+// @return {number} Mean value of the series
+fresh.feat.mean:{[data]
+  avg data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the average over the absolute difference between
+//   subsequent series values
+// @param data {number[]} Numerical data points
+// @return {float} Mean over the absolute difference between data points
+fresh.feat.meanAbsChange:{[data]
+  avg abs 1_deltas data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the average over the difference between subsequent
+//   series values
+// @param data {number[]} Numerical data points
+// @return {float} Mean over the difference between data points
+fresh.feat.meanChange:{[data]
+  n:-1+count data;
+  (data[n]-data 0)%n
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the average central approximation of the second 
+//   derivative of a series
+// @param data {number[]} Numerical data points
+// @return {float} Mean central approximation of the second derivative
+fresh.feat.mean2DerCentral:{[data]
+  p:prev data;
+  avg(.5*data+prev p)-p
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Median value
+// @param data {number[]} Numerical data points
+// @return {number} Median value of the series
+fresh.feat.med:{[data]
+  med data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Minimum value
+// @param data {number[]} Numerical data points
+// @return {number} Minimum value of the series
+fresh.feat.min:{[data]
+  min data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Number of crossings in the series over the value crossVal
+// @param data {number[]} Numerical data points
+// @param crossVal {number} Crossing va;ue
+// @return {int} Number of crossings
+fresh.feat.numCrossing:{[data;crossVal]
+  sum 1_differ data>crossVal
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Number of peaks in a series following data smoothing via 
+//   application of a Ricker wavelet of defined width
+// @param data {number[]} Numerical data points
+// @param width {long} Width of wavelet
+// @return {long} Number of peaks
+fresh.feat.numCwtPeaks:{[data;width]
+  count fresh.i.findPeak[data;1+til width]`
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Number of peaks in the series with a specified support
+// @param data {number[]} Numerical data points
+// @param support {long} Support of the peak 
+// @return {int} Number of peaks
+fresh.feat.numPeaks:{[data;support]
+  sum all fresh.i.peakFind[data;support]each 1+til support
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Partial auto-correlation of a series with a specified lag
+// @param data {number[]} Numerical data points
+// @param lag {long} Lag to apply to data
+// @return {dictionary} Partial auto-correlation
+fresh.feat.partAutoCorrelation:{[data;lag]
+  corrKeys:`$"lag_",/:string 1+til lag;
+  corrVals:lag#$[1>mx:lag&count[data]-1;
+    ();
+    1_fresh.i.pacf[data;`nlags pykw mx;`method pykw`ld]`
+    ],lag#0n;
+  corrKeys!corrVals
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Ratio of the number of non-distinct values to the number of 
+//   possible values
+// @param data {number[]} Numerical data points
+// @return {float} Calculated ratio
+fresh.feat.perRecurToAllData:{[data]
+  g:count each group data;
+  sum[1<g]%count g
+  }
+
+// @kind function
+// @category freshFeat
+// @desc the number of non-distinct values to the number of data points
+// @param data {number[]} Numerical data points
+// @return {float} Calculated ratio
+fresh.feat.perRecurToAllVal:{[data]
+  g:count each group data;
+  sum[g where 1<g]%count data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc The value of a series greater than a user-defined quantile 
+//   percentage of the ordered series
+// @param data {number[]} Numerical data points
+// @param quantile {float} Quantile to check
+// @return {float} Value greater than quantile
+fresh.feat.quantile:{[data;quantile]
+  p:quantile*-1+count data;
+  idx:0 1+\:floor p;
+  r:0^deltas asc[data]idx;
+  r[0]+(p-idx 0)*last r
+  }
+
+// @kind function
+// @category freshFeat
+// @desc The number of values greater than or equal to some minimum and
+//   less than some maximum
+// @param data {number[]} Numerical data points
+// @param minVal {number} Min value allowed
+// @param maxVal {number} Max value allowed
+// @return {int} Number of data points in specified range
+fresh.feat.rangeCount:{[data;minVal;maxVal]
+  sum(data>=minVal)&data<maxVal
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Ratio of values greater than sigma from the mean value
+// @param data {number[]} Numerical data points
+// @param r {float} Ratio to compare
+// @return {float} Calculated ratio
+fresh.feat.ratioBeyondRSigma:{[data;r]
+  avg abs[data-avg data]>r*dev data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Ratio of the number of unique values to total number of values
+//   in a series
+// @param data {number[]} Numerical data points
+// @return {float} Calculated ratio
+fresh.feat.ratioValNumToSeriesLength:{[data]
+  count[distinct data]%count data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Skew of a time series indicating asymmetry within the series
+// @param data {number[]} Numerical data points
+// @return {float} Skew of data
+fresh.feat.skewness:{[data]
+  n:count data;
+  s:sdev data;
+  m:data-avg data;
+  n*sum[m*m*m]%(s*s*s)*(n-1)*-2+n
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Calculate the cross power spectral density of a time series
+// @param data {number[]} Numerical data points
+// @param coeff {int} Frequency at which calculation is performed
+// @return {float} Cross power spectral density of data at given coeff
+fresh.feat.spktWelch:{[data;coeff]
+  fresh.i.welch[data;`nperseg pykw 256&count data][@;1][`]coeff
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Standard deviation
+// @param data {number[]} Numerical data points
+// @return {float} Standard deviation of series
+fresh.feat.stdDev:{[data]
+  dev data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Sum points that appear more than once in a series
+// @param data {number[]} Numerical data points
+// @return {number} Sum of all points present more than once
+fresh.feat.sumRecurringDataPoint:{[data]
+  g:count each group data;
+  k:where 1<g;
+  sum k*g k
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Sum values that appear more than once in a series
+// @param data {number[]} Numerical data points
+// @return {number} Sum of all values present more than once
+fresh.feat.sumRecurringVal:{[data]
+  sum where 1<count each group data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Sum data points
+// @param data {number[]} Numerical data points
+// @return {number} Sum of values within the series
+fresh.feat.sumVal:{[data]
+  sum data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Measure symmetry of a time series
+// @param data {number[]} Numerical data points
+// @param ratio {float} Ratio in range 0->1
+// @return {boolean} Measure of symmetry 
+fresh.feat.symmetricLooking:{[data;ratio]
+  abs[avg[data]-med data]<ratio*max[data]-min data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Measure the asymmetry of a series based on a user-defined lag
+// @param data {number[]} Numerical data points
+// @param lag {long} Size of lag to apply
+// @return {float} Measure of asymmetry of data
+fresh.feat.treverseAsymStat:{[data;lag]
+  x1:xprev[lag]data;
+  x2:xprev[lag]x1;
+  0^avg x1*(data*data)-x2*x2
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Return the number occurrences of a specific value within a 
+//   dataset
+// @param data {number[]} Numerical data points
+// @param val {number} Value to check
+// @return {int} Number of occurrences of val within the series
+fresh.feat.valCount:{[data;val]
+  sum data=val
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Variance of a dataset
+// @param data {number[]} Numerical data points
+// @return {float} Variance of the series
+fresh.feat.var:{[data]
+  var data
+  }
+
+// @kind function
+// @category freshFeat
+// @desc Check if the variance of a dataset is larger than its standard
+//   deviation
+// @param data {number[]} Numerical data points
+// @return {boolean} Indicates if variance is larger than standard deviation
+fresh.feat.varAboveStdDev:{[data]
+  1<var data
+  }
diff --git a/fresh/hyperparam.txt b/fresh/hyperparam.txt
deleted file mode 100644
index dc6737f4..00000000
--- a/fresh/hyperparam.txt
+++ /dev/null
@@ -1,21 +0,0 @@
-binnedentropy|numbins=2 5 10
-cidce|isabs=01b
-numcrossingm|crossval=-1 0 1
-ratiobeyondrsigma|r=0.5 1 1.5 2 2.5 3 5 6 7 10
-largestdev|ratio=0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95
-c3|lag=1 2 3
-autocorr|lag=0 1 2 3 4 5 6 7 8 9
-indexmassquantile|q=0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
-numcwtpeaks|width=1 5
-numpeaks|support=1 3
-symmetriclooking|ratio=0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95
-treverseasymstat|lag=1 2 3
-quantile|quantile=0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
-valcount|val=0 1 0n 0w -0w
-spktwelch|coeff=2 5 7
-rangecount|minval=-1;maxval=1
-partautocorrelation|maxlag=6
-fftcoeff|maxcoeff=10
-agglintrend|chunklen=5 10 50
-eratiobychunk|numsegments=3
-changequant|ql=0.1 0.2;qh=0.9 0.8;isabs=01b
diff --git a/fresh/hyperparameters.json b/fresh/hyperparameters.json
new file mode 100644
index 00000000..9dc019ad
--- /dev/null
+++ b/fresh/hyperparameters.json
@@ -0,0 +1,140 @@
+{
+  "binnedEntropy":{
+    "numBins":{
+      "value":[2,5,10],
+      "type":"long"
+    }
+  },
+  "cidCe":{
+    "isAbs":{
+      "value":[0,1],
+      "type":"boolean"
+    }
+  },
+  "numCrossing":{
+    "crossVal":{
+      "value":[-1,0,1],
+      "type":"long"
+    }
+  },
+  "ratioBeyondRSigma":{
+    "r":{
+      "value":[0.5,1,1.5,2,2.5,3,5,6,7,10],
+      "type":"float"
+    }
+  },
+  "largestDev":{
+    "ratio":{
+      "value":[0.05,0.1,0.15,0.2,0.25,0.3,0.35,0.4,0.45,0.5,0.55,0.6,0.65,0.7,0.75,0.8,0.85,0.9,0.95],
+      "type":"float"
+    }
+  },
+  "c3":{
+    "lag":{
+      "value":[1,2,3],
+      "type":"long"
+    }
+  },
+  "autoCorr":{
+    "lag":{
+      "value":[0,1,2,3,4,5,6,7,8,9],
+      "type":"long"
+    }  
+  },
+  "indexMassQuantile":{
+    "quantile":{
+      "value":[0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9],
+      "type":"float"
+    }
+  },
+  "numCwtPeaks":{
+    "width":{
+      "value":[1,5],
+      "type":"long"
+    }
+  },
+  "numPeaks":{
+    "support":{
+      "value":[1,3],
+      "type":"long"
+    }
+  },
+  "symmetricLooking":{
+    "ratio":{
+      "value":[0,0.05,0.1,0.15,0.2,0.25,0.3,0.35,0.4,0.45,0.5,0.55,0.6,0.65,0.7,0.75,0.8,0.85,0.9,0.95],
+      "type":"float"
+    }
+  },
+  "treverseAsymStat":{
+    "lag":{
+      "value":[1,2,3],
+      "type":"long"
+    }
+  },
+  "quantile":{
+    "quantile":{
+      "value":[0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9],
+      "type":"float"
+    }
+  },
+  "valCount":{
+    "val":{
+      "value":[0,1,null,inf,-inf],
+      "type":"int"
+    }
+  },
+  "spktWelch":{
+    "coeff":{
+      "value":[2,5,7],
+      "type":"long"
+    }
+  },
+  "rangeCount":{
+    "minVal":{
+      "value":[-1],
+      "type":"long"
+    },
+    "maxVal":{
+      "value":[1],
+      "type":"long"
+    }
+  },
+  "partAutoCorrelation":{
+    "lag":{
+      "value":[6],
+      "type":"long"
+    }
+  },
+  "fftCoeff":{
+    "coeff":{
+      "value":[10],
+      "type":"long"
+    }
+  },
+  "aggLinTrend":{
+    "chunkLen":{
+      "value":[5,10,50],
+      "type":"long"
+    }
+  },
+  "eRatioByChunk":{
+    "numSeg":{
+      "value":[3],
+      "type":"long"
+    }
+  },
+  "changeQuant":{
+    "lowerQuant":{
+      "value":[0.1,0.2],
+      "type":"float"
+    },
+    "upperQuant":{
+      "value":[0.9,0.8],
+      "type":"float"
+    },
+    "isAbs":{
+      "value":[0,1],
+      "type":"boolean"
+    }
+  }
+}
diff --git a/fresh/init.q b/fresh/init.q
index f5412c7c..864f521e 100644
--- a/fresh/init.q
+++ b/fresh/init.q
@@ -1,3 +1,14 @@
-/ fresh algorithm implementation (https://arxiv.org/pdf/1610.07717v3.pdf)
+// fresh/init.q - Load fresh library
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// FeatuRe Extraction and Scalable Hypothesis testing (FRESH)
+// FRESH algorithm implementation (https://arxiv.org/pdf/1610.07717v3.pdf)
+
+.ml.loadfile`:fresh/utils.q
+.ml.loadfile`:fresh/feat.q
 .ml.loadfile`:fresh/extract.q
 .ml.loadfile`:fresh/select.q
+.ml.loadfile`:util/utils.q
+.ml.loadfile`:util/utilities.q
+
+.ml.i.deprecWarning[`fresh]
diff --git a/fresh/select.q b/fresh/select.q
index fe10a402..78c972ba 100644
--- a/fresh/select.q
+++ b/fresh/select.q
@@ -1,30 +1,69 @@
-\d .ml
+// fresh/select.q - Feature selection
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Selection of statistically significant features
 
-/ py utils
-sci_ver:1.5<="F"$3#.p.import[`scipy][`:__version__]`
-fresh.i.ksdistrib:  .p.import[`scipy.stats][$[sci_ver;`:kstwo.sf;`:kstwobign.sf];<]
-fresh.i.kendalltau: .p.import[`scipy.stats]`:kendalltau
-fresh.i.fisherexact:.p.import[`scipy.stats]`:fisher_exact
+\d .ml
 
-/ q utils
-fresh.i.ktau:{fresh.i.kendalltau[<;x;y]1}
-fresh.i.fisher:{fresh.i.fisherexact[<;count@''@\:[group@'x value group y]distinct x]1}
+// @kind function
+// @category fresh
+// @desc Statistically significant features based on defined selection
+//   procedure
+// @param tab {table} Value side of a table of created features
+// @param target {int[]|float[]} Targets corresponding to the rows the table
+// @param func {fn} Projection of significant feature function to apply e.g. 
+//   .ml.fresh.kSigFeat[10]
+// @returns {symbol[]} Features deemed statistically significant according to 
+//   user-defined func
+fresh.significantFeatures:{[tab;target;func]
+  func fresh.sigFeat[tab;target]
+  }
 
-/ Function change due to scipy update https://github.com/scipy/scipy/commit/aa319bcfeb38b90f3c4b46c9477f02618583570d
-fresh.i.ks:{
- k:max abs(-). value(1+d bin\:raze d)%n:count each d:asc each y group x;
- en:prd[n]%sum n;
- fresh.i.ksdistrib .$[sci_ver;(k;ceiling en);enlist k*sqrt en]}
-fresh.i.ksyx:{fresh.i.ks[y;x]}
+  // @kind function
+// @category fresh
+// @desc Return p-values for each feature
+// @param tab {table} Value side of a table of created features
+// @param target {int[]|float[]} Targets corresponding to the rows the table
+// @return {dictionary} P-value for each feature to be passed to user-defined 
+//   significance function
+fresh.sigFeat:{[tab;target]
+  func:fresh.i$[2<count distinct target;`kTau`ksYX;`ks`fisher];
+  sigCols:where each(2<;2=)@\:(count distinct@)each flip tab;
+  raze[sigCols]!(func[where count each sigCols]@\:target)@'tab raze sigCols
+  }
 
-/ feature significance
-fresh.sigfeat:{[t;y]
- f:fresh.i$[2<count distinct y;`ktau`ksyx;`ks`fisher];
- raze[c]!(f[where count each c]@\:y)@'t raze c:where each(2<;2=)@\:(count distinct@)each flip t}
+// @kind function
+// @category fresh
+// @desc The Benjamini-Hochberg-Yekutieli (BHY) procedure: determines 
+//   if the feature meets a defined False Discovery Rate (FDR) level. The 
+//   recommended input is 5% (0.05).
+// @param rate {float} False Discovery Rate
+// @param pValues {dictionary} Output of .ml.fresh.sigFeat
+// @return {symbol[]} Significant features
+fresh.benjhoch:{[rate;pValues]
+  idx:1+til n:count pValues:asc pValues;
+  where pValues<=rate*idx%n*sums 1%idx
+  }
 
-/ feature selection
-fresh.benjhoch:{[v;d]where d<=v*s%k*sums 1%s:1+til k:count d:asc d}
-fresh.ksigfeat:{[k;d]key k sublist asc d}
-fresh.percentile:{[p;d]where d<=fresh.feat.quantile[value d]p}
+// @kind function
+// @category fresh
+// @desc K-best features: choose the K features which have the lowest
+//   p-values and thus have been determined to be the most important features 
+//   to allow us to predict the target vector.
+// @param k {long} Number of features to select
+// @param pValues {dictionary} Output of .ml.fresh.sigFeat
+// @return {symbol[]} Significant features
+fresh.kSigFeat:{[k;pValues]
+  key k sublist asc pValues
+  }
 
-fresh.significantfeatures:{[t;y;f]f fresh.sigfeat[t;y]}
+// @kind function
+// @category fresh
+// @desc Percentile based selection: set a percentile threshold for 
+//   p-values below which features are selected.
+// @param percentile {float} Percentile threshold
+// @param pValues {dictionary} Output of .ml.fresh.sigFeat
+// @return {symbol[]} Significant features
+fresh.percentile:{[percentile;pValues]
+  where pValues<=fresh.feat.quantile[value pValues]percentile
+  }
diff --git a/fresh/tests/features.t b/fresh/tests/features.t
index 8760bda3..7ec2b8b3 100644
--- a/fresh/tests/features.t
+++ b/fresh/tests/features.t
@@ -5,7 +5,7 @@ that are present in the tsfresh documentation. It should be noted that for large
 
 \l p.q
 \l ml.q
-\l fresh/extract.q
+\l fresh/init.q
 \l fresh/tests/test.p
 
 xj:10000?10000;
@@ -32,221 +32,221 @@ changequantkeys:`max`min`mean`var`median`std
 
 np:.p.import[`numpy] 
 
-.ml.fresh.feat.hasdup[xj] ~ hasduplicate[xj]
-.ml.fresh.feat.hasdup[xf] ~ hasduplicate[xf]
-.ml.fresh.feat.hasdup[xb] ~ hasduplicate[xb]
-.ml.fresh.feat.hasdup[xi] ~ hasduplicate[xi]
-.ml.fresh.feat.hasdup[x0] ~ hasduplicate[x0]
-.ml.fresh.feat.hasdup[x1] ~ hasduplicate[x1]
-.ml.fresh.feat.hasdup[x2] ~ hasduplicate[x2]
-.ml.fresh.feat.hasdupmin[xj] ~ hasduplicatemin[xj]
-.ml.fresh.feat.hasdupmin[xf] ~ hasduplicatemin[xf]
-.ml.fresh.feat.hasdupmin[xi] ~ hasduplicatemin[xi]
-.ml.fresh.feat.hasdupmin[xb] ~ hasduplicatemin[xb]
-.ml.fresh.feat.hasdupmin[x0] ~ 0b
-.ml.fresh.feat.hasdupmin[x1] ~ hasduplicatemin[x1]
-.ml.fresh.feat.hasdupmin[x2] ~ hasduplicatemin[x2]
-.ml.fresh.feat.hasdupmax[xj] ~ hasduplicatemax[xj]
-.ml.fresh.feat.hasdupmax[xf] ~ hasduplicatemax[xf]
-.ml.fresh.feat.hasdupmax[xi] ~ hasduplicatemax[xi]
-.ml.fresh.feat.hasdupmax[xb] ~ hasduplicatemax[xb]
-.ml.fresh.feat.hasdupmax[x0] ~ 0b
-.ml.fresh.feat.hasdupmax[x1] ~ hasduplicatemax[x1]
-.ml.fresh.feat.hasdupmax[x2] ~ hasduplicatemax[x2]
-.ml.fresh.feat.hasdup[xmixf] ~ 1b
-.ml.fresh.feat.hasdupmin[xmixf] ~ hasduplicatemin[xmixf]
-.ml.fresh.feat.hasdupmax[xmixf] ~ hasduplicatemax[xmixf]
-.ml.fresh.feat.hasdup[xnull] ~ 1b
-.ml.fresh.feat.hasdupmin[xnull] ~ 0b
-.ml.fresh.feat.hasdupmax[xnull] ~ 0b
-
-.ml.fresh.feat.absenergy[xj] ~ "f"$abs_energy[xj]
-.ml.fresh.feat.absenergy[xf] ~ abs_energy[xf]
-.ml.fresh.feat.absenergy[xb] ~ "f"$abs_energy[xb]
-.ml.fresh.feat.absenergy[xi] = "f"$abs_energy[xi]
-.ml.fresh.feat.absenergy[x0] ~ "f"$abs_energy[x0]
-.ml.fresh.feat.absenergy[x1] ~ "f"$abs_energy[x1]
-.ml.fresh.feat.absenergy[x2] ~ "f"$abs_energy[x2]
-.ml.fresh.feat.absenergy[xmixf] ~ sum l*l:xmixf
-.ml.fresh.feat.absenergy[xnull] ~ 0f
-
-.ml.fresh.feat.meanchange[xj] ~ mean_change[xj]
-.ml.fresh.feat.meanchange[xf] ~ mean_change[xf]
-.ml.fresh.feat.meanchange[xi] ~ mean_change[xi]
-.ml.fresh.feat.meanchange[x0] ~ mean_change[x0]
-.ml.fresh.feat.meanchange[x1] ~ mean_change[x1]
-.ml.fresh.feat.meanchange[x2] ~ mean_change[x2]
-/.ml.fresh.feat.meanchange[xb] ~ mean_change[xb]
-
-.ml.fresh.feat.abssumchange[xj] ~ absolute_sum_of_changes[xj]
-.ml.fresh.feat.abssumchange[xf] ~ absolute_sum_of_changes[xf]
-.ml.fresh.feat.abssumchange[xi] ~ "i"$absolute_sum_of_changes[xi]
-.ml.fresh.feat.abssumchange[xb] ~ "i"$absolute_sum_of_changes[xb]
-.ml.fresh.feat.abssumchange[x0] ~ 0f 
-.ml.fresh.feat.abssumchange[x1] ~ absolute_sum_of_changes[x1]
-.ml.fresh.feat.abssumchange[x2] ~ absolute_sum_of_changes[x2]
-.ml.fresh.feat.abssumchange[xnull] ~ 0f
-
-.ml.fresh.feat.meanabschange[xj] ~ mean_abs_change[xj]
-.ml.fresh.feat.meanabschange[xf] ~ mean_abs_change[xf]
-.ml.fresh.feat.meanabschange[xb] ~ mean_abs_change[xb]
-.ml.fresh.feat.meanabschange[xi] ~ mean_abs_change[xi]
-.ml.fresh.feat.meanabschange[x0] ~ 0n
-.ml.fresh.feat.meanabschange[x1] ~ mean_abs_change[x1]
-.ml.fresh.feat.meanabschange[x2] ~ mean_abs_change[x2]
-.ml.fresh.feat.meanabschange[xnull] ~ 0n
-
-.ml.fresh.feat.countabovemean[xj] ~ "i"$count_above_mean[xj]
-.ml.fresh.feat.countabovemean[xf] ~ "i"$count_above_mean[xf]
-.ml.fresh.feat.countabovemean[xb] ~ "i"$count_above_mean[xb]
-.ml.fresh.feat.countabovemean[xi] ~ "i"$count_above_mean[xi]
-.ml.fresh.feat.countabovemean[x0] ~ "i"$count_above_mean[x0]
-.ml.fresh.feat.countabovemean[x1] ~ "i"$count_above_mean[x1]
-.ml.fresh.feat.countabovemean[x2] ~ "i"$count_above_mean[x2]
-.ml.fresh.feat.countabovemean[xnull] ~ "i"$count_above_mean[xnull]
-
-.ml.fresh.feat.countbelowmean[xj] ~ "i"$count_below_mean[xj]
-.ml.fresh.feat.countbelowmean[xf] ~ "i"$count_below_mean[xf]
-.ml.fresh.feat.countbelowmean[xb] ~ "i"$count_below_mean[xb]
-.ml.fresh.feat.countbelowmean[xi] ~ "i"$count_below_mean[xi]
-.ml.fresh.feat.countbelowmean[x0] ~ "i"$count_below_mean[x0]
-.ml.fresh.feat.countbelowmean[x1] ~ "i"$count_below_mean[x1]
-.ml.fresh.feat.countbelowmean[x2] ~ "i"$count_below_mean[x2]
-.ml.fresh.feat.countbelowmean[xnull] ~ "i"$count_below_mean[xnull]
-
-.ml.fresh.feat.firstmax[xj] ~ first_location_of_maximum[xj]
-.ml.fresh.feat.firstmax[xf] ~ first_location_of_maximum[xf]
-.ml.fresh.feat.firstmax[xb] ~ first_location_of_maximum[xb]
-.ml.fresh.feat.firstmax[xi] ~ first_location_of_maximum[xi]
-.ml.fresh.feat.firstmax[x0] ~ 0n
-.ml.fresh.feat.firstmax[x1] ~ first_location_of_maximum[x1]
-.ml.fresh.feat.firstmax[x2] ~ first_location_of_maximum[x2]
-.ml.fresh.feat.firstmax[xnull] ~ 1f
-
-.ml.fresh.feat.firstmin[xj] ~ first_location_of_minimum[xj]
-.ml.fresh.feat.firstmin[xf] ~ first_location_of_minimum[xf]
-.ml.fresh.feat.firstmin[xb] ~ first_location_of_minimum[xb]
-.ml.fresh.feat.firstmin[xi] ~ first_location_of_minimum[xi]
-.ml.fresh.feat.firstmin[x0] ~ 0n
-.ml.fresh.feat.firstmin[x1] ~ first_location_of_minimum[x1]
-.ml.fresh.feat.firstmin[x2] ~ first_location_of_minimum[x2]
-.ml.fresh.feat.firstmin[xnull] ~ 1f
-
-.ml.fresh.feat.ratiovalnumtserieslength[xj] ~ ratio_val_num_to_t_series[xj]
-.ml.fresh.feat.ratiovalnumtserieslength[xf] ~ ratio_val_num_to_t_series[xf]
-.ml.fresh.feat.ratiovalnumtserieslength[xb] ~ ratio_val_num_to_t_series[xb]
-.ml.fresh.feat.ratiovalnumtserieslength[xi] ~ ratio_val_num_to_t_series[xi]
-.ml.fresh.feat.ratiovalnumtserieslength[x0] ~ 0n
-.ml.fresh.feat.ratiovalnumtserieslength[x1] ~ ratio_val_num_to_t_series[x1]
-.ml.fresh.feat.ratiovalnumtserieslength[x2] ~ ratio_val_num_to_t_series[x2]
-.ml.fresh.feat.ratiovalnumtserieslength[xnull] ~ 0.0001
-
-.ml.fresh.feat.ratiobeyondrsigma[xj;0.2] ~ ratio_beyond_r_sigma[xj;0.2]
-.ml.fresh.feat.ratiobeyondrsigma[xj;2.0] ~ ratio_beyond_r_sigma[xj;2.0]
-.ml.fresh.feat.ratiobeyondrsigma[xj;10] ~ ratio_beyond_r_sigma[xj;10]
-.ml.fresh.feat.ratiobeyondrsigma[xf;0.2] ~ ratio_beyond_r_sigma[xf;0.2]
-.ml.fresh.feat.ratiobeyondrsigma[xf;2.0] ~ ratio_beyond_r_sigma[xf;2.0]
-.ml.fresh.feat.ratiobeyondrsigma[xf;10] ~ ratio_beyond_r_sigma[xf;10]
-.ml.fresh.feat.ratiobeyondrsigma[xi;0.2] ~ ratio_beyond_r_sigma[xi;0.2]
-.ml.fresh.feat.ratiobeyondrsigma[xi;2.0] ~ ratio_beyond_r_sigma[xi;2.0]
-.ml.fresh.feat.ratiobeyondrsigma[xi;10] ~ ratio_beyond_r_sigma[xi;10]
-.ml.fresh.feat.ratiobeyondrsigma[xb;0.2] ~ ratio_beyond_r_sigma[xb;0.2]
-.ml.fresh.feat.ratiobeyondrsigma[xb;2.0] ~ ratio_beyond_r_sigma[xb;2.0]
-.ml.fresh.feat.ratiobeyondrsigma[xb;10] ~ ratio_beyond_r_sigma[xb;10]
-.ml.fresh.feat.ratiobeyondrsigma[x0;0.2] ~ 0n
-.ml.fresh.feat.ratiobeyondrsigma[x0;2.0] ~ 0n
-.ml.fresh.feat.ratiobeyondrsigma[x0;10] ~ 0n
-.ml.fresh.feat.ratiobeyondrsigma[x1;0.2] ~ ratio_beyond_r_sigma[x1;0.2]
-.ml.fresh.feat.ratiobeyondrsigma[x1;2.0] ~ ratio_beyond_r_sigma[x1;2.0]
-.ml.fresh.feat.ratiobeyondrsigma[x1;10] ~ ratio_beyond_r_sigma[x1;10]
-.ml.fresh.feat.ratiobeyondrsigma[x2;0.2] ~ ratio_beyond_r_sigma[x2;0.2]
-.ml.fresh.feat.ratiobeyondrsigma[x2;2.0] ~ ratio_beyond_r_sigma[x2;2.0]
-.ml.fresh.feat.ratiobeyondrsigma[x2;10] ~ ratio_beyond_r_sigma[x2;10]
-.ml.fresh.feat.ratiobeyondrsigma[xnull;0.2] ~ 0f
-.ml.fresh.feat.ratiobeyondrsigma[xnull;2.0] ~ 0f
-.ml.fresh.feat.ratiobeyondrsigma[xnull;10] ~ 0f
-
-.ml.fresh.feat.perrecurtoalldata[xj] ~ percentage_recurring_all_data[xj]
-.ml.fresh.feat.perrecurtoalldata[xf] ~ percentage_recurring_all_data[xf]
-.ml.fresh.feat.perrecurtoalldata[xb] ~ percentage_recurring_all_data[xb]
-.ml.fresh.feat.perrecurtoalldata[xi] ~ percentage_recurring_all_data[xi]
-.ml.fresh.feat.perrecurtoalldata[x1] ~ percentage_recurring_all_data[x1]
-.ml.fresh.feat.perrecurtoalldata[x2] ~ percentage_recurring_all_data[x2]
-.ml.fresh.feat.perrecurtoalldata[xnull] ~ 1f
-
-.ml.fresh.feat.perrecurtoallval[xj] ~ percentage_recurring_all_val[xj]
-.ml.fresh.feat.perrecurtoallval[xf] ~ percentage_recurring_all_val[xf]
-.ml.fresh.feat.perrecurtoallval[xb] ~ percentage_recurring_all_val[xb]
-.ml.fresh.feat.perrecurtoallval[xi] ~ percentage_recurring_all_val[xi]
-.ml.fresh.feat.perrecurtoallval[x1] ~ percentage_recurring_all_val[x1]
-.ml.fresh.feat.perrecurtoallval[x2] ~ percentage_recurring_all_val[x2]
-.ml.fresh.feat.perrecurtoallval[xnull] ~ 1f
-
-.ml.fresh.feat.largestdev[xj;0.5] ~ large_standard_deviation[xj;0.5]
-.ml.fresh.feat.largestdev[xj;5.0] ~ large_standard_deviation[xj;5.0]
-.ml.fresh.feat.largestdev[xj;1] ~ large_standard_deviation[xj;1]
-.ml.fresh.feat.largestdev[xf;0.5] ~ large_standard_deviation[xf;0.5]
-.ml.fresh.feat.largestdev[xf;5.0] ~ large_standard_deviation[xf;5.0]
-.ml.fresh.feat.largestdev[xf;1] ~ large_standard_deviation[xf;1]
-.ml.fresh.feat.largestdev[xi;0.5] ~ large_standard_deviation[xi;0.5]
-.ml.fresh.feat.largestdev[xi;5.0] ~ large_standard_deviation[xi;5.0]
-.ml.fresh.feat.largestdev[xi;1] ~ large_standard_deviation[xi;1]
-.ml.fresh.feat.largestdev[x0;0.5] ~ 0b
-.ml.fresh.feat.largestdev[x0;5.0] ~ 0b
-.ml.fresh.feat.largestdev[x0;1] ~ 0b
-.ml.fresh.feat.largestdev[x1;0.5] ~ 0b
-.ml.fresh.feat.largestdev[x1;5.0] ~ 0b
-.ml.fresh.feat.largestdev[x1;1] ~ 0b
-.ml.fresh.feat.largestdev[x2;0.5] ~ 0b
-.ml.fresh.feat.largestdev[x2;5.0] ~ 0b
-.ml.fresh.feat.largestdev[x2;1] ~ 0b
-.ml.fresh.feat.largestdev[xb;0.5] ~ 0b
-.ml.fresh.feat.largestdev[xb;5.0] ~ 0b
-.ml.fresh.feat.largestdev[xb;1] ~ 0b
-.ml.fresh.feat.largestdev[xnull;0.5] ~ large_standard_deviation[xnull;0.5]
-.ml.fresh.feat.largestdev[xnull;5.0] ~ large_standard_deviation[xnull;5.0]
-.ml.fresh.feat.largestdev[xnull;1] ~ large_standard_deviation[xnull;1]
-
-.ml.fresh.feat.valcount[xj;yint] ~ "i"$value_count[xj;yint]
-.ml.fresh.feat.valcount[xf;yfloat] ~ "i"$value_count[xf;yfloat]
-.ml.fresh.feat.valcount[xb;yint] ~ "i"$value_count[xb;yint]
-.ml.fresh.feat.valcount[xb;yfloat] ~ "i"$value_count[xb;yfloat]
-.ml.fresh.feat.valcount[xi;yint] ~ "i"$value_count[xi;yint]
-.ml.fresh.feat.valcount[xi;yfloat] ~ "i"$value_count[xi;yfloat]
-.ml.fresh.feat.valcount[x0;yint] ~ "i"$value_count[x0;yint]
-.ml.fresh.feat.valcount[x0;yfloat] ~ "i"$value_count[x0;yfloat]
-.ml.fresh.feat.valcount[x1;yint] ~ "i"$value_count[x1;yint]
-.ml.fresh.feat.valcount[x1;yfloat] ~ "i"$value_count[x1;yfloat]
-.ml.fresh.feat.valcount[x2;yint] ~ "i"$value_count[x2;yint]
-.ml.fresh.feat.valcount[x2;yfloat] ~ "i"$value_count[x2;yfloat]
-.ml.fresh.feat.valcount[xnull;yint] ~ "i"$value_count[xnull;yint]
-.ml.fresh.feat.valcount[xnull;yfloat] ~ "i"$value_count[xnull;yfloat]
-
-.ml.fresh.feat.cidce[xj;0b] ~ cid_ce[xj;0b]
-.ml.fresh.feat.cidce[xf;0b] ~ cid_ce[xf;0b]
-.ml.fresh.feat.cidce[xb;0b] ~ cid_ce[xb;0b]
-.ml.fresh.feat.cidce[xi;0b] ~ cid_ce[xi;0b]
-.ml.fresh.feat.cidce[x0;0b] ~ cid_ce[x0;0b]
-.ml.fresh.feat.cidce[x1;0b] ~ cid_ce[x1;0b]
-.ml.fresh.feat.cidce[x2;0b] ~ cid_ce[x2;0b]
-.ml.fresh.feat.cidce[xnull;0b] ~ 0n
-.ml.fresh.feat.cidce[xj;1b] ~ cid_ce[xj;1b]
-.ml.fresh.feat.cidce[xf;1b] ~ cid_ce[xf;1b]
-.ml.fresh.feat.cidce[xb;1b] ~ cid_ce[xb;1b]
-.ml.fresh.feat.cidce[xi;1b] ~ cid_ce[xi;1b]
-.ml.fresh.feat.cidce[x0;1b] ~ cid_ce[x0;1b]
-.ml.fresh.feat.cidce[x1;0b] ~ cid_ce[x1;0b]
-.ml.fresh.feat.cidce[x2;0b] ~ cid_ce[x2;0b]
-.ml.fresh.feat.cidce[xnull;1b] ~ 0n
-
-.ml.fresh.feat.mean2dercentral[xj] ~ mean_second_derivative_central[xj]
-.ml.fresh.feat.mean2dercentral[xf] ~ mean_second_derivative_central[xf]
-.ml.fresh.feat.mean2dercentral[xi] ~ mean_second_derivative_central[xi]
-.ml.fresh.feat.mean2dercentral[xb] ~ 0f
-.ml.fresh.feat.mean2dercentral[x0] ~ 0n
-.ml.fresh.feat.mean2dercentral[x1] ~ 0n
-.ml.fresh.feat.mean2dercentral[x2] ~ 0n
-.ml.fresh.feat.mean2dercentral[xnull] ~ 0n
+.ml.fresh.feat.hasDup[xj] ~ hasduplicate[xj]
+.ml.fresh.feat.hasDup[xf] ~ hasduplicate[xf]
+.ml.fresh.feat.hasDup[xb] ~ hasduplicate[xb]
+.ml.fresh.feat.hasDup[xi] ~ hasduplicate[xi]
+.ml.fresh.feat.hasDup[x0] ~ hasduplicate[x0]
+.ml.fresh.feat.hasDup[x1] ~ hasduplicate[x1]
+.ml.fresh.feat.hasDup[x2] ~ hasduplicate[x2]
+.ml.fresh.feat.hasDupMin[xj] ~ hasduplicatemin[xj]
+.ml.fresh.feat.hasDupMin[xf] ~ hasduplicatemin[xf]
+.ml.fresh.feat.hasDupMin[xi] ~ hasduplicatemin[xi]
+.ml.fresh.feat.hasDupMin[xb] ~ hasduplicatemin[xb]
+.ml.fresh.feat.hasDupMin[x0] ~ 0b
+.ml.fresh.feat.hasDupMin[x1] ~ hasduplicatemin[x1]
+.ml.fresh.feat.hasDupMin[x2] ~ hasduplicatemin[x2]
+.ml.fresh.feat.hasDupMax[xj] ~ hasduplicatemax[xj]
+.ml.fresh.feat.hasDupMax[xf] ~ hasduplicatemax[xf]
+.ml.fresh.feat.hasDupMax[xi] ~ hasduplicatemax[xi]
+.ml.fresh.feat.hasDupMax[xb] ~ hasduplicatemax[xb]
+.ml.fresh.feat.hasDupMax[x0] ~ 0b
+.ml.fresh.feat.hasDupMax[x1] ~ hasduplicatemax[x1]
+.ml.fresh.feat.hasDupMax[x2] ~ hasduplicatemax[x2]
+.ml.fresh.feat.hasDup[xmixf] ~ 1b
+.ml.fresh.feat.hasDupMin[xmixf] ~ hasduplicatemin[xmixf]
+.ml.fresh.feat.hasDupMax[xmixf] ~ hasduplicatemax[xmixf]
+.ml.fresh.feat.hasDup[xnull] ~ 1b
+.ml.fresh.feat.hasDupMin[xnull] ~ 0b
+.ml.fresh.feat.hasDupMax[xnull] ~ 0b
+
+.ml.fresh.feat.absEnergy[xj] ~ "f"$abs_energy[xj]
+.ml.fresh.feat.absEnergy[xf] ~ abs_energy[xf]
+.ml.fresh.feat.absEnergy[xb] ~ "f"$abs_energy[xb]
+.ml.fresh.feat.absEnergy[xi] = "f"$abs_energy[xi]
+.ml.fresh.feat.absEnergy[x0] ~ "f"$abs_energy[x0]
+.ml.fresh.feat.absEnergy[x1] ~ "f"$abs_energy[x1]
+.ml.fresh.feat.absEnergy[x2] ~ "f"$abs_energy[x2]
+.ml.fresh.feat.absEnergy[xmixf] ~ sum l*l:xmixf
+.ml.fresh.feat.absEnergy[xnull] ~ 0f
+
+.ml.fresh.feat.meanChange[xj] ~ mean_change[xj]
+.ml.fresh.feat.meanChange[xf] ~ mean_change[xf]
+.ml.fresh.feat.meanChange[xi] ~ mean_change[xi]
+.ml.fresh.feat.meanChange[x0] ~ mean_change[x0]
+.ml.fresh.feat.meanChange[x1] ~ mean_change[x1]
+.ml.fresh.feat.meanChange[x2] ~ mean_change[x2]
+/.ml.fresh.feat.meanChange[xb] ~ mean_change[xb]
+
+.ml.fresh.feat.absSumChange[xj] ~ absolute_sum_of_changes[xj]
+.ml.fresh.feat.absSumChange[xf] ~ absolute_sum_of_changes[xf]
+.ml.fresh.feat.absSumChange[xi] ~ "i"$absolute_sum_of_changes[xi]
+.ml.fresh.feat.absSumChange[xb] ~ "i"$absolute_sum_of_changes[xb]
+.ml.fresh.feat.absSumChange[x0] ~ 0f 
+.ml.fresh.feat.absSumChange[x1] ~ absolute_sum_of_changes[x1]
+.ml.fresh.feat.absSumChange[x2] ~ absolute_sum_of_changes[x2]
+.ml.fresh.feat.absSumChange[xnull] ~ 0f
+
+.ml.fresh.feat.meanAbsChange[xj] ~ mean_abs_change[xj]
+.ml.fresh.feat.meanAbsChange[xf] ~ mean_abs_change[xf]
+.ml.fresh.feat.meanAbsChange[xb] ~ mean_abs_change[xb]
+.ml.fresh.feat.meanAbsChange[xi] ~ mean_abs_change[xi]
+.ml.fresh.feat.meanAbsChange[x0] ~ 0n
+.ml.fresh.feat.meanAbsChange[x1] ~ mean_abs_change[x1]
+.ml.fresh.feat.meanAbsChange[x2] ~ mean_abs_change[x2]
+.ml.fresh.feat.meanAbsChange[xnull] ~ 0n
+
+.ml.fresh.feat.countAboveMean[xj] ~ "i"$count_above_mean[xj]
+.ml.fresh.feat.countAboveMean[xf] ~ "i"$count_above_mean[xf]
+.ml.fresh.feat.countAboveMean[xb] ~ "i"$count_above_mean[xb]
+.ml.fresh.feat.countAboveMean[xi] ~ "i"$count_above_mean[xi]
+.ml.fresh.feat.countAboveMean[x0] ~ "i"$count_above_mean[x0]
+.ml.fresh.feat.countAboveMean[x1] ~ "i"$count_above_mean[x1]
+.ml.fresh.feat.countAboveMean[x2] ~ "i"$count_above_mean[x2]
+.ml.fresh.feat.countAboveMean[xnull] ~ "i"$count_above_mean[xnull]
+
+.ml.fresh.feat.countBelowMean[xj] ~ "i"$count_below_mean[xj]
+.ml.fresh.feat.countBelowMean[xf] ~ "i"$count_below_mean[xf]
+.ml.fresh.feat.countBelowMean[xb] ~ "i"$count_below_mean[xb]
+.ml.fresh.feat.countBelowMean[xi] ~ "i"$count_below_mean[xi]
+.ml.fresh.feat.countBelowMean[x0] ~ "i"$count_below_mean[x0]
+.ml.fresh.feat.countBelowMean[x1] ~ "i"$count_below_mean[x1]
+.ml.fresh.feat.countBelowMean[x2] ~ "i"$count_below_mean[x2]
+.ml.fresh.feat.countBelowMean[xnull] ~ "i"$count_below_mean[xnull]
+
+.ml.fresh.feat.firstMax[xj] ~ first_location_of_maximum[xj]
+.ml.fresh.feat.firstMax[xf] ~ first_location_of_maximum[xf]
+.ml.fresh.feat.firstMax[xb] ~ first_location_of_maximum[xb]
+.ml.fresh.feat.firstMax[xi] ~ first_location_of_maximum[xi]
+.ml.fresh.feat.firstMax[x0] ~ 0n
+.ml.fresh.feat.firstMax[x1] ~ first_location_of_maximum[x1]
+.ml.fresh.feat.firstMax[x2] ~ first_location_of_maximum[x2]
+.ml.fresh.feat.firstMax[xnull] ~ 1f
+
+.ml.fresh.feat.firstMin[xj] ~ first_location_of_minimum[xj]
+.ml.fresh.feat.firstMin[xf] ~ first_location_of_minimum[xf]
+.ml.fresh.feat.firstMin[xb] ~ first_location_of_minimum[xb]
+.ml.fresh.feat.firstMin[xi] ~ first_location_of_minimum[xi]
+.ml.fresh.feat.firstMin[x0] ~ 0n
+.ml.fresh.feat.firstMin[x1] ~ first_location_of_minimum[x1]
+.ml.fresh.feat.firstMin[x2] ~ first_location_of_minimum[x2]
+.ml.fresh.feat.firstMin[xnull] ~ 1f
+
+.ml.fresh.feat.ratioValNumToSeriesLength[xj] ~ ratio_val_num_to_t_series[xj]
+.ml.fresh.feat.ratioValNumToSeriesLength[xf] ~ ratio_val_num_to_t_series[xf]
+.ml.fresh.feat.ratioValNumToSeriesLength[xb] ~ ratio_val_num_to_t_series[xb]
+.ml.fresh.feat.ratioValNumToSeriesLength[xi] ~ ratio_val_num_to_t_series[xi]
+.ml.fresh.feat.ratioValNumToSeriesLength[x0] ~ 0n
+.ml.fresh.feat.ratioValNumToSeriesLength[x1] ~ ratio_val_num_to_t_series[x1]
+.ml.fresh.feat.ratioValNumToSeriesLength[x2] ~ ratio_val_num_to_t_series[x2]
+.ml.fresh.feat.ratioValNumToSeriesLength[xnull] ~ 0.0001
+
+.ml.fresh.feat.ratioBeyondRSigma[xj;0.2] ~ ratio_beyond_r_sigma[xj;0.2]
+.ml.fresh.feat.ratioBeyondRSigma[xj;2.0] ~ ratio_beyond_r_sigma[xj;2.0]
+.ml.fresh.feat.ratioBeyondRSigma[xj;10] ~ ratio_beyond_r_sigma[xj;10]
+.ml.fresh.feat.ratioBeyondRSigma[xf;0.2] ~ ratio_beyond_r_sigma[xf;0.2]
+.ml.fresh.feat.ratioBeyondRSigma[xf;2.0] ~ ratio_beyond_r_sigma[xf;2.0]
+.ml.fresh.feat.ratioBeyondRSigma[xf;10] ~ ratio_beyond_r_sigma[xf;10]
+.ml.fresh.feat.ratioBeyondRSigma[xi;0.2] ~ ratio_beyond_r_sigma[xi;0.2]
+.ml.fresh.feat.ratioBeyondRSigma[xi;2.0] ~ ratio_beyond_r_sigma[xi;2.0]
+.ml.fresh.feat.ratioBeyondRSigma[xi;10] ~ ratio_beyond_r_sigma[xi;10]
+.ml.fresh.feat.ratioBeyondRSigma[xb;0.2] ~ ratio_beyond_r_sigma[xb;0.2]
+.ml.fresh.feat.ratioBeyondRSigma[xb;2.0] ~ ratio_beyond_r_sigma[xb;2.0]
+.ml.fresh.feat.ratioBeyondRSigma[xb;10] ~ ratio_beyond_r_sigma[xb;10]
+.ml.fresh.feat.ratioBeyondRSigma[x0;0.2] ~ 0n
+.ml.fresh.feat.ratioBeyondRSigma[x0;2.0] ~ 0n
+.ml.fresh.feat.ratioBeyondRSigma[x0;10] ~ 0n
+.ml.fresh.feat.ratioBeyondRSigma[x1;0.2] ~ ratio_beyond_r_sigma[x1;0.2]
+.ml.fresh.feat.ratioBeyondRSigma[x1;2.0] ~ ratio_beyond_r_sigma[x1;2.0]
+.ml.fresh.feat.ratioBeyondRSigma[x1;10] ~ ratio_beyond_r_sigma[x1;10]
+.ml.fresh.feat.ratioBeyondRSigma[x2;0.2] ~ ratio_beyond_r_sigma[x2;0.2]
+.ml.fresh.feat.ratioBeyondRSigma[x2;2.0] ~ ratio_beyond_r_sigma[x2;2.0]
+.ml.fresh.feat.ratioBeyondRSigma[x2;10] ~ ratio_beyond_r_sigma[x2;10]
+.ml.fresh.feat.ratioBeyondRSigma[xnull;0.2] ~ 0f
+.ml.fresh.feat.ratioBeyondRSigma[xnull;2.0] ~ 0f
+.ml.fresh.feat.ratioBeyondRSigma[xnull;10] ~ 0f
+
+.ml.fresh.feat.perRecurToAllData[xj] ~ percentage_recurring_all_data[xj]
+.ml.fresh.feat.perRecurToAllData[xf] ~ percentage_recurring_all_data[xf]
+.ml.fresh.feat.perRecurToAllData[xb] ~ percentage_recurring_all_data[xb]
+.ml.fresh.feat.perRecurToAllData[xi] ~ percentage_recurring_all_data[xi]
+.ml.fresh.feat.perRecurToAllData[x1] ~ percentage_recurring_all_data[x1]
+.ml.fresh.feat.perRecurToAllData[x2] ~ percentage_recurring_all_data[x2]
+.ml.fresh.feat.perRecurToAllData[xnull] ~ 1f
+
+.ml.fresh.feat.perRecurToAllVal[xj] ~ percentage_recurring_all_val[xj]
+.ml.fresh.feat.perRecurToAllVal[xf] ~ percentage_recurring_all_val[xf]
+.ml.fresh.feat.perRecurToAllVal[xb] ~ percentage_recurring_all_val[xb]
+.ml.fresh.feat.perRecurToAllVal[xi] ~ percentage_recurring_all_val[xi]
+.ml.fresh.feat.perRecurToAllVal[x1] ~ percentage_recurring_all_val[x1]
+.ml.fresh.feat.perRecurToAllVal[x2] ~ percentage_recurring_all_val[x2]
+.ml.fresh.feat.perRecurToAllVal[xnull] ~ 1f
+
+.ml.fresh.feat.largestDev[xj;0.5] ~ large_standard_deviation[xj;0.5]
+.ml.fresh.feat.largestDev[xj;5.0] ~ large_standard_deviation[xj;5.0]
+.ml.fresh.feat.largestDev[xj;1] ~ large_standard_deviation[xj;1]
+.ml.fresh.feat.largestDev[xf;0.5] ~ large_standard_deviation[xf;0.5]
+.ml.fresh.feat.largestDev[xf;5.0] ~ large_standard_deviation[xf;5.0]
+.ml.fresh.feat.largestDev[xf;1] ~ large_standard_deviation[xf;1]
+.ml.fresh.feat.largestDev[xi;0.5] ~ large_standard_deviation[xi;0.5]
+.ml.fresh.feat.largestDev[xi;5.0] ~ large_standard_deviation[xi;5.0]
+.ml.fresh.feat.largestDev[xi;1] ~ large_standard_deviation[xi;1]
+.ml.fresh.feat.largestDev[x0;0.5] ~ 0b
+.ml.fresh.feat.largestDev[x0;5.0] ~ 0b
+.ml.fresh.feat.largestDev[x0;1] ~ 0b
+.ml.fresh.feat.largestDev[x1;0.5] ~ 0b
+.ml.fresh.feat.largestDev[x1;5.0] ~ 0b
+.ml.fresh.feat.largestDev[x1;1] ~ 0b
+.ml.fresh.feat.largestDev[x2;0.5] ~ 0b
+.ml.fresh.feat.largestDev[x2;5.0] ~ 0b
+.ml.fresh.feat.largestDev[x2;1] ~ 0b
+.ml.fresh.feat.largestDev[xb;0.5] ~ 0b
+.ml.fresh.feat.largestDev[xb;5.0] ~ 0b
+.ml.fresh.feat.largestDev[xb;1] ~ 0b
+.ml.fresh.feat.largestDev[xnull;0.5] ~ large_standard_deviation[xnull;0.5]
+.ml.fresh.feat.largestDev[xnull;5.0] ~ large_standard_deviation[xnull;5.0]
+.ml.fresh.feat.largestDev[xnull;1] ~ large_standard_deviation[xnull;1]
+
+.ml.fresh.feat.valCount[xj;yint] ~ "i"$value_count[xj;yint]
+.ml.fresh.feat.valCount[xf;yfloat] ~ "i"$value_count[xf;yfloat]
+.ml.fresh.feat.valCount[xb;yint] ~ "i"$value_count[xb;yint]
+.ml.fresh.feat.valCount[xb;yfloat] ~ "i"$value_count[xb;yfloat]
+.ml.fresh.feat.valCount[xi;yint] ~ "i"$value_count[xi;yint]
+.ml.fresh.feat.valCount[xi;yfloat] ~ "i"$value_count[xi;yfloat]
+.ml.fresh.feat.valCount[x0;yint] ~ "i"$value_count[x0;yint]
+.ml.fresh.feat.valCount[x0;yfloat] ~ "i"$value_count[x0;yfloat]
+.ml.fresh.feat.valCount[x1;yint] ~ "i"$value_count[x1;yint]
+.ml.fresh.feat.valCount[x1;yfloat] ~ "i"$value_count[x1;yfloat]
+.ml.fresh.feat.valCount[x2;yint] ~ "i"$value_count[x2;yint]
+.ml.fresh.feat.valCount[x2;yfloat] ~ "i"$value_count[x2;yfloat]
+.ml.fresh.feat.valCount[xnull;yint] ~ "i"$value_count[xnull;yint]
+.ml.fresh.feat.valCount[xnull;yfloat] ~ "i"$value_count[xnull;yfloat]
+
+.ml.fresh.feat.cidCe[xj;0b] ~ cid_ce[xj;0b]
+.ml.fresh.feat.cidCe[xf;0b] ~ cid_ce[xf;0b]
+.ml.fresh.feat.cidCe[xb;0b] ~ cid_ce[xb;0b]
+.ml.fresh.feat.cidCe[xi;0b] ~ cid_ce[xi;0b]
+.ml.fresh.feat.cidCe[x0;0b] ~ cid_ce[x0;0b]
+.ml.fresh.feat.cidCe[x1;0b] ~ cid_ce[x1;0b]
+.ml.fresh.feat.cidCe[x2;0b] ~ cid_ce[x2;0b]
+.ml.fresh.feat.cidCe[xnull;0b] ~ 0n
+.ml.fresh.feat.cidCe[xj;1b] ~ cid_ce[xj;1b]
+.ml.fresh.feat.cidCe[xf;1b] ~ cid_ce[xf;1b]
+.ml.fresh.feat.cidCe[xb;1b] ~ cid_ce[xb;1b]
+.ml.fresh.feat.cidCe[xi;1b] ~ cid_ce[xi;1b]
+.ml.fresh.feat.cidCe[x0;1b] ~ cid_ce[x0;1b]
+.ml.fresh.feat.cidCe[x1;0b] ~ cid_ce[x1;0b]
+.ml.fresh.feat.cidCe[x2;0b] ~ cid_ce[x2;0b]
+.ml.fresh.feat.cidCe[xnull;1b] ~ 0n
+
+.ml.fresh.feat.mean2DerCentral[xj] ~ mean_second_derivative_central[xj]
+.ml.fresh.feat.mean2DerCentral[xf] ~ mean_second_derivative_central[xf]
+.ml.fresh.feat.mean2DerCentral[xi] ~ mean_second_derivative_central[xi]
+.ml.fresh.feat.mean2DerCentral[xb] ~ 0f
+.ml.fresh.feat.mean2DerCentral[x0] ~ 0n
+.ml.fresh.feat.mean2DerCentral[x1] ~ 0n
+.ml.fresh.feat.mean2DerCentral[x2] ~ 0n
+.ml.fresh.feat.mean2DerCentral[xnull] ~ 0n
 
 .ml.fresh.feat.skewness[xj] ~ skewness_py[xj]
 (.ml.fresh.feat.skewness[xf] - skewness_py[xf])<1e-13
@@ -266,40 +266,40 @@ np:.p.import[`numpy]
 .ml.fresh.feat.kurtosis[x2] ~ 0n
 .ml.fresh.feat.kurtosis[xnull] ~ 0n
 
-.ml.fresh.feat.longstrikeltmean[xj] ~ longest_strike_below_mean[xj]
-.ml.fresh.feat.longstrikeltmean[xf] ~ longest_strike_below_mean[xf]
-.ml.fresh.feat.longstrikeltmean[xb] ~ longest_strike_below_mean[xb]
-.ml.fresh.feat.longstrikeltmean[xi] ~ longest_strike_below_mean[xi]
-.ml.fresh.feat.longstrikeltmean[x0] ~ longest_strike_below_mean[x0]
-("f"$.ml.fresh.feat.longstrikeltmean[x1]) ~ 0f
-.ml.fresh.feat.longstrikeltmean[x2] ~ longest_strike_below_mean[x2]
-.ml.fresh.feat.longstrikeltmean[xnull] ~ longest_strike_below_mean[xnull]
-
-.ml.fresh.feat.longstrikegtmean[xj] ~ longest_strike_above_mean[xj]
-.ml.fresh.feat.longstrikegtmean[xf] ~ longest_strike_above_mean[xf]
-.ml.fresh.feat.longstrikegtmean[xb] ~ longest_strike_above_mean[xb]
-.ml.fresh.feat.longstrikegtmean[xi] ~ longest_strike_above_mean[xi]
-.ml.fresh.feat.longstrikegtmean[x0] ~ longest_strike_above_mean[x0]
-("f"$.ml.fresh.feat.longstrikegtmean[x1]) ~ 0f
-.ml.fresh.feat.longstrikegtmean[x2] ~ longest_strike_above_mean[x2]
-.ml.fresh.feat.longstrikegtmean[xnull] ~ longest_strike_above_mean[xnull]
-
-.ml.fresh.feat.sumrecurringval[xj] ~ sum_recurring_values[xj]
-.ml.fresh.feat.sumrecurringval[xf] ~ sum_recurring_values[xf]
-.ml.fresh.feat.sumrecurringval[xi] ~ "i"$sum_recurring_values[xi]
-.ml.fresh.feat.sumrecurringval[xb] ~ "i"$sum_recurring_values[xb]
-.ml.fresh.feat.sumrecurringval[x1] ~ sum_recurring_values[x1]
-.ml.fresh.feat.sumrecurringval[x2] ~ sum_recurring_values[x2]
-.ml.fresh.feat.sumrecurringval[x0] ~ 0f
-.ml.fresh.feat.sumrecurringval[xnull] ~ 0f
-
-.ml.fresh.feat.sumrecurringdatapoint[xj] ~ sum_recurring_data_points[xj]
-.ml.fresh.feat.sumrecurringdatapoint[xf] ~ sum_recurring_data_points[xf]
-.ml.fresh.feat.sumrecurringdatapoint[xb] ~ sum_recurring_data_points[xb]
-.ml.fresh.feat.sumrecurringdatapoint[xi] ~ sum_recurring_data_points[xi]
-.ml.fresh.feat.sumrecurringdatapoint[x1] ~ sum_recurring_data_points[x1]
-.ml.fresh.feat.sumrecurringdatapoint[x2] ~ sum_recurring_data_points[x2]
-.ml.fresh.feat.sumrecurringdatapoint[xnull] ~ 0f
+.ml.fresh.feat.longStrikeBelowMean[xj] ~ longest_strike_below_mean[xj]
+.ml.fresh.feat.longStrikeBelowMean[xf] ~ longest_strike_below_mean[xf]
+.ml.fresh.feat.longStrikeBelowMean[xb] ~ longest_strike_below_mean[xb]
+.ml.fresh.feat.longStrikeBelowMean[xi] ~ longest_strike_below_mean[xi]
+.ml.fresh.feat.longStrikeBelowMean[x0] ~ longest_strike_below_mean[x0]
+("f"$.ml.fresh.feat.longStrikeBelowMean[x1]) ~ 0f
+.ml.fresh.feat.longStrikeBelowMean[x2] ~ longest_strike_below_mean[x2]
+.ml.fresh.feat.longStrikeBelowMean[xnull] ~ longest_strike_below_mean[xnull]
+
+.ml.fresh.feat.longStrikeAboveMean[xj] ~ longest_strike_above_mean[xj]
+.ml.fresh.feat.longStrikeAboveMean[xf] ~ longest_strike_above_mean[xf]
+.ml.fresh.feat.longStrikeAboveMean[xb] ~ longest_strike_above_mean[xb]
+.ml.fresh.feat.longStrikeAboveMean[xi] ~ longest_strike_above_mean[xi]
+.ml.fresh.feat.longStrikeAboveMean[x0] ~ longest_strike_above_mean[x0]
+("f"$.ml.fresh.feat.longStrikeAboveMean[x1]) ~ 0f
+.ml.fresh.feat.longStrikeAboveMean[x2] ~ longest_strike_above_mean[x2]
+.ml.fresh.feat.longStrikeAboveMean[xnull] ~ longest_strike_above_mean[xnull]
+
+.ml.fresh.feat.sumRecurringVal[xj] ~ sum_recurring_values[xj]
+.ml.fresh.feat.sumRecurringVal[xf] ~ sum_recurring_values[xf]
+.ml.fresh.feat.sumRecurringVal[xi] ~ "i"$sum_recurring_values[xi]
+.ml.fresh.feat.sumRecurringVal[xb] ~ "i"$sum_recurring_values[xb]
+.ml.fresh.feat.sumRecurringVal[x1] ~ sum_recurring_values[x1]
+.ml.fresh.feat.sumRecurringVal[x2] ~ sum_recurring_values[x2]
+.ml.fresh.feat.sumRecurringVal[x0] ~ 0f
+.ml.fresh.feat.sumRecurringVal[xnull] ~ 0f
+
+.ml.fresh.feat.sumRecurringDataPoint[xj] ~ sum_recurring_data_points[xj]
+.ml.fresh.feat.sumRecurringDataPoint[xf] ~ sum_recurring_data_points[xf]
+.ml.fresh.feat.sumRecurringDataPoint[xb] ~ sum_recurring_data_points[xb]
+.ml.fresh.feat.sumRecurringDataPoint[xi] ~ sum_recurring_data_points[xi]
+.ml.fresh.feat.sumRecurringDataPoint[x1] ~ sum_recurring_data_points[x1]
+.ml.fresh.feat.sumRecurringDataPoint[x2] ~ sum_recurring_data_points[x2]
+.ml.fresh.feat.sumRecurringDataPoint[xnull] ~ 0f
 
 .ml.fresh.feat.c3[xj;2] ~ c3_py[xj;2]
 .ml.fresh.feat.c3[xf;4] ~ c3_py[xf;4]
@@ -310,249 +310,249 @@ np:.p.import[`numpy]
 .ml.fresh.feat.c3[x2;4] ~ 0n
 .ml.fresh.feat.c3[xnull;4] ~ 0n
 
-.ml.fresh.feat.vargtstddev[xj] ~ variance_larger_than_standard_deviation[xj]
-.ml.fresh.feat.vargtstddev[xf] ~ variance_larger_than_standard_deviation[xf] 
-.ml.fresh.feat.vargtstddev[xb] ~ variance_larger_than_standard_deviation[xb]
-.ml.fresh.feat.vargtstddev[xi] ~ variance_larger_than_standard_deviation[xi]
-.ml.fresh.feat.vargtstddev[x0] ~ 0b
-.ml.fresh.feat.vargtstddev[x1] ~ variance_larger_than_standard_deviation[x1]
-.ml.fresh.feat.vargtstddev[x2] ~ variance_larger_than_standard_deviation[x2]
-.ml.fresh.feat.vargtstddev[xnull] ~ 0b
-
-.ml.fresh.feat.numcwtpeaks[xj;3] ~ number_cwt_peaks[xj;3]
-.ml.fresh.feat.numcwtpeaks[xf;3] ~ number_cwt_peaks[xf;3]
-.ml.fresh.feat.numcwtpeaks[xb;3] ~ number_cwt_peaks[xb;3]
-.ml.fresh.feat.numcwtpeaks[xi;3] ~ number_cwt_peaks[xi;3]
-.ml.fresh.feat.numcwtpeaks[x1;3] ~ number_cwt_peaks[x1;3]
-.ml.fresh.feat.numcwtpeaks[x2;3] ~ number_cwt_peaks[x2;3]
-.ml.fresh.feat.numcwtpeaks[xnull;3] ~ number_cwt_peaks[xnull;3]
+.ml.fresh.feat.varAboveStdDev[xj] ~ variance_larger_than_standard_deviation[xj]
+.ml.fresh.feat.varAboveStdDev[xf] ~ variance_larger_than_standard_deviation[xf] 
+.ml.fresh.feat.varAboveStdDev[xb] ~ variance_larger_than_standard_deviation[xb]
+.ml.fresh.feat.varAboveStdDev[xi] ~ variance_larger_than_standard_deviation[xi]
+.ml.fresh.feat.varAboveStdDev[x0] ~ 0b
+.ml.fresh.feat.varAboveStdDev[x1] ~ variance_larger_than_standard_deviation[x1]
+.ml.fresh.feat.varAboveStdDev[x2] ~ variance_larger_than_standard_deviation[x2]
+.ml.fresh.feat.varAboveStdDev[xnull] ~ 0b
+
+.ml.fresh.feat.numCwtPeaks[xj;3] ~ number_cwt_peaks[xj;3]
+.ml.fresh.feat.numCwtPeaks[xf;3] ~ number_cwt_peaks[xf;3]
+.ml.fresh.feat.numCwtPeaks[xb;3] ~ number_cwt_peaks[xb;3]
+.ml.fresh.feat.numCwtPeaks[xi;3] ~ number_cwt_peaks[xi;3]
+.ml.fresh.feat.numCwtPeaks[x1;3] ~ number_cwt_peaks[x1;3]
+.ml.fresh.feat.numCwtPeaks[x2;3] ~ number_cwt_peaks[x2;3]
+.ml.fresh.feat.numCwtPeaks[xnull;3] ~ number_cwt_peaks[xnull;3]
 
 /For the testing of quantiles the 'y' argument must be in the range [0;1] by definition
 .ml.fresh.feat.quantile[xj;0.5] ~ quantile_py[xj;0.5]
 .ml.fresh.feat.quantile[xf;0.5] ~ quantile_py[xf;0.5]
-.ml.fresh.feat.quantile[xb;0.5] ~ quantile_py[xb;0.5]
+.ml.fresh.feat.quantile[xb;0.5] ~ quantile_py["f"$xb;0.5]
 .ml.fresh.feat.quantile[xi;0.5] ~ quantile_py[xi;0.5]
 .ml.fresh.feat.quantile[x0;0.5] ~ 0f
 .ml.fresh.feat.quantile[x1;0.5] ~ quantile_py[x1;0.5]
 .ml.fresh.feat.quantile[x2;0.5] ~ quantile_py[x2;0.5]
 .ml.fresh.feat.quantile[xnull;0.5] ~ 0f
 
-.ml.fresh.feat.numcrossingm[xj;350] ~ "i"$number_crossing_m[xj;350]
-.ml.fresh.feat.numcrossingm[xf;350] ~ "i"$number_crossing_m[xf;350]
-.ml.fresh.feat.numcrossingm[xb;350] ~ "i"$number_crossing_m[xb;350]
-.ml.fresh.feat.numcrossingm[xi;350] ~ "i"$number_crossing_m[xi;350]
-.ml.fresh.feat.numcrossingm[x0;350] ~ "i"$number_crossing_m[x0;350]
-.ml.fresh.feat.numcrossingm[x1;350] ~ "i"$number_crossing_m[x1;350]
-.ml.fresh.feat.numcrossingm[x2;350] ~ "i"$number_crossing_m[x2;350]
-.ml.fresh.feat.numcrossingm[xnull;350] ~ "i"$number_crossing_m[xnull;350]
-
-.ml.fresh.feat.binnedentropy[xj;50] ~ binned_entropy[xj;50]
-.ml.fresh.feat.binnedentropy[xf;50] ~ binned_entropy[xf;50]
-.ml.fresh.feat.binnedentropy[xi;50] ~ binned_entropy[xi;50]
-.ml.fresh.feat.binnedentropy[x1;50] ~ binned_entropy[x1;50]
-.ml.fresh.feat.binnedentropy[x2;50] ~ binned_entropy[x2;50]
-abs[.ml.fresh.feat.binnedentropy[xnull;50]] ~ 0f
-
-.ml.fresh.feat.autocorr[xf;50] ~ autocorrelation[xf;50]
-.ml.fresh.feat.autocorr[xj;50] ~ autocorrelation[xj;50]
-.ml.fresh.feat.autocorr[xi;50] ~ autocorrelation[xi;50]
-.ml.fresh.feat.autocorr[x0;50] ~ 0n
-.ml.fresh.feat.autocorr[x1;50] ~ 0n
-.ml.fresh.feat.autocorr[x2;50] ~ 0n
-.ml.fresh.feat.autocorr[xnull;50] ~ 0n
-
-.ml.fresh.feat.numpeaks[xj;1] ~ "i"$number_peaks[xj;1]
-.ml.fresh.feat.numpeaks[xj;4] ~ "i"$number_peaks[xj;4]
-.ml.fresh.feat.numpeaks[xf;1] ~ "i"$number_peaks[xf;1]
-.ml.fresh.feat.numpeaks[xf;4] ~ "i"$number_peaks[xf;4]
-.ml.fresh.feat.numpeaks[xb;1] ~ "i"$number_peaks[xb;1]
-.ml.fresh.feat.numpeaks[xb;4] ~ "i"$number_peaks[xb;4]
-.ml.fresh.feat.numpeaks[xi;1] ~ "i"$number_peaks[xi;1]
-.ml.fresh.feat.numpeaks[xi;4] ~ "i"$number_peaks[xi;4]
-.ml.fresh.feat.numpeaks[x0;1] ~ "i"$number_peaks[x0;1]
-.ml.fresh.feat.numpeaks[x0;4] ~ "i"$number_peaks[x0;4]
-.ml.fresh.feat.numpeaks[x1;1] ~ "i"$number_peaks[x1;1]
-.ml.fresh.feat.numpeaks[x1;4] ~ "i"$number_peaks[x1;4]
-.ml.fresh.feat.numpeaks[x2;1] ~ "i"$number_peaks[x2;1]
-.ml.fresh.feat.numpeaks[x2;4] ~ "i"$number_peaks[x2;4]
-.ml.fresh.feat.numpeaks[xnull;1] ~ "i"$number_peaks[xnull;1]
-.ml.fresh.feat.numpeaks[xnull;4] ~ "i"$number_peaks[xnull;4]
-
-.ml.fresh.feat.rangecount[xj;20;100] ~ "i"$range_count[xj;20;100]
-.ml.fresh.feat.rangecount[xf;20.1;100.0] ~ "i"$range_count[xf;20.1;100.0]
-.ml.fresh.feat.rangecount[xi;20;100] ~ "i"$range_count[xi;20;100]
-.ml.fresh.feat.rangecount[xb;20;100] ~ "i"$range_count[xb;20;100]
-.ml.fresh.feat.rangecount[x0;20;100] ~ "i"$range_count[x0;20;100]
-.ml.fresh.feat.rangecount[x1;20;100] ~ "i"$range_count[x1;20;100]
-.ml.fresh.feat.rangecount[x2;20;100] ~ "i"$range_count[x2;20;100]
-.ml.fresh.feat.rangecount[xnull;20;100] ~ "i"$range_count[xnull;20;100]
-
-.ml.fresh.feat.treverseasymstat[xj;2] ~ time_reversal_asymmetry_statistic[xj;2]
-.ml.fresh.feat.treverseasymstat[xf;2] ~ time_reversal_asymmetry_statistic[xf;2]
-.ml.fresh.feat.treverseasymstat[xi;2] ~ time_reversal_asymmetry_statistic[xi;2]
-.ml.fresh.feat.treverseasymstat[xb;2] ~ 0.0001
-.ml.fresh.feat.treverseasymstat[x0;2] ~ 0f
-.ml.fresh.feat.treverseasymstat[x1;2] ~ "f"$time_reversal_asymmetry_statistic[x1;2]
-.ml.fresh.feat.treverseasymstat[x2;2] ~ "f"$time_reversal_asymmetry_statistic[x2;2]
-.ml.fresh.feat.treverseasymstat[xnull;2] ~ 0f
-
-.ml.fresh.feat.indexmassquantile[xi;.6] ~ index_mass_quantile[xi;.6]
-.ml.fresh.feat.indexmassquantile[xj;1.] ~ index_mass_quantile[xj;1.]
-.ml.fresh.feat.indexmassquantile[xh;0.] ~ index_mass_quantile[xh;0.]
-.ml.fresh.feat.indexmassquantile[xi;x0] ~ x0
-
-.ml.fresh.feat.lastmax[xi] ~ last_location_of_maximum[xi]
-.ml.fresh.feat.lastmax[xj] ~ last_location_of_maximum[xj]
-.ml.fresh.feat.lastmax[xf] ~ last_location_of_maximum[xf]
-.ml.fresh.feat.lastmax[x0] ~ 0n
-.ml.fresh.feat.lastmax[xs] ~ 0f
-
-.ml.fresh.feat.lastmin[xi] ~ last_location_of_minimum[xi]
-.ml.fresh.feat.lastmin[xj] ~ last_location_of_minimum[xj]
-.ml.fresh.feat.lastmin[xf] ~ last_location_of_minimum[xf]
-.ml.fresh.feat.lastmin[x0] ~ 0n
-.ml.fresh.feat.lastmin[xs] ~ 0f
-
-(value .ml.fresh.feat.changequant[xf;0.2;0.8;1b]) ~ change_quantiles[xf;0.2;0.8;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xf;0.25;0.7;1b]) ~ change_quantiles[xf;0.25;0.7;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xf;0.2;0.65;1b]) ~ change_quantiles[xf;0.2;0.65;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xf;0.2;0.775;1b]) ~ change_quantiles[xf;0.2;0.775;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xf;0.2;0.8;0b]) ~ change_quantiles[xf;0.2;0.8;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xf;0.25;0.7;0b]) ~ change_quantiles[xf;0.25;0.7;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xf;0.2;0.65;0b]) ~ change_quantiles[xf;0.2;0.65;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xf;0.2;0.775;0b]) ~ change_quantiles[xf;0.2;0.775;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.2;0.8;1b]) ~ change_quantiles[xj;0.2;0.8;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.25;0.7;1b]) ~ change_quantiles[xj;0.25;0.7;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.2;0.65;1b]) ~ change_quantiles[xj;0.2;0.65;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.2;0.775;1b]) ~ change_quantiles[xj;0.2;0.775;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.2;0.8;0b]) ~ change_quantiles[xj;0.2;0.8;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.25;0.7;0b]) ~ change_quantiles[xj;0.25;0.7;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.2;0.65;0b]) ~ change_quantiles[xj;0.2;0.65;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xj;0.2;0.775;0b]) ~ change_quantiles[xj;0.2;0.775;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.2;0.8;1b]) ~ change_quantiles[xi;0.2;0.8;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.25;0.7;1b]) ~ change_quantiles[xi;0.25;0.7;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.2;0.65;1b]) ~ change_quantiles[xi;0.2;0.65;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.2;0.775;1b]) ~ change_quantiles[xi;0.2;0.775;1b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.2;0.8;0b]) ~ change_quantiles[xi;0.2;0.8;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.25;0.7;0b]) ~ change_quantiles[xi;0.25;0.7;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.2;0.65;0b]) ~ change_quantiles[xi;0.2;0.65;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[xi;0.2;0.775;0b]) ~ change_quantiles[xi;0.2;0.775;0b;]each changequantkeys
-(value .ml.fresh.feat.changequant[x0;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
-(value .ml.fresh.feat.changequant[x1;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
-(value .ml.fresh.feat.changequant[x2;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
-(value .ml.fresh.feat.changequant[xnull;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
-
-(.ml.fresh.feat.lintrend[xj]`slope) ~ linear_trend[xj][0]
-(.ml.fresh.feat.lintrend[xj]`intercept) ~ linear_trend[xj][1]
-(.ml.fresh.feat.lintrend[xj]`rval) ~ linear_trend[xj][2]
-(.ml.fresh.feat.lintrend[xf]`slope) ~ linear_trend[xf][0]
-(.ml.fresh.feat.lintrend[xf]`intercept) ~ linear_trend[xf][1]
-(.ml.fresh.feat.lintrend[xf]`rval) ~ linear_trend[xf][2]
-(.ml.fresh.feat.lintrend[xb]`slope) ~ linear_trend[xb][0]
-(.ml.fresh.feat.lintrend[xb]`intercept) ~ linear_trend[xb][1]
-(.ml.fresh.feat.lintrend[xb]`rval) ~ linear_trend[xb][2]
-(.ml.fresh.feat.lintrend[xi]`slope) ~ linear_trend[xi][0]
-(.ml.fresh.feat.lintrend[xi]`intercept) ~ linear_trend[xi][1]
-(.ml.fresh.feat.lintrend[xi]`rval) ~ linear_trend[xi][2]
-(.ml.fresh.feat.lintrend[x0]`slope) ~ 0f
-(.ml.fresh.feat.lintrend[x0]`intercept) ~ 0f
-(.ml.fresh.feat.lintrend[x0]`rval) ~ 0f
-(.ml.fresh.feat.lintrend[x1]`slope) ~ 0f
-(.ml.fresh.feat.lintrend[x1]`intercept) ~ 0f
-(.ml.fresh.feat.lintrend[x1]`rval) ~ 0f
-(.ml.fresh.feat.lintrend[x2]`slope) ~ linear_trend[x2][0] 
-(.ml.fresh.feat.lintrend[x2]`intercept) ~ linear_trend[x2][1]
-(.ml.fresh.feat.lintrend[x2]`rval) ~ linear_trend[x2][2]
-(.ml.fresh.feat.lintrend[xnull]`slope) ~ 0f
-(.ml.fresh.feat.lintrend[xnull]`intercept) ~ 0f
-(.ml.fresh.feat.lintrend[xnull]`rval) ~ 0f
-
-(value .ml.fresh.feat.aggautocorr[xj]) ~ agg_autocorrelation[xj;]each autocorrkeys
-(value .ml.fresh.feat.aggautocorr[xf]) ~ agg_autocorrelation[xf;]each autocorrkeys
-(1_value .ml.fresh.feat.aggautocorr[xb]) ~ 1_agg_autocorrelation[xb;]each autocorrkeys
-(value .ml.fresh.feat.aggautocorr[xi]) ~ agg_autocorrelation[xi;]each autocorrkeys
-(value .ml.fresh.feat.aggautocorr[x0]) ~ 4#0f
-(value .ml.fresh.feat.aggautocorr[x1]) ~ 4#0f
-(value .ml.fresh.feat.aggautocorr[x2]) ~ agg_autocorrelation[x2;]each autocorrkeys
-(value .ml.fresh.feat.aggautocorr[xnull]) ~ 4#0f
-
-(.ml.fresh.feat.fftaggreg[xj]`centroid) ~ fft_aggregated[xj][0]
-(.ml.fresh.feat.fftaggreg[xj]`variance) ~ fft_aggregated[xj][1]
-(.ml.fresh.feat.fftaggreg[xi]`centroid) ~ fft_aggregated[xi][0]
-(.ml.fresh.feat.fftaggreg[xi]`variance) ~ fft_aggregated[xi][1]
-(.ml.fresh.feat.fftaggreg[xf]`centroid) ~ fft_aggregated[xf][0]
-(.ml.fresh.feat.fftaggreg[xf]`variance) ~ fft_aggregated[xf][1]
-(.ml.fresh.feat.fftaggreg[xb]`centroid) ~ fft_aggregated[xb][0]
-(.ml.fresh.feat.fftaggreg[xb]`variance) ~ fft_aggregated[xb][1]
-(.ml.fresh.feat.fftaggreg[x1]`centroid) ~ fft_aggregated[x1][0]
-(.ml.fresh.feat.fftaggreg[x1]`variance) ~ fft_aggregated[x1][1]
-(.ml.fresh.feat.fftaggreg[x2]`centroid) ~ fft_aggregated[x2][0]
-(.ml.fresh.feat.fftaggreg[x2]`variance) ~ fft_aggregated[x2][1]
-(.ml.fresh.feat.fftaggreg[xnull]`centroid) ~ 0n
-(.ml.fresh.feat.fftaggreg[xnull]`variance) ~ 0n
-
-(value .ml.fresh.feat.augfuller[xj]) ~ "f"$augmented_dickey_fuller[xj][0 1 2]
-(value .ml.fresh.feat.augfuller[xf]) ~ "f"$augmented_dickey_fuller[xf][0 1 2]
-(value .ml.fresh.feat.augfuller[xi]) ~ "f"$augmented_dickey_fuller[xi][0 1 2]
-(value .ml.fresh.feat.augfuller[xb]) ~ "f"$augmented_dickey_fuller[xb][0 1 2]
-(value .ml.fresh.feat.augfuller[x0]) ~ 3#0n
-(value .ml.fresh.feat.augfuller[x1]) ~ 3#0n
-(value .ml.fresh.feat.augfuller[x2]) ~ 3#0n
-(value .ml.fresh.feat.augfuller[xnull]) ~ 3#0n
-
-(.ml.fresh.feat.spktwelch[xj;til 100]) ~ spkt_welch_density[xj;til 100]
-(.ml.fresh.feat.spktwelch[xf;til 100]) ~ spkt_welch_density[xf;til 100]
-(.ml.fresh.feat.spktwelch[xi;til 100]) ~ spkt_welch_density[xi;til 100]
-(.ml.fresh.feat.spktwelch[xb;til 100]) ~ spkt_welch_density[xb;til 100]
-(.ml.fresh.feat.spktwelch[xnull;til 100]) ~ 100#0n
-
-(.ml.fresh.feat.spktwelch[xj;k]) ~ spkt_welch_density[xj;k]
-(.ml.fresh.feat.spktwelch[xf;k]) ~ spkt_welch_density[xf;k]
-(.ml.fresh.feat.spktwelch[xi;k]) ~ spkt_welch_density[xi;k]
-(.ml.fresh.feat.spktwelch[xb;k]) ~ spkt_welch_density[xb;k]
-(.ml.fresh.feat.spktwelch[xnull;k]) ~ 100#0n
-
-fft_coefficient[xj;`abs;0]~.ml.fresh.feat.fftcoeff[xj;1]`coeff_0_abs
-fft_coefficient[xj;`abs;49]~.ml.fresh.feat.fftcoeff[xj;50]`coeff_49_abs
-fft_coefficient[xj;`real;0]~.ml.fresh.feat.fftcoeff[xj;1]`coeff_0_real
-fft_coefficient[xj;`real;49]~.ml.fresh.feat.fftcoeff[xj;50]`coeff_49_real
-fft_coefficient[xj;`angle;0]~.ml.fresh.feat.fftcoeff[xj;1]`coeff_0_angle
-fft_coefficient[xj;`angle;49]~.ml.fresh.feat.fftcoeff[xj;50]`coeff_49_angle
-fft_coefficient[xj;`imag;0]~.ml.fresh.feat.fftcoeff[xj;1]`coeff_0_imag
-fft_coefficient[xj;`imag;49]~.ml.fresh.feat.fftcoeff[xj;50]`coeff_49_imag
-fft_coefficient[xf;`abs;0]~.ml.fresh.feat.fftcoeff[xf;1]`coeff_0_abs
-fft_coefficient[xf;`abs;49]~.ml.fresh.feat.fftcoeff[xf;50]`coeff_49_abs
-fft_coefficient[xf;`real;0]~.ml.fresh.feat.fftcoeff[xf;1]`coeff_0_real
-fft_coefficient[xf;`real;49]~.ml.fresh.feat.fftcoeff[xf;50]`coeff_49_real
-fft_coefficient[xf;`angle;0]~.ml.fresh.feat.fftcoeff[xf;1]`coeff_0_angle
-fft_coefficient[xf;`angle;49]~.ml.fresh.feat.fftcoeff[xf;50]`coeff_49_angle
-fft_coefficient[xf;`imag;0]~.ml.fresh.feat.fftcoeff[xf;1]`coeff_0_imag
-fft_coefficient[xf;`imag;49]~.ml.fresh.feat.fftcoeff[xf;50]`coeff_49_imag
-fft_coefficient[xi;`abs;0]~.ml.fresh.feat.fftcoeff[xi;1]`coeff_0_abs
-fft_coefficient[xi;`abs;49]~.ml.fresh.feat.fftcoeff[xi;50]`coeff_49_abs
-fft_coefficient[xi;`real;0]~.ml.fresh.feat.fftcoeff[xi;1]`coeff_0_real
-fft_coefficient[xi;`real;49]~.ml.fresh.feat.fftcoeff[xi;50]`coeff_49_real
-fft_coefficient[xi;`angle;0]~.ml.fresh.feat.fftcoeff[xi;1]`coeff_0_angle
-fft_coefficient[xi;`angle;49]~.ml.fresh.feat.fftcoeff[xi;50]`coeff_49_angle
-fft_coefficient[xi;`imag;0]~.ml.fresh.feat.fftcoeff[xi;1]`coeff_0_imag
-fft_coefficient[xi;`imag;49]~.ml.fresh.feat.fftcoeff[xi;50]`coeff_49_imag
-fft_coefficient[xb;`abs;0]~.ml.fresh.feat.fftcoeff[xb;1]`coeff_0_abs
-fft_coefficient[xb;`abs;49]~.ml.fresh.feat.fftcoeff[xb;50]`coeff_49_abs
-fft_coefficient[xb;`real;0]~.ml.fresh.feat.fftcoeff[xb;1]`coeff_0_real
-fft_coefficient[xb;`real;49]~.ml.fresh.feat.fftcoeff[xb;50]`coeff_49_real
-fft_coefficient[xb;`angle;0]~.ml.fresh.feat.fftcoeff[xb;1]`coeff_0_angle
-fft_coefficient[xb;`angle;49]~.ml.fresh.feat.fftcoeff[xb;50]`coeff_49_angle
-fft_coefficient[xb;`imag;0]~.ml.fresh.feat.fftcoeff[xb;1]`coeff_0_imag
-fft_coefficient[xb;`imag;49]~.ml.fresh.feat.fftcoeff[xb;50]`coeff_49_imag
-
-(.ml.fresh.feat.fftcoeff[xnull;50]`coeff_49_abs) ~ 0n
-(.ml.fresh.feat.fftcoeff[xnull;50]`coeff_49_real) ~ 0n
-(.ml.fresh.feat.fftcoeff[xnull;50]`coeff_49_angle) ~ 0n
-(.ml.fresh.feat.fftcoeff[xnull;50]`coeff_49_imag) ~ 0n 
+.ml.fresh.feat.numCrossing[xj;350] ~ "i"$number_crossing_m[xj;350]
+.ml.fresh.feat.numCrossing[xf;350] ~ "i"$number_crossing_m[xf;350]
+.ml.fresh.feat.numCrossing[xb;350] ~ "i"$number_crossing_m[xb;350]
+.ml.fresh.feat.numCrossing[xi;350] ~ "i"$number_crossing_m[xi;350]
+.ml.fresh.feat.numCrossing[x0;350] ~ "i"$number_crossing_m[x0;350]
+.ml.fresh.feat.numCrossing[x1;350] ~ "i"$number_crossing_m[x1;350]
+.ml.fresh.feat.numCrossing[x2;350] ~ "i"$number_crossing_m[x2;350]
+.ml.fresh.feat.numCrossing[xnull;350] ~ "i"$number_crossing_m[xnull;350]
+
+.ml.fresh.feat.binnedEntropy[xj;50] ~ binned_entropy[xj;50]
+.ml.fresh.feat.binnedEntropy[xf;50] ~ binned_entropy[xf;50]
+.ml.fresh.feat.binnedEntropy[xi;50] ~ binned_entropy[xi;50]
+.ml.fresh.feat.binnedEntropy[x1;50] ~ binned_entropy[x1;50]
+.ml.fresh.feat.binnedEntropy[x2;50] ~ binned_entropy[x2;50]
+abs[.ml.fresh.feat.binnedEntropy[xnull;50]] ~ 0f
+
+.ml.fresh.feat.autoCorr[xf;50] ~ autocorrelation[xf;50]
+.ml.fresh.feat.autoCorr[xj;50] ~ autocorrelation[xj;50]
+.ml.fresh.feat.autoCorr[xi;50] ~ autocorrelation[xi;50]
+.ml.fresh.feat.autoCorr[x0;50] ~ 0n
+.ml.fresh.feat.autoCorr[x1;50] ~ 0n
+.ml.fresh.feat.autoCorr[x2;50] ~ 0n
+.ml.fresh.feat.autoCorr[xnull;50] ~ 0n
+
+.ml.fresh.feat.numPeaks[xj;1] ~ "i"$number_peaks[xj;1]
+.ml.fresh.feat.numPeaks[xj;4] ~ "i"$number_peaks[xj;4]
+.ml.fresh.feat.numPeaks[xf;1] ~ "i"$number_peaks[xf;1]
+.ml.fresh.feat.numPeaks[xf;4] ~ "i"$number_peaks[xf;4]
+.ml.fresh.feat.numPeaks[xb;1] ~ "i"$number_peaks[xb;1]
+.ml.fresh.feat.numPeaks[xb;4] ~ "i"$number_peaks[xb;4]
+.ml.fresh.feat.numPeaks[xi;1] ~ "i"$number_peaks[xi;1]
+.ml.fresh.feat.numPeaks[xi;4] ~ "i"$number_peaks[xi;4]
+.ml.fresh.feat.numPeaks[x0;1] ~ "i"$number_peaks[x0;1]
+.ml.fresh.feat.numPeaks[x0;4] ~ "i"$number_peaks[x0;4]
+.ml.fresh.feat.numPeaks[x1;1] ~ "i"$number_peaks[x1;1]
+.ml.fresh.feat.numPeaks[x1;4] ~ "i"$number_peaks[x1;4]
+.ml.fresh.feat.numPeaks[x2;1] ~ "i"$number_peaks[x2;1]
+.ml.fresh.feat.numPeaks[x2;4] ~ "i"$number_peaks[x2;4]
+.ml.fresh.feat.numPeaks[xnull;1] ~ "i"$number_peaks[xnull;1]
+.ml.fresh.feat.numPeaks[xnull;4] ~ "i"$number_peaks[xnull;4]
+
+.ml.fresh.feat.rangeCount[xj;20;100] ~ "i"$range_count[xj;20;100]
+.ml.fresh.feat.rangeCount[xf;20.1;100.0] ~ "i"$range_count[xf;20.1;100.0]
+.ml.fresh.feat.rangeCount[xi;20;100] ~ "i"$range_count[xi;20;100]
+.ml.fresh.feat.rangeCount[xb;20;100] ~ "i"$range_count[xb;20;100]
+.ml.fresh.feat.rangeCount[x0;20;100] ~ "i"$range_count[x0;20;100]
+.ml.fresh.feat.rangeCount[x1;20;100] ~ "i"$range_count[x1;20;100]
+.ml.fresh.feat.rangeCount[x2;20;100] ~ "i"$range_count[x2;20;100]
+.ml.fresh.feat.rangeCount[xnull;20;100] ~ "i"$range_count[xnull;20;100]
+
+.ml.fresh.feat.treverseAsymStat[xj;2] ~ time_reversal_asymmetry_statistic[xj;2]
+.ml.fresh.feat.treverseAsymStat[xf;2] ~ time_reversal_asymmetry_statistic[xf;2]
+.ml.fresh.feat.treverseAsymStat[xi;2] ~ time_reversal_asymmetry_statistic[xi;2]
+.ml.fresh.feat.treverseAsymStat[xb;2] ~ 0.0001
+.ml.fresh.feat.treverseAsymStat[x0;2] ~ 0f
+.ml.fresh.feat.treverseAsymStat[x1;2] ~ "f"$time_reversal_asymmetry_statistic[x1;2]
+.ml.fresh.feat.treverseAsymStat[x2;2] ~ "f"$time_reversal_asymmetry_statistic[x2;2]
+.ml.fresh.feat.treverseAsymStat[xnull;2] ~ 0f
+
+.ml.fresh.feat.indexMassQuantile[xi;.6] ~ index_mass_quantile[xi;.6]
+.ml.fresh.feat.indexMassQuantile[xj;1.] ~ index_mass_quantile[xj;1.]
+.ml.fresh.feat.indexMassQuantile[xh;0.] ~ index_mass_quantile[xh;0.]
+.ml.fresh.feat.indexMassQuantile[xi;x0] ~ x0
+
+.ml.fresh.feat.lastMax[xi] ~ last_location_of_maximum[xi]
+.ml.fresh.feat.lastMax[xj] ~ last_location_of_maximum[xj]
+.ml.fresh.feat.lastMax[xf] ~ last_location_of_maximum[xf]
+.ml.fresh.feat.lastMax[x0] ~ 0n
+.ml.fresh.feat.lastMax[xs] ~ 0f
+
+.ml.fresh.feat.lastMin[xi] ~ last_location_of_minimum[xi]
+.ml.fresh.feat.lastMin[xj] ~ last_location_of_minimum[xj]
+.ml.fresh.feat.lastMin[xf] ~ last_location_of_minimum[xf]
+.ml.fresh.feat.lastMin[x0] ~ 0n
+.ml.fresh.feat.lastMin[xs] ~ 0f
+
+(value .ml.fresh.feat.changeQuant[xf;0.2;0.8;1b]) ~ change_quantiles[xf;0.2;0.8;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xf;0.25;0.7;1b]) ~ change_quantiles[xf;0.25;0.7;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xf;0.2;0.65;1b]) ~ change_quantiles[xf;0.2;0.65;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xf;0.2;0.775;1b]) ~ change_quantiles[xf;0.2;0.775;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xf;0.2;0.8;0b]) ~ change_quantiles[xf;0.2;0.8;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xf;0.25;0.7;0b]) ~ change_quantiles[xf;0.25;0.7;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xf;0.2;0.65;0b]) ~ change_quantiles[xf;0.2;0.65;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xf;0.2;0.775;0b]) ~ change_quantiles[xf;0.2;0.775;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.2;0.8;1b]) ~ change_quantiles[xj;0.2;0.8;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.25;0.7;1b]) ~ change_quantiles[xj;0.25;0.7;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.2;0.65;1b]) ~ change_quantiles[xj;0.2;0.65;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.2;0.775;1b]) ~ change_quantiles[xj;0.2;0.775;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.2;0.8;0b]) ~ change_quantiles[xj;0.2;0.8;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.25;0.7;0b]) ~ change_quantiles[xj;0.25;0.7;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.2;0.65;0b]) ~ change_quantiles[xj;0.2;0.65;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xj;0.2;0.775;0b]) ~ change_quantiles[xj;0.2;0.775;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.2;0.8;1b]) ~ change_quantiles[xi;0.2;0.8;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.25;0.7;1b]) ~ change_quantiles[xi;0.25;0.7;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.2;0.65;1b]) ~ change_quantiles[xi;0.2;0.65;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.2;0.775;1b]) ~ change_quantiles[xi;0.2;0.775;1b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.2;0.8;0b]) ~ change_quantiles[xi;0.2;0.8;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.25;0.7;0b]) ~ change_quantiles[xi;0.25;0.7;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.2;0.65;0b]) ~ change_quantiles[xi;0.2;0.65;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[xi;0.2;0.775;0b]) ~ change_quantiles[xi;0.2;0.775;0b;]each changequantkeys
+(value .ml.fresh.feat.changeQuant[x0;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
+(value .ml.fresh.feat.changeQuant[x1;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
+(value .ml.fresh.feat.changeQuant[x2;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
+(value .ml.fresh.feat.changeQuant[xnull;0.2;0.775;1b]) ~ (-0w 0w,4#0n)
+
+(.ml.fresh.feat.linTrend[xj]`slope) ~ linear_trend[xj][0]
+(.ml.fresh.feat.linTrend[xj]`intercept) ~ linear_trend[xj][1]
+(.ml.fresh.feat.linTrend[xj]`rval) ~ linear_trend[xj][2]
+(.ml.fresh.feat.linTrend[xf]`slope) ~ linear_trend[xf][0]
+(.ml.fresh.feat.linTrend[xf]`intercept) ~ linear_trend[xf][1]
+(.ml.fresh.feat.linTrend[xf]`rval) ~ linear_trend[xf][2]
+(.ml.fresh.feat.linTrend[xb]`slope) ~ linear_trend[xb][0]
+(.ml.fresh.feat.linTrend[xb]`intercept) ~ linear_trend[xb][1]
+(.ml.fresh.feat.linTrend[xb]`rval) ~ linear_trend[xb][2]
+(.ml.fresh.feat.linTrend[xi]`slope) ~ linear_trend[xi][0]
+(.ml.fresh.feat.linTrend[xi]`intercept) ~ linear_trend[xi][1]
+(.ml.fresh.feat.linTrend[xi]`rval) ~ linear_trend[xi][2]
+(.ml.fresh.feat.linTrend[x0]`slope) ~ 0f
+(.ml.fresh.feat.linTrend[x0]`intercept) ~ 0f
+(.ml.fresh.feat.linTrend[x0]`rval) ~ 0f
+(.ml.fresh.feat.linTrend[x1]`slope) ~ 0f
+(.ml.fresh.feat.linTrend[x1]`intercept) ~ 0f
+(.ml.fresh.feat.linTrend[x1]`rval) ~ 0f
+(.ml.fresh.feat.linTrend[x2]`slope) ~ linear_trend[x2][0] 
+(.ml.fresh.feat.linTrend[x2]`intercept) ~ linear_trend[x2][1]
+(.ml.fresh.feat.linTrend[x2]`rval) ~ linear_trend[x2][2]
+(.ml.fresh.feat.linTrend[xnull]`slope) ~ 0f
+(.ml.fresh.feat.linTrend[xnull]`intercept) ~ 0f
+(.ml.fresh.feat.linTrend[xnull]`rval) ~ 0f
+
+(value .ml.fresh.feat.aggAutoCorr[xj]) ~ agg_autocorrelation[xj;]each autocorrkeys
+(value .ml.fresh.feat.aggAutoCorr[xf]) ~ agg_autocorrelation[xf;]each autocorrkeys
+(1_value .ml.fresh.feat.aggAutoCorr[xb]) ~ 1_agg_autocorrelation[xb;]each autocorrkeys
+(value .ml.fresh.feat.aggAutoCorr[xi]) ~ agg_autocorrelation[xi;]each autocorrkeys
+(value .ml.fresh.feat.aggAutoCorr[x0]) ~ 4#0f
+(value .ml.fresh.feat.aggAutoCorr[x1]) ~ 4#0f
+(value .ml.fresh.feat.aggAutoCorr[x2]) ~ agg_autocorrelation[x2;]each autocorrkeys
+(value .ml.fresh.feat.aggAutoCorr[xnull]) ~ 4#0f
+
+(.ml.fresh.feat.fftAggreg[xj]`centroid) ~ fft_aggregated[xj][0]
+(.ml.fresh.feat.fftAggreg[xj]`variance) ~ fft_aggregated[xj][1]
+(.ml.fresh.feat.fftAggreg[xi]`centroid) ~ fft_aggregated[xi][0]
+(.ml.fresh.feat.fftAggreg[xi]`variance) ~ fft_aggregated[xi][1]
+(.ml.fresh.feat.fftAggreg[xf]`centroid) ~ fft_aggregated[xf][0]
+(.ml.fresh.feat.fftAggreg[xf]`variance) ~ fft_aggregated[xf][1]
+(.ml.fresh.feat.fftAggreg[xb]`centroid) ~ fft_aggregated[xb][0]
+(.ml.fresh.feat.fftAggreg[xb]`variance) ~ fft_aggregated[xb][1]
+(.ml.fresh.feat.fftAggreg[x1]`centroid) ~ fft_aggregated[x1][0]
+(.ml.fresh.feat.fftAggreg[x1]`variance) ~ fft_aggregated[x1][1]
+(.ml.fresh.feat.fftAggreg[x2]`centroid) ~ fft_aggregated[x2][0]
+(.ml.fresh.feat.fftAggreg[x2]`variance) ~ fft_aggregated[x2][1]
+(.ml.fresh.feat.fftAggreg[xnull]`centroid) ~ 0n
+(.ml.fresh.feat.fftAggreg[xnull]`variance) ~ 0n
+
+(value .ml.fresh.feat.augFuller[xj]) ~ "f"$augmented_dickey_fuller[xj][0 1 2]
+(value .ml.fresh.feat.augFuller[xf]) ~ "f"$augmented_dickey_fuller[xf][0 1 2]
+(value .ml.fresh.feat.augFuller[xi]) ~ "f"$augmented_dickey_fuller[xi][0 1 2]
+(value .ml.fresh.feat.augFuller[xb]) ~ "f"$augmented_dickey_fuller[xb][0 1 2]
+(value .ml.fresh.feat.augFuller[x0]) ~ 3#0n
+(value .ml.fresh.feat.augFuller[x1]) ~ 3#0n
+(value .ml.fresh.feat.augFuller[x2]) ~ 3#0n
+(value .ml.fresh.feat.augFuller[xnull]) ~ 3#0n
+
+(.ml.fresh.feat.spktWelch[xj;til 100]) ~ spkt_welch_density[xj;til 100]
+(.ml.fresh.feat.spktWelch[xf;til 100]) ~ spkt_welch_density[xf;til 100]
+(.ml.fresh.feat.spktWelch[xi;til 100]) ~ spkt_welch_density[xi;til 100]
+(.ml.fresh.feat.spktWelch[xb;til 100]) ~ spkt_welch_density[xb;til 100]
+(.ml.fresh.feat.spktWelch[xnull;til 100]) ~ 100#0n
+
+(.ml.fresh.feat.spktWelch[xj;k]) ~ spkt_welch_density[xj;k]
+(.ml.fresh.feat.spktWelch[xf;k]) ~ spkt_welch_density[xf;k]
+(.ml.fresh.feat.spktWelch[xi;k]) ~ spkt_welch_density[xi;k]
+(.ml.fresh.feat.spktWelch[xb;k]) ~ spkt_welch_density[xb;k]
+(.ml.fresh.feat.spktWelch[xnull;k]) ~ 100#0n
+
+fft_coefficient[xj;`abs;0]~.ml.fresh.feat.fftCoeff[xj;1]`coeff_0_abs
+fft_coefficient[xj;`abs;49]~.ml.fresh.feat.fftCoeff[xj;50]`coeff_49_abs
+fft_coefficient[xj;`real;0]~.ml.fresh.feat.fftCoeff[xj;1]`coeff_0_real
+fft_coefficient[xj;`real;49]~.ml.fresh.feat.fftCoeff[xj;50]`coeff_49_real
+fft_coefficient[xj;`angle;0]~.ml.fresh.feat.fftCoeff[xj;1]`coeff_0_angle
+fft_coefficient[xj;`angle;49]~.ml.fresh.feat.fftCoeff[xj;50]`coeff_49_angle
+fft_coefficient[xj;`imag;0]~.ml.fresh.feat.fftCoeff[xj;1]`coeff_0_imag
+fft_coefficient[xj;`imag;49]~.ml.fresh.feat.fftCoeff[xj;50]`coeff_49_imag
+fft_coefficient[xf;`abs;0]~.ml.fresh.feat.fftCoeff[xf;1]`coeff_0_abs
+fft_coefficient[xf;`abs;49]~.ml.fresh.feat.fftCoeff[xf;50]`coeff_49_abs
+fft_coefficient[xf;`real;0]~.ml.fresh.feat.fftCoeff[xf;1]`coeff_0_real
+fft_coefficient[xf;`real;49]~.ml.fresh.feat.fftCoeff[xf;50]`coeff_49_real
+fft_coefficient[xf;`angle;0]~.ml.fresh.feat.fftCoeff[xf;1]`coeff_0_angle
+fft_coefficient[xf;`angle;49]~.ml.fresh.feat.fftCoeff[xf;50]`coeff_49_angle
+fft_coefficient[xf;`imag;0]~.ml.fresh.feat.fftCoeff[xf;1]`coeff_0_imag
+fft_coefficient[xf;`imag;49]~.ml.fresh.feat.fftCoeff[xf;50]`coeff_49_imag
+fft_coefficient[xi;`abs;0]~.ml.fresh.feat.fftCoeff[xi;1]`coeff_0_abs
+fft_coefficient[xi;`abs;49]~.ml.fresh.feat.fftCoeff[xi;50]`coeff_49_abs
+fft_coefficient[xi;`real;0]~.ml.fresh.feat.fftCoeff[xi;1]`coeff_0_real
+fft_coefficient[xi;`real;49]~.ml.fresh.feat.fftCoeff[xi;50]`coeff_49_real
+fft_coefficient[xi;`angle;0]~.ml.fresh.feat.fftCoeff[xi;1]`coeff_0_angle
+fft_coefficient[xi;`angle;49]~.ml.fresh.feat.fftCoeff[xi;50]`coeff_49_angle
+fft_coefficient[xi;`imag;0]~.ml.fresh.feat.fftCoeff[xi;1]`coeff_0_imag
+fft_coefficient[xi;`imag;49]~.ml.fresh.feat.fftCoeff[xi;50]`coeff_49_imag
+fft_coefficient[xb;`abs;0]~.ml.fresh.feat.fftCoeff[xb;1]`coeff_0_abs
+fft_coefficient[xb;`abs;49]~.ml.fresh.feat.fftCoeff[xb;50]`coeff_49_abs
+fft_coefficient[xb;`real;0]~.ml.fresh.feat.fftCoeff[xb;1]`coeff_0_real
+fft_coefficient[xb;`real;49]~.ml.fresh.feat.fftCoeff[xb;50]`coeff_49_real
+fft_coefficient[xb;`angle;0]~.ml.fresh.feat.fftCoeff[xb;1]`coeff_0_angle
+fft_coefficient[xb;`angle;49]~.ml.fresh.feat.fftCoeff[xb;50]`coeff_49_angle
+fft_coefficient[xb;`imag;0]~.ml.fresh.feat.fftCoeff[xb;1]`coeff_0_imag
+fft_coefficient[xb;`imag;49]~.ml.fresh.feat.fftCoeff[xb;50]`coeff_49_imag
+
+(.ml.fresh.feat.fftCoeff[xnull;50]`coeff_49_abs) ~ 0n
+(.ml.fresh.feat.fftCoeff[xnull;50]`coeff_49_real) ~ 0n
+(.ml.fresh.feat.fftCoeff[xnull;50]`coeff_49_angle) ~ 0n
+(.ml.fresh.feat.fftCoeff[xnull;50]`coeff_49_imag) ~ 0n 
 
 /
-(value[.ml.fresh.feat.fftaggreg[xb]]0 1 2) ~ fft_aggregated[xb] 0 1 2
-fftaggreg[xj][3] ~ fft_aggregated[xj][3]
-fftaggreg[xf][2] ~ fft_aggregated[xf][2]
-fftaggreg[xf][3] ~ fft_aggregated[xf][3]
+(value[.ml.fresh.feat.fftAggreg[xb]]0 1 2) ~ fft_aggregated[xb] 0 1 2
+fftAggreg[xj][3] ~ fft_aggregated[xj][3]
+fftAggreg[xf][2] ~ fft_aggregated[xf][2]
+fftAggreg[xf][3] ~ fft_aggregated[xf][3]
 \
diff --git a/fresh/tests/sigtests.t b/fresh/tests/sigtests.t
index 9573dd55..6619325a 100644
--- a/fresh/tests/sigtests.t
+++ b/fresh/tests/sigtests.t
@@ -12,8 +12,8 @@ equivalent significance tests implemented previously in python.
 \
 
 \l p.q
-\l fresh/extract.q
-\l fresh/select.q
+\l ml.q
+\l fresh/init.q
 \l fresh/tests/significancetests.p
 
 xf:5000?1000f
@@ -28,7 +28,7 @@ yb:5000#0101101011b
 .ml.fresh.i.ks[yb;xf] ~ target_binary_feature_real_test[yb;xf]
 
 / 1c.
-.ml.fresh.i.ktau[xf;yf] ~ target_real_feature_real_test[xf;yf]
+.ml.fresh.i.kTau[xf;yf] ~ target_real_feature_real_test[xf;yf]
 
 /
 2.
@@ -49,7 +49,7 @@ pdmatrix:{pddf[benjamini_hochberg_test[y;"FALSE";x]][`:values]}
 k:{pdmatrix[x;y]`}
 vec:{k[x;y][;2]}
 bhfn:{[table;target]
-	pdict:.ml.fresh.sigfeat[table;target];
+	pdict:.ml.fresh.sigFeat[table;target];
 	ptable:([]label:key pdict;p_value:value pdict);
 	dfptable:tab2df[ptable];
 	("i"$count .ml.fresh.benjhoch[0.05;pdict]) ~ sum vec[0.05;dfptable]=1b
diff --git a/fresh/tests/test.p b/fresh/tests/test.p
index a6ca4d6f..c4d2299e 100644
--- a/fresh/tests/test.p
+++ b/fresh/tests/test.p
@@ -37,7 +37,6 @@ p)def< range_count(x,min,max):return np.sum((x >= min) & (x < max))
 p)def< variance_larger_than_standard_deviation(x):return np.var(x) > np.std(x)
 p)def< number_cwt_peaks(x,n):return len(find_peaks_cwt(vector=x, widths=np.array(list(range(1, n + 1))), wavelet=ricker)) 
 p)def< quantile_py(x, q):x = pd.Series(x);return pd.Series.quantile(x, q)
-p)def< quantile_py(x, q):x = pd.Series(x);return pd.Series.quantile(x, q)
 p)def< value_count(x, value):
         if np.isnan(value):
                 return np.isnan(x)
diff --git a/fresh/utils.q b/fresh/utils.q
new file mode 100644
index 00000000..85a1f87d
--- /dev/null
+++ b/fresh/utils.q
@@ -0,0 +1,202 @@
+// fresh/utils.q - Utility functions
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Unitily functions used in the implimentation of FRESH
+
+\d .ml 
+
+// Python imports
+sci_ver  :1.5<="F"$3#.p.import[`scipy][`:__version__]`
+numpy    :.p.import`numpy
+pyStats  :.p.import`scipy.stats
+signal   :.p.import`scipy.signal
+stattools:.p.import`statsmodels.tsa.stattools
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Compute the one-dimensional
+//   discrete Fourier Transform for real input
+fresh.i.rfft:numpy`:fft.rfft
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Return the real part of the complex argument
+fresh.i.real:numpy`:real
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Return the angle of the complex argument
+fresh.i.angle:numpy`:angle
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Return the imaginary part of the complex argument
+fresh.i.imag:numpy`:imag
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Calculate the absolute value element-wise
+fresh.i.abso:numpy`:abs
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Kolmogorov-Smirnov two-sided test statistic distribution
+fresh.i.ksDistrib:pyStats[$[sci_ver;`:kstwo.sf;`:kstwobign.sf];<]
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Calculate Kendall’s tau, a correlation measure for
+//   ordinal data
+fresh.i.kendallTau:pyStats`:kendalltau
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Perform a Fisher exact test on a 2x2 contingency table
+fresh.i.fisherExact:pyStats`:fisher_exact
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Estimate power spectral density using Welch’s method
+fresh.i.welch:signal`:welch
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Find peaks in a 1-D array with wavelet transformation
+fresh.i.findPeak:signal`:find_peaks_cwt
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Calculate the autocorrelation function
+fresh.i.acf:stattools`:acf
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Partial autocorrelation estimate
+fresh.i.pacf:stattools`:pacf
+
+// @private
+// @kind function 
+// @category freshPythonUtility
+// @desc Augmented Dickey-Fuller unit root test
+fresh.i.adFuller:stattools`:adfuller
+
+// Python features
+fresh.i.pyFeat:`aggAutoCorr`augFuller`fftAggReg`fftCoeff`numCwtPeaks,
+  `partAutoCorrelation`spktWelch
+
+// Extract utilities
+
+// @private
+// @kind function
+// @category freshUtility
+// @desc Create a mapping between the functions and columns on which
+//   they are to be applied
+// @param map {symbol[][]} Two element list where first element is the
+//   columns to which functions are to be applied and the second element is
+//   the name of the function in the .ml.fresh.feat namespace to be applied
+// @return {symbol[]} A mapping of the functions to be applied to each column
+fresh.i.colMap:{[map]
+  updFunc:flip (` sv'`.ml.fresh.feat,'map[;1];map[;0]);
+  updFunc,'last@''2_'map
+  }
+
+// @private
+// @kind function 
+// @category freshUtility
+// @desc Returns the length of each sequence
+// @param condition {boolean} Executed condition, e.g. data>avg data
+// @return {long[]} Sequence length based on condition
+fresh.i.getLenSeqWhere:{[condition]
+  idx:where differ condition;
+  (1_deltas idx,count condition)where condition idx
+  }
+
+// @private
+// @kind function 
+// @category freshUtility
+// @desc Find peaks within the data
+// @param data {number[]} Numerical data points
+// @param support {long} Support of the peak
+// @param idx {long} Current index
+// @return {boolean[]} 1 where peak exists
+fresh.i.peakFind:{[data;support;idx]
+  neg[support]_support _min data>/:xprev\:[-1 1*idx]data
+  }
+
+// @private
+// @kind function 
+// @category freshUtility
+// @desc Expand results produced by FRESH
+// @param results {table} Table of resulting features
+// @param column {symbol} Column of interest
+// @return {table} Expanded results table
+fresh.i.expandResults:{[results;column]
+  t:(`$"_"sv'string column,'cols t)xcol t:results column;
+  ![results;();0b;enlist column],'t
+  }
+
+// Select utilities
+
+// @private
+// @kind function
+// @category freshUtility
+// @desc Apply python function for Kendall’s tau
+// @param target {number[]} Target vector
+// @param feature {number[]} Feature table column
+// @return {float} Kendall’s tau - Close to 1 shows strong agreement, close to
+//   -1 shows strong disagreement
+fresh.i.kTau:{[target;feature]
+  fresh.i.kendallTau[<;target;feature]1
+  }
+
+// @private
+// @kind function
+// @category freshUtility
+// @desc Perform a Fisher exact test
+// @param target {number[]} Target vector
+// @param feature {number[]} Feature table column
+// @return {float} Results of Fisher exact test
+fresh.i.fisher:{[target;feature]
+  g:group@'target value group feature;
+  fresh.i.fisherExact[<;count@''@\:[g]distinct target]1
+  }
+
+// @private
+// @kind function
+// @category freshUtility
+// @desc Calculate the Kolmogorov-Smirnov two-sided test statistic
+//   distribution
+// @param feature {number[]} Feature table column
+// @param target {number[]} Target vector
+// @return {float} Kolmogorov-Smirnov two-sided test statistic distribution
+fresh.i.ks:{[feature;target]
+  d:asc each target group feature;
+  n:count each d;
+  k:max abs(-). value(1+d bin\:raze d)%n;
+  en:prd[n]%sum n;
+  fresh.i.ksDistrib .$[sci_ver;(k;ceiling en);enlist k*sqrt en]
+  }
+
+// @private
+// @kind function
+// @category freshUtility
+// @desc Pass data correctly to .ml.fresh.i.ks allowing for projection
+//   in main function
+// @param target {number[]} Target vector
+// @param feature {number[]} Feature table column
+// @return {float} Kolmogorov-Smirnov two-sided test statistic distribution
+fresh.i.ksYX:{[target;feature]
+  fresh.i.ks[feature;target]
+  }
diff --git a/graph/README.md b/graph/README.md
index 16478d12..1a1362e6 100644
--- a/graph/README.md
+++ b/graph/README.md
@@ -31,6 +31,6 @@ Documentation is available on the [Graph](https://code.kx.com/q/ml/toolkit/graph
 
 ## Status
 
-The graph-pipeline library is still in development and is available here as a beta release. Further functionality and improvements will be made to the library in the coming months.
+The graph-pipeline library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
 
 If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/graph/graph.q b/graph/graph.q
index c212fb64..426df44b 100644
--- a/graph/graph.q
+++ b/graph/graph.q
@@ -1,10 +1,32 @@
+// graph/graph.q - Graph tools
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Create, update, and delete functionality for a graph.
+
 \d .ml
 
+// @kind function
+// @category graph
+// @desc Generate an empty graph
+// @return {dictionary} Structure required for the generation of a connected 
+//   graph. This includes a key for information on the nodes present within the
+//   graph and edges outlining how the nodes within the graph are connected. 
 createGraph:{[]
-  nodes:1!enlist`nodeId``function`inputs`outputs!(`;::;::;::;::);
-  edges:2!enlist`dstNode`dstName`srcNode`srcName`valid!(`;`;`;`;0b);
-  `nodes`edges!(nodes;edges)}
+  nodeKeys:`nodeId``function`inputs`outputs;
+  nodes:1!enlist nodeKeys!(`;::;::;::;::);
+  edgeKeys:`destNode`destName`sourceNode`sourceName`valid;
+  edges:2!enlist edgeKeys!(`;`;`;`;0b);
+  `nodes`edges!(nodes;edges)
+  }
 
+// @kind function
+// @category graph
+// @desc Add a functional node to a graph
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param nodeId {symbol} Denotes the name associated with the functional node
+// @param node {fn} A functional node
+// @return {dictionary} The graph with the the new node added to the graph 
+//   structure
 addNode:{[graph;nodeId;node]
   node,:(1#`)!1#(::);
   if[nodeId in exec nodeId from graph`nodes;'"invalid nodeId"];
@@ -15,72 +37,155 @@ addNode:{[graph;nodeId;node]
   if[-10h=type node`outputs;
     node[`outputs]:(1#`output)!enlist node`outputs;
     node[`function]:((1#`output)!enlist@)node[`function]::;
-  ];
+    ];
   if[99h<>type node`outputs;'"invalid outputs"];
   graph:@[graph;`nodes;,;update nodeId from node];
-  edges:flip`dstNode`dstName`srcNode`srcName`valid!(nodeId;key node`inputs;`;`;0b);
+  edgeKeys:`destNode`destName`sourceNode`sourceName`valid;
+  edges:flip edgeKeys!(nodeId;key node`inputs;`;`;0b);
   graph:@[graph;`edges;,;edges];
-  graph}
+  graph
+  }
 
+// @kind function
+// @category graph
+// @desc Update the contents of a functional node
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param nodeId {symbol} Denotes the name of a functional node to be updated
+// @param node {fn} A functional node
+// @return {dictionary} The graph with the named functional node contents 
+//   overwritten
 updNode:{[graph;nodeId;node]
   node,:(1#`)!1#(::);
   if[not nodeId in 1_exec nodeId from graph`nodes;'"invalid nodeId"];
   if[count key[node]except``function`inputs`outputs;'"invalid node"];
-  oldnode:graph[`nodes]nodeId;
+  oldNode:graph[`nodes]nodeId;
   if[`inputs in key node;
     if[(::)~node`inputs;node[`inputs]:(0#`)!""];
     if[-10h=type node`inputs;node[`inputs]:(1#`input)!enlist node`inputs];
     if[99h<>type node`inputs;'"invalid inputs"];
-    inputEdges:select from graph[`edges]where dstNode=nodeId,dstName in key oldnode`inputs;
+    inputEdges:select from graph[`edges]where destNode=nodeId,
+      destName in key oldNode`inputs;
     graph:@[graph;`edges;key[inputEdges]_];
-    inputEdges:flip[`dstNode`dstName!(nodeId;key node`inputs)]#inputEdges;
+    inputEdges:flip[`destNode`destName!(nodeId;key node`inputs)]#inputEdges;
     graph:@[graph;`edges;,;inputEdges];
-    inputEdges:select from inputEdges where not null srcNode;
-    graph:{[graph;edge]connectEdge[graph]. edge`srcNode`srcName`dstNode`dstName}/[graph;0!inputEdges];
-  ];
+    inputEdges:select from inputEdges where not null sourceNode;
+    graph:i.connectGraph/[graph;0!inputEdges];
+    ];
   if[`outputs in key node;
     if[-10h=type node`outputs;
-      node[`outputs]:(1#`output)!enlist node`outputs;
-    ];
+      node[`outputs]:(1#`output)!enlist node`outputs];
     if[99h<>type node`outputs;'"invalid outputs"];
-    outputEdges:select from graph[`edges]where srcNode=nodeId,srcName in key oldnode`outputs;
+    outputEdges:select from graph[`edges]where sourceNode=nodeId,
+      sourceName in key oldNode`outputs;
     graph:@[graph;`edges;key[outputEdges]_];
-    outputEdges:select from outputEdges where srcName in key node`outputs;
+    outputEdges:select from outputEdges where sourceName in key node`outputs;
     graph:@[graph;`edges;,;outputEdges];
-    outputEdges:select srcNode,srcName,dstNode,dstName from outputEdges;
-    graph:{[graph;edge]connectEdge[graph]. edge`srcNode`srcName`dstNode`dstName}/[graph;0!outputEdges];
-  ];
+    outputEdge:select sourceNode,sourceName,destName,destName from outputEdges;
+    graph:i.connectGraph/[graph;0!outputEdge];
+    ];
   if[`function in key node;
-    if[(1#`output)~key graph[`nodes;nodeId]`outputs;node[`function]:((1#`output)!enlist@)node[`function]::];
-  ];
+    if[(1#`output)~key graph[`nodes;nodeId]`outputs;
+      node[`function]:((1#`output)!enlist@)node[`function]::];
+    ];
   graph:@[graph;`nodes;,;update nodeId from node];
-  graph}
+  graph
+  }
 
+// @kind function
+// @category graph
+// @desc Delete a named function node
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param nodeId {symbol} Denotes the name of a functional node to be deleted
+// @return {dictionary} The graph with the named fucntional node removed
 delNode:{[graph;nodeId]
   if[not nodeId in 1_exec nodeId from graph`nodes;'"invalid nodeId"];
   graph:@[graph;`nodes;_;nodeId];
-  inputEdges:select from graph[`edges]where dstNode=nodeId;
+  inputEdges:select from graph[`edges]where destNode=nodeId;
   graph:@[graph;`edges;key[inputEdges]_];
-  outputEdges:select from graph[`edges]where srcNode=nodeId;
-  graph:@[graph;`edges;,;update srcNode:`,srcName:`,valid:0b from outputEdges];
-  graph}
+  outputEdges:select from graph[`edges]where sourceNode=nodeId;
+  graph:@[graph;`edges;,;update sourceNode:`,sourceName:`,
+    valid:0b from outputEdges];
+  graph
+  }
+
+// @kind function
+// @category graph
+// @desc Add a configuration node to a graph
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param nodeId {symbol} Denotes the name associated with the configuration 
+//   node
+// @param config {fn} Any configuration information to be supplied to other
+//   nodes in the graph
+// @return {dictionary} A graph with the the new configuration added to the 
+//   graph structure
+addCfg:{[graph;nodeId;config]
+  nodeKeys:``function`inputs`outputs;
+  addNode[graph;nodeId]nodeKeys!(::;@[;config];::;"!")
+  }
 
-addCfg:{[graph;nodeId;cfg]addNode[graph;nodeId]``function`inputs`outputs!(::;@[;cfg];::;"!")}
-updCfg:{[graph;nodeId;cfg]updNode[graph;nodeId](1#`function)!enlist cfg}
+// @kind function
+// @category graph
+// @desc Update the contents of a configuration node
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param nodeId {symbol} Denotes the name of a configuration node to be 
+//   updated
+// @param config {fn} Any configuration information to be supplied to other
+//   nodes in the graph
+// @return {dictionary} The graph with the named configuration node contents 
+//   overwritten
+updCfg:{[graph;nodeId;config]
+  updNode[graph;nodeId](1#`function)!enlist config
+  }
+
+// @kind function
+// @category graph
+// @desc Delete a named configuration node
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param nodeId {symbol} Denotes the name of a configuration node to be 
+//   deleted
+// @return {dictionary} The graph with the named fucntional node removed
 delCfg:delNode
 
-connectEdge:{[graph;srcNode;srcName;dstNode;dstName]
-  if[99h<>type srcOutputs:graph[`nodes;srcNode;`outputs];'"invalid srcNode"];
-  if[99h<>type dstInputs:graph[`nodes;dstNode;`inputs];'"invalid dstNode"];
-  if[not srcName in key srcOutputs;'"invalid srcName"];
-  if[not dstName in key dstInputs;'"invalid dstName"];
-  edge:(1#`valid)!1#srcOutputs[srcName]~dstInputs[dstName];
-  graph:@[graph;`edges;,;update dstNode,dstName,srcNode,srcName from edge];
-  graph}
+// @kind function
+// @category graph
+// @desc Connect the output of one node to the input to another
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param sourceNode {symbol} Denotes the name of a node in the graph which 
+//   contains the relevant output
+// @param sourceName {symbol} Denotes the name of the output to be connected to
+//   an associated input node 
+// @param destNode {symbol} Name of a node in the graph which contains the 
+//   relevant input to be connected to
+// @param destName {symbol} Name of the input which is connected to the output
+//   defined by sourceNode and sourceName
+// @return {dictionary} The graph with the relevant connection made between the 
+//   inputs and outputs of two nodes
+connectEdge:{[graph;sourceNode;sourceName;destNode;destName]
+  srcOutputs:graph[`nodes;sourceNode;`outputs];  
+  dstInputs:graph[`nodes;destNode;`inputs];
+  if[99h<>type srcOutputs;'"invalid sourceNode"];
+  if[99h<>type dstInputs;'"invalid destNode"];
+  if[not sourceName in key srcOutputs;'"invalid sourceName"];
+  if[not destName in key dstInputs;'"invalid destName"];
+  edge:(1#`valid)!1#srcOutputs[sourceName]~dstInputs[destName];
+  graph:@[graph;`edges;,;update destNode,destName,sourceNode,
+    sourceName from edge];
+  graph
+  }
 
-disconnectEdge:{[graph;dstNode;dstName]
-  if[not(dstNode;dstName)in key graph`edges;'"invalid edge"];
+// @kind function
+// @category graph
+// @desc Disconnect an edge from the input of a node
+// @param graph {dictionary} Graph originally generated using .ml.createGraph
+// @param destNode {symbol} Name of the node containing the edge to be deleted
+// @param destName {symbol} Name of the edge associated with a specific input 
+//   to be disconnected
+// @return {dictionary} The graph with the edge connected to the destination 
+//   input removed from the graph.
+disconnectEdge:{[graph;destNode;destName]
+  if[not(destNode;destName)in key graph`edges;'"invalid edge"];
   edge:(1#`valid)!1#0b;
-  graph:@[graph;`edges;,;update dstNode,dstName,srcName:`,srcNode:` from edge];
-  graph}
-
+  graph:@[graph;`edges;,;update destNode,destName,sourceName:`,
+    sourceNode:` from edge];
+  graph
+  }
diff --git a/graph/init.q b/graph/init.q
index 3f7568d9..e401db10 100644
--- a/graph/init.q
+++ b/graph/init.q
@@ -1,4 +1,14 @@
+// graph/init.q - Load graph library
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Graph and Pipeline is a structural framework for developing 
+// q/kdb+ solutions, based on a directed acyclic graph.
+
+.ml.loadfile`:graph/utils.q
 .ml.loadfile`:graph/graph.q
 .ml.loadfile`:graph/pipeline.q
 .ml.loadfile`:graph/modules/saving.q
 .ml.loadfile`:graph/modules/loading.q
+
+.ml.loadfile`:util/utils.q
+.ml.i.deprecWarning`graph
diff --git a/graph/modules/loading.q b/graph/modules/loading.q
index b1528ef9..750adaea 100644
--- a/graph/modules/loading.q
+++ b/graph/modules/loading.q
@@ -1,27 +1,113 @@
-\d .ml
-  
-i.loadfname:{[cfg]
-  file:hsym`$$[(not ""~cfg`directory)&`directory in key cfg;cfg`directory;"."],"/",cfg`fileName;
+\d .ml  
+
+// Utility Functions for loading data
+
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Construct path to a data file
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {str} Path to the data file
+i.loadFileName:{[config]
+  file:hsym`$$[(not ""~config`directory)&`directory in key config;
+    config`directory;
+    "."],"/",config`fileName;
   if[()~key file;'"file does not exist"];
-  file}
+  file
+  }
+
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Load splayed table or binary file
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {tab} Date obtained from splayed table or binary file
+i.loadFunc.splay:i.loadFunc.binary:{[config]
+  get i.loadFileName config
+  }
+
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Load data from csv file
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {tab} Data obtained from csv
+i.loadFunc.csv:{[config]
+  (config`schema;config`separator)0: i.loadFileName config
+  }
 
-i.loadfunc.splay:i.loadfunc.binary:{[cfg]get i.loadfname cfg}
-i.loadfunc.csv:{[cfg](cfg`schema;cfg`separator)0: i.loadfname cfg}
-i.loadfunc.json:{[cfg].j.k first read0 i.loadfname cfg}
-i.loadfunc.hdf5:{[cfg]
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Load data from json file
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {tab} Data obtained from json file
+i.loadFunc.json:{[config]
+  .j.k first read0 i.loadFileName config
+  }
+
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Load data from HDF5 file
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {tab} Data obtained from HDF5 file  
+i.loadFunc.hdf5:{[config]
   if[not`hdf5 in key`;@[system;"l hdf5.q";{'"unable to load hdf5 lib"}]];
-  if[not .hdf5.ishdf5 fname:i.loadfname cfg;'"file is not an hdf5 file"];
-  if[not .hdf5.isObject[fpath;cfg`dname];'"hdf5 dataset does not exist"];
-  .hdf5.readData[fpath;cfg`dname]}
-i.loadfunc.ipc:{[cfg]
-  h:@[hopen;cfg`port;{'"error opening connection"}];
-  ret:@[h;cfg`select;{'"error executing query"}];
+  if[not .hdf5.ishdf5 filePath:i.loadFileName config;
+    '"file is not an hdf5 file"
+    ];
+  if[not .hdf5.isObject[filePath;config`dname];'"hdf5 dataset does not exist"];
+  .hdf5.readData[fpath;config`dname]
+  }
+
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Load data from ipc
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {tab} Data obtained via IPC
+i.loadFunc.ipc:{[config]
+  h:@[hopen;config`port;{'"error opening connection"}];
+  ret:@[h;config`select;{'"error executing query"}];
   @[hclose;h;{}];
-  ret}
-i.loadfunc.process:{[cfg]if[not `data in key cfg;'"Data to be used must be defined"];cfg[`data]}
+  ret
+  }
+
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Load data from config dictionary
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {dict} Data obtained from config dictionary
+i.loadFunc.process:{[config]
+  if[not `data in key config;'"Data to be used must be defined"];
+  config`data
+  }
+
+// @private
+// @kind function
+// @category loadingUtility
+// @fileoverview Load data from a defined source
+// @param config {dict} Any configuration information about the dataset being 
+//   loaded in
+// @return {dict} Data obtained from a defined source
+i.loadDataset:{[config]
+  if[null func:i.loadFunc config`typ;'"dataset type not supported"];
+  func config
+  }
 
-i.loaddset:{[cfg]
-  if[null func:i.loadfunc cfg`typ;'"dataset type not supported"];
-  func cfg}
+// Loading functionality
 
-loaddset:`function`inputs`outputs!(i.loaddset;"!";"+")
+// @kind function
+// @category loading
+// @fileoverview Node to load data from a defined source
+// @return {dict} Node in graph to be used for loading data
+loadDataSet:`function`inputs`outputs!(i.loadDataset;"!";"+")
diff --git a/graph/modules/saving.q b/graph/modules/saving.q
index 56159792..2f7605bc 100644
--- a/graph/modules/saving.q
+++ b/graph/modules/saving.q
@@ -1,29 +1,112 @@
 \d .ml
   
-i.savefname:{[cfg]
+// Utility Functions for loading data
+
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Construct path to location where data is to be saved
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @return {str} Path to a file location
+i.saveFileName:{[cfg]
   file:hsym`$$[`dir in key cfg;cfg`key;"."],"/",cfg fname;
   if[not ()~key file;'"file exists"];
   file}
 
-i.savedset.txt:{[cfg;dset]i.savefname[cfg]0:.h.tx[cfg`typ;dset];}
-i.savedset[`csv`xml`xls]:i.savedset.txt
-i.savedset.binary:{[cfg;dset]i.savefname[cfg]set dset;}
-i.savedset.json:{[cfg;dset]
-  h:hopen i.savefname cfg;
-  h @[.j.j;dset;{'"error converting to json"}];
-  hclose h;}
-i.savedset.hdf5:{[cfg;dset]
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Save data as a text file
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @param data {tab} Data which is to be saved
+// @return {null} Data is saved as a text file 
+i.saveFunc.txt:{[config;data]
+  i.saveFileName[config]0:.h.tx[config`typ;data];
+  }
+
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Save data as a text file
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @param data {tab} Data which is to be saved
+// @return {null} Data is saved as a text file 
+i.saveFunc[`csv`xml`xls]:i.saveFunc.txt
+
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Save data as a binary file
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @param data {tab} Data which is to be saved
+// @return {null} Data is saved as a binary file 
+i.saveFunc.binary:{[config;data]
+  i.saveFileName[config]set data;
+  }
+
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Save data as a json file
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @param data {tab} Data which is to be saved
+// @return {null} Data is saved as a json file 
+i.saveFunc.json:{[config;data]
+  h:hopen i.saveFileName config;
+  h @[.j.j;data;{'"error converting to json"}];
+  hclose h;
+  }
+
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Save data as a HDF5 file
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @param data {tab} Data which is to be saved
+// @return {null} Data is saved as a HDF5 file 
+i.saveFunc.hdf5:{[config;data]
   if[not`hdf5 in key`;@[system;"l hdf5.q";{'"unable to load hdf5 lib"}]];
-  .hdf5.createFile fname:i.savefname cfg;
-  .hdf5.writeData[fname;cfg`dname;dset];
+  .hdf5.createFile filePath:i.saveFilename config;
+  .hdf5.writeData[filePath;config`dname;data];
+  }
+
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Save data as a splayed table
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @param data {tab} Data which is to be saved
+// @return {null} Data is saved as a splayed table 
+i.saveFunc.splay:{[config;data]
+  dataName:first` vs filePath:i.saveFileName config;
+  filePath:` sv filePath,`;
+  filePath set .Q.en[dataName]data;
+  }
+
+// @private
+// @kind function
+// @category savingUtility
+// @fileoverview Save data in a defined format
+// @param config {dict} Any configuration information about the dataset being 
+//   saved
+// @param data {tab} Data which is to be saved
+// @return {null} Data is saved in the defined format 
+i.saveDataset:{[config;data]
+  if[null func:i.saveFunc cfg`typ;'"dataset type not supported"];
+  func data
   }
-i.savedset.splay:{[cfg;dset]
-  dname:first` vs fname:i.savefname cfg;
-  fname:` sv fname,`;
-  fname set .Q.en[dname]dset;}
 
-i.savefunc:{[cfg;dset]
-  if[null func:i.savedset cfg`typ;'"dataset type not supported"];
-  func dset}
+// Saving functionality
 
-savedset:`function`inputs`outputs!(i.savefunc;`cfg`dset!"!+";" ")
+// @kind function
+// @category saving
+// @fileoverview Node to save data from a defined source
+// @return {dict} Node in graph to be used for saving data
+saveDataset:`function`inputs`outputs!(i.saveDataset;`cfg`dset!"!+";" ")
diff --git a/graph/pipeline.q b/graph/pipeline.q
index 4cea2893..70ecd458 100644
--- a/graph/pipeline.q
+++ b/graph/pipeline.q
@@ -1,51 +1,54 @@
+// graph/pipeline.q - Build and execute a pipeline
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Contains createPipeline and execPipeline for 
+// the creation and execution of pipelines.
+
 \d .ml
 
-// Execution of a pipeline will not default to enter q debug mode but should be possible to overwrite
+// Execution of a pipeline will not default to enter q debug mode but should 
+//   be possible to overwrite
 graphDebug:0b
-updDebug:{[x]graphDebug::not graphDebug}
 
+// @kind function
+// @category pipeline
+// @desc Update debugging mode
+// @return {::} Debugging is updated
+updDebug:{[]
+  graphDebug::not graphDebug
+  }
+
+// @kind function
+// @category pipeline
+// @desc Generate a execution pipeline based on a valid graph
+// @param graph {dictionary} Graph originally generated by .ml.createGraph, 
+//   which has all relevant input edges connected validly
+// @return {dictionary} An optimal execution pipeline populated with all 
+//   information required to allow its successful execution 
 createPipeline:{[graph]
   if[not all exec 1_valid from graph`edges;'"disconnected edges"];
-  outputs:ungroup select srcNode:nodeId,srcName:key each outputs from 1_graph`nodes;
-  endpoints:exec distinct srcNode from outputs except select srcNode,srcName from graph`edges;
-  optimalpath:distinct raze paths idesc count each paths:i.getOptimalPath[graph]each endpoints;
-  pipeline:([]nodeId:optimalpath)#graph`nodes;
-  nodeinputs:key each exec inputs from pipeline;
-  pipeline:update inputs:count[i]#enlist(1#`)!1#(::),outputtypes:outputs,inputorder:nodeinputs from pipeline;
-  pipeline:select nodeId,complete:0b,error:`,function,inputs,outputs:inputs,outputtypes,inputorder from pipeline;
-  pipeline:pipeline lj select outputmap:([]srcName;dstNode;dstName)by nodeId:srcNode from graph`edges;
+  outputs:ungroup select sourceNode:nodeId,sourceName:key each outputs 
+    from 1_graph`nodes;
+  srcInfo:select sourceNode,sourceName from graph`edges;
+  endPoints:exec distinct sourceNode from outputs except srcInfo;
+  paths:i.getOptimalPath[graph]each endPoints;
+  optimalPath:distinct raze paths idesc count each paths;
+  pipeline:([]nodeId:optimalPath)#graph`nodes;
+  nodeInputs:key each exec inputs from pipeline;
+  pipeline:update inputs:count[i]#enlist(1#`)!1#(::),outputTypes:outputs,
+    inputOrder:nodeInputs from pipeline;
+  pipeline:select nodeId,complete:0b,error:`,function,inputs,outputs:inputs,
+     outputTypes,inputOrder from pipeline;
+  pipeline:pipeline lj select outputMap:([]sourceName;destNode;destName)by 
+    nodeId:sourceNode from graph`edges;
   1!pipeline}
 
-execPipeline:{[pipeline]i.execCheck i.execNext/pipeline}
-
-
-// Pipeline creation utilities
-i.getDeps:{[graph;node]exec distinct srcNode from graph[`edges]where dstNode=node}
-i.getAllDeps:{[graph;node]$[count depNodes:i.getDeps[graph]node;distinct node,raze .z.s[graph]each depNodes;node]}
-i.getAllPaths:{[graph;node]$[count depNodes:i.getDeps[graph]node;node,/:raze .z.s[graph]each depNodes;raze node]}
-i.getLongestPath:{[graph;node]paths first idesc count each paths:reverse each i.getAllPaths[graph;node]}
-i.getOptimalPath:{[graph;node]distinct raze reverse each i.getAllDeps[graph]each i.getLongestPath[graph;node]}
-
-i.execNext:{[pipeline]
-  node:first 0!select from pipeline where not complete;
-  -1"Executing node: ",string node`nodeId;
-  if[not count inputs:node[`inputs]node[`inputorder];inputs:1#(::)];
-  res:`complete`error`outputs!$[graphDebug;
-      .[(1b;`;)node[`function]::;inputs];
-      .[(1b;`;)node[`function]::;inputs;{[err](0b;`$err;::)}]
-  ];
-  / compare outputs to outputtypes ?
-  if[not null res`error;-2"Error: ",string res`error];
-  if[res`complete;
-    res[`inputs]:(1#`)!1#(::);
-    outputmap:update data:res[`outputs]srcName from node`outputmap;
-    res[`outputs]:((1#`)!1#(::)),(exec distinct srcName from outputmap)_ res`outputs;
-    pipeline:{[pipeline;map]pipeline[map`dstNode;`inputs;map`dstName]:map`data;pipeline}/[pipeline;outputmap];
-  ];
-  pipeline,:update nodeId:node`nodeId from res;
-  pipeline}
-
-i.execCheck:{[pipeline]
-  if[any not null exec error from pipeline;:0b];
-  if[all exec complete from pipeline;:0b];
-  1b}
+// @kind function
+// @category pipeline
+// @desc Execute a generated pipeline
+// @param pipeline {dictionary} Pipeline created by .ml.createPipeline
+// @return {dictionary} The pipeline with each node executed and appropriate 
+//   outputs populated.
+execPipeline:{[pipeline]
+  i.execCheck i.execNext/pipeline
+  }
diff --git a/graph/tests/graph.t b/graph/tests/graph.t
index 51a03ba6..852f4398 100644
--- a/graph/tests/graph.t
+++ b/graph/tests/graph.t
@@ -4,6 +4,7 @@
 
 \l p.q
 \l ml.q
+\l graph/utils.q
 \l graph/graph.q
 \l graph/pipeline.q
 
@@ -79,7 +80,7 @@ failingTest[.ml.updNode;(g;`node1;outputType);0b;"invalid outputs"]
 
 // Connect an invalid edge between 2 nodes and check that this is not valid
 g:.ml.connectEdge[g;`cfg1;`output;`node1;`input]
-0b~first exec valid from g[`edges] where dstNode=`node1,dstName=`input
+0b~first exec valid from g[`edges] where destNode=`node1,destName=`input
 g:.ml.disconnectEdge[g;`node1;`input]
 
 // Attempt to disconnect a node that doesn't exist
@@ -89,16 +90,16 @@ failingTest[.ml.disconnectEdge;(g;`node;`input);0b;"invalid edge"]
 failingTest[.ml.disconnectEdge;(g;`node1;`test);0b;"invalid edge"]
 
 // Attempt to connect an edge with a non existent source node
-failingTest[.ml.connectEdge;(g;`nocfg;`output;`node1;`input);0b;"invalid srcNode"]
+failingTest[.ml.connectEdge;(g;`nocfg;`output;`node1;`input);0b;"invalid sourceNode"]
 
 // Attempt to connect an edge from an existent source node but non existent source name
-failingTest[.ml.connectEdge;(g;`cfg1;`nosrcName;`node1;`input);0b;"invalid srcName"]
+failingTest[.ml.connectEdge;(g;`cfg1;`nosourceName;`node1;`input);0b;"invalid sourceName"]
 
 // Attempt to connect an edge from an non existent destination node
-failingTest[.ml.connectEdge;(g;`cfg1;`output;`nosrcnode;`input);0b;"invalid dstNode"]
+failingTest[.ml.connectEdge;(g;`cfg1;`output;`nosrcnode;`input);0b;"invalid destNode"]
 
 // Attempt to connect an edge from an existent destination node but non existent destination name
-failingTest[.ml.connectEdge;(g;`cfg1;`output;`node1;`noinput);0b;"invalid dstName"]
+failingTest[.ml.connectEdge;(g;`cfg1;`output;`node1;`noinput);0b;"invalid destName"]
 
 
 -1"\nTesting delNode";
@@ -116,7 +117,7 @@ not `tempNode in exec nodeId from g[`nodes]
 // but function errors on execution (for pipeline testing)
 g:.ml.updNode[g;`node1]`function`inputs`outputs!({`e+1};"!";"!")
 g:.ml.connectEdge[g;`cfg1;`output;`node1;`input]
-1b~first exec valid from g[`edges] where dstNode=`node1,dstName=`input
+1b~first exec valid from g[`edges] where destNode=`node1,destName=`input
 
 
 -1"\nTesting failing pipeline execution without debug mode active";
diff --git a/graph/utils.q b/graph/utils.q
new file mode 100644
index 00000000..ee3fc613
--- /dev/null
+++ b/graph/utils.q
@@ -0,0 +1,153 @@
+// graph/utils.q - Utility functions for graphs
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Utility functions for implementation of graph library
+
+\d .ml
+
+// Graphing creation utilities
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Connect the output of one node to the input to another 
+// @param graph {dictionary} Graph originally generated by .ml.createGraph, 
+//   which has all relevant input edges connected validly
+// @param edge {dictionary} Contains information about the edge node
+// @return {dictionary} The graph with the relevant connection made between the 
+//   inputs and outputs of two nodes.
+i.connectGraph:{[graph;edge]
+  edgeKeys:`sourceNode`sourceName`destNode`destName;
+  connectEdge[graph]. edge edgeKeys
+  }
+
+// Pipeline creation utilities
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Extract the source of a specific node 
+// @param graph {dictionary} Graph originally generated by .ml.createGraph, 
+//   which has all relevant input edges connected validly
+// @param node {symbol} Name associated with the functional node
+// @return {symbol} Source of the given node
+i.getDeps:{[graph;node]
+  exec distinct sourceNode from graph[`edges]where destNode=node
+  }
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Extract all dependent source nodes needed to run the node
+// @param graph {dictionary} Graph originally generated by .ml.createGraph, 
+//   which has all relevant input edges connected validly
+// @param node {symbol} Denoting the name to be associated with the functional 
+//   node
+// @return {symbol[]} All sources required for the given node  
+i.getAllDeps:{[graph;node]
+  depNodes:i.getDeps[graph]node; 
+  $[count depNodes; 
+    distinct node,raze .z.s[graph]each depNodes;
+    node
+    ]
+  }
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc  Extract all the paths needed to run the node
+// @param graph {dictionary} Graph originally generated by .ml.createGraph, 
+//   which has all relevant input edges connected validly
+// @param node {symbol} Denoting the name to be associated with the functional 
+//   node
+// @return {symbol} All paths required for the given node
+i.getAllPaths:{[graph;node]
+  depNodes:i.getDeps[graph]node; 
+  $[count depNodes; 
+    node,/:raze .z.s[graph]each depNodes;
+    raze node
+    ]
+  }
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Get the longest path
+// @param graph {dictionary} Graph originally generated by .ml.createGraph, 
+//   which has all relevant input edges connected validly
+// @param node {symbol} Denoting the name to be associated with the functional 
+//   node
+// @return {symbol} The longest path available
+i.getLongestPath:{[graph;node]
+  paths:reverse each i.getAllPaths[graph;node];
+  paths first idesc count each paths
+  }
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Extract the optimal path to run the node
+// @param graph {dictionary} Graph originally generated by .ml.createGraph, 
+//   which has all relevant input edges connected validly
+// @param node {symbol} Denoting the name to be associated with the functional 
+//   node
+// @return {symbol} The optimal path to run the node
+i.getOptimalPath:{[graph;node]
+  longestPath:i.getLongestPath[graph;node];
+  distinct raze reverse each i.getAllDeps[graph]each longestPath
+  }
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Update input data information within the pipeline
+// @param pipeline {dictionary} Pipeline created by .ml.createPipeline
+// @param map {dictionary} Contains information needed to run the node
+// @return {dictionary} Pipeline updated with input information
+i.updateInputData:{[pipeline;map]
+  pipeline[map`destNode;`inputs;map`destName]:map`data;
+  pipeline
+  }
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Execute the first non completed node in the pipeline
+// @param pipeline {dictionary} Pipeline created by .ml.createPipeline
+// @return {dictionary} Pipeline with executed node marked as complete
+i.execNext:{[pipeline]
+  node:first 0!select from pipeline where not complete;
+  -1"Executing node: ",string node`nodeId;
+  inputs:node[`inputs]node`inputOrder;
+  if[not count inputs;inputs:1#(::)];
+  resKeys:`complete`error`outputs;
+  resVals:$[graphDebug;
+    .[(1b;`;)node[`function]::;inputs];
+    .[(1b;`;)node[`function]::;inputs;{[err](0b;`$err;::)}]
+    ];
+  res:resKeys!resVals;
+  if[not null res`error;-2"Error: ",string res`error];
+  if[res`complete;
+    res[`inputs]:(1#`)!1#(::);
+    outputMap:update data:res[`outputs]sourceName from node`outputMap;
+    uniqueSource:(exec distinct sourceName from outputMap)_ res`outputs;
+    res[`outputs]:((1#`)!1#(::)),uniqueSource;
+    pipeline:i.updateInputData/[pipeline;outputMap];
+    ];
+  pipeline,:update nodeId:node`nodeId from res;
+  pipeline
+  }
+
+// @private
+// @kind function
+// @category pipelineUtility
+// @desc Check if any nodes are left to be executed or if any
+//   errors have occured
+// @param pipeline {dictionary} Pipeline created by .ml.createPipeline
+// @return {dictionary} Return 0b if all nodes have been completed or if any 
+//   errors have occured. Otherwise return 1b
+i.execCheck:{[pipeline]
+  if[any not null exec error from pipeline;:0b];
+  if[all exec complete from pipeline;:0b];
+  1b
+  }
diff --git a/init.q b/init.q
index 15341349..c17e0f22 100644
--- a/init.q
+++ b/init.q
@@ -1,7 +1,16 @@
-.ml.loadfile`:util/init.q
-.ml.loadfile`:fresh/init.q
-.ml.loadfile`:clust/init.q
-.ml.loadfile`:xval/init.q
-.ml.loadfile`:graph/init.q
-.ml.loadfile`:optimize/init.q
-.ml.loadfile`:timeseries/init.q
+// init.q - Load ml libraries
+// Copyright (c) 2021 Kx Systems Inc
+
+\d .ml
+
+path:{string`ml^`$@[{"/"sv -1_"/"vs ssr[;"\\";"/"](-3#get .z.s)0};`;""]}`
+system"l ",path,"/","ml.q"
+
+loadfile`:util/init.q
+loadfile`:stats/init.q
+loadfile`:fresh/init.q
+loadfile`:clust/init.q
+loadfile`:xval/init.q
+loadfile`:graph/init.q
+loadfile`:optimize/init.q
+loadfile`:timeseries/init.q
diff --git a/ml.q b/ml.q
index ac6e6816..a5272582 100644
--- a/ml.q
+++ b/ml.q
@@ -1,5 +1,25 @@
+// ml.q - Setup for ml namespace
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Define version, path, and loadfile 
+
+
 \l p.q /embedPy
 \d .ml
 version:@[{TOOLKITVERSION};`;`development]
 path:{string`ml^`$@[{"/"sv -1_"/"vs ssr[;"\\";"/"](-3#get .z.s)0};`;""]}`
 loadfile:{$[.z.q;;-1]"Loading ",x:_[":"=x 0]x:$[10=type x;;string]x;system"l ",path,"/",x;}
+
+// The following functionality should be available for all initialized sections of the library
+
+// @private
+// @kind function
+// @category utility
+// @fileoverview If set to `1b` deprecation warnings are ignored
+i.ignoreWarning:0b
+
+// @private
+// @kind function
+// @category utilities
+// @fileoverview Change ignoreWarnings
+updateIgnoreWarning:{[]i.ignoreWarning::not i.ignoreWarning}
diff --git a/optimize/README.md b/optimize/README.md
new file mode 100644
index 00000000..1d2c79a5
--- /dev/null
+++ b/optimize/README.md
@@ -0,0 +1,35 @@
+# Numerical optimization
+
+The functionality contained within this folder provides a number of implementations of numerical optimization techniques. Such techniques are used to find the local or global minima of user-provided objective functions and are central to many statistical models.
+
+## Functionality
+
+At present, the optimization folder contains an implementation of the Broyden-Fletcher-Goldfarb-Shanno algorithm. 
+
+The Broyden-Fletcher-Goldfarb-Shanno(BFGS) algorithm is a quasi-Newton iterative method for solving unconstrained non-linear optimization problems. This is a class of hill-climbing optimization that seeks a stationary, preferably twice-differentiable, solution to the objective function.
+
+## Requirements
+
+- kdb+ > 3.5
+
+## Installation
+
+Place the `ml` library in `$QHOME` and load into a q instance using `ml/ml.q`
+
+### Load
+
+The following will load the optimization functionality into the `.ml` namespace
+```q
+q)\l ml/ml.q
+q).ml.loadfile`:optimize/init.q
+```
+
+## Documentation
+
+Documentation is available on the [Optimization](https://code.kx.com/q/ml/toolkit/optimize/) homepage.
+
+## Status
+
+The optimization library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
+
+If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/optimize/init.q b/optimize/init.q
index bfca8622..8ca0a58c 100644
--- a/optimize/init.q
+++ b/optimize/init.q
@@ -1,3 +1,15 @@
+// optimize/init.q - Load optimize library
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// The .ml.optimize namespace contains functions that relate to 
+// the application of numerical optimization techniques. Such 
+// techniques are used to find local or global minima of user-provided 
+// objective functions and are central to many statistical models.
+
 \d .ml
-loadfile`:util/util.q
-loadfile`:optimize/optim.q
+loadfile`:util/utils.q
+loadfile`:util/utilities.q
+loadfile`:optimize/utils.q
+loadfile`:optimize/optimize.q
+
+.ml.i.deprecWarning`optimize
diff --git a/optimize/optim.q b/optimize/optim.q
deleted file mode 100644
index d832e790..00000000
--- a/optimize/optim.q
+++ /dev/null
@@ -1,659 +0,0 @@
-// Namespace appropriately
-\d .ml
-
-// @kind function
-// @category optimization
-// @fileoverview Optimize a function using the 
-//   Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm. This implementation
-//   is based on https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/optimize.py#L1058
-//   and is a quasi-Newton hill-climbing optimization technique used to find
-//   a preferebly twice continuously differentiable stationary point of a function.
-//   An outline of the algorithm mathematically is provided here:
-//   https://en.wikipedia.org/wiki/Broyden-Fletcher-Goldfarb-Shanno_algorithm#Algorithm
-// @param func {lambda} the function to be optimized. This function should take
-//   as its arguments a list/dictionary of parameters to be optimized and a list/dictionary
-//   of additional unchanging arguments
-// @param x0 {num[]/dict} the first guess at the parameters to be optimized as 
-//   a list or dictionary of numeric values
-// @param args {list/dict/(::)} any unchanging parameters to required for evaluation 
-//   of the function, these should be in the order that they are to be applied
-//   to the function
-// @param params {dict} any modifications to be applied to the optimization procedure e.g.
-//   - display   {bool} are the results at each optimization iteration to be printed
-//   - optimIter {integer} maximum number of iterations in optimization procedure
-//   - zoomIter  {integer} maximum number of iterations when finding optimal zoom
-//   - wolfeIter {integer} maximum number of iterations in 
-//   - norm      {integer} order of norm (0W = max; -0W = min), otherwise calculated via
-//      sum[abs[vec]xexp norm]xexp 1%norm
-//   - gtol      {float} gradient norm must be less than gtol before successful termination
-//   - geps      {float} the absolute step size used for numerical approximation
-//      of the jacobian via forward differences.
-//   - stepSize  {float} maximum allowable 'alpha' step size between calculations
-//   - c1        {float} armijo rule condition 
-//   - c2        {integer} curvature conditions rule 
-// @returns {dict} a dictionary containing the estimated optimal parameters, number of iterations 
-//   and the evaluated return of the function being optimized. 
-optimize.BFGS:{[func;x0;args;params]
-  // update the default behaviour of the parameters
-  params:i.updDefault[params];
-  // format x0 based on input type
-  x0:i.dataFormat[x0];
-  // Evaluate the function at the starting point
-  f0:i.funcEval[func;x0;args];
-  // Calculate the starting gradient
-  gk:i.grad[func;x0;args;params`geps];
-  // Initialize Hessian matrix as identity matrix
-  hess:.ml.eye count x0;
-  // set initial step guess i.e. the step before f0
-  prev_fk:f0+sqrt[sum gk*gk]%2;
-  gradNorm:i.vecNorm[gk;params`norm];
-  optimKeys:`xk`fk`prev_fk`gk`prev_xk`hess`gnorm`I`idx;
-  optimVals:(x0;f0;prev_fk;gk;0n;hess;gradNorm;hess;0);
-  optimDict:optimKeys!optimVals;
-  // Run optimization until one of the stopping conditions is met
-  optimDict:i.stopOptimize[;params]i.BFGSFunction[func;;args;params]/optimDict;
-  returnKeys:`xVals`funcRet`numIter;
-  // if function returned due to a null xVal or the new value being worse than the previous
-  //  value then return the k-1 value
-  returnVals:$[(optimDict[`fk]<optimDict`prev_fk) & (not any null optimDict`xk);
-    optimDict`xk`fk`idx;
-    optimDict`prev_xk`prev_fk`idx
-    ];
-  returnKeys!returnVals
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview optimize a function until gradient tolerance is reached or 
-//   maximum number of allowed iterations is met. The following outlines a python equivalent
-//   https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/optimize.py#L1131
-// @param func      {lambda} the function to be minimized
-// @param optimDict {dict} variables to be updated at each iteration of optimization
-// @param args      {any} arguments to the optimization function that do not change per iteration 
-// @param params    {dict} parameters controlling non default optimization behaviour
-// @return {dict} variables, gradients, matrices and indices at the end of each iteration
-i.BFGSFunction:{[func;optimDict;args;params] 
-  // calculate search direction
-  pk:neg mmu[optimDict`hess;optimDict`gk];
-  // line search func to be inserted to get alpha
-  wolfe:i.wolfeSearch[;;;pk;func;;args;params]. optimDict`fk`prev_fk`gk`xk;
-  // old fk goes to previous val
-  optimDict[`prev_fk]:optimDict`fk;
-  // update values based on wolfe line search
-  alpha:wolfe 0;
-  optimDict[`fk]:wolfe 1;
-  gnew:wolfe 2;
-  // redefine the x value at k-1 to the current x value
-  optimDict[`prev_xk]:optimDict`xk;
-  // Calculate the step distance for moving from x(k-1) -> x(k)
-  sk:alpha*pk;
-  // update values of x at the new position k
-  optimDict[`xk]:optimDict[`prev_xk]+sk;
-  // if null gnew, then get gradient of new x value
-  if[any null gnew;gnew:i.grad[func;optimDict`xk;args;params`geps]];
-  // subtract new gradients
-  yk:gnew-optimDict`gk;;
-  optimDict[`gk]:gnew;
-  // get new norm of gradient
-  optimDict[`gnorm]:i.vecNorm[optimDict`gk;params`norm];
-  // calculate new hessian matrix for next iteration 
-  rhok:1%mmu[yk;sk];
-  if[0w=rhok;
-    rhok:1000f;
-    -1"Division by zero in calculation of rhok, assuming rhok large";];
-  A1:optimDict[`I] - sk*\:yk*rhok;
-  A2:optimDict[`I] - yk*\:sk*rhok;
-  optimDict[`hess]:mmu[A1;mmu[optimDict`hess;A2]]+rhok*(sk*/:sk);
-  // if x(k) returns infinite value update gnorm and fk
-  if[0w in abs optimDict`xk;optimDict[`gnorm`fk]:(0n;0w)];
-  optimDict[`idx]+:1;
-  if[params`display;show optimDict;-1"";];
-  optimDict
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview complete a line search across an unconstrained minimization problem making
-//   use of wolfe conditions to constrain the search the naming convention for dictionary keys 
-//   in this implementation is based on the python implementation of the same functionality here
-//   https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L193
-// @param fk      {float} function return evaluated at position k
-// @param prev_fk {float} function return evaluated at position k-1
-// @param gk      {float} gradient at position k
-// @param pk      {float} search direction
-// @param func    {lambda} function being optimized 
-// @param xk      {num[]} parameter values at position k
-// @param args    {dict/num[]} function arguments that do not change per iteration
-// @param params  {dict} parameters controlling non default optimization behaviour
-// @return {num[]} new alpha, fk and derivative values
-i.wolfeSearch:{[fk;prev_fk;gk;pk;func;xk;args;params]
-  phiFunc   :i.phi[func;pk;;xk;args];
-  derphiFunc:i.derphi[func;params`geps;pk;;xk;args];
-  // initial Wolfe conditions
-  wolfeDict:`idx`alpha0`phi0`phi_a0!(0;0;fk;fk);
-  // calculate the derivative at that phi0
-  derphi0:gk mmu pk;
-  wolfeDict[`derphi_a0`derphi0]:2#derphi0;
-  // calculate step size this should be 0 < x < 1 
-  // with min(x;maxstepsize) or 1f otherwise
-  alpha:1.01*2*(fk - prev_fk)%derphi0;
-  alphaVal:$[alpha within 0 1f;min(alpha;params`stepSize);1f];
-  wolfeDict[`alpha1]:alphaVal;
-  // function value at alpha1
-  wolfeDict[`phi_a1]:phiFunc wolfeDict`alpha1;
-  // repeat until wolfe criteria is reached or max iterations have been done
-  // to get new alpha, phi and derphi values
-  wolfeDict:i.stopWolfe[;params]i.scalarWolfe[derphiFunc;phiFunc;pk;params]/wolfeDict;
-  // if the line search did not converge, use last alpha , phi and derphi
-  $[not any null raze wolfeDict`alpha_star`phi_star`derphi_star;
-    wolfeDict`alpha_star`phi_star`derphi_star;
-    wolfeDict`alpha1`phi_a1`derphi_a0_fin
-  ]
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview apply a scalar search to find an alpha value that satisfies
-//   strong Wolfe conditions, a python implementation of this is outlined here
-//   https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L338
-//   This functions defines the bounds between which the step function can be found.
-//   When the optimal bound is found, the area is zoomed in on and optimal value find
-// @param derphiFunc {proj} function to calculate the value of the objective function
-//   derivative at alpha
-// @param phiFunc {proj} function to calculate the value of the objective function at alpha
-// @param pk {float} search direction
-// @param params {dict} parameters controlling non default optimization behaviour
-// @param wolfeDict {dict} all data relevant to the calculation of the optimal
-//   alpha values 
-// @returns {dict} new alpha, fk and derivative values
-i.scalarWolfe:{[derphiFunc;phiFunc;pk;params;wolfeDict]
-  // set up zoom function constant params
-  zoomSetup:i.zoomFunc[derphiFunc;phiFunc;;;params]. wolfeDict`phi0`derphi0;
-  // if criteria 1, zoom and break loop
-  if[i.wolfeCriteria1[wolfeDict;params];
-    wolfeDict[`idx]:0w;
-    wolfeDict[i.zoomReturn]:zoomSetup wolfeDict`alpha0`alpha1`phi_a0`phi_a1`derphi_a0;
-    :wolfeDict
-  ];
-  // calculate the derivative of the function at the new position
-  derphiCalc:derphiFunc wolfeDict`alpha1;
-  // update the new derivative fnc
-  wolfeDict[`derphi_a1]:derphiCalc`derval;
-  $[i.wolfeCriteria2[wolfeDict;params];
-    [wolfeDict[`alpha_star] :wolfeDict`alpha1;
-     wolfeDict[`phi_star]   :wolfeDict`phi_a1;
-     wolfeDict[`derphi_star]:derphiCalc`grad;
-     wolfeDict[`idx]:0w;
-     wolfeDict
-    ];
-    0<=wolfeDict`derphi_a1;
-    [wolfeDict[`idx]:0w;
-     wolfeDict[i.zoomReturn]:zoomSetup wolfeDict`alpha1`alpha0`phi_a1`phi_a0`derphi_a1
-    ];
-    // update dictionary and repeat process until criteria is met
-    [wolfeDict[`alpha0]:wolfeDict`alpha1;
-     wolfeDict[`alpha1]:2*wolfeDict`alpha1;
-     wolfeDict[`phi_a0]:wolfeDict`phi_a1;
-     wolfeDict[`phi_a1]:phiFunc wolfeDict`alpha1;
-     wolfeDict[`derphi_a0]:wolfeDict`derphi_a1;
-     wolfeDict[`derphi_a0_fin]:derphiCalc`grad;
-     wolfeDict[`idx]+:1
-    ]
-  ];
-  wolfeDict
-  }
-
-// @private
-// @kind function
-// @category optimize
-// @fileoverview function to apply 'zoom' iteratively during linesearch to find optimal alpha
-//   value satisfying strong Wolfe conditions
-// @param derphiFunc {proj} function to calculate the value of the objective function
-//   derivative at alpha
-// @param phiFunc {proj} function to calculate the value of the objective function at alpha
-// @param phi0 {float} value of function evaluation at x(k-1)
-// @param derphi0 {float} value of objective function derivative at x(k-1)
-// @param params {dict} parameters controlling non default optimization behaviour
-// @param lst {num[]} bounding conditions for alpha, phi and derphi used in zoom algorithm
-// @returns {num[]} new alpha, fk and derivative values
-i.zoomFunc:{[derphiFunc;phiFunc;phi0;derphi0;params;lst]
-  zoomDict:i.zoomKeys!lst,phi0;
-  zoomDict[`idx`a_rec]:2#0f;
-  zoomDict:i.stopZoom[;params]i.zoom[derphiFunc;phiFunc;phi0;derphi0;params]/zoomDict;
-  // if zoom did not converge, set to null
-  $[count star:zoomDict[i.zoomReturn];star;3#0N]
-  }
-
-// @private
-// @kind function
-// @category optimize
-// @fileoverview function to apply an individual step in 'zoom' during linesearch 
-//   to find optimal alpha value satisfying strong Wolfe conditions. An outline of
-//   the python implementation of this section of the algorithm can be found here
-//   https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L556
-// @param derphiFunc {proj} function to calculate the value of the objective function
-//   derivative at alpha
-// @param phiFunc {proj} function to calculate the value of the objective function at alpha
-// @param phi0 {float} value of function evaluation at x(k-1)
-// @param derphi0 {float} value of objective function derivative at x(k-1)
-// @param params {dict} parameters controlling non default optimization behaviour
-// @param zoomDict {dict} parameters to be updated as 'zoom' procedure is applied to find
-//   the optimal value of alpha
-// @returns {dict} parameters calculated for an individual step in line search procedure
-//   to find optimal alpha value satisfying strong Wolfe conditions
-i.zoom:{[derphiFunc;phiFunc;phi0;derphi0;params;zoomDict]
-  // define high and low values
-  dalpha:zoomDict[`a_hi]-zoomDict`a_lo;
-  // These should probably be named a and b since mapping doesn't work properly?
-  highLow:`high`low!$[dalpha>0;zoomDict`a_hi`a_lo;zoomDict`a_lo`a_hi];
-  if["i"$zoomDict`idx;
-    cubicCheck:dalpha*0.2;
-    findMin:i.cubicMin . zoomDict`a_lo`phi_lo`derphi_lo`a_hi`phi_hi`a_rec`phi_rec
-  ];
-  if[i.quadCriteria[findMin;highLow;cubicCheck;zoomDict];
-    quadCheck:0.1*dalpha;
-    findMin:i.quadMin . zoomDict`a_lo`phi_lo`derphi_lo`a_hi`phi_hi;
-    if[(findMin > highLow[`low]-quadCheck) | findMin < highLow[`high]+quadCheck;
-      findMin:zoomDict[`a_lo]+0.5*dalpha
-    ]
-  ];
-  // update new values depending on fnd_min
-  phiMin:phiFunc[findMin];
-  //first condition, update and continue loop
-  if[i.zoomCriteria1[phi0;derphi0;phiMin;findMin;zoomDict;params];
-    zoomDict[`idx]+:1;
-    zoomDict[i.zoomKeys1]:zoomDict[`phi_hi`a_hi],findMin,phiMin;
-    :zoomDict
-  ];
-  // calculate the derivative at the cubic minimum
-  derphiMin:derphiFunc findMin;
-  // second scenario, create new features and end the loop
-  $[i.zoomCriteria2[derphi0;derphiMin;params];
-    [zoomDict[`idx]:0w;
-     zoomDict:zoomDict,i.zoomReturn!findMin,phiMin,enlist derphiMin`grad];
-    i.zoomCriteria3[derphiMin;dalpha];
-    [zoomDict[`idx]+:1;
-     zoomDict[i.zoomKeys1,i.zoomKeys2]:zoomDict[`phi_hi`a_hi`a_lo`phi_lo],
-                                   findMin,phiMin,derphiMin`derval];
-    [zoomDict[`idx]+:1;
-     zoomDict[i.zoomKeys3,i.zoomKeys2]:zoomDict[`phi_lo`a_lo],
-                                   findMin,phiMin,derphiMin`derval]
-  ];
-  zoomDict
-  }
-
-
-// Vector norm calculation
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview calculate the vector norm, used in calculation of the gradient norm at position k.
-//   Default behaviour is to use the maximum value of the gradient, this can be overwritten by
-//   a user, this is in line with the default python implementation.
-// @param vec {num[]} calculated gradient values
-// @param ord {long} order of norm (0W = max; -0W = min)
-// @return the gradient norm based on the input gradient
-i.vecNorm:{[vec;ord]
-  if[-7h<>type ord;'"ord must be +/- infinity or a long atom"];
-  $[ 0W~ord;max abs vec;
-    -0W~ord;min abs vec;
-    sum[abs[vec]xexp ord]xexp 1%ord
-  ]
-  }
-
-
-// Stopping conditions
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview evaluate if the optimization function has reached a condition which is
-//   should result in the optimization algorithm being stopped. 
-// @param dict {dict} optimization function returns
-// @param params {dict} parameters controlling non default optimization behaviour
-// @return {bool} indication as to if the optimization has met one of it's stopping conditions
-i.stopOptimize:{[dict;params]
-  // is the function evaluation at k an improvement on k-1?
-  check1:dict[`fk] < dict`prev_fk;
-  // has x[k] returned a non valid return?
-  check2:not any null dict`xk;
-  // have the maximum number of iterations been met?
-  check3:params[`optimIter] > dict`idx;
-  // is the gradient at position k below the accepted tolerance
-  check4:params[`gtol] < dict`gnorm;
-  check1 & check2 & check3 & check4
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview evaluate if the wolfe condition search has reached a condition which is
-//   should result in the optimization algorithm being stopped.
-// @param dict {dict} optimization function returns
-// @param params {dict} parameters controlling non default optimization behaviour
-// @return {bool} indication as to if the optimization has met one of it's stopping conditions
-i.stopWolfe:{[dict;params]
-  dict[`idx] < params`wolfeIter
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview evaluate if the alpha condition 'zoom' has reached a condition which is
-//   should result in the optimization algorithm being stopped.
-// @param dict {dict} optimization function returns
-// @param params {dict} parameters controlling non default optimization behaviour
-// @return {bool} indication as to if the optimization has met one of it's stopping conditions
-i.stopZoom:{[dict;params]
-  dict[`idx] < params`zoomIter
-  }
-
-
-// Function + derivative evaluation at x[k]+ p[k]*alpha[k]
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview evaluate the objective function at the position x[k] + step size
-// @param func {lambda} the objective function to be minimized
-// @param pk {float} step direction
-// @param alpha {float} size of the step to be applied
-// @param xk {num[]} parameter values at position k
-// @param args {dict/num[]} function arguments that do not change per iteration
-// @param xk {num[]} 
-// @returns {float} function evaluated at at the position x[k] + step size
-i.phi:{[func;pk;alpha;xk;args]
-  xk+:alpha*pk;
-  i.funcEval[func;xk;args]
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview evaluate the derivative of the objective function at 
-//   the position x[k] + step size
-// @param func {lambda} the objective function to be minimized
-// @param eps {float} the absolute step size used for numerical approximation
-//   of the jacobian via forward differences.
-// @param pk {float} step direction
-// @param alpha {float} size of the step to be applied
-// @param xk {num[]} parameter values at position k
-// @param args {dict/num[]} function arguments that do not change per iteration
-// @returns {dict} gradient and value of scalar derivative
-i.derphi:{[func;eps;pk;alpha;xk;args]
-  // increment xk by a small step size
-  xk+:alpha*pk;
-  // get gradient at the new position
-  gval:i.grad[func;xk;args;eps];
-  derval:gval mmu pk;
-  `grad`derval!(gval;derval)
-  }
-
-
-// Minimization functions
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview find the minimizing solution for a cubic polynomial which
-//   passes through the points (a,fa), (b,fb) and (c,fc) with a derivative of the 
-//   objective function calculated as fpa. This follows the python implementation 
-//   outlined here https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L482
-// @param a {float} position a
-// @param b {float} position b
-// @param c {float} position c
-// @param fa {float} objective function evaluated at a
-// @param fb {float} objective function evaluated at b
-// @param fc {float} objective function evaluated at c
-// @param fpa {float} derivative of the objective function evaluated at a
-// @returns {num[]} minimized parameter set as a solution for the cubic polynomial
-i.cubicMin:{[a;fa;fpa;b;fb;c;fc]
-  db:b-a;
-  dc:c-a;
-  denom:(db*dc)xexp 2*(db-dc);
-  d1:2 2#0f;
-  d1[0]:(1 -1)*xexp[;2]each(db;dc);
-  d1[1]:(-1 1)*xexp[;3]each(dc;db);
-  AB:d1 mmu(fb-fa-fpa*db;fc-fa-fpa*dc);
-  AB%:denom;
-  radical:AB[1]*AB[1]-3*AB[0]*fpa;
-  a+(neg[AB[1]]+sqrt(radical))%(3*AB[0])
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview find the minimizing solution for a quadratic polynomial which
-//   passes through the points (a,fa) and (b,fb) with a derivative of the objective function
-//   calculated as fpa. This follows the python implementation outlined here
-//   https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L516
-// @param a {float} position a
-// @param b {float} position b
-// @param fa {float} objective function evaluated at a
-// @param fb {float} objective function evaluated at b
-// @param fpa {float} derivative of the objective function evaluated at a
-// @returns {num[]} minimized parameter set as a solution for the quadratic polynomial
-i.quadMin:{[a;fa;fpa;b;fb]
-  db:b-a;
-  B:(fb-fa-fpa*db)%(db*db);
-  a-fpa%(2*B)
-  }
-
-
-// Gradient + function evaluation
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview calculation of the gradient of the objective function for all parameters of x
-//   incremented individually by epsilon
-// @param func {lambda} the objective function to be minimized
-// @param xk {num[]} parameter values at position k
-// @param args {dict/num[]} function arguments that do not change per iteration
-// @param eps {float} the absolute step size used for numerical approximation
-//   of the jacobian via forward differences.
-// @returns {dict} gradient of function at position k
-i.grad:{[func;xk;args;eps]
-  fk:i.funcEval[func;xk;args];
-  i.gradEval[fk;func;xk;args;eps]each til count xk
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview calculation of the gradient of the objective function for a single
-//   parameter set x where one of the indices has been incremented by epsilon
-// @param func {lambda} the objective function to be minimized
-// @param xk {num[]} parameter values at position k
-// @param args {dict/num[]} function arguments that do not change per iteration
-// @param eps {float} the absolute step size used for numerical approximation
-//   of the jacobian via forward differences.
-// @returns {dict} gradient of function at position k with an individual
-//   variable x incremented by epsilon
-i.gradEval:{[fk;func;xk;args;eps;idx]
-  if[(::)~fk;fk:i.funcEval[func;xk;args]];
-  // increment function optimisation values by epsilon
-  xk[idx]+:eps;
-  // Evaluate the gradient
-  (i.funcEval[func;xk;args]-fk)%eps
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview evaluate the objective function at position x[k] with relevant
-//   additional arguments accounted for
-// @param {lambda} the objective function to be minimized
-// @param xk {num[]} parameter values at position k
-// @param args {dict/num[]} function arguments that do not change per iteration
-// @returns {float} the objective function evaluated at the appropriate location
-i.funcEval:{[func;xk;args]
-  $[any args~/:((::);());func xk;func[xk;args]]
-  }
-
-
-// Paramter dictionary
-
-// @private
-// @kind function
-// @category
-// @fileoverview update the default behaviour of the model optimization procedure
-//   to account for increased sensitivity to tolerance, the number of iterations,
-//   how the gradient norm is calculated and various numerical updates including changes
-//   to the Armijo rule and curvature for calculation of the strong Wolfe conditions.
-// @param dict {dict/(::)/()} if a dictionary update the default dictionary to include
-//   the user defined updates, otherwise use the default dictionary 
-// @returns {dict} updated or default parameter set depending on user input
-i.updDefault:{[dict]
-  returnKeys:`norm`optimIter`gtol`geps`stepSize`c1`c2`wolfeIter`zoomIter`display;
-  returnVals:(0W;0W;1e-4;1.49e-8;0w;1e-4;0.9;10;10;0b);
-  returnDict:returnKeys!returnVals;
-  if[99h<>type dict;dict:()!()];
-  i.wolfeParamCheck[returnDict,dict]
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview Ensure that the armijo and curvature parameters are consistent 
-//   with the expected values for calculation of the strong Wolfe conditions.
-//   Return an error on unsuitable conditions otherwise return the input dictionary
-// @param dict {dict} updated parameter dictionary containing default information and
-//   any updated parameter information
-// @returns {dict/err} the original input dictionary or an error suggesting that the 
-//   Armijo and curvature parameters are unsuitable
-i.wolfeParamCheck:{[dict]
-  check1:dict[`c1]>dict`c2;
-  check2:any not dict[`c1`c2]within 0 1;
-  $[check1 or check2;
-    '"When evaluating Wolfe conditions the following must hold 0 < c1 < c2 < 1";
-    dict
-  ]
-  }
-
-
-// Data Formatting
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview Ensure that the input parameter x at position 0 which will
-//   be updated is in a format that is suitable for use with this optimization
-//   procedure i.e. the data is a list of values.
-// @param x0 {dict/num/num[]} initial values of x to be optimized
-// @returns {num[]} the initial values of x converted into a suitable numerical list format
-i.dataFormat:{[x0]
-  "f"$$[99h=type x0;raze value x0;0h >type x0;enlist x0; x0]
-  }
-
-
-// Conditional checks for Wolfe, zoom and quadratic condition evaluation
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview ensure new values lead to improvements over the older values
-// @param wolfeDict {dict} the current iterations values for the objective function and the 
-//   derivative of the objective function evaluated 
-// @param params {dict} parameter dictionary containing the updated/default information
-//   used to modify the behaviour of the system as a whole
-// @returns {bool} indication as to if a further zoom is required 
-i.wolfeCriteria1:{[wolfeDict;params]
-  check1:wolfeDict[`phi_a1]>wolfeDict[`phi0]+params[`c1]*prd wolfeDict`alpha1`derphi0;
-  check2:(wolfeDict[`phi_a1]>=wolfeDict`phi_a0) and (1<wolfeDict`idx);
-  check1 or check2
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview ensure new values lead to improvements over the older values
-// @param wolfeDict {dict} the current iterations values for the objective function and the
-//   derivative of the objective function evaluated
-// @param params {dict} parameter dictionary containing the updated/default information
-//   used to modify the behaviour of the system as a whole
-// @returns {bool} indication as to if a further zoom is required 
-i.wolfeCriteria2:{[wolfeDict;params]
-  neg[params[`c2]*wolfeDict[`derphi0]]>=abs wolfeDict`derphi_a1
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview check if there is need to apply quadratic minimum calculation
-// @param findMin {num[]} the currently calculated minimum values
-// @param highLow {dict} upper and lower bounds of the search space
-// @param cubicCheck {float} interpolation check parameter
-// @param zoomDict {dict} parameters to be updated as 'zoom' procedure is applied to find
-//   the optimal value of alpha
-// @returns {bool} indication as to if the value of findMin needs to be updated 
-i.quadCriteria:{[findMin;highLow;cubicCheck;zoomDict]
-  // On initial iteration the minimum has not been calculated
-  // as such criteria should exit early to complete the quadratic calculation
-  if[findMin~();:1b];
-  check1:0=zoomDict`idx;
-  check2:findMin>highLow[`low] -cubicCheck;
-  check3:findMin<highLow[`high]+cubicCheck;
-  check1 or check2 or check3
-  }
-
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview check if the zoom conditions are sufficient
-// @param phi0 {float} objective function evaluation at index 0 
-// @param derphi0 {float} derivative of objective function evaluated at index 0
-// @param phiMin {float} objective function evaluated at the current minimum
-// @param findMin {float} the currently calculated minimum value
-// @param zoomDict {dict} parameters to be updated as 'zoom' procedure is applied to find
-//   the optimal value of alpha
-// @param params {dict} parameter dictionary containing the updated/default information
-//   used to modify the behaviour of the system as a whole
-// @returns indication as to if further zooming is required
-i.zoomCriteria1:{[phi0;derphi0;phiMin;findMin;zoomDict;params]
-  check1:phiMin> phi0+findMin*derphi0*params`c1;
-  check2:phiMin>=zoomDict`phi_lo;
-  check1 or check2
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview check if the zoom conditions are sufficient
-// @param derphi0 {float} derivative of the objective function evaluated at index 0
-// @param derphiMin {float} derivative of the objective function evaluated at the current minimum
-// @param params {dict} parameter dictionary containing the updated/default information
-//   used to modify the behaviour of the system as a whole
-// @returns indication as to if further zooming is required
-i.zoomCriteria2:{[derphi0;derphiMin;params]
-  abs[derphiMin`derval]<=neg derphi0*params`c2
-  }
-
-// @private
-// @kind function
-// @category optimization
-// @fileoverview check if the zoom conditions are sufficient
-// @param derphiMin {float} derivative of the objective function evaluated at the current minimum
-// @param dalpha {float} difference between the upper and lower bound of the zoom bracket
-// @returns indication as to if further zooming is required
-i.zoomCriteria3:{[derphiMin;dalpha]
-  0<=derphiMin[`derval]*dalpha
-  }
-	   
-	   
-// Zoom dictionary 
-
-//input keys of zoom dictionary
-i.zoomKeys:`a_lo`a_hi`phi_lo`phi_hi`derphi_lo`phi_rec;
-// keys to be updated in zoom each iteration
-i.zoomKeys1:`phi_rec`a_rec`a_hi`phi_hi;
-// extra keys that have to be updated in some scenarios
-i.zoomKeys2:`a_lo`phi_lo`derphi_lo;
-i.zoomKeys3:`phi_rec`a_rec
-// final updated keys to be used
-i.zoomReturn:`alpha_star`phi_star`derphi_star;
diff --git a/optimize/optimize.q b/optimize/optimize.q
new file mode 100644
index 00000000..26dad8f5
--- /dev/null
+++ b/optimize/optimize.q
@@ -0,0 +1,77 @@
+// optimize/optimize.q - Otimization algorithms
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Contains an implementation of the BFGS algorithm.
+
+// Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm. This implementation
+// is based on 
+// https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/optimize.py#L1058
+// and is a quasi-Newton hill-climbing optimization technique used to find a
+// preferably twice continuously differentiable stationary point of a 
+// function.
+
+// An outline of the algorithm mathematically is provided here:
+// https://en.wikipedia.org/wiki/Broyden-Fletcher-Goldfarb-Shanno_algorithm 
+
+\d .ml
+
+// @kind function
+// @category optimization
+// @desc Optimize a function using the Broyden-Fletcher-Goldfarb-Shanno
+//    (BFGS) algorithm
+// @param func {fn} Function to be optimized. This function should take
+//   as its arguments a list/dictionary of parameters to be optimized and
+//   a list/dictionary of additional unchanging arguments
+// @param x0 {number[]|dictionary} The first guess at the parameters to be 
+//   optimized as a list or dictionary of numeric values
+// @param args {list|dictionary|(::)} Any unchanging parameters to required for 
+//   evaluation of the function, these should be in the order that they are to 
+//   be applied to the function
+// @param params {dictionary} Any modifications to be applied to the 
+//   optimization procedure e.g.
+//   - display   {boolean} Results at each optimization iteration to be printed
+//   - optimIter {int} Maximum number of iterations in optimization procedure
+//   - zoomIter  {int} Maximum number of iterations when finding optimal zoom
+//   - wolfeIter {int} Maximum number of iterations
+//   - norm {int} Order of norm (0W = max; -0W = min) otherwise calculated via
+//      sum[abs[vec]xexp norm]xexp 1%norm
+//   - gtol {float} Gradient norm must be less than gtol before successful 
+//      termination
+//   - geps {float} The absolute step size used for numerical approximation of
+//      the jacobian via forward differences.
+//   - stepSize {float} Maximum allowable 'alpha' step size between 
+//     calculations
+//   - c1 {float} Armijo rule condition 
+//   - c2 {int} Curvature conditions rule 
+// @returns {dictionary} Contains the estimated optimal parameters, number of
+//   iterations and the evaluated return of the function being optimized
+optimize.BFGS:{[func;x0;args;params]
+  // Update the default behaviour of the parameters
+  params:i.updDefault[params];
+  // Format x0 based on input type
+  x0:i.dataFormat[x0];
+  // Evaluate the function at the starting point
+  f0:i.funcEval[func;x0;args];
+  // Calculate the starting gradient
+  gk:i.grad[func;x0;args;params`geps];
+  // Initialize Hessian matrix as identity matrix
+  hess:.ml.eye count x0;
+  // Set initial step guess i.e. the step before f0
+  fkPrev:f0+sqrt[sum gk*gk]%2;
+  gradNorm:i.vecNorm[gk;params`norm];
+  optimKeys:`xk`fk`fkPrev`gk`xkPrev`hess`gnorm`I`idx;
+  optimVals:(x0;f0;fkPrev;gk;0n;hess;gradNorm;hess;0);
+  optimDict:optimKeys!optimVals;
+  // Run optimization until one of the stopping conditions is met
+  optimDict:i.stopOptimize[;params]i.BFGSFunction[func;;args;params]/optimDict;
+  returnKeys:`xVals`funcRet`numIter;
+  // If function returned due to a null xVal or the new value being worse than
+  // the previous value then return the k-1 value
+  nullOptim:not any null optimDict`xk;
+  fkCompare:optimDict[`fk]<optimDict`fkPrev;
+  returnVals:$[fkCompare & nullOptim;
+    optimDict`xk`fk`idx;
+    optimDict`xkPrev`fkPrev`idx
+    ];
+  returnKeys!returnVals
+  }
diff --git a/optimize/tests/test.t b/optimize/tests/test.t
index 51e12f15..41d73b03 100644
--- a/optimize/tests/test.t
+++ b/optimize/tests/test.t
@@ -1,7 +1,9 @@
 \l p.q
 \l ml.q
-\l util/util.q
-\l optimize/optim.q
+\l util/utils.q
+\l util/utilities.q
+\l optimize/utils.q
+\l optimize/optimize.q
 
 // Function for the capturing of expected errors
 failingTest:{[function;data;applyType;expectedError]
diff --git a/optimize/utils.q b/optimize/utils.q
new file mode 100644
index 00000000..f6a3be38
--- /dev/null
+++ b/optimize/utils.q
@@ -0,0 +1,679 @@
+// optimize/utils.q - Load utility functions
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Utilities used in the optimize library
+
+\d .ml
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Optimize a function until gradient tolerance is reached or
+//   maximum number of allowed iterations is met. The following outlines a
+//   python equivalent
+//  https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/optimize.py#L1131
+// @param func {fn} Function to be minimized
+// @param optimDict {dictionary} Variables to be updated at each iteration of
+//   optimization
+// @param args {any} Arguments to the optimization function that do not 
+//   change per iteration 
+// @param params {dictionary} Parameters controlling non default optimization 
+//   behaviour
+// @return {dictionary} Variables, gradients, matrices and indices at the end 
+//   of each iteration
+i.BFGSFunction:{[func;optimDict;args;params] 
+  // Calculate search direction
+  pk:neg mmu[optimDict`hess;optimDict`gk];
+  // Line search func to be inserted to get alpha
+  wolfe:i.wolfeSearch[;;;pk;func;;args;params]. optimDict`fk`fkPrev`gk`xk;
+  // Old fk goes to previous val
+  optimDict[`fkPrev]:optimDict`fk;
+  // Update values based on wolfe line search
+  alpha:wolfe 0;
+  optimDict[`fk]:wolfe 1;
+  gNew:wolfe 2;
+  // Redefine the x value at k-1 to the current x value
+  optimDict[`xkPrev]:optimDict`xk;
+  // Calculate the step distance for moving from x(k-1) -> x(k)
+  sk:alpha*pk;
+  // Update values of x at the new position k
+  optimDict[`xk]:optimDict[`xkPrev]+sk;
+  // If null gNew, then get gradient of new x value
+  if[any null gNew;gNew:i.grad[func;optimDict`xk;args;params`geps]];
+  // Subtract new gradients
+  yk:gNew-optimDict`gk;
+  optimDict[`gk]:gNew;
+  // Get new norm of gradient
+  optimDict[`gnorm]:i.vecNorm[optimDict`gk;params`norm];
+  // Calculate new hessian matrix for next iteration 
+  rhok:1%mmu[yk;sk];
+  if[0w=rhok;
+    rhok:1000f;
+    -1"Division by zero in calculation of rhok, assuming rhok large";
+    ];
+  A1:optimDict[`I]-sk*\:yk*rhok;
+  A2:optimDict[`I]-yk*\:sk*rhok;
+  hessMul:mmu[A1;mmu[optimDict`hess;A2]];
+  optimDict[`hess]:hessMul+rhok*(sk*/:sk);
+  // if x(k) returns infinite value update gnorm and fk
+  if[0w in abs optimDict`xk;optimDict[`gnorm`fk]:(0n;0w)];
+  optimDict[`idx]+:1;
+  if[params`display;show optimDict;-1"";];
+  optimDict
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Complete a line search across an unconstrained minimization
+//   problem making use of wolfe conditions to constrain the search. The naming
+//   convention for dictionary keys in this implementation is based on the 
+//   python implementation of the same functionality here
+// https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L193
+// @param fk {float} Function return evaluated at position k
+// @param fkPrev {float} Function return evaluated at position k-1
+// @param gk {float} Gradient at position k
+// @param pk {float} Search direction
+// @param func {fn} Function being optimized 
+// @param xk {number[]} Parameter values at position k
+// @param args {dictionary|number[]} Function arguments that do not change per 
+//   iteration
+// @param params {dictionary} Parameters controlling non default optimization
+//   behaviour
+// @return {number[]} New alpha, fk and derivative values
+i.wolfeSearch:{[fk;fkPrev;gk;pk;func;xk;args;params]
+  phiFunc   :i.phi[func;pk;;xk;args];
+  derPhiFunc:i.derPhi[func;params`geps;pk;;xk;args];
+  // Initial Wolfe conditions
+  wolfeKeys:`idx`alpha0`phi0`phia0;
+  wolfeVals:(0;0;fk;fk);
+  wolfeDict:wolfeKeys!wolfeVals;
+  // Calculate the derivative at that phi0
+  derPhi0:gk mmu pk;
+  wolfeDict[`derPhia0`derPhi0]:2#derPhi0;
+  // Calculate step size this should be 0 < x < 1 
+  // with min(x;maxstepsize) or 1f otherwise
+  alpha:1.01*2*(fk-fkPrev)%derPhi0;
+  alphaVal:$[alpha within 0 1f;min(alpha;params`stepSize);1f];
+  wolfeDict[`alpha1]:alphaVal;
+  // function value at alpha1
+  wolfeDict[`phia1]:phiFunc wolfeDict`alpha1;
+  // Repeat until wolfe criteria is reached or max iterations have been done
+  // to get new alpha, phi and derPhi values
+  wolfeDict:i.stopWolfe[;params]
+    i.scalarWolfe[derPhiFunc;phiFunc;pk;params]/wolfeDict;
+  // if the line search did not converge, use last alpha , phi and derPhi
+  $[not any null raze wolfeDict`alphaStar`phiStar`derPhiStar;
+    wolfeDict`alphaStar`phiStar`derPhiStar;
+    wolfeDict`alpha1`phia1`derPhia0Fin
+    ]
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Apply a scalar search to find an alpha value that satisfies
+//   strong Wolfe conditions, a python implementation of this is outlined here
+// https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L338
+//   This functions defines the bounds between which the step function can 
+//   be found. When the optimal bound is found, the area is zoomed recursively
+//   until the optimal value is found
+// @param derPhiFunc {fn} Function to calculate the value of the objective
+//   function derivative at alpha
+// @param phiFunc {fn} Function to calculate the value of the objective
+//   function at alpha
+// @param pk {float} Search direction
+// @param params {dictionary} Parameters controlling non default optimization
+//   behaviour
+// @param wolfeDict {dictionary} All data relevant to the calculation of the 
+//   optimal alpha values 
+// @returns {dictionary} New alpha, fk and derivative values
+i.scalarWolfe:{[derPhiFunc;phiFunc;pk;params;wolfeDict]
+  // Set up zoom function constant params
+  zoomSetup:i.zoomFunc[derPhiFunc;phiFunc;;;params]. wolfeDict`phi0`derPhi0;
+  // If criteria 1 is met, zoom and break loop
+  if[i.wolfeCriteria1[wolfeDict;params];
+    wolfeDict[`idx]:0w;
+    wolfeVals:wolfeDict`alpha0`alpha1`phia0`phia1`derPhia0;
+    updZoom:zoomSetup wolfeVals;
+    wolfeDict[i.zoomReturn]:updZoom;
+    :wolfeDict
+    ];
+  // Calculate the derivative of the function at the new position
+  derPhiCalc:derPhiFunc wolfeDict`alpha1;
+  // Update the new derivative function
+  wolfeDict[`derPhia1]:derPhiCalc`derval;
+  $[i.wolfeCriteria2[wolfeDict;params];
+    [wolfeDict[`alphaStar]:wolfeDict`alpha1;
+     wolfeDict[`phiStar]:wolfeDict`phia1;
+     wolfeDict[`derPhiStar]:derPhiCalc`grad;
+     wolfeDict[`idx]:0w;
+     wolfeDict
+    ];
+    0<=wolfeDict`derPhia1;
+    [wolfeDict[`idx]:0w;
+     updZoom:zoomSetup wolfeDict`alpha1`alpha0`phia1`phia0`derPhia1;
+     wolfeDict[i.zoomReturn]:updZoom   
+    ];
+    // Update dictionary and repeat process until criteria is met
+    [wolfeDict[`alpha0]:wolfeDict`alpha1;
+     wolfeDict[`alpha1]:2*wolfeDict`alpha1;
+     wolfeDict[`phia0]:wolfeDict`phia1;
+     wolfeDict[`phia1]:phiFunc wolfeDict`alpha1;
+     wolfeDict[`derPhia0]:wolfeDict`derPhia1;
+     wolfeDict[`derPhia0Fin]:derPhiCalc`grad;
+     wolfeDict[`idx]+:1
+    ]
+    ];
+  wolfeDict
+  }
+
+// @private
+// @kind function
+// @category optimizeUtility
+// @desc Function to apply 'zoom' iteratively during linesearch to find
+//   optimal alpha value satisfying strong Wolfe conditions
+// @param derPhiFunc {fn} Function to calculate the value of the objective
+//   function derivative at alpha
+// @param phiFunc {fn} Function to calculate the value of the objective
+//   function at alpha
+// @param phi0 {float} Value of function evaluation at x(k-1)
+// @param derPhi0 {float} Value of objective function derivative at x(k-1)
+// @param params {dictionary} Parameters controlling non default optimization 
+//   behaviour
+// @param cond {number[]} Bounding conditions for alpha, phi and derPhi used in 
+//   zoom algorithm
+// @returns {number[]} New alpha, fk and derivative values
+i.zoomFunc:{[derPhiFunc;phiFunc;phi0;derPhi0;params;cond]
+  zoomDict:i.zoomKeys!cond,phi0;
+  zoomDict[`idx`aRec]:2#0f;
+  zoomDict:i.stopZoom[;params]
+    i.zoom[derPhiFunc;phiFunc;phi0;derPhi0;params]/zoomDict;
+  // If zoom did not converge, set to null
+  $[count star:zoomDict[i.zoomReturn];star;3#0N]
+  }
+
+// @private
+// @kind function
+// @category optimizeUtility
+// @desc Function to apply an individual step in 'zoom' during 
+//   linesearch to find optimal alpha value satisfying strong Wolfe conditions.
+//   An outline of the python implementation of this section of the algorithm 
+//   can be found here
+// https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L556
+// @param derPhiFunc {fn} Function to calculate the value of the objective 
+//   function derivative at alpha
+// @param phiFunc {fn} Function to calculate the value of the objective 
+//   function at alpha
+// @param phi0 {float} Value of function evaluation at x(k-1)
+// @param derPhi0 {float} Value of objective function derivative at x(k-1)
+// @param params {dictionary} Parameters controlling non default optimization 
+//   behaviour
+// @param zoomDict {dictionary} Parameters to be updated as 'zoom' procedure is
+//   applied to find the optimal value of alpha
+// @returns {dictionary} Parameters calculated for an individual step in line
+//   search procedure to find optimal alpha value satisfying strong Wolfe 
+//   conditions
+i.zoom:{[derPhiFunc;phiFunc;phi0;derPhi0;params;zoomDict]
+  alphaDiff:zoomDict[`aHi]-zoomDict`aLo;
+  // define high and low values
+  highLowVal:$[alphaDiff>0;zoomDict`aHi`aLo;zoomDict`aLo`aHi];
+  highLow:`high`low!highLowVal;
+  if["i"$zoomDict`idx;
+    cubicCheck:alphaDiff*0.2;
+    findMin:i.cubicMin . zoomDict`aLo`phiLo`derPhiLo`aHi`phiHi`aRec`phiRec
+    ];
+  if[i.quadCriteria[findMin;highLow;cubicCheck;zoomDict];
+    quadCheck:0.1*alphaDiff;
+    findMin:i.quadMin . zoomDict`aLo`phiLo`derPhiLo`aHi`phiHi;
+    lowerCheck:findMin<highLow[`high]+quadCheck;
+    upperCheck:findMin>highLow[`low]-quadCheck;
+    if[upperCheck|lowerCheck;
+      findMin:zoomDict[`aLo]+0.5*alphaDiff
+      ]
+    ];
+  // Update new values depending on findMin
+  phiMin:phiFunc[findMin];
+  // First condition, update and continue loop
+  if[i.zoomCriteria1[phi0;derPhi0;phiMin;findMin;zoomDict;params];
+    zoomDict[`idx]+:1;
+    zoomDict[i.zoomKeys1]:zoomDict[`phiHi`aHi],findMin,phiMin;
+    :zoomDict
+    ];
+  // Calculate the derivative at the cubic minimum
+  derPhiMin:derPhiFunc findMin;
+  // Second scenario, create new features and end the loop
+  $[i.zoomCriteria2[derPhi0;derPhiMin;params];
+    [zoomDict[`idx]:0w;
+     zoomDict:zoomDict,i.zoomReturn!findMin,phiMin,enlist derPhiMin`grad
+    ];
+    i.zoomCriteria3[derPhiMin;alphaDiff];
+    [zoomDict[`idx]+:1;
+     zoomDict[i.zoomKeys1,i.zoomKeys2]:zoomDict[`phiHi`aHi`aLo`phiLo],
+       findMin,phiMin,derPhiMin`derval
+    ];
+    [zoomDict[`idx]+:1;
+     zoomDict[i.zoomKeys3,i.zoomKeys2]:zoomDict[`phiLo`aLo],
+       findMin,phiMin,derPhiMin`derval
+    ]
+    ];
+  zoomDict
+  }
+
+// Vector norm calculation
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Calculate the vector norm, used in calculation of the gradient
+//   norm at position k. Default behaviour is to use the maximum value of the
+//   gradient, this can be overwritten by a user, this is in line with the 
+//   default python implementation.
+// @param gradVals {number[]} Vector of calculated gradient values
+// @param ord {long} Order of norm (0W = max; -0W = min)
+// @return {float} Gradient norm based on the input gradient
+i.vecNorm:{[gradVals;ord]
+  if[-7h<>type ord;'"ord must be +/- infinity or a long atom"];
+  $[0W~ord;max abs gradVals;
+    -0W~ord;min abs gradVals;
+    sum[abs[gradVals]xexp ord]xexp 1%ord
+   ]
+  }
+
+// Stopping conditions
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Evaluate if the optimization function has reached a condition
+//   which should result in the optimization algorithm being stopped
+// @param dict {dictionary} Optimization function returns
+// @param params {dictionary} Parameters controlling non default optimization
+//   behaviour
+// @return {boolean} Indication as to if the optimization has met one of it's
+//   stopping conditions
+i.stopOptimize:{[dict;params]
+  // Is the function evaluation at k an improvement on k-1?
+  check1:dict[`fk]<dict`fkPrev;
+  // Has x[k] returned a non valid return?
+  check2:not any null dict`xk;
+  // Have the maximum number of iterations been met?
+  check3:params[`optimIter]>dict`idx;
+  // Is the gradient at position k below the accepted tolerance
+  check4:params[`gtol]<dict`gnorm;
+  check1&check2&check3&check4
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Evaluate if the wolfe condition search has reached a condition
+//   which should result in the optimization algorithm being stopped
+// @param dict {dictionary} Optimization function returns
+// @param params {dictionary} Parameters controlling non default optimization
+//   behaviour
+// @return {boolean} Indication as to if the optimization has met one of it's 
+//   stopping conditions
+i.stopWolfe:{[dict;params]
+  dict[`idx]<params`wolfeIter
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Evaluate if the alpha condition 'zoom' has reached a condition
+//   which should result in the optimization algorithm being stopped
+// @param dict {dictionary} Optimization function returns
+// @param params {dictionary} Parameters controlling non default optimization
+//   behaviour
+// @return {boolean} Indication as to if the optimization has met one of it's 
+//   stopping conditions
+i.stopZoom:{[dict;params]
+  dict[`idx]<params`zoomIter
+  }
+
+// Function + derivative evaluation at x[k]+p[k]*alpha[k]
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Evaluate the objective function at the position x[k]+step size
+// @param func {fn} The objective function to be minimized
+// @param pk {float} Step direction
+// @param alpha {float} Size of the step to be applied
+// @param xk {number[]} Parameter values at position k
+// @param args {dictionary|number[]} Function arguments that do not change per 
+//   iteration
+// @returns {float} Function evaluated at at the position x[k] + step size
+i.phi:{[func;pk;alpha;xk;args]
+  xk+:alpha*pk;
+  i.funcEval[func;xk;args]
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Evaluate the derivative of the objective function at
+//   the position x[k] + step size
+// @param func {fn} The objective function to be minimized
+// @param eps {float} The absolute step size used for numerical approximation
+//   of the jacobian via forward differences
+// @param pk {float} Step direction
+// @param alpha {float} Size of the step to be applied
+// @param xk {number[]} Parameter values at position k
+// @param args {dictionary|number[]} Function arguments that do not change per 
+//   iteration
+// @returns {dictionary} Gradient and value of scalar derivative
+i.derPhi:{[func;eps;pk;alpha;xk;args]
+  // Increment xk by a small step size
+  xk+:alpha*pk;
+  // Get gradient at the new position
+  gval:i.grad[func;xk;args;eps];
+  derval:gval mmu pk;
+  `grad`derval!(gval;derval)
+  }
+
+// Minimization functions
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Find the minimizing solution for a cubic polynomial which
+//   passes through the points (a,fa), (b,fb) and (c,fc) with a derivative of
+//   the objective function calculated as fpa. This follows the python 
+//   implementation outlined here 
+// https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L482
+// @param a {float} Position a
+// @param fa {float} Objective function evaluated at a
+// @param fpa {float} Derivative of the objective function evaluated at 'a'
+// @param b {float} Position b
+// @param fb {float} Objective function evaluated at b
+// @param c {float} Position c
+// @param fc {float} Objective function evaluated at c
+// @returns {number[]} Minimized parameter set as a solution for the cubic 
+//   polynomial
+i.cubicMin:{[a;fa;fpa;b;fb;c;fc]
+  bDiff:b-a;
+  cDiff:c-a;
+  denom:(bDiff*cDiff)xexp 2*(bDiff-cDiff);
+  d1:2 2#0f;
+  d1[0]:(1 -1)*xexp[;2]each(bDiff;cDiff);
+  d1[1]:(-1 1)*xexp[;3]each(cDiff;bDiff);
+  AB:d1 mmu(fb-fa-fpa*bDiff;fc-fa-fpa*cDiff);
+  AB%:denom;
+  radical:AB[1]*AB[1]-3*AB[0]*fpa;
+  a+(neg[AB[1]]+sqrt(radical))%(3*AB[0])
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Find the minimizing solution for a quadratic polynomial which
+//   passes through the points (a,fa) and (b,fb) with a derivative of the 
+//   objective function calculated as fpa. This follows the python 
+//   implementation outlined here
+// https://github.com/scipy/scipy/blob/v1.5.0/scipy/optimize/linesearch.py#L516
+// @param a {float} Position a
+// @param fa {float} Objective function evaluated at a
+// @param fpa {float} Derivative of the objective function evaluated at a
+// @param b {float} Position b
+// @param fb {float} Objective function evaluated at b
+// @returns {number[]} Minimized parameter set as a solution for the quadratic
+//   polynomial
+i.quadMin:{[a;fa;fpa;b;fb]
+  bDiff:b-a;
+  B:(fb-fa-fpa*bDiff)%(bDiff*bDiff);
+  a-fpa%(2*B)
+  }
+
+// Gradient + function evaluation
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Calculation of the gradient of the objective function for all 
+//   parameters of x incremented individually by epsilon
+// @param func {fn} The objective function to be minimized
+// @param xk {number[]} Parameter values at position k
+// @param args {dictionary|number[]} Function arguments that do not change per 
+//   iteration
+// @param eps {float} The absolute step size used for numerical approximation
+//   of the jacobian via forward differences
+// @returns {dictionary} Gradient of function at position k
+i.grad:{[func;xk;args;eps]
+  fk:i.funcEval[func;xk;args];
+  i.gradEval[fk;func;xk;args;eps]each til count xk
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Calculation of the gradient of the objective function for a
+//   single parameter set x where one of the indices has been incremented by
+//   epsilon
+// @param func {fn} The objective function to be minimized
+// @param xk {number[]} Parameter values at position k
+// @param args {dictionary|number[]} Function arguments that do not change per 
+//   iteration
+// @param eps {float} The absolute step size used for numerical approximation
+//   of the jacobian via forward differences
+// @returns {dictionary} Gradient of function at position k with an individual
+//   variable x incremented by epsilon
+i.gradEval:{[fk;func;xk;args;eps;idx]
+  if[(::)~fk;fk:i.funcEval[func;xk;args]];
+  // Increment function optimisation values by epsilon
+  xk[idx]+:eps;
+  // Evaluate the gradient
+  (i.funcEval[func;xk;args]-fk)%eps
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Evaluate the objective function at position x[k] with relevant
+//   additional arguments accounted for
+// @param {fn} The objective function to be minimized
+// @param xk {number[]} Parameter values at position k
+// @param args {dictionary|number[]} Function arguments that do not change per 
+//   iteration
+// @returns {float} The objective function evaluated at the appropriate
+//   location
+i.funcEval:{[func;xk;args]
+  $[any args~/:((::);());func xk;func[xk;args]]
+  }
+
+// Parameter dictionary
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Update the default behaviour of the model optimization 
+//   procedure to account for increased sensitivity to tolerance, the number 
+//   of iterations, how the gradient norm is calculated and various numerical 
+//   updates including changes to the Armijo rule and curvature for calculation
+//   of the strong Wolfe conditions
+// @param dict {dictionary|(::)|()} If dict isn't empty,update the default 
+//   dictionary to include the user defined updates, otherwise use the default 
+//   dictionary
+// @returns {dictionary} Updated or default parameter set depending on
+//   user input
+i.updDefault:{[dict]
+  dictKeys:`norm`optimIter`gtol`geps`stepSize`c1`c2`wolfeIter`zoomIter`display;
+  dictVals:(0W;0W;1e-4;1.49e-8;0w;1e-4;0.9;10;10;0b);
+  returnDict:dictKeys!dictVals;
+  if[99h<>type dict;dict:()!()];
+  i.wolfeParamCheck[returnDict,dict]
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Ensure that the Armijo and curvature parameters are consistent
+//   with the expected values for calculation of the strong Wolfe conditions
+// @param dict {dictionary} Updated parameter dictionary containing default
+//   information and any updated parameter information
+// @returns {dictionary|err} The original input dictionary or an error 
+//   suggesting that the Armijo and curvature parameters are unsuitable
+i.wolfeParamCheck:{[dict]
+  check1:dict[`c1]>dict`c2;
+  check2:any not dict[`c1`c2]within 0 1;
+  $[check1 or check2;
+   '"When evaluating Wolfe conditions the following must hold 0 < c1 < c2 < 1";
+    dict
+    ]
+  }
+
+// Data Formatting
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Ensure that the input parameter x at position 0 which 
+//   will be updated is in a format that is suitable for use with this 
+//   optimization procedure i.e. the data is a list of values.
+// @param x0 {dictionary|number|number[]} Initial values of x to be optimized
+// @returns {number[]} The initial values of x converted into a suitable
+//   numerical list format
+i.dataFormat:{[x0]
+  "f"$$[99h=type x0;raze value x0;0h>type x0;enlist x0;x0]
+  }
+
+// Conditional checks for Wolfe, zoom and quadratic condition evaluation
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Ensure new values lead to improvements over the older values
+// @param wolfeDict {dictionary} The current iterations values for the 
+//   objective function and the derivative of the objective function evaluated
+// @param params {dictionary} Parameter dictionary containing the updated/
+//   default information used to modify the behaviour of the system as a whole
+// @returns {boolean} Indication as to if a further zoom is required
+i.wolfeCriteria1:{[wolfeDict;params]
+  prdVal:prd wolfeDict`alpha1`derPhi0;
+  check1:wolfeDict[`phia1]>wolfeDict[`phi0]+params[`c1]*prdVal;
+  prevPhi:wolfeDict[`phia1]>=wolfeDict`phia0;
+  wolfeIdx:1<wolfeDict`idx;
+  check2:prevPhi and wolfeIdx;
+  check1 or check2
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Ensure new values lead to improvements over the older values
+// @param wolfeDict {dictionary} The current iterations values for the 
+//   objective function and the derivative of the objective function evaluated
+// @param params {dictionary} Parameter dictionary containing the updated/
+/    default information used to modify the behaviour of the system as a whole
+// @returns {boolean} Indication as to if a further zoom is required 
+i.wolfeCriteria2:{[wolfeDict;params]
+  neg[params[`c2]*wolfeDict[`derPhi0]]>=abs wolfeDict`derPhia1
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Check if there is need to apply quadratic minimum calculation
+// @param findMin {number[]} The currently calculated minimum values
+// @param highLow {dictionary} Upper and lower bounds of the search space
+// @param cubicCheck {float} Interpolation check parameter
+// @param zoomDict {dictionary} Parameters to be updated as 'zoom' procedure is 
+//   applied to find the optimal value of alpha
+// @returns {boolean} Indication as to if the value of findMin needs to be
+//   updated 
+i.quadCriteria:{[findMin;highLow;cubicCheck;zoomDict]
+  // On first iteration the initial minimum has not been calculated
+  // as such criteria should exit early to complete the quadratic calculation
+  if[findMin~();:1b];
+  check1:0=zoomDict`idx;
+  check2:findMin>highLow[`low] -cubicCheck;
+  check3:findMin<highLow[`high]+cubicCheck;
+  check1 or check2 or check3
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Check if the zoom conditions are sufficient
+// @param phi0 {float} Objective function evaluation at index 0
+// @param derPhi0 {float} Derivative of objective function evaluated at index 0
+// @param phiMin {float} 0bjective function evaluated at the current minimum
+// @param findMin {float} The currently calculated minimum value
+// @param zoomDict {dictionary} Parameters to be updated as 'zoom' procedure is
+//   applied to find the optimal value of alpha
+// @param params {dictionary} Parameter dictionary containing the updated/
+//   default information used to modify the behaviour of the system as a whole
+// @returns {boolean} Indication as to if further zooming is required
+i.zoomCriteria1:{[phi0;derPhi0;phiMin;findMin;zoomDict;params]
+  calc:phi0+findMin*derPhi0*params`c1;
+  check1:phiMin>calc;
+  check2:phiMin>=zoomDict`phiLo;
+  check1 or check2
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Check if the zoom conditions are sufficient
+// @param derPhi0 {float} Derivative of the objective function evaluated at
+//   index 0
+// @param derPhiMin {float} Derivative of the objective function evaluated at
+//   the current minimum
+// @param params {dictionary} Parameter dictionary containing the 
+//   updated/default information used to modify the behaviour of the system 
+//   as a whole
+// @returns {boolean} Indication as to if further zooming is required
+i.zoomCriteria2:{[derPhi0;derPhiMin;params]
+  abs[derPhiMin`derval]<=neg derPhi0*params`c2
+  }
+
+// @private
+// @kind function
+// @category optimizationUtility
+// @desc Check if the zoom conditions are sufficient
+// @param derPhiMin {float} Derivative of the objective function evaluated at 
+//   the current minimum
+// @param alphaDiff {float} Difference between the upper and lower bound of the 
+//   zoom bracket
+// @returns {boolean} Indication as to if further zooming is required
+i.zoomCriteria3:{[derPhiMin;alphaDiff]
+  0<=derPhiMin[`derval]*alphaDiff
+  }
+	   
+// Zoom dictionary 
+
+// @private
+// @kind symbol
+// @category optimizationUtility
+// @desc Input keys of zoom dictionary
+// @type symbol[]
+i.zoomKeys:`aLo`aHi`phiLo`phiHi`derPhiLo`phiRec;
+
+// @private
+// @kind symbol
+// @category optimizationUtility
+// @desc Keys to be updated in zoom each iteration
+// @type symbol[]
+i.zoomKeys1:`phiRec`aRec`aHi`phiHi;
+
+// @private
+// @kind symbol
+// @category optimizationUtility
+// @desc Extra keys that have to be updated in some scenarios
+// @type symbol[]
+i.zoomKeys2:`aLo`phiLo`derPhiLo;
+
+// @private
+// @kind symbol
+// @category optimizationUtility
+// @desc Extra keys that have to be updated in some scenarios
+// @type symbol[]
+i.zoomKeys3:`phiRec`aRec
+
+// @private
+// @kind symbol
+// @category optimizationUtility
+// @desc Final updated keys to be used
+// @type symbol[]
+i.zoomReturn:`alphaStar`phiStar`derPhiStar;
diff --git a/stats/README.md b/stats/README.md
new file mode 100644
index 00000000..046b4bc6
--- /dev/null
+++ b/stats/README.md
@@ -0,0 +1,34 @@
+# Statistical Analysis
+
+This folder contains implementations of statistical methods for data exploration and estimation of models parameters.
+
+## Functionality
+
+The functionality contained within this section range from descriptive statistical methods to gain more insight into data, to linear regression estimation methods to investigate unknown parameters in a model. The linear regression implementations include `Ordinary Least Squares` and `Weighted Least Squares` 
+
+
+## Requirements
+
+- kdb+ > 3.5
+
+## Installation
+
+Place the `ml` library in `$QHOME` and load into a q instance using `ml/ml.q`
+
+### Load
+
+The following will load the optimization functionality into the `.ml` namespace
+```q
+q)\l ml/ml.q
+q).ml.loadfile`:stats/init.q
+```
+
+## Documentation
+
+Documentation is available on the [Statistics](https://code.kx.com/q/ml/toolkit/statistics/) homepage.
+
+## Status
+
+The optimization library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
+
+If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/stats/describe.json b/stats/describe.json
new file mode 100644
index 00000000..805c8255
--- /dev/null
+++ b/stats/describe.json
@@ -0,0 +1,74 @@
+{
+  "count":{
+    "func":"count",
+    "type":["num","temporal","other"]
+  },
+  "type":{
+    "func":"{.ml.stats.i.metaTypes .Q.ty x}",
+    "type":["num","temporal","other"]
+  }, 
+  "mean":{
+    "func":"avg",
+    "type":["num"]
+  },
+  "std":{
+    "func":"sdev",
+    "type":["num"]
+  },
+  "min":{
+    "func":"min",
+    "type":["num","temporal"]
+  },
+  "max":{
+    "func":"max",
+    "type":["num","temporal"]
+  },
+  "q1":{
+    "func":"{.ml.stats.percentile[x;0.25]}",
+    "type":["num"]
+  },
+  "q2":{
+    "func":"{.ml.stats.percentile[x;0.5]}",
+    "type":["num"]
+  },
+  "q3":{
+    "func":"{.ml.stats.percentile[x;0.75]}",
+    "type":["num"]
+  },
+  "nulls":{
+    "func":"{sum null x}",
+    "type":["num","temporal","other"]
+  },
+  "inf":{
+    "func":"{sum x=.ml.stats.i.infinity .ml.stats.i.metaTypes[.Q.ty  x]}",
+    "type":["num"]
+  },
+  "range":{
+    "func":".ml.range",
+    "type":["num","temporal"]
+  },
+  "skew":{
+    "func":".ml.fresh.feat.skewness",
+    "type":["num"]
+  },
+  "countDistinct":{
+    "func":"{count distinct x}",
+    "type":["num","temporal","other"]
+  },
+  "mode":{
+    "func":"{first key desc count each group x}",
+    "type":["num","temporal","other"]
+  },
+  "freq":{
+    "func":"{first value asc count each group x}",
+    "type":["num","temporal","other"]
+  },
+  "sampleDev":{
+    "func":"sdev",
+    "type":["num"]
+  },
+  "standardError":{
+    "func":"{dev[x]%sqrt count x}",
+    "type":["num"]
+  }
+}
diff --git a/stats/init.q b/stats/init.q
new file mode 100644
index 00000000..63053519
--- /dev/null
+++ b/stats/init.q
@@ -0,0 +1,5 @@
+// stats/init.q - Load stats library
+// Copyright (c) 2021 Kx Systems Inc
+
+.ml.loadfile`:stats/utils.q
+.ml.loadfile`:stats/stats.q
diff --git a/stats/stats.q b/stats/stats.q
new file mode 100644
index 00000000..e5eab3d3
--- /dev/null
+++ b/stats/stats.q
@@ -0,0 +1,159 @@
+// stats/stats.q - Statistical tools
+// Copyright (c) 2021 Kx Systems Inc
+//
+// This statistical library contains functionality ranging from
+// descriptive statistical methods to gain more insight into a 
+// users data, to linear regression estimation methods to investigate 
+// unknown parameters in a model. Includes OLS, WLS, describe, 
+// and percentile
+
+\d .ml
+
+// @kind function
+// @category stats
+// @desc Train an ordinary least squares model on data
+// @param endog {number[][]|number[]} The endogenous variable
+// @param exog {number[][]|number[]} A variables that predict the 
+//   endog variable
+// @param trend {boolean} Whether a trend is added to the model
+// @returns {dictionary} Contains the following information:
+//   modelInfo - Coefficients and statistical values calculated during the 
+//     fitting process
+//   predict - A projection allowing for prediction on new input data
+stats.OLS.fit:{[endog;exog;trend]
+  stats.i.checkLen[endog;exog;"exog"];
+  endog:"f"$endog;
+  exog:"f"$$[trend;1f,'exog;exog];
+  if[1=count exog[0];exog:flip enlist exog];
+  coef:first enlist[endog]lsq flip exog;
+  modelInfo:stats.i.OLSstats[coef;endog;exog;trend];
+  returnInfo:enlist[`modelInfo]!enlist modelInfo;
+  predict:stats.OLS.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predict
+  }
+
+// @desc Predict values using coefficients calculated via OLS
+// @param config {dictionary} Information returned from `OLS.fit`
+//   including:
+//   modelInfo - Coefficients and statistical values calculated during the 
+//     fitting process
+//   predict - A projection allowing for prediction on new input data
+// @param exog {table|number[][]|number[]} The exogenous variables
+// @returns {number[]} The predicted values
+stats.OLS.predict:{[config;exog]
+  modelInfo:config`modelInfo;
+  trend:`yIntercept in key modelInfo`variables;
+  exog:"f"$$[trend;1f,'exog;exog];
+  coef:modelInfo`coef;
+  if[1=count exog[0];exog:flip enlist exog];
+  sum coef*flip exog
+  }
+
+// @kind function
+// @category stats
+// @desc Train a weighted least squares model on data
+// @param endog {number[][]|number[]} The endogenous variable
+// @param exog {number[][]|number[]} A variables that predict the 
+//   endog variable
+// @param weights {float[]} The weights to be applied to the endog variable
+// @param trend {boolean} Whether a trend is added to the model
+// @returns {dictionary} Contains the following information:
+//   modelInfo - Coefficients and statistical values calculated during the 
+//     fitting process
+//   predict - A projection allowing for prediction on new input data
+stats.WLS.fit:{[endog;exog;weights;trend]
+  stats.i.checkLen[endog;exog;"exog"];
+  if[weights~(::);weights:()];
+  if[count weights;stats.i.checkLen[endog;weights;"weights"]];
+  endog:"f"$endog; 
+  // Calculate the weights if not given
+  // Must be inversely proportional to the error variance
+  if[not count weights;
+    trained:stats.OLS.fit[endog;exog;0b];
+    residuals:endog-trained[`predict]exog;
+    trained:stats.OLS.fit[abs residuals;exog;0b];
+    weights:1%{x*x}trained[`predict]exog
+    ];
+  exog:"f"$$[trend;1f,'exog;exog];
+  if[1=count exog[0];exog:flip enlist exog];
+  updDependent:flip[exog]mmu weights*'endog;
+  updPredictor:flip[exog]mmu weights*'exog;
+  coef:raze inv[updPredictor]mmu updDependent;
+  modelInfo:stats.i.OLSstats[coef;endog;exog;trend];
+  modelInfo,:enlist[`weights]!enlist weights;
+  returnInfo:enlist[`modelInfo]!enlist modelInfo;
+  predict:stats.WLS.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predict
+  }
+
+// @desc Predict values using coefficients calculated via WLS
+// @param config {dictionary} Information returned from `WLS.fit`
+//   including:
+//   modelInfo - Coefficients and statistical values calculated during the 
+//     fitting process
+//   predict - A projection allowing for prediction on new input data
+// @param exog {table|number[][]|number[]} The exogenous variables
+// @returns {number[]} The predicted values
+stats.WLS.predict:stats.OLS.predict
+
+// @kind data
+// @category stats
+// @desc Load in functions defined within `describe.json`
+// @type dictionary
+stats.describeFuncs:.j.k raze read0`$path,"/stats/describe.json"
+
+// @kind function
+// @category stats
+// @desc Generates descriptive statistics of a table
+// @param tab {table} A simple table
+// @returns {dictionary} A tabular description of aggregate information 
+//   of each column
+stats.describe:{[tab]
+  funcTab:stats.describeFuncs;
+  if[not all `func`type in cols value funcTab;
+    '"Keyed table must contain a func and type attribute"];
+  typeKeys:`num`temporal`other;
+  typeFunc:distinct raze value[funcTab][`type];
+  typCheck:raze not enlist[typeFunc] in string each typeKeys;
+  if[any typCheck;
+    '"Invalid type given:",raze typeFunc where typCheck
+    ];
+  descKeys:key funcTab;
+  funcs:get each value[funcTab]`func;
+  // Get indices of where each type of function is in the function list
+  typeDict:typeKeys!where@'(string each typeKeys) in/:\:value[funcTab]`type;
+  numTypes:"hijef";
+  temporalTypes:"pmdznuvt";
+  numCols:exec c from meta[tab]where t in numTypes;
+  temporalCols:exec c from meta[tab]where t in temporalTypes;
+  otherCols:cols[tab]except numCols,temporalCols;
+  colDict:typeKeys!(numCols;temporalCols;otherCols);
+  applyInd:where 0<count each colDict;
+  inds:asc distinct raze typeDict applyInd;
+  n:count funcs;
+  m:count applyInd;
+  // Create empty list so num/other have same amount of funcs
+  // so that they can be joined later
+  funcDict:applyInd!(m,n)#{(::)};
+  funcUpd:stats.i.updFuncDict[funcs;typeDict]/[funcDict;applyInd];
+  tabUpd:colDict[applyInd]#\:tab;
+  descVals:(,'/){flip x@\:/:flip y}'[funcUpd;tabUpd];
+  // Reorder columns to original order
+  descVals:cols[tab]xcols descVals;
+  descKeys[inds]!descVals[inds]
+  }
+
+// @kind function
+// @category utilities
+// @desc Percentile calculation for an array
+// @param array {number[]} A numerical array
+// @param perc {float} Percentile of interest
+// @returns {float} The value below which `perc` percent of the observations
+//   within the array are found
+stats.percentile:{[array;perc]
+  array:array where not null array;
+  percent:perc*-1+count array;
+  i:0 1+\:floor percent;
+  iDiff:0^deltas asc[array]i;
+  iDiff[0]+(percent-i 0)*last iDiff
+  }
diff --git a/stats/tests/stats.t b/stats/tests/stats.t
new file mode 100644
index 00000000..ed88fd0d
--- /dev/null
+++ b/stats/tests/stats.t
@@ -0,0 +1,56 @@
+\l ml.q
+\l util/init.q
+\l stats/init.q
+\l fresh/init.q
+
+np:.p.import[`numpy]
+ols:.p.import[`statsmodels.api]`:OLS
+wls:.p.import[`statsmodels.api]`:WLS
+
+x:1000?1000
+xf:1000?100f
+plaintab:([]4 5 6.;1 2 3.;-1 -2 -3.;0.4 0.5 0.6)
+plaintabn:plaintab,'([]x4:1 3 0n)
+plaintabn2:plaintab,'([]x4:`a`a`b)
+symTab:([]`a`a`a`b`b`c;101011b)
+symTab2:([]"abbcca";"x"$til 6)
+timeTab:([]"p"$til 5;"z"$1 3 2 2 1)
+
+.ml.stats.percentile[x;0.75]~np[`:percentile][x;75]`
+.ml.stats.percentile[x;0.02]~np[`:percentile][x;2]`
+.ml.stats.percentile[xf;0.5]~np[`:percentile][xf;50]`
+.ml.stats.percentile[3 0n 4 4 0n 4 4 3 3 4;0.5]~4.0
+
+descKeys:`count`mean`std`min`q1`q2`q3`max
+keySym:`count`type`nulls`countDistinct`mode`freq
+temporalKeys:`count`type`min`max`nulls`range`countDistinct`mode`freq
+.ml.stats.describe[symTab]~flip `x`x1!flip keySym!(6 6;`symbol`boolean;0 0i;3 2;(`a;1b);1 2)
+.ml.stats.describe[symTab2]~flip `x`x1!flip keySym!(6 6;`char`byte;0 0i;3 6;("a";0x00);2 1)
+.ml.stats.describe[timeTab]~flip `x`x1!flip temporalKeys!(5 5;`timestamp`datetime;("p"$0;"z"$1);("p"$4;"z"$3);0 0i;(0D00:00:00.000000004;2f);5 3;("p"$0;"z"$1);1 1)
+
+.ml.stats.describeFuncs:descKeys!.ml.stats.describeFuncs[descKeys]
+("f"$flip value .ml.stats.describe[plaintab])~flip .ml.df2tab .p.import[`pandas][`:DataFrame.describe][.ml.tab2df[plaintab]]
+("f"$flip value .ml.stats.describe[plaintabn])~flip (.ml.df2tab .p.import[`pandas][`:DataFrame.describe][.ml.tab2df[plaintab]]),'"f"$([]x4:3 2,sdev[1 3 0n],1 1.5 2 2.5 3)
+all all(flip value .ml.stats.describe[plaintabn2])=flip (.ml.df2tab .p.import[`pandas][`:DataFrame.describe][.ml.tab2df[plaintab]]),'([]x4:3f,7#(::))
+
+vec1: 6 2 5 1 9 2 4
+vec2:1 9 7 2 3 4 1
+mdlOLS1:.ml.stats.OLS.fit[7+2*til 10;til 10;1b]
+mdlOLS2:.ml.stats.OLS.fit[7+2*til 10;til 10;0b]
+mdlOLS3:.ml.stats.OLS.fit[vec2;vec1;0b]
+mdlOLS1.modelInfo.coef~ols[7+2*til 10;1f,'til 10][`:fit][][`:params]`
+mdlOLS2.modelInfo.coef~ols[7+2*til 10;til 10][`:fit][][`:params]`
+mdlOLS3.modelInfo.coef~ols[vec2;vec1][`:fit][][`:params]`
+mdlOLS1.predict[vec1]~19 11 17 9 25 11 15f
+mdlOLS2.predict[7+2*til 10]~ols[7+2*til 10;til 10][`:fit][][`:predict][7+2*til 10]`
+mdlOLS3.predict[vec2]~ols[vec2;vec1][`:fit][][`:predict][vec2]`
+
+mdlWLS1:.ml.stats.WLS.fit[7+2*til 10;til 10;(5#1),(5#2);1b]
+mdlWLS2:.ml.stats.WLS.fit[7+2*til 10;til 10;(5#1),(5#2);0b]
+mdlWLS3:.ml.stats.WLS.fit[vec2;vec1;til count vec1;0b]
+mdlWLS1.modelInfo.coef~wls[7+2*til 10;1f,'til 10;(5#1),(5#2)][`:fit][][`:params]`
+mdlWLS2.modelInfo.coef~wls[7+2*til 10;til 10;(5#1),(5#2)][`:fit][][`:params]`
+mdlWLS3.modelInfo.coef~wls[vec2;vec1;til count vec1][`:fit][][`:params]`
+mdlWLS1.predict[vec2]~9 25 21 11 13 15 9f
+mdlWLS2.predict[7+2*til 10]~wls[7+2*til 10;til 10;(5#1),(5#2)][`:fit][][`:predict][7+2*til 10]`
+mdlWLS3.predict[vec2]~wls[vec2;vec1;til count vec1][`:fit][][`:predict][vec2]`
diff --git a/stats/utils.q b/stats/utils.q
new file mode 100644
index 00000000..b1beace7
--- /dev/null
+++ b/stats/utils.q
@@ -0,0 +1,183 @@
+// stats/utils.q - Utility functions
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Utility functions for implementations within the stats library
+
+\d .ml
+
+// @private
+// @kind function
+// @category statsUtility
+// @desc Check that the length of the endog and another parameter
+//   are equal 
+// @param endog {float[]} The endogenous variable
+// @param param {number[][]|number[]} A parameter to compare the length of
+// @param paramName {string} The name of the parameter
+// @returns {::;err} Return an error if they aren't equal
+stats.i.checkLen:{[endog;param;paramName]
+  if[not count[endog]=count param;
+    '"The length of the endog variable and ",paramName," must be equal"
+    ]
+  }
+
+// @private
+// @kind function
+// @category statsUtility
+// @desc Calculate descriptive stats for an OLS regression
+// @param coef {float[]} The coefficients for each predictor variable
+// @param endog {float[]} The endogenous variable
+// @param exog {float[][]} Values that predict the endog variable
+// @param trend {boolean} Whether a trend is added to the model
+// @returns {dictionary[]} The descriptive statistics
+stats.i.OLSstats:{[coef;endog;exog;trend]
+  n:count endog;
+  p:count[coef]-trend;
+  statsDict:stats.i.OLScalcs[coef;endog;exog;n;p];
+  variables:stats.i.coefStats[coef;endog;exog;trend;n;p];
+  `coef`variables`statsDict!(coef;variables;statsDict)
+  }
+
+// @private
+// @kind function
+// @category statsUtility
+// @desc Calculate descriptive stats for an OLS regression
+// @param coef {float[]} The coefficients for each predictor variable
+// @param endog {float[]} The endogenous variable
+// @param exog {float[][]} Values that predict the endog variable
+// @param n {long} The number of endog variables
+// @param p {long} Number of coefs not including trend value
+// @returns {dictionary[]} The descriptive statistics
+stats.i.OLScalcs:{[coef;endog;exog;n;p]
+  coefDict:enlist[`coef]!enlist coef;
+  modelInfo:enlist[`modelInfo]!enlist coefDict;
+  predicted:stats.OLS.predict[modelInfo;exog];
+  mseCalc:mse[predicted;endog];
+  r2:r2Score[predicted;endog];
+  r2Adj:r2AdjScore[predicted;endog;p];
+  // Calculate degrees of freedom
+  dfTotal:n-1;
+  dfModel:p;
+  dfResidual:dfTotal-dfModel;
+  // Sum of squares
+  SSTotal:sse[endog;avg endog];
+  SSModel:sse[predicted;first avg predicted];
+  SSResidual:sse[predicted;endog];
+  // Regression mean squares are the sum squares%degrees of freedom
+  MSTotal:SSTotal%dfTotal; 
+  MSModel:SSModel%dfModel;
+  MSResidual:SSResidual%dfResidual;
+  fStat:MSModel%MSResidual;
+  logLike:stats.i.logLikelihood[SSResidual;n];
+  rseCalc:rse[predicted;endog;dfResidual];
+  pValue:2*1-pyStats[`:t][`:cdf;<][fStat;p;dfResidual];
+  dictKeys:`dfTotal`dfModel`dfResidual`sumSquares`meanSquares,
+    `fStat`r2`r2Adj`mse`rse`pValue`logLike;
+  dictVals:(dfTotal;dfModel;dfResidual;SSResidual;MSResidual;fStat;r2;
+    r2Adj;mseCalc;rseCalc;pValue;logLike);
+  dictKeys!dictVals
+  }
+
+// @private
+// @kind function
+// @category statsUtility
+// @desc Calculate the loglikelihood of the residuals
+// @param SSResiduals {float} Sum of squares of the residual
+// @param n {long} The number of endog variables
+// @returns {float[]} The loglikelihood value
+stats.i.logLikelihood:{[SSResidual;n]
+  n2:n%2;
+  ((neg[n2]*log[2*3.14])-(n2*log[SSResidual%n]))-n2
+  }
+  
+// @private
+// @kind function
+// @category statsUtility
+// @desc Calculate descriptive stats for the calculated coefficients
+// @param coef {float[]} The coefficients for each predictor variable
+// @param endog {float[]} The endogenous variable
+// @param exog {float[][]} Values that predict the endog variable
+// @param trend {boolean} Whether a trend is added to the model
+// @param n {long} The number of endog variables
+// @param p {long} Number of coefs not including trend value
+// @returns {dictionary[]} The descriptive statistics for the 
+//   calculated coefficients
+stats.i.coefStats:{[coef;endog;exog;trend;n;p]
+  varNames:`$"x",'string til count coef;
+  if[trend;varNames:`yIntercept,-1_varNames];
+  stdErr:stats.i.coefStdErr[coef;exog;endog];
+  tStat:coef%stdErr;
+  pValue:2*1-pyStats[`:t][`:cdf;<][;n-p-1]each abs tStat;
+  // Calculate the confidence interval
+  C195:stats.i.CI95[n;p]each stdErr;
+  ([name: varNames]coef;stdErr;tStat;pValue;C195)
+  }
+  
+// @private
+// @kind function
+// @category statsUtility
+// @desc Calculate the standard error of the coefficient
+// @param coef {float[]} The calculated coefficient
+// @param exog {float[][]} Values that predict the endog variable
+// @param endog {float[]} The endogenous variable
+// @returns {float[]} The standard error of the coefficient
+stats.i.coefStdErr:{[coef;exog;endog]
+  shape:count[exog]-count first exog;
+  error:{x*x}endog-exog mmu coef;
+  dSigmaSq:sum error%shape;
+  matrixInv:inv flip[exog]mmu exog;
+  mVarCovar:dSigmaSq*matrixInv;
+  // Get the diagonal values from a matrix
+  diag:mVarCovar ./: 2#/:til count mVarCovar;
+  sqrt diag
+  }
+
+// @private
+// @kind function
+// @category statsUtility
+// @desc Calculate the 95% confidence interval of the standard error
+//   of the coefficient
+// @param n {long} Number of endog values
+// @param p {long} Number of coefficients
+// @param stdErr {float} The standard error of the coefficient
+// @returns {float} The confidence interval
+stats.i.CI95:{[n;p;stdErr]
+  alpha:(1-.95)%2;
+  // Degrees of freedom
+  df:(n-p)-1;
+  // Calculate the percent point function
+  ppf:pyStats[`:t][`:ppf][alpha; df]`;
+  neg ppf*stdErr
+  }
+
+// @private 
+// @kind dictionary
+// @category statsUtility
+// @desc Infinity values for different types
+// @type dictionary
+infTypes:`int`long`real`float`timestamp`month`date`datetime`timespan`minute`second`time
+stats.i.infinity:infTypes!infTypes$\:0w
+
+// @private 
+// @kind data
+// @category statsUtility
+// @desc Meta type letters to symbolic names
+stats.i.metaTypes:" bgxhijefcCspmdznuvt"!
+  `general`boolean`guid`byte`short`int`long`real`float`char`string,
+  `symbol`timestamp`month`date`datetime`timespan`minute`second`time
+
+// @private
+// @kind function
+// @category statsUtility
+// @desc Update the function dictionary applied to data
+// @param funcs {fn[]} Functions loaded from `.ml.stats.describeFuncs`
+// @param typeDict {dictionary} Indices of functions to be applied for 
+//   each type
+// @param funcDict {dictionary} Contains all functions to be applied 
+//   for each type
+// @param typ {symbol} The type of function to be extracted (
+//   `num`temporal`other)
+// @returns {dictionary} The updated funcDict for each `typ` 
+stats.i.updFuncDict:{[funcs;typeDict;funcDict;typ]
+  funcDict[typ;typeDict typ]:funcs typeDict typ;
+  funcDict
+  }
diff --git a/tests/filelength.t b/tests/filelength.t
new file mode 100644
index 00000000..8dcfdbd2
--- /dev/null
+++ b/tests/filelength.t
@@ -0,0 +1,36 @@
+// Names of the folders containing q scripts whose line lengths are to be tested
+folders:("util";"fresh";"xval";"graph";"graph/modules";"optimize";"clust")
+
+// Function for retrieval of all q files
+getFiles:{
+  files:key hsym `$x;
+  pathStem:x,"/";
+  qfiles:files where files like "*.q";
+  `$pathStem,/:string qfiles
+  }
+
+// List of all the q files within the appropriate folders
+files:raze getFiles each folders
+
+// For an individual file test that the lines of the file do no exceed 80 characters unless
+// exempt using '// noqa' at the end of the line
+testLineLength:{[file]
+  fileContent:read0 hsym file;
+  excessCharacters:80<count each fileContent;
+  excessLocations:where excessCharacters;
+  excessContent:trim fileContent excessLocations;
+  testFail:lineTest[file]'[excessLocations;excessContent];
+  $[any testFail;[-1"";0b];1b]
+  }
+
+// Check that an individual line conforms with the acceptable line length, return 0b
+// if there are no issues, print line information and return 1b otherwise. Note lines
+// with a following '// noqa' will be ignored
+lineTest:{[file;loc;line]
+  // find all lines that are not exempt from unit tests i.e. don't end with
+  // '// noqa' these are ignored from the line length tests
+  exempt:"\/\/ noqa"~-7#line;
+  $[exempt;0b;[-1 "Line: ",string[loc+1]," File: '",string[file],"' Content: ",line;1b]]
+  }
+
+all testLineLength each files
diff --git a/tests/testFiles.bat b/tests/testFiles.bat
new file mode 100644
index 00000000..344b03cb
--- /dev/null
+++ b/tests/testFiles.bat
@@ -0,0 +1 @@
+q test.q fresh/tests/ util/tests/ xval/tests clust/tests/ graph/tests/ timeseries/tests/ optimize/tests/ stats/tests/ tests/ -q
diff --git a/timeseries/README.md b/timeseries/README.md
index c1744b37..b93ccdc3 100644
--- a/timeseries/README.md
+++ b/timeseries/README.md
@@ -34,6 +34,6 @@ Documentation is available on the [timeseries](https://code.kx.com/v2/ml/toolkit
 
 ## Status
   
-The time series library is still in development and is available here as a beta release. Further functionality and improvements will be made to the library in the coming months.
+The time series library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
 
 If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/timeseries/fit.q b/timeseries/fit.q
index 7301adbb..3d825a29 100644
--- a/timeseries/fit.q
+++ b/timeseries/fit.q
@@ -1,136 +1,179 @@
-\d .ml
-
+// timeseries/fit.q - Fit timeseries models 
+// Copyright (c) 2021 Kx Systems Inc
+// 
 // Fitting functionality for time series models. 
+// Models include AR, ARCH, ARMA, ARIMA, and SARIMA.
+
+\d .ml
 
 // @kind function
 // @category modelFit
-// @fileoverview Fit an AutoRegressive model (AR)
-// @param endog {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog  {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   may be accounted for to improve the model, if (::)/() this will be ignored
-// @param lags  {integer} The number/order of time lags of the model
-// @param trend {boolean} Is a trend line to be accounted for in fitting of model
-// @return {dict} All information required to use a fit model for the prediction of
-//   new values based on incoming data
-ts.AR.fit:{[endog;exog;lags;trend]
-  // cast endog to floating value
+// @desc Fit an AutoRegressive model (AR)
+// @param endog {float[]} Endogenous variable (time-series) from which to build
+//   a model. This is the target variable from which a value is to be predicted
+// @param exog {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param p {int} The number/order of time lags of the model
+// @param trend {boolean} Is a trend line to be accounted for when fitting 
+//   the model
+// @return {dictionary} Contains the following information:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+ts.AR.fit:{[endog;exog;p;trend]
+  // Cast endog to floating value
   endog:"f"$endog;
   exog:ts.i.fitDataCheck[endog;exog];
   // Estimate coefficients
-  coeff:$[sum trend,count[exog];
-    ts.i.estimateParams[endog;exog;endog;`p`q`tr!lags,0,trend];
-    ts.i.durbinLevinson[endog;lags]
+  coeffs:$[sum trend,count exog;
+    ts.i.estimateCoefficients[endog;exog;endog;`p`q`trend!p,0,trend];
+    ts.i.durbinLevinson[endog;p]
     ];
   // Get lagged values needed for future predictions
-  lagvals:neg[lags]#endog;
-  // return dictionary with required info for predictions
-  keyvals:`params`tr_param`exog_param`p_param`lags;
-  params:(coeff;trend#coeff;coeff trend +til count exog 0;neg[lags]#coeff;lagvals);
-  keyvals!params
+  lagVals:neg[p]#endog;
+  // Return dictionary with required info for predictions
+  dictKeys:`coefficients`trendCoeff`exogCoeff`pCoeff`lagVals;
+  dictVals:(coeffs;trend#coeffs;coeffs trend +til count exog 0;
+    neg[p]#coeffs;lagVals);
+  modelDict:dictKeys!dictVals;
+  returnInfo:enlist[`modelInfo]!enlist modelDict;
+  predictFunc:ts.AR.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
 
 // @kind function
 // @category modelFit
-// @fileoverview Fit an AutoRegressive Moving Average model (ARMA)
-// @param endog {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog  {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   may be accounted for to improve the model, if (::)/() this will be ignored
-// @param lags  {integer} The number/order of time  lags of the model
-// @param resid {integer} The number of residual errors to be accounted for
-// @param trend {boolean} Is a trend line to be accounted for in fitting of model
-// @return {dict} All information required to use a fit model for the prediction of
-//   new values based on incoming data
-ts.ARMA.fit:{[endog;exog;lags;resid;trend]
-  // cast endog to floating value
+// @desc Fit an AutoRegressive Moving Average model (ARMA)
+// @param endog {float[]} Endogenous variable (time-series) from which to build
+//   a model. This is the target variable from which a value is to be predicted
+// @param exog {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model, if (::)/() 
+//   this will be ignored
+// @param p {int} The number/order of time  lags of the model
+// @param q {int} The number of residual errors to be accounted for
+// @param trend {boolean} Is a trend line to be accounted for when fitting 
+//   the model
+// @return {dictionary} Contains the following information:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+ts.ARMA.fit:{[endog;exog;p;q;trend]
+  // Cast endog to floating value
   endog:"f"$endog;
   exog:ts.i.fitDataCheck[endog;exog];
-  $[resid~0;
-    // if q = 0 then model is an AR model
-    ts.AR.fit[endog;exog;lags;trend],`q_param`resid`estresid`pred_dict!
-      (();();();`p`q`tr!lags,resid,trend);
-    ts.i.ARMA.model[endog;exog;`p`q`tr!lags,resid,trend]]
+  paramDict:`p`q`trend!p,q,trend;
+  modelDict:$[q~0;
+    // If q = 0 then model is an AR model
+    [dictKeys:`qCoeff`residualVals`residualCoeffs`paramDict;
+     dictVals:(();();();paramDict);
+     ts.AR.fit[endog;exog;p;trend][`modelInfo],dictKeys!dictVals
+     ];
+    ts.i.ARMA.model[endog;exog;paramDict]
+    ];
+  returnInfo:enlist[`modelInfo]!enlist modelDict;
+  predictFunc:ts.ARMA.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
 
 // @kind function
 // @category modelFit
-// @fileoverview Fit an AutoRegressive Integrated Moving Average model (ARIMA)
-// @param endog {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog  {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   may be accounted for to improve the model, if (::)/() this will be ignored
-// @param lags  {integer} The number/order of time  lags of the model
-// @param diff  {integer} The order of time series differencing used in integration
-// @param resid {integer} The number of residual errors to be accounted for
-// @param trend {boolean} Is a trend line to be accounted for in fitting of model
-// @return {dict} All information required to use a fit model for the prediction of
-//   new values based on incoming data
-ts.ARIMA.fit:{[endog;exog;lags;diff;resid;trend]
+// @desc Fit an AutoRegressive Integrated Moving Average model (ARIMA)
+// @param endog {float[]} Endogenous variable (time-series) from which to build
+//   a model. This is the target variable from which a value is to be predicted
+// @param exog {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param p {int} The number/order of time  lags of the model
+// @param d {int} The order of time series differencing used in integration
+// @param q {int} The number of residual errors to be accounted for
+// @param trend {boolean} Is a trend line to be accounted for in fitting of 
+//   model
+// @return {dictionary} Contains the following information:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+ts.ARIMA.fit:{[endog;exog;p;d;q;trend]
   exog:ts.i.fitDataCheck[endog;exog];
   // Apply integration (non seasonal)
-  I:ts.i.differ[endog;diff;()!()]`final;
+  I:ts.i.differ[endog;d;()!()]`final;
   // Fit an ARMA model on the differenced time series
-  mdl:ts.ARMA.fit[I;diff _exog;lags;resid;trend];
+  modelDict:ts.ARMA.fit[I;d _exog;p;q;trend]`modelInfo;
   // Retrieve the original data to be used when fitting on new data
-  origData:neg[diff]#endog;
+  originalData:neg[d]#endog;
   // Produce the relevant differenced data for use in future predictions
-  origDiff:enlist[`origd]!enlist diff{deltas x}/origData;
+  originalDiff:enlist[`originalData]!enlist d{deltas x}/originalData;
   // return relevant data
-  mdl,origDiff
+  modelDict,:originalDiff;
+  returnInfo:enlist[`modelInfo]!enlist modelDict;
+  predictFunc:ts.ARIMA.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
 
 // @kind function
 // @category modelFit
-// @fileoverview Fit a Seasonal AutoRegressive Integrated Moving Average model (SARIMA)
-// @param endog {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog  {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   may be accounted for to improve the model, if (::)/() this will be ignored
-// @param lags  {integer} The number/order of time  lags of the model
-// @param diff  {integer} The order of time series differencing used in integration
-// @param resid {integer} The number of residual errors to be accounted for
-// @param trend {boolean} Is a trend line to be accounted for in fitting of model
-// @param seas  {dict}    Is a dictionary containing required seasonal components
-// @return {dict} All information required to use a fit model for the prediction of
-//   new values based on incoming data
-ts.SARIMA.fit:{[endog;exog;lags;diff;resid;trend;seas]
-  // cast endog to floating value
+// @desc Fit a Seasonal AutoRegressive Integrated Moving Average model 
+//   (SARIMA)
+// @param endog {float[]} Endogenous variable (time-series) from which to build
+//   a model. This is the target variable from which a value is to be predicted
+// @param exog  {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param p {int} The number/order of time  lags of the model
+// @param d {int} The order of time series differencing used in integration
+// @param p {int} The number of residual errors to be accounted for
+// @param trend {boolean} Is a trend line to be accounted for in fitting of 
+//   model
+// @param season {dictionary} Is a dictionary containing required seasonal 
+//   components
+// @return {dictionary} Contains the following information:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+ts.SARIMA.fit:{[endog;exog;p;d;q;trend;season]
+  // Cast endog to floating value
   endog:"f"$endog;
-  ts.i.dictCheck[seas;`P`Q`D`m;"seas"];
+  ts.i.dictCheck[season;`P`Q`D`m;"seas"];
   // Apply error checking (exogenous data not converted to matrix?)
   exog:ts.i.fitDataCheck[endog;exog];
   // Apply appropriate seasonal+non seasonal differencing
-  I:ts.i.differ[endog;diff;seas];
+  I:ts.i.differ[endog;d;season];
   // Create dictionary with p,q and seasonal components
-  dict:`p`q`P`Q`m`tr!lags,resid,((1+til each seas[`P`Q])*seas[`m]),seas[`m],trend;
-  // add additional seasonal components
-  dict[`seas_add_P`seas_add_Q]:(raze'){1+til[x]+/:y}'[(lags;resid);dict`P`Q];
+  seasonInfo:((1+til each season`P`Q)*season`m),season[`m],trend;
+  dict:`p`q`P`Q`m`trend!p,q,seasonInfo;
+  // Add additional seasonal components
+  dict[`additionalP`additionalQ]:(raze'){1+til[x]+/:y}'[(p;q);dict`P`Q];
   // Generate data for regenerate data following differencing
-  origDiffSeason:`origd`origs!(diff{deltas x}/neg[diff]#endog;neg[prd seas`D`m]#I`init);
+  diffKeys:`originalData`seasonData;
+  diffVals:(d{deltas x}/neg[d]#endog;neg[prd season`D`m]#I`init);
+  diffDict:diffKeys!diffVals;
   // Apply SARMA model and postpend differenced original data
-  ts.i.SARMA.model[I`final;exog;dict],origDiffSeason
+  modelDict:ts.i.SARMA.model[I`final;exog;dict],diffDict;
+  returnInfo:enlist[`modelInfo]!enlist modelDict;
+  predictFunc:ts.SARIMA.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
 
 // @kind function
 // @category modelFit
-// @fileoverview Fit an AutoRegressive Conditional Heteroscedasticity model (ARCH)
-// @param resid {num[]} Residual errors from fitted time series model
-// @param lags  {integer} The number/order of time  lags of the model
-// @return {dict} All information required to use a fit model for the prediction of
-//   new values based on incoming data
-ts.ARCH.fit:{[resid;lags]
-  // cast to floating value
-  resid:"f"$resid;
-  // cast endog to floating value
-  sqresid:resid*resid;
-  // Using the resid errorrs calculate coefficients
-  coeff:ts.i.estimateParams[sqresid;();sqresid;`p`q`tr!lags,0,1b];
+// @desc Fit an AutoRegressive Conditional Heteroscedasticity model
+//   (ARCH)
+// @param residuals {number[]} Residual errors from fitted time series model
+// @param p {int} The number/order of time  lags of the model
+// @return {dictionary} Contains the following information:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+ts.ARCH.fit:{[residuals;p]
+  // Cast to floating value
+  residuals:"f"$residuals;
+  // Cast endog to floating value
+  squareResiduals:residuals*residuals;
+  paramDict:`p`q`trend!p,0,1b;
+  // Using the residuals errors calculate coefficients
+  coeff:ts.i.estimateCoefficients[squareResiduals;();squareResiduals;paramDict];
   // Get lagged values needed for future predictions
-  resid:neg[lags]#sqresid;
-  // return dictionary with required info for predictions
-  keyVals:`params`tr_param`p_param`resid;
-  params:(coeff;coeff[0];1_coeff;resid);
-  keyVals!params
+  lastResiduals:neg[p]#squareResiduals;
+  // Return dictionary with required info for predictions
+  dictKeys:`coefficients`trendCoeff`pCoeff`residualVals;
+  dictVals:(coeff;coeff 0;1_coeff;lastResiduals);
+  modelDict:dictKeys!dictVals;
+  returnInfo:enlist[`modelInfo]!enlist modelDict;
+  predictFunc:ts.ARCH.predict returnInfo;
+  returnInfo,enlist[`predict]!enlist predictFunc
   }
-
diff --git a/timeseries/init.q b/timeseries/init.q
index 26c2af79..46ed65c6 100644
--- a/timeseries/init.q
+++ b/timeseries/init.q
@@ -1,7 +1,15 @@
-\d .ml
-loadfile`:optimize/init.q
-loadfile`:timeseries/utils.q
-loadfile`:fresh/extract.q
-loadfile`:timeseries/fit.q
-loadfile`:timeseries/predict.q
-loadfile`:timeseries/misc.q
+// timeseries/init.q - Load timeseries library
+// Copyright (c) 2021 Kx Systems Inc
+// 
+//  Timeseries forecasting is the use of a model to predict 
+//  the future values of a dataset based on historical observations. 
+
+.ml.loadfile`:optimize/init.q
+.ml.loadfile`:timeseries/utils.q
+.ml.loadfile`:fresh/extract.q
+.ml.loadfile`:timeseries/fit.q
+.ml.loadfile`:timeseries/predict.q
+.ml.loadfile`:timeseries/misc.q
+
+.ml.loadfile`:util/utils.q
+.ml.i.deprecWarning`timeSeries
diff --git a/timeseries/misc.q b/timeseries/misc.q
index 0a21f7a1..99a5b95d 100644
--- a/timeseries/misc.q
+++ b/timeseries/misc.q
@@ -1,84 +1,91 @@
-\d .ml
-
-// Miscellaneous functionality relating to time-series analysis
+// timeseries/misc.q - Timeseries functions
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// Miscellaneous functionality relating to time series analysis
 // and model generation procedures
 
+\d .ml
 
 // @kind function
 // @category misc
-// @fileoverview Summary of the stationarity of each vector of a multivariate time series 
-//   or a single vector
-// @param dset {dict/tab/num[]} a time series of interest, the entries should 
+// @desc Summary of the stationarity of each vector of a multivariate 
+//   time series or a single vector
+// @param data {dictionary|table|number[]} a time series of interest,
+//   the entries should 
 //   in each case be numeric data types.
-// @return {keytab} informative outputs from the python adfuller test indicating
-//   the stationarity of each vector entry of the relevant dataset
-ts.stationarity:{[dset]
-  dtype:type dset;
+// @return {dictionary} informative outputs from the python adfuller test 
+//   indicating the stationarity of each vector entry of the relevant dataset
+ts.stationarity:{[data]
+  dataType:type data;
   // Names to be provided to form the key for the return table
-  keyNames:$[99h=dtype;key dset;
-    98h=dtype;cols dset;
+  keyNames:$[99h=dataType;key data;
+    98h=dataType;cols data;
     enlist`data
     ];
-  // Column names associated with the returns from the augmented dickey fuller test
-  dcols:`ADFstat`pvalue`stationary,`$raze each"CriticalValue_",/:string(1;5;10),\:"%";
-  scores:ts.i.stationaryScores[dset;dtype];
-  keyNames!flip dcols!scores
+  // Column names associated with the returns from the augmented Dickey Fuller
+  // test
+  criticalVals:`$raze each"CriticalValue_",/:string(1;5;10),\:"%";
+  dataCols:`ADFstat`pvalue`stationary,criticalVals;
+  scores:ts.i.stationaryScores[data;dataType];
+  keyNames!flip dataCols!scores
   }
 
 // @kind function
 // @category misc
-// @fileoverview Retrieve the best parameters for an ARIMA model based on the
+// @desc Retrieve the best parameters for an ARIMA model based on the
 //   Akaike Information Criterion (AIC)
-// @param train  {dict} training data dictionary containing `endog/`exog data
-// @param test   {dict} testing data dictionary containing `endog/`exog data
-// @param len    {integer} number of steps forward to predict
-// @param params {dict} parameter sets to fit ARIMA model with 
-// @return {dict} parameter set which produced the lowest AIC score
+// @param train  {dictionary} training data dictionary 
+//   containing `endog/`exog data
+// @param test   {dictionary} testing data dictionary 
+//   containing `endog/`exog data
+// @param len    {int} number of steps forward to predict
+// @param params {dictionary} parameter sets to fit ARIMA model with 
+// @return {dictionary} parameter set which produced the lowest AIC score
 ts.ARIMA.aicParam:{[train;test;len;params]
   ts.i.dictCheck[;`endog`exog;]'[(train;test);("train";"test")];
-  ts.i.dictCheck[params;`p`d`q`tr;"params"];
-  // get aic scores for each set of params
-  scores:ts.i.aicFitScore[train;test;len;]each flip params;
-  // return best value
+  ts.i.dictCheck[params;`p`d`q`trend;"params"];
+  // Get AIC scores for each set of params
+  scores:ts.i.aicFitScore[train;test;len]each flip params;
+  // Return best value
   bestScore:min scores;
   scoreEntry:enlist[`score]!enlist bestScore;
   params[;scores?bestScore],scoreEntry
   }
 
-
 // Time-series feature engineering functionality
 
 // @kind function
 // @category misc
-// @fileoverview Apply a set of user defined functions over variously sized sliding windows
-//   to a subset of columns within a table
-// @param tab      {tab} dataset onto which to apply the windowed functions
-// @param colNames {symbol[]} names of the columns on which to apply the functions
-// @param funcs    {symbol[]} names of the functions to be applied
-// @param wins     {integer[]} list of sliding window sizes
-// @return         {tab} table with functions applied on specified columns over 
-//   appropriate windows remove the first max[wins] columns as these are produced
-//   with insufficient information to be deemed accurate
-ts.windowFeatures:{[tab;colNames;funcs;wins]
-  // unique combinations of columns/windows and functions to be applied to the dataset
-  uniCombs:(cross/)(funcs;wins;colNames);
-  // column names for windowed functions (remove ".") to ensure that if namespaced columns
-  // exist they don't jeopardize parsing of select statements.
+// @desc Apply a set of user defined functions over variously sized 
+//   sliding windows to a subset of columns within a table
+// @param tab {table} dataset onto which to apply the windowed functions
+// @param colNames {symbol[]} names of the columns on which to apply the 
+//   functions
+// @param funcs {symbol[]} names of the functions to be applied
+// @param winSize {int[]} list of sliding window sizes
+// @return {table} table with functions applied on specified columns over
+//   appropriate windows remove the first max[winSize] columns as these are 
+//   produced with insufficient information to be deemed accurate
+ts.windowFeatures:{[tab;colNames;funcs;winSize]
+  // Unique combinations of columns/windows and functions to be applied to the 
+  // dataset
+  uniCombs:(cross/)(funcs;winSize;colNames);
+  // Column names for windowed functions (remove ".") to ensure that if
+  // namespaced columns exist they dont jeopardize parsing of select statements
   winCols:`$ssr[;".";""]each sv["_"]each string uniCombs;
-  // values from applied functions over associated windows
-  winVals:{ts.i.slidingWindowFunction[get string y 0;y 1;x y 2]}[tab]each uniCombs;
-  max[wins]_tab,'flip winCols!winVals
+  // Values from applied functions over associated windows
+  winVals:ts.i.setupWindow[tab]each uniCombs;
+  max[winSize]_tab,'flip winCols!winVals
   }
 
-
 // @kind function
 // @category misc
-// @fileoverview Apply a set of user defined functions over variously sized sliding windows
-//   to a subset of columns within a table
-// @param tab      {tab} dataset from which to generate lagged data
-// @param colNames {symbol[]} names of the columns from which to retrieve lagged data
-// @param lags     {integers[]} list of lagged values to retrieve from the dataset
-// @return         {tab} table with columns added associated with the specied lagged
+// @desc Apply a set of user defined functions over variously sized 
+//   sliding windows to a subset of columns within a table
+// @param tab {table} Dataset from which to generate lagged data
+// @param colNames {symbol[]} Names of the columns to retrieve lagged data from
+// @param lags {int[]} List of lagged values to retrieve from the dataset
+// @return {table} Table with columns added associated with the specied lagged
 //   values 
 ts.laggedFeatures:{[tab;colNames;lags]
   if[1=count colNames;colNames,:()];
@@ -88,14 +95,14 @@ ts.laggedFeatures:{[tab;colNames;lags]
   tab,'flip lagNames!lagVals
   }
 
-
 // Plotting functionality
 
 // @kind function
 // @category misc
-// @fileoverview Plot and display an autocorrelation plot
-// @param data {num[]} dataset from which to generate the autocorrelation plot
-// @param n    {int} number of lags to include in the graph
+// @desc Plot and display an autocorrelation plot
+// @param data {number[]} Dataset from which to generate the autocorrelation 
+//   plot
+// @param n {int} Number of lags to include in the graph
 // @return {graph} display to standard out the autocorrelation bar plot
 ts.acfPlot:{[data;n;width]
   acf:ts.i.autoCorrFunction[data;]each n;
@@ -104,12 +111,12 @@ ts.acfPlot:{[data;n;width]
 
 // @kind function
 // @category misc
-// @fileoverview Plot and display an autocorrelation plot
-// @param data {num[]} dataset from which to generate the partial autocorrelation plot
-// @param n    {int} number of lags to include in the graph
+// @desc Plot and display an autocorrelation plot
+// @param data {number[]} Dataset from which to generate the partial 
+//   autocorrelation plot
+// @param n {int} Number of lags to include in the graph
 // @return {graph} display to standard out the partial autocorrelation bar plot
 ts.pacfPlot:{[data;n]
   pacf:.ml.fresh.i.pacf[data;neg[1]+m:n&count data]`;
   ts.i.plotFunction[data;1_pacf;1_til m;1;"Partial AutoCorrelation"];
   }
-
diff --git a/timeseries/predict.q b/timeseries/predict.q
index 0bfb5c9c..84e94823 100644
--- a/timeseries/predict.q
+++ b/timeseries/predict.q
@@ -1,98 +1,112 @@
-\d .ml
-
+// timeseries/predict.q - Timeseries prediction 
+// Copyright (c) 2021 Kx Systems Inc
+// 
 // Prediction functionality for time-series models
 
+\d .ml
+
 // @kind function
 // @category modelPredict
-// @fileoverview Predictions based on an AutoRegressive model (AR)
-// @param mdl  {dict} model parameters returned from fitting of an appropriate model
-// @param exog {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   required for application of model prediction 
-// @param len  {integer} number of values to be predicted
-// @return     {float[]} list of predicted values
-ts.AR.predict:{[mdl;exog;len]
-  ts.i.dictCheck[mdl;ts.i.AR.keyList;"mdl"];
-  exog:ts.i.predDataCheck[mdl;exog];
-  mdl[`pred_dict]:`p`tr!count each mdl`p_param`tr_param;
-  mdl[`estresid]:();
-  mdl[`resid]:();
-  ts.i.predictFunction[mdl;exog;len;ts.i.AR.singlePredict]
+// @desc Predictions based on an AutoRegressive model (AR)
+// @params config {dictionary} Information returned from `ml.ts.AR.fit` 
+//   including:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+// @param exog {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model
+// @param len {long} Number of future values to be predicted
+// @return {float[]} Predicted values
+ts.AR.predict:{[config;exog;len]
+  model:config`modelInfo;
+  exog:ts.i.predDataCheck[model;exog];
+  model[`paramDict]:`p`trend!count each model`pCoeff`trendCoeff;
+  model[`residualCoeffs]:();
+  model[`residuals]:();
+  ts.i.predictFunction[model;exog;len;ts.i.AR.singlePredict]
   }
 
 // @kind function
 // @category modelPredict
-// @fileoverview Predictions based on an AutoRegressive Moving Average model (ARMA)
-// @param mdl  {dict} model parameters returned from fitting of an appropriate model
-// @param exog {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   required for application of model prediction 
-// @param len  {integer} number of values to be predicted
-// @return     {float[]} list of predicted values
-// Predict future data using an ARMA model
-/. r    > list of predicted values
-ts.ARMA.predict:{[mdl;exog;len]
-  ts.i.dictCheck[mdl;ts.i.ARMA.keyList;"mdl"];
-  exog:ts.i.predDataCheck[mdl;exog];
-  ts.i.predictFunction[mdl;exog;len;ts.i.ARMA.singlePredict]
+// @desc Predictions based on an AutoRegressive Moving Average model 
+//   (ARMA)
+// @params config {dictionary} Information returned from `ml.ts.ARMA.fit`
+//   including:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+// @param exog {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model
+// @param len {long} Number of future values to be predicted
+// @return {float[]} Predicted values
+ts.ARMA.predict:{[config;exog;len]
+  model:config`modelInfo;
+  exog:ts.i.predDataCheck[model;exog];
+  ts.i.predictFunction[model;exog;len;ts.i.ARMA.singlePredict]
   }
 
 // @kind function
 // @category modelPredict
-// @fileoverview Predictions based on an AutoRegressive Integrated Moving Average 
-//   model (ARIMA)
-// @param mdl  {dict} model parameters returned from fitting of an appropriate model
-// @param exog {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   required for application of model prediction 
-// @param len  {integer} number of values to be predicted
-// @return     {float[]} list of predicted values
-ts.ARIMA.predict:{[mdl;exog;len]
-  ts.i.dictCheck[mdl;ts.i.ARIMA.keyList;"mdl"];
-  exog:ts.i.predDataCheck[mdl;exog];
+// @desc Predictions based on an AutoRegressive Integrated Moving
+//   Average model (ARIMA)
+// @params config {dictionary} Information returned from `ml.ts.ARIMA.fit`
+//   including:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+// @param exog {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model
+// @param len {long} Number of future values to be predicted
+// @return {float[]} Predicted values
+ts.ARIMA.predict:{[config;exog;len]
+  model:config`modelInfo;
+  exog:ts.i.predDataCheck[model;exog];
   // Calculate predictions not accounting for differencing
-  pred:ts.i.predictFunction[mdl;exog;len;ts.i.ARMA.singlePredict];
-  dval:count mdl`origd;
+  preds:ts.i.predictFunction[model;exog;len;ts.i.ARMA.singlePredict];
+  dVal:count model`originalData;
   // Revert data to correct scale (remove differencing if previously applied)
-  $[dval;dval _dval{sums x}/mdl[`origd],pred;pred]
+  $[dVal;dVal _dVal{sums x}/model[`originalData],preds;preds]
   }
 
 // @kind function
 // @category modelPredict
-// @fileoverview Predictions based on a Seasonal AutoRegressive Integrated Moving 
-//   Average model (SARIMA)
-// @param mdl  {dict} model parameters returned from fitting of an appropriate model
-// @param exog {tab/num[][]/(::)} Exogenous variables, are additional variables which
-//   required for application of model prediction 
-// @param len  {integer} number of values to be predicted
-// @return     {float[]} list of predicted values
-ts.SARIMA.predict:{[mdl;exog;len]
-  ts.i.dictCheck[mdl;ts.i.SARIMA.keyList;"mdl"];
-  exog:ts.i.predDataCheck[mdl;exog];
+// @desc Predictions based on a Seasonal AutoRegressive Integrated 
+//   Moving Average model (SARIMA)
+// @params config {dictionary} Information returned from `ml.ts.SARIMA.fit`
+//   including:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+// @param exog {table|float[]|(::)} Exogenous variables are additional 
+//   variables which may be accounted for to improve the model
+// @param len {long} Number of future values to be predicted
+// @return {float[]} Predicted values
+ts.SARIMA.predict:{[config;exog;len]
+  model:config`modelInfo;
+  exog:ts.i.predDataCheck[model;exog];
   // Calculate predictions not accounting for differencing
-  preds:$[count raze mdl[`pred_dict];
-    ts.i.predictFunction[mdl;exog;len;ts.i.SARMA.singlePredict];
-    ts.i.AR.predict[mdl;exog;len]
+  preds:$[count raze model`paramDict;
+    ts.i.predictFunction[model;exog;len;ts.i.SARMA.singlePredict];
+    ts.i.AR.predict[model;exog;len]
     ];
   // Order of seasonal differencing originally applied
-  sval:count mdl`origs;
-  // if seasonal differenced, revert to original
-  if[sval;preds:ts.i.reverseSeasonDiff[mdl[`origs];preds]];
+  dSeasVal:count model`seasonData;
+  // If seasonal differenced, revert to original
+  if[dSeasVal;preds:ts.i.reverseSeasonDiff[model`seasonData;preds]];
   // Order of differencing originally applied
-  dval:count mdl`origd;
+  dVal:count model`originalData;
   // Revert data to correct scale (remove differencing if previously applied)
-  $[dval;dval _dval{sums x}/mdl[`origd],preds;preds]
+  $[dVal;dVal _dVal{sums x}/model[`originalData],preds;preds]
   }
 
-
 // @kind function
 // @category modelPredict
-// @fileoverview Predictions based on an AutoRegressive Conditional Heteroskedasticity 
-//   model (ARCH)
-// @param mdl  {dict} model parameters returned from fitting of an appropriate model
-// @param len  {integer} number of values to be predicted
-// @return     {float[]} list of predicted values
-// Predict future volatility using an ARCH model
-/. r    > list of predicted values
-ts.ARCH.predict:{[mdl;len]
-  ts.i.dictCheck[mdl;ts.i.ARCH.keyList;"mdl"];
-  // predict and return future values
-  last{x>count y 1}[len;]ts.i.ARCH.singlePredict[mdl`params]/(mdl`resid;())
+// @desc Predictions based on an AutoRegressive Conditional 
+//   Heteroskedasticity model (ARCH)
+// @params config {dictionary} Information returned from `ml.ts.ARCH.fit`
+//   including:
+//   modelInfo - Model coefficients and data needed for future predictions
+//   predict - A projection allowing for prediction of future values
+// @param len {long} Number of future values to be predicted
+// @return {float[]} Predicted values
+ts.ARCH.predict:{[config;len]
+  model:config`modelInfo;
+  last{x>count y 1}[len;]ts.i.ARCH.singlePredict
+    [model`coefficients]/(model`residualVals;())
   }
diff --git a/timeseries/tests/data/linux/fit/AR1 b/timeseries/tests/data/linux/fit/AR1
index fb1f99352d314ef58141ffe87ab5b6c4a1cc3c2f..79eba12d031756618244df20eaf1ee5a57ebf598 100644
GIT binary patch
literal 595
zcmbtRO-jTt6n-PPn0kR()}3y8fEEUa83bL43pb_En5JzoO-hr3D6Tz_2k<T)z$@r`
z2`wFPXBYYTzAx{+uiwv#pTP+LbfyyOM%M=VK~$1E#0W?Flr|&kT5>70v)~4yt34qg
zo=p=@eb^Lh9+<NDgr2Qoh3m)nZN2?^`CQ|fu&^F8bo1Zl{i>zLRi_|=2=UjP8(qsL
zqh^^Bx`^V~&zlCKoQKblAyw*ZA`^0QV2<ru!Jl$3@-+&Q`D<SGY+&61{T6B2m)_Yq
zH=}m3V7v{gGI$HDx49@V2rH|RDUaJCy}0~?obh=s?07wD?xfL+$;e&iHM{%7O5G@N
o7vdn#Zx{ysr`aw%{xcu&^^Q}Tun-6J3yDoQAycAD%<D0|Uodl~U;qFB

literal 109
zcmey*n9R+<%D})-kXV$MTg*^W6c1)Gq*mmoLzx9oIwvu`7^o7Ynv;PMNN#@Tm1g&-
VYCk8C4HAQps4^fP7)Z1`003wS8ejkb

diff --git a/timeseries/tests/data/linux/fit/AR2 b/timeseries/tests/data/linux/fit/AR2
index 81d320880e07fa841a280ce1286dc966ec0344f0..c8aa9d52f4cb8dced0f0f81ef03c33dc644ff94f 100644
GIT binary patch
delta 538
zcmbVJJxjzu5S<uQjGd*$5j$;SXOT3@1;I`%v<QK$n@zH~*$v4C6w&()a$qS~il7|+
z4u6Kf!s6TyFI-_|HM8%%nRzq&xjx;xm^*kARsf(DIbkQf5YV(Fr&uaX<BDF@L6D-L
zm=Z2KkS*c)SR+8LMQO9f28POW#Af?X?xfQ7dCkBB&jGkiu;2wPBV7|Q!c))p&DhWq
z+2A2kj8=uR3uvGX#Ym!YMetQ(M5afoW#*Y=4Qf#xE8A(9HJLFu*}<ac(kBR4R+d?h
zmNs^mI<%ytd5@UEyENS0`-2?t!I&C(+;c3c;7iu@dXvbAj?RLP@{*BLdsn^now};J
z2n-9u<v$PIsV@3|!1UXI1C`C+Zk}Ji?q~6fv%SFev|s$tVjthmqqAo7>4Q63r0?$N
E2RNXn)&Kwi

delta 56
xcmbO%be3KJKVvdC11kdqLqTFuVs0@*Nl`qQ#gJN&pAKafK<S*s^o`;3m;v|T64n3!

diff --git a/timeseries/tests/data/linux/fit/AR3 b/timeseries/tests/data/linux/fit/AR3
index 1642ecb567ae1309736e426aade4e3cc309ed596..ddc5e238f58901967b7ee49b0b92d319536b59ba 100644
GIT binary patch
delta 555
zcmdnW{#Y>lKVvdC0}}%SLvDUbYK~`KT0TQTQEEzNatTljBo35j1yT8_X=$0snW=dt
z#SA4ysd*{RAQ6Vtiu`m4s{lgfB&LTY=4{k&W3JBt=}*bcVc=rmVq{>bj!w?cOUq2R
z2I;ZRNzIGpQcwU}YXy-=fV;sOC<9VuRZ^^%sRwe9OJYf)b4F@%b~IQK*o;_^GB7tf
zp&+p+G1mp?>{zRWf&{Pw>lKspOY=$;QWKLiAj%RTZiGmKor`2bQEG8!N@-#aSiCsa
zN<$NiBwQTsFrbxgrFqFEnfZB0P6K(b-WsgPF-WgCGcP?SH2@N>AYa!4Llqbz3~&!K
zB#{}=#XwU)@xuT%kCVX^$P$qj?Ml>Y+~~59Q3@swvJEQk01?~#nQ<DU5=<TxRT>uW
Y!`F*Vw{QI3`}Nkf=KT=4$@7`T0A!k|-v9sr

delta 56
xcmaDXxRqW1KVvdC11kdqLqTFuVs0@*Nl`qQ#gJN&pAKafK<S*s^o`+d%m4%U67>K8

diff --git a/timeseries/tests/data/linux/fit/AR4 b/timeseries/tests/data/linux/fit/AR4
index 87bae7c71befc01d7ad235363fa80f22714ec673..85e62b7bbc11b3c9b261a24297d201079eb73a67 100644
GIT binary patch
delta 527
zcmbVJJxjzu5S?65F$#jeAR-)E7(0ukk_&>JSZEOfSvQ+xakCqe4G1Est*tCY5Uk{|
z@R#^A1pk5V2N$levYOfV-pst2{akzAzWj0U2222;7CB+3yb#c|B&S#^Oyi1P(Ls=+
zpqLUaJCH5m`Aj1~ZbWIb#s-GU3&iG!FYZw3`n+b~f#(3+CRp%-mXWTB7~!es`(|ut
ziEQ)`DMqV8*#$JvhGHbqtRnb2F(T6=)iU!;vIe!Nj+N~+%$m#?9Conix%3Ibm6c`I
zqos}Ar4B9WXx<}c@GcGa_WvM9d^Dy;p7b0`D)^E$z1|QR(eZiEQC>1~X78%EzEf9K
z7lCnMxccXzJJm)1514-IaHO)`kFUw*!P7i`b+#6`k%065?L0cG7JRsOr;GH>o&Exb
CQJ<p#

delta 56
xcmZ21bd_EIKVvdC11kdqLqTFuVs0@*Nl`qQ#gJN&pAKafK<S*s^o`-mm;wDD67~Q9

diff --git a/timeseries/tests/data/linux/fit/ARCH1 b/timeseries/tests/data/linux/fit/ARCH1
index 79bfe798a4e2326f0c3b9eb47ae4578450f73b94..4f8913186b77560e7be64ca45dcf062a565c084f 100644
GIT binary patch
literal 559
zcmey*n9R+<#K6Fio1c=J<C&M1&rnd5nv$7Z0u%#@1EpC&RDNn&T4r))YF<e(LrGC;
zUWzkFgrNXJ6{Qwurj#b;geB$_166}`aDvH)ON_%xPKh`!S$q4Upl!DOq;Kz;i?){R
zpBJ;l(fDV}{_ijaoD9rBO}LanWPu787&;mp!1O0pM-c6COxrO7<j|Dd90o3iULdPF
zIypZtEi>IZCp9mYOF;qbH7kf%0^Fb0K$)Dx;*#nLyX5@Ryb^^<1;g5Cpcd=cl489~
zJ;xwt553~dy!4#Z07!@cH7F=VgAGVP4v$!U4TvCeC|GM~Y63OX0>h6H=q9+e3`yh%
cDFfIj;sTZvtQHo!FrE0q7+DR_#aKfc04+PMkpKVy

delta 54
zcmZ3_GJ{d;KVvdC0}BHKLqTFuVs0@*Nl`qQ#ZUmHi&Bd-Q-Ja;lQ%GGPHb=m0J)zK
AfdBvi

diff --git a/timeseries/tests/data/linux/fit/ARCH2 b/timeseries/tests/data/linux/fit/ARCH2
index 7af3136fdd8d75450c97824fa74f3c388eae38a9..15c59dc599c952cd593fcd7514c96db89967014b 100644
GIT binary patch
literal 463
zcmey*n9R+<#K6Fio1c=J<C&M1&rnd5nv$7Z0u%#@1EpC&RDNn&T4r))YF<e(LrGC;
zUWzkFgrNXJ6{Qwurj#b;geB$_166}`aDuh#JAJDAGEKy>f#;a$v31S+zrzGL85n_z
zV8RgInp4|X>z6V)W`HbC$<1NlV(103s-u(h^U^ZYt#eZIV!0F)z)rP-h$X<?Z4H#k
zNh~g@uCPnaFU>1as8leljRtD5jx8zH%hYoWa`w<G&df{CNezJb5U4>xAsTEz0<!;N
x^)(=Z$iA}H(9{HKs0D@x$l-8n8Is5kA_lNg*aHd@BCybcsYDJlWHDIC0RZDqiK74j

delta 53
ycmX@loT2rfF`1izg@J*gAh9Shx0s=%C?3pWD1g#Msl}NoKzWwQ4UCc#3seEKTo3;M

diff --git a/timeseries/tests/data/linux/fit/ARIMA1 b/timeseries/tests/data/linux/fit/ARIMA1
index cd34de86b632a33770e36c86b906affdd292f51a..694da18a4244363a1f8ca0aa60d8fed5835050ce 100644
GIT binary patch
literal 1227
zcmc&z&ubGw6rQ$%r2Ykh!XP3Fl3Z=~l8_?yP>={BgwRZ9XSbuXJDZ(JXek8`?OiWk
z^&sA>pvUb=ym=D@4`L50cy9`p_})wsn;=C`?%}<e_r7o6uX%$RevZr%LJ}PcwVXzp
zWVr|>w_rm%xaU!65k-<qky=A+F4C}$79uuu>`$3j9j_FN?=odbE{qJjOif%I#E^{T
zELp-(I+w9bnOb6&K`;zhB4<E;=q&6#pWP|$T-iOpfBdyrU8sG0a_OMB82|i|e|l3K
z0tUYRpyYJPr94gY)@f>q%sCE-c3AbtVBhJ7_o|+A^zQZXqsOfJ?8nPiva$wN<a}Rx
zJ9pTBUtD;x^K`HAxwyZ%{OZQ7Zw0gp-5Kfl4#64kN7zRE#)knpOJ=}uF!ZdW4ig21
z?}B}+$#ojZIKbuxN~EoMO5J=Gy`w!IRsbGEEm~8SWt<$gu)Z$%gQf$yf3y%*sn9x8
zysMZMbU0(FQ$tg2DdSvsQ!CS$MmndVj3NO;=3oFSqRsCRFtMB6lv`k*2!e?QE{$4a
zrB*gTRpPdPOD=&B-T)K;VSF*|lT#mN*bqIDTN*+U)N0Cgo(pcN;Y<m7H5ZBQiHYo}
zj4-{P)Lo-CDa$e?MR*O8zTCus0SdZ`a?5lRLpQc7t})Gt@U82v#=uEz_cja@M>Os<
zS^ICRKwKt&w-6<Vg~Z|O4@OLX$9n#}AtM~V!|6R4`A*xG5eE6wJ2Uc~wmBa25BuW@
HlUwu)!De|m

delta 116
zcmX@jd7LTnKVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
uT3iBQ6cnYV#HVB?moVfPWu~V94Fnm=$-px4qVVJc%mNel&zc;=s0jesRwfPr

diff --git a/timeseries/tests/data/linux/fit/ARIMA2 b/timeseries/tests/data/linux/fit/ARIMA2
index 98e47b41a907f046daf7f836531907b8a3b8f53e..cce5792a552c4908cea0f3705d764ee40586a2e2 100644
GIT binary patch
delta 898
zcmcIjziSjh6rS}6l1u52!lW2P#DLxf%iG3qL9r0@ASx`U8E0oVgL5;Rnb|WTgtM_1
z1hKNPi1pzgU^)vMdrK_@D_a|VZ)S~mVCxod?!NcleBXQf?T4k0%adOTUm_<6AtRFu
zbyF9HR957Yd%zHfbQ=#Nib8TJw0Fc?q4Pc>MC_U}{;J}!6A#78yG%J^g_C*B)ZCv?
z;z-5pDK^?bE@NdWHB%eRv+2DX3oRgc@p@r(vW*(@kpdCV5<>PfZge5b6nRr6^sr57
zklu@lt>ZaT$Y52^yN>MOP<VNNTkywOC<=EBv8>yreWrM=m>0AeUmESGo)}LV=cd+P
z>XH`5(p(mWfCX>>pcl*CUk(`Ndab#K_8Asbng$)~gIntz2dzrnj(ZLYa@Y?{05ox$
zt!5#Uu#raNK#Ya;G>0*$H<TM|1^3i(rUboY#mJ1soIENE)Hjx<c508Zs!~$qtDyQq
zhKwl&`Viy3t4EIR9(2QMvU&dD<uGL$B5wQuTmf%m#x#%qk5Mk7$%0e<VH6O6)+8jE
zX4B-{VvwYIKlgrYt{njinMHCH;#56P9{d3iKKyj$&Y3q)-p^uR*Iu1hE8mVWG+unt
wqi{aGe{Ht@?dSI3<tA&eci&rw$=R5P<+PxqB0KR*0V6M;j;WJD)tBb(-_67i&Hw-a

delta 110
zcmX>teVjY`KVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
pT3iBQ6cnYV#HVB?moVfPWu~V94dmLa&ashkvpdry=E-+hH31Z`B>DgV

diff --git a/timeseries/tests/data/linux/fit/ARIMA3 b/timeseries/tests/data/linux/fit/ARIMA3
index 71b9e363d67ee862b5754007ad0a4bc9426cb653..5cd55f86c5b73cc888a84c938ac87a29ef15c865 100644
GIT binary patch
delta 864
zcmcIjF>4e-6rS~rlEhB1NP>(aK|$`sO0pIn7X^!eq)^Cmm~nP?GdMf5nVCHzA#5eK
zLVkqx8BipJ+WQxh*3!mOi0{o@GzYdeZ+74J-n{R7`|Zj6;Zl2&@Og5M5K@_3sK>f6
zq_!fL+(QoY0MFrJL{UgCh4zkkD|9}<3=un~j8Anuw&S5#`IIR~tZ*`Kn3{SCC63h0
zUS`!Q&N5b(QZu#6JR3iIGt+?q`^Pia+AZXeR|-hHP6*k}xY30yQ=FSBp@(yn2I?yj
z*!-g-1q7>Byz9y?io(kWTY|sLLQ%M5h-JMV9Wcck#k`;s`O;`db!$9joSQ~_sY_ZI
zOLJKi0v5m_0lir6{>}g~j~dNAw4a%ROw+)lzrNP*I%rkmN!)W_ki&jJ0gQ=iHYW>d
z!$u|?*TqO!Pjiq#y`kJ#E4ZhQGbQLvD=IS*Q}n1Tkl#p}#;I47)wPl$zXhysWH>R!
zK%ZmWbxq~y&R#F9CYzo=yd6}gA>zjOz!mT{&Y0%K|1ruknk+cwA4UNJXiY+rarQM?
zm<^QfeY|u1SHFD#SwLpVU5I1#Hn|WJ`-h#Y3#S-Xe0zWY;?c&Z&lC82XL$<#`D5wm
i;XD2%!Uv1zKO|S;GpwKk{WaN+{{z@{@zUwd0Fd9s{0!Rw

delta 110
zcmdlj^PfBVKVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
pT3iBQ6cnYV#HVB?moVfPWu~V94dmJ!&vB4(b2!r>=E*W_iU26TB;)`9

diff --git a/timeseries/tests/data/linux/fit/ARIMA4 b/timeseries/tests/data/linux/fit/ARIMA4
index a5d9c743841d2523fca7a229e035c707ce298bdf..5f07cc33218777008912e3c02410704be88be70c 100644
GIT binary patch
delta 902
zcmcIjziSjh6rS}An50nxrtpwKR21}%h+1SD!v#S^krXN%hZ$#QH-ob?o1NJsa)bj>
zEVMfeqW%rSVecIxwt`?G*jQKvB3RkQ_hv782evlz<GnZU`@Xl|{+NAP?d}GAmh2;h
zq$U#TijED*auG@HAck>B&){Z697`^Rc9ytY=%|4aA~sA?p0aXVFE_=Kn@m}f3oE07
zsfmj(u_R;p97~rG%j7bVnyF>xSpQZsRRMtB+o^@_8ghtI1tcB?gmjwR=vXEp;)Y7-
z)(oYd`eK<_{i7p<3|1{VTaz^uh0Pmlg5PWUBHyEhW%W93FvSbSoS-B5(r8O{*Eq^J
zH-&amCp0!Wjbt1P7yyR=yv1_+7X!pxE;M)Wd<+GdhMvdL%C)7Mg{Ml~DCg`ML@*yv
z05GMR)zLt@ej+1|E21rOM<b9youS<1x!{gk&Xl01a*>+0n4m{xjQrZt6jt4*EX$M>
z(P?0Pyorb*F7ytr9b2T9Zfw^5XquDwtutO_=vSQhCb$B=#+YgD{vV^9M3Z@^{KF_f
z0If+#&~NsG`Dstd2gt8oh{AE2oa?h*aB=U3u~&ol557d-2757d*s~|sukLg{4}N}K
z*rFG<N4eg|%F$!vWz@Ymu`_pmc;NZ={JXOchh^?fW&i8oaCwHoRG?o**2}*HOnjs|
W-hvDde)|6S=_Nux9ID>l8vX`|1{|ya

delta 110
zcmbO&^PW5UKVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
pT3iBQ6cnYV#HVB?moVfPWu~V94dmMF&#{qlvpdr!=E;9p4FMF5C1U^p

diff --git a/timeseries/tests/data/linux/fit/ARMA1 b/timeseries/tests/data/linux/fit/ARMA1
index 2a1590ccb99d0217d93ce668a0c752c79901881e..d44763c4b9b2da8a04a93329ce4b6ae318e11028 100644
GIT binary patch
literal 912
zcmey*n9R+<#K6Fio1c=J<C&M1&rnd5nv$7Z0u%#@1Eo1ZRDNn&T4r))YF<e(LrGC;
zUWzkFgdw#eKOMp<fKY`HDkm{LEHS5;p(wRDGo>^!2ZaMxQp`}0Sd^IS0<j8Y2PXpy
zkd&Nq_h{LvdItuz<!8GxcJ2?1duKHJR`&j_>Wlo8c5k=;4pRWq0@B0?qM(X7!EQ#^
z!pXo4QUd}HS{%SM2b7j*cYx@Lb6jb0h_A(=v*xZYuPR6sq(>#mbLC2rCi`n`3b$7(
zr|*w^{BBX|?W+C2PytC7Fcd-p1IPyPe}dftVt|4U1pYH-fS4({ISgD3(}1k%=;Zvo
zw9IsCP{>*5q~^tPDJXyg-wGm;0FMc4pbSWrRY|d4rXDC-ToOwXoikFCv!lU^z-GjP
zl)+U1L({D^FS#T$KMzR_$R2C38pj}CN4?_Ay!4#Z07yK8?5_p-2jq0P)eK4WjCcmH
z<-|olB)(y30Hzb@b|3(G0X03q#6WR?o+@B+_|gVU4M;zF3W3StOD7ODB%~Gq86^d>

delta 114
zcmbQhzM9GJKVvdC11AFmLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
kT3iBQ6cnYV#HUPbIwTD=iy4T2G6D$@_{jvJ8U8Z@05?@3KL7v#

diff --git a/timeseries/tests/data/linux/fit/ARMA2 b/timeseries/tests/data/linux/fit/ARMA2
index 53fdae4be045ea489037061d135f19486909a720..532332194e35d38df311335e43f49fd7708864f9 100644
GIT binary patch
delta 593
zcmX@jeL=GRKVvdC0}}%SLvDUbYK~`KT0TQTQEEzNatTljBo3731X1~^X=$0snW=dt
z#SA4ysd*{RAQ6Vtiu`m4s{le3La3a?^svO7VuqsB;>?uN#2ge3SV=KML1GbeVy??Z
zza&;>pn;P&uu9iwfb2}k&0*kTm<D83M<?g!rDdjDgY2=+NzIGpQcwUp)e0h!0C%@F
zPzI#Rs-##iQxD_`m&B4p=Zw_k>}aqeuo<x+WpEWhZ@QJ{C6{F8=Rwre!!5D~D{>6-
zb<``)%uCNn4S)m-$N{y$Kw$*B4sJI?5|zS916M!+!wC{6pvYigVB8qGkx_&b><X{|
z!;k+!Q6O_OKhrcuGYFd#9D^!)!lp7>f%c2l{T8^}-Lr?ufmC1oSW)@Z@ZNri*yNMU
oa#G0B&-T^4RSqh4fQW7W%Dj$I48jJ605ed30Yf1qHGm=y09|CPx&QzG

delta 134
zcmca0d73-=KVvdC11AFmLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
zT3iBQ6cnYV#HVB?mjI0d8OO<By7>dgWX8=(Ooy1Ifdb4x{F4btfWS{i2+i=H0RT+z
BB~t(Z

diff --git a/timeseries/tests/data/linux/fit/ARMA3 b/timeseries/tests/data/linux/fit/ARMA3
index 17160976fdca23f81948729efd1f8cb4ffe6ad73..d8ce229512f2cd589ed968ba0b092a30ed3156f9 100644
GIT binary patch
delta 604
zcmZ3@JwdYmKVvdC0}}%SLvDUbYK~`KT0TQTQEEzNatTljBo3731X1~^X=$0snW=dt
z#SA4ysd*{RAQ6Vtiu`m4s{le3La3a?^svO7VuqsB;>?uN#2ge3SV=KML1GbeVy??Z
zKPOgZpn;Pouu9iwfb2}k&0*kTm<D83M<?g!rDdjDgY2=+NzIGpQcwUp)e0h!0C%@F
zPzI#Rs-##iQxD_`m&B4p=Zw_k>}aqeuo<x+WpEWhZ@QJ{C6{F8=Rwre!!5D~D{>6-
zb<``)%uCNn4S)m-$N{y$Kw$*B4sJI?5|zS916M!+!wC{6pvYigVB8qGl2M!!><X{|
z!;k+!Q6Q6(!DwUjWX3oc4<sfk__8i;%R_r)v6)gb4l}ns+z%52g*(FmflE)@dmI=V
uv>4U*S2)1rKxX)eo!fu9An^c1Z1Y{_WsG7FHZU-lfeH#33Lyyt6psL+8LqYf

delta 115
zcmbOrxtiPWKVvdC11AFmLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
hT3iBQ6cnYV#HVa*a$=PR8paI7KN*1p2*4Tt830!MApHOU

diff --git a/timeseries/tests/data/linux/fit/ARMA4 b/timeseries/tests/data/linux/fit/ARMA4
index 2e5f7cebde1cd4f2f82474f59fc0b8bebeeb6239..eaccc3547cef8b3d8f22edf329af6acbc5ffd369 100644
GIT binary patch
delta 592
zcmeC?xgb^lpD~%6fr){EAvZrIHODhAEuW#FC^aQBxdbQ%5(i3if~fq|w6x6R%+$P+
zVuq5U)VvgDkO)I+MSePjRREz1AyiIcdRSskF+)*mab`+sVh#!itfZKsAhC!!G1q0I
z-y&9Kpn;P=uu9iwfb2}k&0*kTm<D83M<?g!rDdjDgY2=+NzIGpQcwUp)e0h!0C%@F
zPzI#Rs-##iQxD_`m&B4p=Zw_k>}aqeuo<x+WpEWhZ@QJ{C6{F8=Rwre!!5D~D{>6-
zb<``)%uCNn4S)m-$N{y$Kw$*B4sJI?5|zS916M!+!wC{6pvYigVB8pbl2L>c><S+9
z{|A`Ds~v!%K$*?zOw$<IAneI|ndOuqOiplQIxw)!3h<b1Klk9`v{yk}_d}I^WL85_
sR<UuL#N2b04jhrPKmOV$9e|pw%d(A848jHm05j0C0)|3JVgSV*05i|9Q2+n{

delta 115
zcmca0)y?DgpD~%6fs=uOp&+p+F}IkZq$nQDVo0sXPlqxKpmZUW&Phx!W++N6&P-uQ
jEiQpD3W`!w;!`#@En<}h8paI7KS2Zt{A7aA4F4Gcaqc05

diff --git a/timeseries/tests/data/linux/fit/SARIMA1 b/timeseries/tests/data/linux/fit/SARIMA1
index 34f4cd023f88f48e45de4dea9e39328618856a74..566a8b31c40d9224ea804db2f1349db2f07a7f9f 100644
GIT binary patch
literal 2115
zcmdT^J!n)x5Z;IyE<r01!DbN)|42HET#*+O|Im<MR5+uzzTNlo$h)_feeaH_D6ueF
zD0UJ{iJ;i1;7_o*6RcDa6(L9j;~!#W8Z}j%*}Xk4(Z)j1Ep}&SXJ_V{otgPMw~;I%
zgybX_;qJ(jB(DSy+yrDOhwBx1Na1;b8;B^0Nuor=+fhQqi1ZDWH^;C!y3NsJj*$8L
zSQrx};(!-exPZ|rG08K<avkVcs=yB-7Iv_Nkyx-;Mw%o#k`!46@r*p*b!$IAR9ZFJ
zpWVFmQ>lK_>krp1XInpZ>>oRoKi~R(+bm!DX1w(q=)(FwpVDWF)WK;V8_t?AoHdlv
z=qp^QVz!hb%e5WYZjP2U=_p5^qk9dX<%TC|!;c<tQlt@79<03+9=JH<OqIv)pM2yy
zpATPO-GA11CYR4l6mN}{$Je!}hg~`6ZQZ*s8>UtH?oK~?b;);b-)n#Pq`19&w_*CR
zvodr>JBPkLo#=92Of9(}XR}}(|LNe>y?f-$%hHV<rzfwR`%r>^L&$kDOu9)A$(c_h
zjPoD~BxJX{hVCH|=%W4{<_A}msMjeoLPF}bGt1#m;F`0<JSd<AYsIKr&gO0o1IA@i
zuE{lL00^!@yyUr%G>~UN9?!Tk@&Z4Nm6i@gG|)h)F4h)vk*!v48g5XlC5fAYCM;tt
zoAyEB4rR2GwlRQe&CRr(g>E5aiJ+B&hpfajk_lz5D+^H)L_YPTqCD_C0Tu6p0G^_D
z+_E0R0v)Q@F=4{VRtp#%P+AQFN_s~jaxv>QVIY*W*6C~O?QCm`!FnjVP4DR#D<;}{
zH59P2=rtySQVXX`aiOU6sA(7$E{L1jo(hjrmd}TQ;P9IFL32SbLM9Q5&QbPf!&MTs
z`*$`4-$Ior3YE}AR;`20z{^;(X_G1qQtQS79YQxlMh%!+Z(4ic7d*A;uR2-?V=)8^
z7Qep&qSwUAL4pChQYqmkG<I1i=msTnvM3g2Eb#EFi-9b}KfP@PgzGaL9Hw!jhjBk1
z7jiKjInk^Wn90_<AK0Q{nN;A9oq)!~eyCP0Ir8t#Sku6NHfd{m8OE)(fE{gVgl$J}
zt;`PKTA3ZgR)#}i@i4Y`cC7r#NVaeP=3uryF!sgc+1&jXL)yZyQ=Gu4wlY9QegFay
BXKw%i

delta 166
zcmX>saGs_9KVvdC11|#uLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#9RQ^Rp>$4SdND&$
zYH?-?Luzpegi%nGni8LqnOwq<UzC}i0w#-r)_|<yWMBo76_-3VhHu=f&nU`JoSIl1
UpO}&oAAn#^Ud>)HxsFK$02g&OssI20

diff --git a/timeseries/tests/data/linux/fit/SARIMA2 b/timeseries/tests/data/linux/fit/SARIMA2
index 68924cbad75f7b6c39d0c793e8b042d120fb399d..bca1afaabbd0fdc3314c1827263e1b04c6b02e57 100644
GIT binary patch
delta 1349
zcmcIkF>4e-6rK&?AQ2-0tB5d23WbmqTIC823C4ipC6Ry}M>5XLUIu4(HZyx68p1ZV
zRwot~mbMZCwoy@Rv=E850r3ymnZo$q&h6$dTG(WZotgLEy!U<Io0)GzUq-XTfS)3L
zgpkMxspsO*ki<%%cnUVOgZnfNMuwr{O2(-psg<#qMhlS}rs-#iAJ+U(^TWI!G;1y~
z?TD35i5Ao4VhmS#j3WuNESkZ}#;T@@nVw-OBTh1BVw`SA!<7-JOqdWVRR$tzUGsLk
z@-ACN%p%fckbH*eW}TZjRLv^ltZErI1}V*{PI|{m$-4?3jG9c{xEe<hu;%ot<oD}&
zP@bcKVG|Q{nrYtBES0n)7r=2e&P+-f=cW~>DsIxySSnN)N)U{L0rq0Jn;ZpP5JN3{
zC_SE)l0H_Tv_t`dTyMo3`F7zzF4a6{acbe#)VPD}TGle(yBuXF$J9bj2GShcj#5xs
z5V~!JwA6&u>&<iza9XrPKcFm0w2}gLi$&-S^b)j0GN4i>^~zDhI$F*BE2$J?)XH^f
zQ<|DGJtS3Oy~tbjV&ydRSSPU}oCeU;v+1q|(V1|KzZi9QpsdtOq=y7>DCpu5=&)Gu
zU<FtZF*nvqo>Iq|mh_^Pky)2C>#3zeZ1=h{Eq7Gz5^x-?3#eGdA6>@(rfWrxZfs5D
zC#~L7zcH4};KjhEZNZ%k;Fy<3m)t9^X~s|Hp8oGO9YVnX8a^3>1lwzGg0llT*yi^~
zFK3@zcnwzY>{R=n28({~{U7e!Xx!wVu6Enr9wyJ0zJA;-Y<Rcnxr4{A`hM&#A6#+2
zhWAPzxZn%gmHzFKA|5L}|9O6Ye{c4hx9z;|e-T{vb5L;r0ur)DYGj^7{#omKP>ww=
Qt7nS5unWGR{kwAa4<g63Z2$lO

delta 168
zcmcbta-FaKKVvdC11|#uLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#9RQ^Rp>$4SdND&$
zYH?-?Luzpegi%nGni8LqnOwq<UzC}i0w#-r)_|<yWH1Mk>5p61yKmO#-pshUfXRnN
Wl%Y5^u{b_4B_%!p!JK@J?Fay+5;y|@

diff --git a/timeseries/tests/data/linux/fit/SARIMA3 b/timeseries/tests/data/linux/fit/SARIMA3
index a4cbc49a35c8864e7f24d93ff51c11872c5e6e35..f1654f874b65f342a74d504523d55b53dca3c292 100644
GIT binary patch
delta 1346
zcmcIky>AmS6!-B_HATPx5`xJ?2auwcj|G|ml~za{5NRt_peUlo_N5lS^VPnDLR;Oy
z$QV{uR#atTDkBR6{0oQ`wu%Ln_w3v?0R|S9^SkHY`}qBy?YHM&TrCbGeu^9;gd|2t
zy_Cj=WL65rbMT=bwol_?WE?B5WSTpYTbYUl^bomaI>E}qvKf|USeC=0S!a!DN33*8
zbeXPx>iFj|j%3WTWDzSHt2!!WdXeRfILVwz(cDX}O<sb^gbAT?WdP9}u?K_6ov7Hr
zrbVL36!`?^c8i-dR-Fbm+R!p>Pf_Z*&V`Njig*Kd5H**(88w3{V9|vQ$)B`5$meJy
zY<89|FwMJ~<&qBdf_@xL3zJjEx#_04N;@<*mI@We5)|X$fLMgPxoPMNFx0Z=GW4@v
z@y803R$YK1ce^P^!Xtekmxjlz&ad2?pK*{~%VrUV=er!{m|yW^AkA;#C<Ub@q5D=y
zOHE9J;7pG-PRp6-XOv}`R#HH;ScT!hFhNTs11c3#uOCfpsMXTnl1fphR`#ULX>O|Y
zkW|I3GH)Y@<!O4XlUNas185r9j8+45Azb?pMxzs`Dvc8vBLN&rx_AVJELK9Z0zB9;
zH`Yp?Q^%Q>^s<$S=}B6Q)lxBzyQfUo9hJKR9H(0XDmL&#AK`b~brVOoc4z%bYmL=!
z*S!ou3>?}nT*?TJd3AKj<MN&s{Qq3kf4!$?P%vl>-wZ;cz0G&gg_>{HJIIei@Pb(*
z*Y}?GqqXu6eLt7Z-g$ZM%YNw#5dn?2`t=97^J?|;e*5P82kPO+3Wtc{-t7=Oh)#Tq
zCW77ne)3>w4!%yjiEacpNLYiojBJu7Stdz%)JC3@(Wj+9n9PntT%Xo1mT7O*4t@a-
C<gi8n

delta 167
zcmZoRST9olpD~%6ftP`Sp&+p+F}IkZq$nQDVo0sXPlqxKpmZUW4uH~uP&y|uy_lgW
zwKy|{A+@*!!YC+8O^Hv*OfF%_FUm|$0h7f*Yd}_UGME8LseK}ImTlZC&mqcCoSIl1
VpO}&oAAn#^t`;lUoW{w+2mm9QHWdH>

diff --git a/timeseries/tests/data/linux/fit/SARIMA4 b/timeseries/tests/data/linux/fit/SARIMA4
index 3400a955cdef953e24384813aaad74bf39e20b8d..7e0bd5a33ac6491852405671bd073b1fddeb56d8 100644
GIT binary patch
delta 1357
zcmcIk&ubJh6wXLl+4}1uh($C)@fT$adJwt?EiFP1;<l9{EUS%^%r=;rY?IkqEp-I1
zp6ov$EO@h0>!la>;z?BS9}sW)CkP(udy|>zta|X`C3$&ydGGtam*mxvCufpf&p$x+
z5kf-Eg_?;1O*%$!>BrzhKWrbuNsAzmz7$bxNo+*Kr_e*hs%|^0<ECXdHQY4oCdJwd
zOj%-tmAuPTu^7Wu8e>U^85T}sWo=|zModk!m=P<O)e)Lo;pxf<R3?mb8A}Zjjgr0H
zs~q%_B?QewMTW^|FxQ&Cjsn@PBhb1MQEQmel<S1sSS*OwVF#loVq23nQ~`@lEeZc#
zGYv{}v@mRZoK7+2cNL2T&GZ61mPUz=Df4~Zjba(KX`l_|G6)1H=7R&yVz`|c1zZqA
z4SOgYp2dPcR-m*%0g7DfMm`cQ;Xp3+G-iHs?)qfSLUtt@iECbpGRrYJmy&@rza2*@
zD9s7oFkBd_1L_24Y5+LRd!imt*6AoIIGpD5&>UzcXo+M%r9`O3y^dvC&HOE?;A7N?
z6=7l;>mofQl|eJlTX$lmG}Blsup(Rs(A2RhSA*z;+twe9${i>w4H6k30UUC=cm*;R
za}F#A4<h!pF~X0j^_dd%oDrd35i}X7B?D~tiqu`ZSMEG;9Bp!_SjU0h!Ew!YLrYiJ
z$J3kE9H`$KOJ#6kVAIy&QF?IAi>piS=I=D||Ib7H*LQjt1p{dKXAt6TE$?{8swvpi
zL4NFl7tAWTxOIQSTgX@VYEOK>{NYn?z5ndd``2%KZ~FNj#{YbN`}j`nTj6urTqYcE
zKK*+1<-0Fe+?fq;=wu}`PYrEm=Jv;-P4AL(gMw8!)*;KJL1szl{<P9>QvO*sdX-xv
TWi;gR1@CNs^=ek4y8p#52llYB

delta 166
zcmZoxThCwrpD~%6ftP`Sp&+p+F}IkZq$nQDVo0sXPlqxKpmZUW4uH~uP&y|uy_lgW
zwKy|{A+@*!!YC+8O^Hv*OfF%_FUm|$0h7f*Yd}_UGME9$3oWOlwr$+Y&nC)HoSIl1
UpO}&oAAn#^ek~X=*^Ye+00AF2WB>pF

diff --git a/timeseries/tests/data/misc/aicScore1 b/timeseries/tests/data/misc/aicScore1
index b8d2a1ae25b57f1df2a84e18c0e182ef6073f522..8d18cc8d59403000f0a268d0bbfd16e4379c6b46 100644
GIT binary patch
delta 26
hcmZ>97x~Ya%+0{cz`#(zkit;NP*Rkdmoib%7ywQ=21)<`

delta 23
ecmZ>F5&X}X%+0{cz`#(zkit;NP*OBe#25fOH3bs@

diff --git a/timeseries/tests/data/misc/aicScore2 b/timeseries/tests/data/misc/aicScore2
index 90981b9a9e28cfe51cf1686cd41428046c39fe90..d83345238c96b6bcd2758e1660f5be903712f793 100644
GIT binary patch
delta 26
hcmZ>97x~Ya%+0{cz`#(zkit;NP*Rkdmoib%7ywQ=21)<`

delta 23
ecmZ>F5&X}X%+0{cz`#(zkit;NP*OBe#25fOH3bs@

diff --git a/timeseries/tests/data/misc/aicScore3 b/timeseries/tests/data/misc/aicScore3
index ceeeac11cb4b6be8cef256c302df5f7be5c24e1c..e314b9123de25b5f7096952d1152473d108c8a74 100644
GIT binary patch
delta 26
hcmZ>97x~Ya%+0{cz`#(zkit;NP*Rkdmoib%7ywQ=21)<`

delta 23
ecmZ>F5&X}X%+0{cz`#(zkit;NP*OBe#25fOH3bs@

diff --git a/timeseries/tests/data/misc/aicScore4 b/timeseries/tests/data/misc/aicScore4
index d3339401b64d17b27d4f8c565e78672d6d78308f..b8d8868f337490aa97b087b27602260ebfa8e55d 100644
GIT binary patch
delta 26
hcmZ>97x~Ya%+0{cz`#(zkit;NP*Rkdmoib%7ywQ=21)<`

delta 23
ecmZ>F5&X}X%+0{cz`#(zkit;NP*OBe#25fOH3bs@

diff --git a/timeseries/tests/data/windows/fit/AR1 b/timeseries/tests/data/windows/fit/AR1
index 47e50d0e669681291e42401606ddf4cd063fc7a3..5c37aca73f15a8c1fcbf5baef299e8c1e60e801d 100644
GIT binary patch
literal 595
zcmbtR%SyyB6ul$3nEC^=tUKNG16mjyW)O5GF5HwtW16<XG$~CAqKHdB%YX3$`~|%?
zp``=v>>`hIPVT*@=V!&w-~<3VQwepWYXkitD#;yUgd=@Qn-O&_xfI%2aD&j*o)8ev
zrU|D$Y>G7xOj&$F&(^TQ_5J&{-hRD&uJKG*SdSUH`ET=a)l%cCQxHLf_-oFMu4R)^
zvrGwHL~-orO#@NR!)M5lDs?uI2{}11$M&t@Pq`QQ8U@MxH7|QMu<n3<i?r-Z@9dnL
zQM*_$-Ud|}yam?VTof3DmDR|U$8C{bT>e4M_&gVOydE`o(&)ux<Sz4?-F;%EZj`tS
naggUX41@mDY!@E?nGg7S$0<!%h=clt#3r1ODbXe7^_bo-SgfUx

literal 109
zcmey*n9R+<%D})-kXV$MTg*^W6c1)Gq*mmoLzx9oIwvu`7^o7Ynv;PMNY=meO0#=Z
UwVxBn28lsPR2dKt3?$ke09nx)_y7O^

diff --git a/timeseries/tests/data/windows/fit/AR2 b/timeseries/tests/data/windows/fit/AR2
index bdd39803c93efc1ef970cf6efda351c528379d43..a222bcc5006314d5e0af76d375b73d019bc7afac 100644
GIT binary patch
delta 538
zcmbVJJxjzu5S<uQjGd*$5j$;SXOT3@1;I+O&>{q~ZZ^r{W;Y}oP(<%H$bqF`DHg}!
z@9<~%D=c(Byl{n;)y%&4X6DW8$NF^Z^4r0iumS+J$O${)g@C3dImJ?88dvnH4uTW~
z#guT_fouuS#~J~0BTAb!HZW9PAU5BBb|;mt&ua!Acn-i_f(0*V8R?pc5uSR!Z^nj}
z$OaFQVzerhT|fhEC`J;ED}t{RBQiZwEi=y~Yfy{oSlLd)tjUbQ$qp7hmp(zbva-y2
zw6w9i)S)FE&3nWQ-lgI0-XG+E560BU<DO$l1z)nJ*PBE}baWncl$VU0*}Lkk@6}b+
zMPOJMuKs!GPIb}$1E${w9H{JccKh=7`7n=Po$Up#r~TrG7JL74o}4wCPaoaUBAvOT
EAMhEacmMzZ

delta 56
xcmbO%be3KJKVvdC11kdqLqTFuVs0@*Nl`qQ#gJN&pAKafK<S*s^o`;3m;v|T64n3!

diff --git a/timeseries/tests/data/windows/fit/AR3 b/timeseries/tests/data/windows/fit/AR3
index 1e9da8eb53befa8ff0af2de986d2ac74b5c9096d..5b2a3ae088deceeb836471f9df5327717f7cbfdb 100644
GIT binary patch
delta 555
zcmdnW{#Y>lKVvdC0}}%SLvDUbYK~`KT0TQTQEEzNatTljBo35j1yT8_X=$0snW=dt
z#SA4ysd*{RAQ6Vtiu`m4s{lgfB&LTY=4{k&W3JBt=}*bcVc=rmVq{>bj!w?cOUq2R
z2I;ZRNzIGpQcwU}YXy-=fV;sOC<9VuRZ^^%sRwe9OJYf)b4F@%b~IQK*o;_^GB7tf
zp&+p+G1mp?>{zRWf&{Pw>lKspOY=$;QWKLiAj%RTZiGmKor`2bQEG8!N@-#aSiCsa
zN<$NiBwQTsFrbxgrFqFEnfZB0P6K(b-WsgPF-WgCGcP?SH2@N>AYa!4Llqbz3~&!K
zB#{}=#XwU)@xuT%kCVX^$l{Y0?Ml>Y+~~59Q3@swvJEQk01?~#nQ<DU5=<TxRo&+A
Z!`F*Vx1au}_v@`|&HEv8ljk#w0RU|%sZam_

delta 56
xcmaDXxRqW1KVvdC11kdqLqTFuVs0@*Nl`qQ#gJN&pAKafK<S*s^o`+d%m4%U67>K8

diff --git a/timeseries/tests/data/windows/fit/AR4 b/timeseries/tests/data/windows/fit/AR4
index 51d34ef1c906d728c95d7fb06c3488794c8c3579..8b29b6a6e785030858309e0cc0780cb9e35e7364 100644
GIT binary patch
delta 527
zcmbVJJxjzu5S?65F$#jeAU1~<#?B(C<bq%)7FvWr*3Bka-0X&A1A<6uYb#3;1S>fd
z{3ZSj!GECp!G$ZVtY-GTH#2W$Ki8hNFMb@n0TTeIMNZfWF9b9#$tji!)3~BnbP%K{
zD5iwV4rEJsKGO)0Yf;*)v4NrT9I^SqvpZC}KCc;g;5h&{2^PGdWu$8&MtJJ^z8M=@
zA{#wKiqWc2b^#5vp%_Uts|dbIjL7szwah$|tU)cRV`V!HvnDeJhaD_>E`5S<Wo4Q5
zXlY}2sY6RTn)iqqyi3E~y+6niAC0Mz$34fA3ch4buQx<SbaWPUl$VU0+PmtlZ`D=R
zMPOVQF8_JxPIb}$1E$|P9IEX7<7=|H|2U6doUH|JB;dS#I}gsP1@G_N=^}k~r@y4<
Bpfmsg

delta 56
xcmZ21bd_EIKVvdC11kdqLqTFuVs0@*Nl`qQ#gJN&pAKafK<S*s^o`-mm;wDD67~Q9

diff --git a/timeseries/tests/data/windows/fit/ARCH1 b/timeseries/tests/data/windows/fit/ARCH1
index 79bfe798a4e2326f0c3b9eb47ae4578450f73b94..4f8913186b77560e7be64ca45dcf062a565c084f 100644
GIT binary patch
literal 559
zcmey*n9R+<#K6Fio1c=J<C&M1&rnd5nv$7Z0u%#@1EpC&RDNn&T4r))YF<e(LrGC;
zUWzkFgrNXJ6{Qwurj#b;geB$_166}`aDvH)ON_%xPKh`!S$q4Upl!DOq;Kz;i?){R
zpBJ;l(fDV}{_ijaoD9rBO}LanWPu787&;mp!1O0pM-c6COxrO7<j|Dd90o3iULdPF
zIypZtEi>IZCp9mYOF;qbH7kf%0^Fb0K$)Dx;*#nLyX5@Ryb^^<1;g5Cpcd=cl489~
zJ;xwt553~dy!4#Z07!@cH7F=VgAGVP4v$!U4TvCeC|GM~Y63OX0>h6H=q9+e3`yh%
cDFfIj;sTZvtQHo!FrE0q7+DR_#aKfc04+PMkpKVy

delta 54
zcmZ3_GJ{d;KVvdC0}BHKLqTFuVs0@*Nl`qQ#ZUmHi&Bd-Q-Ja;lQ%GGPHb=m0J)zK
AfdBvi

diff --git a/timeseries/tests/data/windows/fit/ARCH2 b/timeseries/tests/data/windows/fit/ARCH2
index 27d88e56f34996351482193759456494dc3f4240..316f25baa3a956c2413c9eaafc0ec76f34c4678a 100644
GIT binary patch
literal 463
zcmey*n9R+<#K6Fio1c=J<C&M1&rnd5nv$7Z0u%#@1EpC&RDNn&T4r))YF<e(LrGC;
zUWzkFgrNXJ6{Qwurj#b;geB$_166}`aDuh_Ien`8GEKzsu*5OZW9ypte}@ThGB5%a
z!Gs~aHK(?()-Ppp%m7)OlAFW8#n206RYxc1=cQ$)Tj!+a#d0YqfSqau5leu(+Zrg7
zlUQ6*U167;Uz%5<P^n;88x7QA9a~bYm#OC%<m{nWoSB!NlNtc=Ay9*YLNwTb1Z4lk
x>T5s*k$q*Yp{WVfPzwwXki+5DG9-~7L=0e~um=<*L|~x>Q;8gA$YQXN0{}Xii(3Ey

delta 53
ycmX@loT2rfF`1izg@J*gAh9Shx0s=%C?3pWD1g#Msl}NoKzWwQ4UCc#3seEKTo3;M

diff --git a/timeseries/tests/data/windows/fit/ARIMA1 b/timeseries/tests/data/windows/fit/ARIMA1
index 9e5e79863b93dc955525349488bf39e8cd07d5cd..1d6761d0fc93bee2d0e272f2b263acae4a46ba09 100644
GIT binary patch
literal 1227
zcmc&z&ubGw6rQ$%rv3%q1`$z^<Z8Q@gaomNg5)4d2xdAvn-0$IY<FfuODTA0@Ae|z
zyomQInB(>&-aHE)#0q-oy%8+&y_qC7L5iN-!+SIDec!%c^M-T$GMOiY#5xpeC5bdi
zGZ9K|!G?BlFQU>SiX@jJv4+@8Bw-ybL~QD=Kc!x6c%@kPK2wHd!pJaZYU<)3hNLWG
zaRWo?Om=0$RD)Rt!7yNnTmX3-FCDy`-!1m9?_D`O{Z{msY9F6nJt~6e=a=kLzc>L5
ze1l=h>5@x%mgKFo)Dl^691xwb>W{&p(@!2$J?HEDx2I3{S@rqx>vp`l23F+s8}AlQ
z1|Nz`uXbM?G(Q&)w^rWVzWcp^R-rp38@@wu#`_O!BYu;^kX$5lU^p6i)=`JCg2MN}
zzTM(FiDWmx<_1b6?L|u6e3reVGZ|I@9z-o$Q<i0%9Ja8&F8Je?1G#^+5mu?tI#WDX
z%nCZ1vDAs7iMEt+uJgpoq)Q{6(NIQ_fFW}*02R^ZcL<o+ojl<d*k^)ZqJc}J)?BTX
z4N#T1=iib`AcQvn1wa^IOy}&>M;SInPh^&cPz1G_a-C&@TWUB{g5JnPtb1ZAJ1Qeg
zuP1eG)D~rFs-y^SLeke-7%)IVAEDefd2HzBPQ^8*H5I;n%heb-iS6EjVd97;ohEDl
zZ54>i<nI=u<gk!9e1qYb>F-$2pEqQT!*@8lCu84v+cL%=e|BfazVkN6WBy@(JYjl^
FegTC)du0Fs

delta 116
zcmX@jd7LTnKVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
uT3iBQ6cnYV#HVB?moVfPWu~V94Fnm=$-px4qVVJc%mNel&zc;=s0jesRwfPr

diff --git a/timeseries/tests/data/windows/fit/ARIMA2 b/timeseries/tests/data/windows/fit/ARIMA2
index 56af381433491ea7c11489d2e11cfde42d6acfae..a575e52e6bc4900373deeb48a60fa167c5baac24 100644
GIT binary patch
delta 898
zcmcIjziSjh6rS}6l1u52!lW2P#DLxf%WPw~U`Qe8K~!8$GtSO#24`nBJF{m(2y0_6
z2x4VnA-0F5h3PE(6D(~6tF$)y-pm^Bz}79^+<ou8`M&q|+s_MM7MouYUm&LlA%#hW
z+SHjLWhGL{9bkw<x{HSqSthv@+F9Z%q0<2(MC_S7{3^q-8xF<tyG&V92`kf@shK~%
z#FCO#XIQb0T&9w_)J&~2$0qkS<~l&|^3B|Gvx6GaLV<|q5g`W&H#(DfjJ&ZDdeo)V
zOJ5C%?UOlT$Y9l~vpw0vq44s-j^IxcU*zu?VOhUV2Tbu=F(+s{zBJlW-55t1=cd+9
z>YQe#qN&U>0Sn*&KrfctzZ@{k&02E@?NcnMH1;~yhPT#w7Fw0K9ro-Mq_7{D0BFKA
z+wDRczmZnsP>e<8XbNLcXDBySCAg!OGbQMiN)%=+X5>+sp}w&+wN?9+m8Fs*T?W;c
z5@d`q&_@^#Y+YEoci8u<NoM&+SN)W+kGSzea0R@LDbqauKSsHXCi71DhfzQPT9c4y
zl1!ot^Ip=&_~-ud*2)o}keMgfAx_kb=;0p#;Rnap?wou3^wTufth~OcmcE~0XuR;m
xNB(?v|HgFv`>);Mt1Z@IAAWS+Mdw2vmeYZblI(^r1&q9SHl$8_Ro~jXe*?dn503x<

delta 110
zcmX>teVjY`KVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
pT3iBQ6cnYV#HVB?moVfPWu~V94dmLa&ashkvpdry=E-+hH31Z`B>DgV

diff --git a/timeseries/tests/data/windows/fit/ARIMA3 b/timeseries/tests/data/windows/fit/ARIMA3
index 7d5943fe677637e8a56cdbfbbebb12794a7211b4..82830ebc796aae20f121aa08bdc7fae9871e5767 100644
GIT binary patch
delta 864
zcmcIjL2DE-6wbI-Y3oh!&=yHWih}Hl$BqY=tpyJSsi5Gn*qF&ogPCNSWR^-Pb1J<G
zz4#Bjcv%ENDyToCUOoB;g!*2x)h>AT@-p+j_wv5)%{Qm>$4l*L!sp2aLP%wDp&seN
zklKn|at}Go13ZU^5k(=n6xuuDt<d=ZGeqo~GCtMu*olW?<x{2{vBJr`VQT6nlsHl|
zdzDpdILlaBO3l<7^KAU=aHazTj=s+<x7)}euN06tNC?@>xY30yQ=FSBp@(yn2I{L3
z*!rU*1q7>Bz3a*@io(kW+k(H!LQ%M5h-JMV9Wcck#k`;s`O;`db!$9joSQ~_sY_ZI
zOLJKi0v5m_0lir6{>}g~>y73f+RsfvrfJ~O-`wbT9keR(IPN(x$YDRA0LDZ$Ta$&f
zVIvccn_?uar#Z-=-cW9=72H$DnG*Dt6_pu@DSA{E$ZsS~<J4=)>RL&W-v-t<Gn|-W
zpf51)xu$Y-cfS`_lTFVb-U%wx5OL%C;0pK}=S=hR{}^QjO%|N;52FA9v?d|RIQy0?
z%mzyDKi|FatKU9=EFiPw9>lSFlU$04qvOu?g)<B*o*dr4eDeI$mkIo%voZz0{ke4V
i@E!je;SY-!KPK1WGpwKk{WaN%{{z@{@yglF0Fd9ffDBXs

delta 110
zcmdlj^PfBVKVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
pT3iBQ6cnYV#HVB?moVfPWu~V94dmJ!&vB4(b2!r>=E*W_iU26TB;)`9

diff --git a/timeseries/tests/data/windows/fit/ARIMA4 b/timeseries/tests/data/windows/fit/ARIMA4
index e025ec69405753ae3d72f86a1cb3d89a3259943e..8ed732a201969da4a4c02053fbf0c2758157924c 100644
GIT binary patch
delta 902
zcmcIjziSjh6rS}=k)%-qrtpwKR21~ih+1SD!$m<vkrXN{hZ$#QH-mFCo1NJsa)bl%
zM`L#oME?ij5ZmOCU@Hg~78Vv(foN$L-<!SY9oX8;kN4iZ@B7|<`(yfbwYwMaX>x!N
z(lW77*L7k@mWx<&2QiF8dI~oql0<STw6nzJLdOk^5V2;`@|2a^YPl(v-eJm;Tv!<w
zOpRT9i6t4!XIN_yu}m&gshL`2j`i=R6BPjHZBEQ}SCB*8Qb6KqKu9OzMkg{25jRvq
zH>W7|)aT2@@*f=`WUy-9*_y1OC~V$X5&U7~i+qnJmeuRD!4xkPbAk@#OQS8-UE?U@
z+!WeLozleQG?qysU;rEf@D|JMUkng)t<c=T^AQwe8hRcJOE(v47M?0`yPUIU5W{>x
z0l<`MmWKoB`iTrVE{V3t9gRT-b%t`2=Yl(GIa7k3&PB_##TY#*6Xe&Hrm*S}Wm%@A
zh|d7)6A>bYxX}B!c5KnIbZx!vM-z?XH_v*Np<i+0>);Cb8Y8B8@PCYQ3Qgvn@(-f`
z0kkF|K|ksTvy+~Z50GDb5QXC;xzK05;PU<rW80hCkH5s<w)SJ_u%FLv-PrAX-n#mA
zZi8Oh8RmK)E60wHmYr|EI=MS@aq#Zt&g}bhPX=Y~ZROyb;7ECf!Bn7MMpnze1WbIi
XI@*E^4t@Im{KXYQKOC;!+Zg-?vXmV>

delta 110
zcmbO&^PW5UKVvdC0~Z4WLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
pT3iBQ6cnYV#HVB?moVfPWu~V94dmMF&#{qlvpdr!=E;9p4FMF5C1U^p

diff --git a/timeseries/tests/data/windows/fit/ARMA1 b/timeseries/tests/data/windows/fit/ARMA1
index 8e859e20d0be527b14978df8d3d1d0e3d4626833..95b14c1f16f1e38afaa89bce1a5ef8d232d4be98 100644
GIT binary patch
literal 912
zcmey*n9R+<#K6Fio1c=J<C&M1&rnd5nv$7Z0u%#@1Eo1ZRDNn&T4r))YF<e(LrGC;
zUWzkFgdw#eKOMp<fKY`HDkm{LEHS5;p(wRDGo>^!2ZaMxQp`}0Sd^IS0<j8Y2PXpy
zko+*=?$NSS^$x}Jrk(A|*t!4CgI7kgZ)NXyTrkm3Y4>*f?=S@*Eg(&dKoX{y6YOSm
zEu0L@V6{NvL5l;J$pNJ$+8rQz;v8369O7$n=&ZS`%c}|!1?l;h=(%#GNRxeko5Jmt
z%IW)6AHQ3adb?^rFjPR&1q_9dzyPvA{GTAVfWS{s(1E~z#taZMB{zqGi(wj&RUMt2
zpO==IZVd`K>zvfQSS|$xaNt`(Bog2;VGWc4sj@05*2~lbMT<*fNuqN`YI1foSP|Ha
zSdcQf3SelumF6XvWaj4~sR7wz4OZhA<m;$coSB!NlNtbtXOR81K>vW84!4>iiJlS9
z0JfaC=!e8NEDgYP0^JS-ATOY%2bdTr4$xBtOb%b#fT;oLM^7OzIeh5^qK1Uj0s!{g
B5f%Ud

delta 114
zcmbQhzM9GJKVvdC11AFmLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
kT3iBQ6cnYV#HUPbIwTD=iy4T2G6D$@_{jvJ8U8Z@05?@3KL7v#

diff --git a/timeseries/tests/data/windows/fit/ARMA2 b/timeseries/tests/data/windows/fit/ARMA2
index 17e5259853f8d29b06f214656b18cb2349cc2b1b..9e2511b3befe2f799ea5a86a1d86c17329553fe0 100644
GIT binary patch
delta 593
zcmX@jeL=GRKVvdC0}}%SLvDUbYK~`KT0TQTQEEzNatTljBo3731X1~^X=$0snW=dt
z#SA4ysd*{RAQ6Vtiu`m4s{le3La3a?^svO7VuqsB;>?uN#2ge3SV=KML1GbeVy??Z
zza&;>pn;P&uu9iwfb2}k&0*kTm<D83M<?g!rDdjDgY2=+NzIGpQcwUp)e0h!0C%@F
zPzI#Rs-##iQxD_`m&B4p=Zw_k>}aqeuo<x+WpEWhZ@QJ{C6{F8=Rwre!!5D~D{>6-
zb<``)%uCNn4S)m-$N{y$Kw$*B4sJI?5|zS916M!+!wC{6pvYigVB8qGkx_&b><X{|
z!;k+!Q6O_OKhrcuGYFd#9D|QFg-vC&0__i|`7Lm_yJruR1F8P@p`!As;l2G3vB@Wy
o<)o0MBlg$4RSqh4fQW7W%Dj$I48jJ605ed30Yf1qHGm=y0Hw~YCjbBd

delta 134
zcmca0d73-=KVvdC11AFmLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
zT3iBQ6cnYV#HVB?mjI0d8OO<By7>dgWX8=(Ooy1Ifdb4x{F4btfWS{i2+i=H0RT+z
BB~t(Z

diff --git a/timeseries/tests/data/windows/fit/ARMA3 b/timeseries/tests/data/windows/fit/ARMA3
index 63248b87111f324ca322ed7eee8c054b047e7518..907229f03f4634c7f4094e3a1346debc244429ca 100644
GIT binary patch
delta 604
zcmZ3@JwdYmKVvdC0}}%SLvDUbYK~`KT0TQTQEEzNatTljBo3731X1~^X=$0snW=dt
z#SA4ysd*{RAQ6Vtiu`m4s{le3La3a?^svO7VuqsB;>?uN#2ge3SV=KML1GbeVy??Z
zKPOgZpn;Pouu9iwfb2}k&0*kTm<D83M<?g!rDdjDgY2=+NzIGpQcwUp)e0h!0C%@F
zPzI#Rs-##iQxD_`m&B4p=Zw_k>}aqeuo<x+WpEWhZ@QJ{C6{F8=Rwre!!5D~D{>6-
zb<``)%uCNn4S)m-$N{y$Kw$*B4sJI?5|zS916M!+!wC{6pvYigVB8qGl2M!!><X{|
z!;k+!Q6Q6(!DwUjWX3oc4<z=H`DI<)mWTGpVja9P4l}ns+z%52g*(FmflE)@dmI=V
uv>4U*S2)1rKxSBoo!fu9An^c1Z1Y{_WsG7FHZU-lfeH#33Lyyt6psM!RIgG1

delta 115
zcmbOrxtiPWKVvdC11AFmLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#os*be%utkCoSDLq
hT3iBQ6cnYV#HVa*a$=PR8paI7KN*1p2*4Tt830!MApHOU

diff --git a/timeseries/tests/data/windows/fit/ARMA4 b/timeseries/tests/data/windows/fit/ARMA4
index 99d9305c7add7fd0cb923ed430cf33b1930c0211..60d682d923f856ed763f6630627b01692dbd415c 100644
GIT binary patch
delta 592
zcmeC?xgb^lpD~%6fr){EAvZrIHODhAEuW#FC^aQBxdbQ%5(i3if~fq|w6x6R%+$P+
zVuq5U)VvgDkO)I+MSePjRREz1AyiIcdRSskF+)*mab`+sVh#!itfZKsAhC!!G1q0I
z-y&9Kpn;P=uu9iwfb2}k&0*kTm<D83M<?g!rDdjDgY2=+NzIGpQcwUp)e0h!0C%@F
zPzI#Rs-##iQxD_`m&B4p=Zw_k>}aqeuo<x+WpEWhZ@QJ{C6{F8=Rwre!!5D~D{>6-
zb<``)%uCNn4S)m-$N{y$Kw$*B4sJI?5|zS916M!+!wC{6pvYigVB8pbl2L>c?21^^
z{|A`Ds~v!%K$*?zOw$<IAneI|ndOuqOiplQ>h!YC3h<b1AH895+N+?g`=QD{GOM8|
stJt_rV(z(02Zl)5AAjwW4nR%TW!c6k24MpOfEj360Yf1qF@WL@0Bh*5y#N3J

delta 115
zcmca0)y?DgpD~%6fs=uOp&+p+F}IkZq$nQDVo0sXPlqxKpmZUW&Phx!W++N6&P-uQ
jEiQpD3W`!w;!`#@En<}h8paI7KS2Zt{A7aA4F4Gcaqc05

diff --git a/timeseries/tests/data/windows/fit/SARIMA1 b/timeseries/tests/data/windows/fit/SARIMA1
index 2464585fed55b9b4099c42f73e30d0b84e8e84ce..3234cc565aae756eff8feca023e995f6e4a28b64 100644
GIT binary patch
literal 2115
zcmdT^O=wgx5Kcuc+o}kH2ws8|RFqZ_de}Ym)mEjnr9X-kmg>g$@?M{P`?Ae@yQrm7
z5UU5po2Vy2LGU07Emc8w4|-8SL<ErvEh6YmFKSP9CfRJas}~P~=8()xCNuL*W@f3*
zZX~M+Az9(@peu9*$w}_`b__C<!}WSR1b1EE_IVgbB$hmMI#5FRN#SWKr;jmx^y_0l
z9|7}@uplCmN4}G1!3svHL?p)~%XXq;A$`veS<uO1Mk3B45h{|XND^cX#FLrYw|6@~
zTAX?Me&epzPsQZqrrG1yGi~d-R)0>)OKq)ZPw%OlzuNX4bYcBiN~kkI>fkhwHD^p{
z&KOE4^kuGCHd{=PwaN}`_a;k<w4|lZ$sWyT9L<xI<|l`&1Zf17$6IHDp(~@->(bRn
zXP$Z1{INTm4`1-ChieyZ7Vb}#rne>KlfJC=vF^i{9doiYeb5WvUGuCPvmH-f6!w?y
zHq1S@HU!p*-qCL_Z}wR)->$kWzGlEW{!{nuo^G-5rg-bXxd*o{ek{VjA><qxBmHE6
zWc8;J<~V-r3&?JN6+J*g&_(@4%nz;#QLj>_g@n{AXNJR{z_n<IB~U;M#)?)qoXOoH
z28>ImT#;MO0}xyzc<E#V(m-ATIi9vf==xp?D=ii9aJYd|Rjf&Uk*QQ}3T{v<8Ar{2
zGnO%yO~(kgN7G74*%(H(mKNH<0y`hDnA38>LslXhikLFn7WpvtLyx*bQpa~)4i#^M
z0G^_Dlx%{qK!-|pn(J^fl>$ZwlvaX(l0J|RZOnRA7zib$bPlx-_O>@iU_Iddy7yF!
zWfSd#3JO@Md5wvn)WGSJ4wqE8R5uI@R>VzBPX(ta%jE)}JMh{$1kDA#2$@7IDo2^+
zhAStk>vuM}vztm@;4-GMs8|P^zMD2?Q#w@&q|%KzI)rY9jA}5|-jw$6FL<icUvab&
zMlA#b7Qep?Vo=BGfCK}!g_PWmX=Jm2(;bp$MS-u(nD63O7krVAmc8u+giUD&9Hwxi
z$8aBy^4W-<oM=%AOlPXy5ARm6bSm)2CZI8~A1YN#j{kczRyXjUP1>qnhH+~wU`HDo
zVVlugEi(hST4u(uk>OCN9meL)jFmqb$>#0f9L&ZC#$G#~_1%9lqzw!+#R-gRBLigQ
ECnKX<TL1t6

delta 166
zcmX>saGs_9KVvdC11|#uLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#9RQ^Rp>$4SdND&$
zYH?-?Luzpegi%nGni8LqnOwq<UzC}i0w#-r)_|<yWMBo732jcs>Kpg!Gm0`4rzRH1
UC#Iyt2OyY}SF=}4u457b0Pj#Wk^lez

diff --git a/timeseries/tests/data/windows/fit/SARIMA2 b/timeseries/tests/data/windows/fit/SARIMA2
index 9551519afce1e409f6d6737426371cb756c2b0a2..670e426d727a9b0f50619a7d012567bdf2a8f7bf 100644
GIT binary patch
delta 1349
zcmcIkF>4e-6rK&?AQ2-0tB5d4K!uPLTIC823C4ipC4qn(rx|BvFN3o?o0+{34PhHw
zs}l<=tF#gVwo%Yd3zcY_#{2*~Q>gFl+-~log^gS6%)IyJz3=<p%zPdCJevIq_zBWS
z2#Ji4dM*wPNvsr#r(i=nxKH9>WEd*0WSlyZS{aLJv=F&&ntqn}Vbu>cKg|0<v*r@h
zj#%lGXfa(b#&DI#IFc~Sq8Y4gtZJ&5=^2(X;v{n>#_4u6Tp5ANgbATiWgw!~HE*{o
zZ?hG|EFw(?$w!!O)VYa6)vO}Us+MtMkkXv$q<1WrysO~BsL9lgt8o+oYfi67{-B-*
z<vAJ{HZeh`ndU9cQb{{<0USr;%%qfYZd!4w;wBA^r9y?F1i?5MU@wNd$x*-sG1Rh0
z(&Je!>0<>-OB5i;^;XQ0Zx;^aQq5x)rxtEajXTJ$Wi9i)%Tac6OfBSOAkDGuC<Ubj
zq1#qSOHD|<-c0uZr$tNjL&}mwD=A>NScKj{FF{Kr11e=wFCR6mqt)Esl1ec~tz45f
zrKu^?LsAvii@a4YR!%dIbrLJWX#h<<o9=25oe9_YgHd+}%1XUNdPo3=f-W9`4vPg3
zR)7T&b7QUKDRrD_NiSL%nKenXo?0rzcCRVZa!2JZ0msq0fQnW8(R=vcaIMJE_05U=
zq}6-sH^y=qycpQDO}LW*9P{$%lKaIq&G`Shr~i6QhfpwphEE0|!S?E#;PgNa_Tb?0
z%h{*rUxO7qJJr5t!J?mg|L%nwjhp<#)o$C1Ps#J8FYk8?8{X}6XAYmd>if2P_wb7Q
zF}zp$zy)8>uJmt@6!BPT>-*XL{k_?1-nR3ue=E4`=b+*M1SDjY)W|%E{Ik~epd5Q#
QR!$XpVHbQs`)B3OZ#x{ca{vGU

delta 168
zcmcbta-FaKKVvdC11|#uLqTFuVs0@*Nl`qQ#gJN&pAKafK<Pp#9RQ^Rp>$4SdND&$
zYH?-?Luzpegi%nGni8LqnOwq<UzC}i0w#-r)_|<yWH1Mk&z`lcci*hfy_s=y0h14l
WC_`~-VsU(8N=keHf;ssb+Ytb{;W&i=

diff --git a/timeseries/tests/data/windows/fit/SARIMA3 b/timeseries/tests/data/windows/fit/SARIMA3
index 4640a6d7e6821d98494b1495cdb57e4a126e8af8..0f4e26aa55be3e25edea3111f16d8074a438cca3 100644
GIT binary patch
delta 1346
zcmcIky>AmS6!-B_HATPx5&{+v9YCsDKBguEidrFcK%}iy2}Kb#wlB5nJ74WfD3t03
zM#f02Y%C0@%EZ)(kpcb%#0p!*R^GF7*8~_?SkCXBfA8b>d$!+SetD}nj`%rpiV%_*
zA@xcc8<JTm6wkqje%L;bi;;1xxRPn^NN!~+me51wuIUCV3(HPeT47lYi)P&orX8`;
zDbZuP`kBK&k8vbpmL<zr*;v(8Dbvd=XT(Y7Op4|~a(nt3R3=Oal`8{?)|fpUPQQqX
zEo@pOn#_>TVD7cKNn_P*V51E!)6NW~o@+g9Y*xe@u!E?2?&j4zs(?k8wj|$fdyvo3
zLD<3qU1FN|G|MF&=>`2bnieLfjC0dVbCq^!Y%CQjjwL9@!2z)dclBB53oz8O=Q8xO
zS@FjTlvZ7UBKLbKN5W%$AeV;6tT)#lH0K>;*RoZF;rT8{Iht#p45axj9HpSNB=pb<
zX{m{65S;0W#%Vbd{fx3K(@F}67V9t^7$#_mWI&}t>dljhjkH?%TT&_JsFi(bbDEnf
zJtS3eyUg1NVtJY#>m*i$;{cilHsjR*T?p6tgVFc|s!EeYCP)B>k}e*B5sS6ZtN;&o
z%#F2@=hShgCB0!~V)~L66SY)~<L)cdb0_6)0>|05fQk+L(8u`QbG^jT-TeiB(%KXC
zJ9A!!AO;R?A1-AC$Gkea<Vkr?3;usD>c8I83n&=0hHnNT(ZSA#=t|8u>mB6hF?hkO
zkvj)Z2hm3PhrZRrHy2-B`g&OULPS90tu7Xa+3WQ$hw9_?ciWFXRX9Y9_U?t)VRYtu
zG!^W@kF!T3bNFrQZFD!dLBbluWn_o6$SO&~qc--Oj6W@d;dFKy;s&&KwM@HMJNgCX
CM6cuk

delta 167
zcmZoRST9olpD~%6ftP`Sp&+p+F}IkZq$nQDVo0sXPlqxKpmZUW4uH~uP&y|uy_lgW
zwKy|{A+@*!!YC+8O^Hv*OfF%_FUm|$0h7f*Yd}_UGME9$#S_^X*Kgb_&mqcCoSIl1
VpO}&oAAn#^t`;lUoW{w+2ml)uHPQe8

diff --git a/timeseries/tests/data/windows/fit/SARIMA4 b/timeseries/tests/data/windows/fit/SARIMA4
index 7f59648885841bd51ecd901d368505bae859521f..823c37f3b806f9f6106fcefe0d9ae2dbc36dcf7e 100644
GIT binary patch
delta 1357
zcmcIk&ubJh6wXLl+4}n=RL}^;UoBhETlb))Md(4?wors+v1yXo1~ZdwGFz*qjG+I)
z{s$KHAXG|w@zjfW^WaTTJSzAX=&8Op*_qC&2QOZdmzS6KzVCZUUY~k;DLn}IF>-_u
z5*Z=&Y#bWWu~I0WfDiq!eF7&V!%%T0<HV7~%2-UJhsbr)_EyJFtA1+uY0giYwHKLo
z#7d_`m+4|Lj;k!jkq)ygn!(D(s<w)mo?!_iPBLd=G<Twll@X{+m=G#a1|k|Idv~jH
zJV;j%w1_krCLh4uXmS&Ws$EB*buHu8Fr^vSNx!jF5U;}yMolKJrfR4H7M)&^{6RAd
z%5t<YY+`~=GtIl2C6e~_0z8h!sYxi~+;rna#cdiION9zU35s!Wz*!7;lcRtOVyI=0
zrN^^W@W%?27AQcG+ufKW;SvtyQqN)*rsnTV)f{BkvXT1cWhnbOrsgv;kmk4JC<UcC
zp}ST{OHD|<;7ktyr+H8GBg#4*t)zg{VgZ^1%>*rx45*Y!y>!^IzE-n;ODe@UwQ^0`
zgeIm)4@p(n%=6a0SQ*VM)=8`g*8w#3Y|7OjIu)+<2cvQaib{h-21o#hoGxC0K8tw|
zR)7Z)b7QUK33Z%lNv~KLnKen%fm$lWcCRVZb%*7y0>{y&fQoe-=zSbFTsLxbeRCqa
zY0ZK9t?^6-F9tSk6CPy%*Sxs8WI2DQDgQqY^<UrVNfZpA;h#ZBu(SFmI9tuYW)AY>
z0K8yU$@QIwy<jn4;j8`m^^86Ja&tdA_x;(ct-bwx597z)efqdp`&RgTHlGQ{<=)um
z=WoB<^k;g((D_Q=yfC!gH+SC;Z3j2J8x*X<u?|@!4Kha}|EHCHlk(5f+p63fDWf4z
Qt_7F(*KhSpRFA&+1+x~gQvd(}

delta 166
zcmZoxThCwrpD~%6ftP`Sp&+p+F}IkZq$nQDVo0sXPlqxKpmZUW4uH~uP&y|uy_lgW
zwKy|{A+@*!!YC+8O^Hv*OfF%_FUm|$0h7f*Yd}_UGME9$sn4~`K5yL1&nC)HoSIl1
UpO}&oAAn#^ek~X=*^Ye+00kg7h5!Hn

diff --git a/timeseries/tests/fit.t b/timeseries/tests/fit.t
index 31eee812..8f0178bc 100644
--- a/timeseries/tests/fit.t
+++ b/timeseries/tests/fit.t
@@ -1,10 +1,13 @@
 \l p.q
 \l ml.q
-\l util/util.q
-\l optimize/optim.q
+\l util/utils.q
+\l util/utilities.q
+\l optimize/utils.q
+\l optimize/optimize.q
 \l timeseries/utils.q
 \l timeseries/fit.q
-\l fresh/extract.q
+\l timeseries/predict.q
+\l fresh/init.q
 \l timeseries/tests/failMessage.q
 
 -1"Warning: These tests may cause varying results for Linux vs Windows users";
@@ -26,35 +29,35 @@ fileList:`AR1`AR2`AR3`AR4`ARCH1`ARCH2`ARMA1`ARMA2`ARMA3`ARMA4`ARIMA1`ARIMA2,
 {load hsym`$":timeseries/tests/data/",y,"fit/",string x}[;os]each fileList;
 
 // AR tests
-.ml.ts.AR.fit[endogInt  ;()       ;1;0b]~AR1
-.ml.ts.AR.fit[endogInt  ;exogFloat;3;1b]~AR2
-.ml.ts.AR.fit[endogFloat;exogInt  ;2;1b]~AR3
-.ml.ts.AR.fit[endogFloat;exogMixed;4;0b]~AR4
+.ml.ts.AR.fit[endogInt  ;()       ;1;0b][`modelInfo]~AR1`modelInfo
+.ml.ts.AR.fit[endogInt  ;exogFloat;3;1b][`modelInfo]~AR2`modelInfo
+.ml.ts.AR.fit[endogFloat;exogInt  ;2;1b][`modelInfo]~AR3`modelInfo
+.ml.ts.AR.fit[endogFloat;exogMixed;4;0b][`modelInfo]~AR4`modelInfo
 
 failingTest[.ml.ts.AR.fit;(endogInt  ;5000#exogInt  ;1;1b);0b;"Endog length less than length"]
 failingTest[.ml.ts.AR.fit;(endogFloat;5000#exogFloat;1;1b);0b;"Endog length less than length"]
 
 
 // ARMA tests
-.ml.ts.ARMA.fit[endogInt  ;()       ;1;2;1b]~ARMA1
-.ml.ts.ARMA.fit[endogInt  ;exogFloat;2;1;0b]~ARMA2
-.ml.ts.ARMA.fit[endogFloat;exogInt  ;1;1;0b]~ARMA3
-.ml.ts.ARMA.fit[endogFloat;exogMixed;3;2;1b]~ARMA4
+.ml.ts.ARMA.fit[endogInt  ;()       ;1;2;1b][`modelInfo]~ARMA1`modelInfo
+.ml.ts.ARMA.fit[endogInt  ;exogFloat;2;1;0b][`modelInfo]~ARMA2`modelInfo
+.ml.ts.ARMA.fit[endogFloat;exogInt  ;1;1;0b][`modelInfo]~ARMA3`modelInfo
+.ml.ts.ARMA.fit[endogFloat;exogMixed;3;2;1b][`modelInfo]~ARMA4`modelInfo
 
 failingTest[.ml.ts.ARMA.fit;(endogInt  ;5000#exogInt  ;2;1;0b);0b;"Endog length less than length"]
 failingTest[.ml.ts.ARMA.fit;(endogFloat;5000#exogFloat;2;1;0b);0b;"Endog length less than length"]
 
 
 // ARCH tests
-.ml.ts.ARCH.fit[residInt  ;3]~ARCH1
-.ml.ts.ARCH.fit[residFloat;1]~ARCH2
+.ml.ts.ARCH.fit[residInt  ;3][`modelInfo]~ARCH1`modelInfo
+.ml.ts.ARCH.fit[residFloat;1][`modelInfo]~ARCH2`modelInfo
 
 
 // ARIMA tests
-.ml.ts.ARIMA.fit[endogInt  ;()       ;2;1;2;0b]~ARIMA1
-.ml.ts.ARIMA.fit[endogInt  ;exogFloat;1;1;1;1b]~ARIMA2
-.ml.ts.ARIMA.fit[endogFloat;exogInt  ;3;0;1;1b]~ARIMA3
-.ml.ts.ARIMA.fit[endogFloat;exogMixed;1;2;2;0b]~ARIMA4
+.ml.ts.ARIMA.fit[endogInt  ;()       ;2;1;2;0b][`modelInfo]~ARIMA1`modelInfo
+.ml.ts.ARIMA.fit[endogInt  ;exogFloat;1;1;1;1b][`modelInfo]~ARIMA2`modelInfo
+.ml.ts.ARIMA.fit[endogFloat;exogInt  ;3;0;1;1b][`modelInfo]~ARIMA3`modelInfo
+.ml.ts.ARIMA.fit[endogFloat;exogMixed;1;2;2;0b][`modelInfo]~ARIMA4`modelInfo
 
 failingTest[.ml.ts.ARIMA.fit;(endogInt  ;5000#exogInt  ;1;1;1;1b);0b;"Endog length less than length"]
 failingTest[.ml.ts.ARIMA.fit;(endogFloat;5000#exogFloat;1;1;1;1b);0b;"Endog length less than length"]
@@ -66,10 +69,10 @@ s2:`P`D`Q`m!2 1 0 10
 s3:`P`D`Q`m!2 1 1 30
 s4:`P`D`Q`m!0 1 1 20
 
-.ml.ts.SARIMA.fit[endogInt  ;()       ;1;1;1;0b;s1]~SARIMA1
-.ml.ts.SARIMA.fit[endogInt  ;exogFloat;1;0;1;1b;s2]~SARIMA2
-.ml.ts.SARIMA.fit[endogFloat;exogInt  ;1;2;0;0b;s3]~SARIMA3
-.ml.ts.SARIMA.fit[endogFloat;exogMixed;2;1;1;0b;s4]~SARIMA4
+.ml.ts.SARIMA.fit[endogInt  ;()       ;1;1;1;0b;s1][`modelInfo]~SARIMA1`modelInfo
+.ml.ts.SARIMA.fit[endogInt  ;exogFloat;1;0;1;1b;s2][`modelInfo]~SARIMA2`modelInfo
+.ml.ts.SARIMA.fit[endogFloat;exogInt  ;1;2;0;0b;s3][`modelInfo]~SARIMA3`modelInfo
+.ml.ts.SARIMA.fit[endogFloat;exogMixed;2;1;1;0b;s4][`modelInfo]~SARIMA4`modelInfo
 
 failingTest[.ml.ts.SARIMA.fit;(endogInt  ;5000#exogInt  ;2;0;1;1b;s1);0b;"Endog length less than length"]
 failingTest[.ml.ts.SARIMA.fit;(endogFloat;5000#exogFloat;2;0;1;1b;s1);0b;"Endog length less than length"]
diff --git a/timeseries/tests/misc.t b/timeseries/tests/misc.t
index 6629696b..a907000d 100644
--- a/timeseries/tests/misc.t
+++ b/timeseries/tests/misc.t
@@ -5,7 +5,7 @@
 \l timeseries/fit.q
 \l timeseries/predict.q
 \l timeseries/tests/failMessage.q
-\l fresh/extract.q
+\l fresh/init.q
 
 \S 42
 
@@ -39,7 +39,7 @@ fileList:`stationarityTab1`stationarityTab2`aicScore1`aicScore2`aicScore3`aicSco
 
 // Set up parameters
 dictKeys :`endog`exog
-paramKeys:`p`d`q`tr
+paramKeys:`p`d`q`trend
 
 trainDict1:dictKeys!(endogInt  ;()       )
 trainDict2:dictKeys!(endogInt  ;exogFloat)
diff --git a/timeseries/tests/pred.t b/timeseries/tests/pred.t
index df2d2dad..303a9019 100644
--- a/timeseries/tests/pred.t
+++ b/timeseries/tests/pred.t
@@ -24,49 +24,48 @@ loadFunc[os;"pred/pred"]each fileList;
 
 // AR tests
 
-.ml.ts.AR.predict[AR1;();1000]~predAR1
-.ml.ts.AR.predict[AR2;exogFloatFuture;1000]~predAR2
-.ml.ts.AR.predict[AR3;exogIntFuture;1000]~predAR3
-.ml.ts.AR.predict[AR4;exogMixedFuture;1000]~predAR4
+AR1.predict[()             ;1000]~predAR1
+AR2.predict[exogFloatFuture;1000]~predAR2
+AR3.predict[exogIntFuture  ;1000]~predAR3
+AR4.predict[exogMixedFuture;1000]~predAR4
 
-failingTest[.ml.ts.AR.predict;(AR2;-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
-failingTest[.ml.ts.AR.predict;(AR3;-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"]
+failingTest[AR2.predict;(-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
+failingTest[AR3.predict;(-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"]
 
 // ARCH tests
 
-.ml.ts.ARCH.predict[ARCH1;1000]~predARCH1
-.ml.ts.ARCH.predict[ARCH2;1000]~predARCH2
+ARCH1.predict[1000]~predARCH1
+ARCH2.predict[1000]~predARCH2
 
 // ARMA tests
 
-.ml.ts.ARMA.predict[ARMA1;();1000]~predARMA1
-.ml.ts.ARMA.predict[ARMA2;exogFloatFuture;1000]~predARMA2
-.ml.ts.ARMA.predict[ARMA3;exogIntFuture;1000]~predARMA3
-.ml.ts.ARMA.predict[ARMA4;exogMixedFuture;1000]~predARMA4
+ARMA1.predict[();1000]~predARMA1
+ARMA2.predict[exogFloatFuture;1000]~predARMA2
+ARMA3.predict[exogIntFuture;1000]~predARMA3
+ARMA4.predict[exogMixedFuture;1000]~predARMA4
+
+failingTest[ARMA2.predict;(-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
+failingTest[ARMA3.predict;(-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"]
 
-failingTest[.ml.ts.ARMA.predict;(ARMA2;-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
-failingTest[.ml.ts.ARMA.predict;(ARMA3;-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"]
-failingTest[.ml.ts.ARMA.predict;(AR1  ;()                 ;1000);0b;"The following required dictionary keys for 'mdl' are not provided: q_param, resid, estresid, pred_dict"]
 
 // ARIMA tests
 
-.ml.ts.ARIMA.predict[ARIMA1;();1000]~predARIMA1
-.ml.ts.ARIMA.predict[ARIMA2;exogFloatFuture;1000]~predARIMA2
-.ml.ts.ARIMA.predict[ARIMA3;exogIntFuture;1000]~predARIMA3
-.ml.ts.ARIMA.predict[ARIMA4;exogMixedFuture;1000]~predARIMA4
+ARIMA1.predict[()             ;1000]~predARIMA1
+ARIMA2.predict[exogFloatFuture;1000]~predARIMA2
+ARIMA3.predict[exogIntFuture  ;1000]~predARIMA3
+ARIMA4.predict[exogMixedFuture;1000]~predARIMA4
 
-failingTest[.ml.ts.ARIMA.predict;(ARIMA2;-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
-failingTest[.ml.ts.ARIMA.predict;(ARIMA3;-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"] 
-failingTest[.ml.ts.ARIMA.predict;(ARMA4 ;exogMixedFuture    ;1000);0b;"The following required dictionary keys for 'mdl' are not provided: origd"]
+failingTest[ARIMA2.predict;(-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
+failingTest[ARIMA3.predict;(-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"] 
 
 // SARIMA tests
 
-.ml.ts.SARIMA.predict[SARIMA1;();1000]~predSARIMA1
-.ml.ts.SARIMA.predict[SARIMA2;exogFloatFuture;1000]~predSARIMA2
-.ml.ts.SARIMA.predict[SARIMA3;exogIntFuture;1000]~predSARIMA3
-.ml.ts.SARIMA.predict[SARIMA4;exogMixedFuture;1000]~predSARIMA4
+SARIMA1.predict[()             ;1000]~predSARIMA1
+SARIMA2.predict[exogFloatFuture;1000]~predSARIMA2
+SARIMA3.predict[exogIntFuture  ;1000]~predSARIMA3
+SARIMA4.predict[exogMixedFuture;1000]~predSARIMA4
+
+failingTest[SARIMA2.predict;(-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
+failingTest[SARIMA3.predict;(-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"]
 
-failingTest[.ml.ts.SARIMA.predict;(SARIMA2;-1_'exogFloatFuture;1000);0b;"Test exog length does not match train exog length"]
-failingTest[.ml.ts.SARIMA.predict;(SARIMA3;-1_'exogIntFuture  ;1000);0b;"Test exog length does not match train exog length"]
-failingTest[.ml.ts.SARIMA.predict;(ARIMA2 ;exogFloatFuture    ;1000);0b;"The following required dictionary keys for 'mdl' are not provided: origs, P_param, Q_param"]
 
diff --git a/timeseries/utils.q b/timeseries/utils.q
index b77152a9..1d3a881d 100644
--- a/timeseries/utils.q
+++ b/timeseries/utils.q
@@ -1,92 +1,109 @@
+// timeseries/utils.q - Timeseries Utilities
+// Copyright (c) 2021 Kx Systems Inc
+// 
+// AR/ARMA/SARMA model utilities
+
 \d .ml
 
-// AR/ARMA/SARMA model utilities
 
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview ARMA model generation
-// @param endog  {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog   {tab}   Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param params {dict}  parameter sets used to fit the ARMA model
-// @return {dict} dictionary containing all information required to make predictions
-//   using an ARMA based model
+// @desc ARMA model generation
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param params {dictionary} Parameter sets used to fit the ARMA model
+// @return {dictionary} Dictionary containing all information required to make 
+//   predictions using an ARMA based model
 ts.i.ARMA.model:{[endog;exog;params]
   n:1+max params`p`q;
   errCoeff:ts.i.estimateErrorCoeffs[endog;exog;params;n];
-  ARMAparams:ts.i.ARMA.parameters[endog;errCoeff`coeffs;params;errCoeff`errors;n];
-  mdlKeys:`params`tr_param`exog_param`p_param`q_param`lags`resid`estresid`pred_dict;
-  mdlParams:(errCoeff[`coeffs](::;params[`tr]-1;params[`tr]+til count exog 0)),ARMAparams;
-  mdlKeys!mdlParams
+  ARMAvals:ts.i.ARMA.sortValues[endog;;params;;n] . errCoeff`coeffs`errors;
+  dictKeys:`coefficients`trendCoeff`exogCoeff`pCoeff`qCoeff`lagVals,
+    `residualVals`residualCoeffs`paramDict;
+  dictVals:(errCoeff[`coeffs](::;params[`trend]-1;
+    params[`trend]+til count exog 0)),ARMAvals;
+  dictKeys!dictVals
   }
 
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview SARMA model generation
-// @param endog  {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog   {tab}   Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param params {dict}  parameter sets used to fit the SARMA model
-// @return {dict} dictionary containing all information required to make predictions
-//   using an SARMA based model
+// @desc SARMA model generation
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param params {dictionary} Parameter sets used to fit the SARMA model
+// @return {dictionary} Dictionary containing all information required to make 
+//   predictions using an SARMA based model
 ts.i.SARMA.model:{[endog;exog;params]
   n:1+max params`p`q;
   errCoeff:ts.i.estimateErrorCoeffs[endog;exog;params;n];
-  coeffs:ts.i.SARMA.coefficients[endog;exog;errCoeff[`errors]`err;errCoeff`coeffs;params];
-  SARMAparams:ts.i.SARMA.parameters[endog;coeffs;params;errCoeff`errors;n];
-  modelKeys:`params`tr_param`exog_param`p_param`q_param,
-    `P_param`Q_param`lags`resid`estresid`pred_dict;
-  modelParams:(coeffs(::;params[`tr]-1;params[`tr]+til count exog 0)),SARMAparams;
-  modelKeys!modelParams
+  coeffs:ts.i.SARMA.coefficients[endog;exog;errCoeff[`errors]`errorVals;
+    errCoeff`coeffs;params];
+  SARMAvals:ts.i.SARMA.sortValues[endog;coeffs;params;errCoeff`errors;n];
+  dictKeys:`coefficients`trendCoeff`exogCoeff`pCoeff`qCoeff,
+    `PCoeff`QCoeff`lagVals`residualVals`residualCoeffs`paramDict;
+  dictVals:(coeffs(::;params[`trend]-1;params[`trend]+til count exog 0)),
+    SARMAvals;
+  dictKeys!dictVals
   }
 
 // @private
 // @kind function 
 // @category fitUtility
-// @fileoverview Estimate error coefficients
-// @param endog  {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog   {tab}   Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param params {dict}  parameter sets used to estimate error coefficients
-// @param n      {integer} number of error coefficients to estimate
-// @return {dict} dictionary returning coefficients and errors required for 
-//   model generation
+// @desc Estimate error coefficients
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param params {dictionary} Parameter sets used to estimate coefficients
+// @param n {int} Number of error coefficients to estimate
+// @return {dictionary} Dictionary returning coefficients and errors required
+//   for model generation
 ts.i.estimateErrorCoeffs:{[endog;exog;params;n]
-  errs :ts.i.estimateErrors[endog;exog;n];
-  coeff:ts.i.estimateParams[endog;exog;errs`err;params];
-  `errors`coeffs!(errs;coeff)
+  errors:ts.i.estimateErrors[endog;exog;n];
+  coeffs:ts.i.estimateCoefficients[endog;exog;errors`errorVals;params];
+  `errors`coeffs!(errors;coeffs)
   }
 
 // @private
 // @kind function 
 // @category fitUtility
-// @fileoverview Estimate ARMA model parameters using ordinary least squares
-// @param endog  {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog   {tab}   Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param errors {dict}  errors estimated using `i.estimateErrorCoeffs`
-// @param params {dict}  parameter sets used to estimate model parameters
-// @return {float[]} estimated ARMA model parameters
-ts.i.estimateParams:{[endog;exog;errors;params]
+// @desc Estimate ARMA model parameters using ordinary least squares
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param errors {dictionary} Errors estimated using `i.estimateErrorCoeffs`
+// @param params {dictionary} Parameter sets used to estimate model 
+//   coefficients
+// @return {float[]} Estimated ARMA model coefficients
+ts.i.estimateCoefficients:{[endog;exog;errors;params]
   // Create lagged matrices for the endogenous variable and residual errors
-  endogm:ts.i.lagMatrix[endog ;params`p];
-  resid :ts.i.lagMatrix[errors;params`q];
+  endogMatrix:ts.i.lagMatrix[endog ;params`p];
+  residMatrix:ts.i.lagMatrix[errors;params`q];
   // Collect the data needed for estimation
-  vals:(exog;endogm;resid);
+  values:(exog;endogMatrix;residMatrix);
   // How many data points are required
   m:neg min raze(count[endog]-params[`p`P]),count[errors]-params[`q`Q];
-  x:(,'/)m#'vals;
-  // add seasonality components
+  x:(,'/)m#'values;
+  // Add seasonality components
   if[not 0N~params[`P];x:x,'(m #flip[params[`P]xprev\:endog])];
   if[not 0N~params[`Q];x:x,'(m #flip[params[`Q]xprev\:errors])];
   // If required add a trend line variable
-  if[params`tr;x:1f,'x];
+  if[params`trend;x:1f,'x];
   y:m#endog;
   first enlist[y]lsq flip x
   }
@@ -94,373 +111,423 @@ ts.i.estimateParams:{[endog;exog;errors;params]
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview Durbin Levinson function to calculate the coefficients
+// @desc Durbin Levinson function to calculate the coefficients
 //   in a pure AR model with no trend for a univariate dataset
 //   Implementation can be found here 
 //   https://www.stat.purdue.edu/~zhanghao/STAT520/handout/DurbinLevHandout.pdf
-// @param data {float[][]} dataset from which to estimate the coefficients
-// @param lags {integer} order of the AR(p) model being fit
+// @param data {float[]} Dataset from which to estimate the coefficients
+// @param p {int} Order of the AR(p) model being fit
 // @return {float[]} AR(p) coefficients for specified lagged value
-ts.i.durbinLevinson:{[data;lags]
-  // cast to float
+ts.i.durbinLevinson:{[data;p]
   data:"f"$data;
-  mat:(1+lags;1+lags)#0f;
-  vec:(1+lags)#0f;
-  mat[1;1]:ts.i.autoCorrFunction[data;1];
-  vec[1]  :var[data]*(1-xexp[mat[1;1];2]);
-  reverse 1_last first(lags-1){[data;d]
-    mat:d[0];vec:d[1];n:d[2];
-    k:n+1;
-    dval:sum mat[n;1+til n]mmu ts.i.lagCovariance[data]each k-1+til n;
-    mat[k;k]:(ts.i.lagCovariance[data;k]-dval)%vec[n];
-    upd:{[data;n;mat;j]mat[n;j]-(mat[n+1;n+1]*mat[n;1+n-j])}[data;n;mat]each 1+til n;
-    mat[k;1+til n]:upd;
-    vec[k]:vec[n]*(1-xexp[mat[k;k];2]);
-    (mat;vec;n+1)
-    }[data]/(mat;vec;1)
+  matrix:(1+p;1+p)#0f;
+  vector:(1+p)#0f;
+  matrix[1;1]:ts.i.autoCorrFunction[data;1];
+  vector[1]  :var[data]*(1-xexp[matrix[1;1];2]);
+  estParams:first(p-1) ts.i.durbinLevinsonEstimate[data]/(matrix;vector;1);
+  reverse 1_last estParams
   }
 
+// @private
+// @kind function
+// @category fitUtility
+// @desc Recursive function to estimate the coefficients
+//   in a pure AR model with no trend for a univariate dataset
+// @param data {float[]} Dataset from which to estimate the coefficients
+// @param info {number[]} Matrix, vector and n information 
+// @return {float[]} New matrix,vector and n information
+ts.i.durbinLevinsonEstimate:{[data;info]
+  matrix:info 0;vector:info 1;n:info 2;
+  k:n+1;
+  dVal:sum matrix[n;1+til n]mmu ts.i.lagCovariance[data]each k-1+til n;
+  matrix[k;k]:(ts.i.lagCovariance[data;k]-dVal)%vector n;
+  updateMatrix:ts.i.durbinUpdateMatrix[data;n;matrix]each 1+til n;
+  matrix[k;1+til n]:updateMatrix;
+  vector[k]:vector[n]*(1-xexp[matrix[k;k];2]);
+  (matrix;vector;n+1)
+  }
 
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview Estimate residual errors for the Hannan Riessanan method
-// @param endog  {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog   {tab/num[][]} Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param lags   {integer} The number/order of time lags of the model
-// @return {dict} Residual errors and parameters for calculation of these parameters
-ts.i.estimateErrors:{[endog;exog;lags]
-  // Construct an AR model to estimate the residual error parameters
-  estresid:ts.AR.fit[endog;exog;lags;0b]`params;
-  // Convert the endogenous variable to lagged matrix
-  endogm:ts.i.lagMatrix[endog;lags];
-  // Predict future values based on estimations from AR model and use to estimate error
-  err:(lags _endog)-((neg[count endogm]#exog),'endogm)mmu estresid;
-  `params`err!(estresid;err)
+// @desc Update matrix values for calculating AR coefficients using
+//   Durbin Levinson method
+// @param data {float[]} Dataset from which to estimate the coefficients
+// @param n {int} Number of iterations
+// @param matrix {float[]} Matrix used to caluclate coefficients
+// @param j {int} Column in the matrix
+// @return {float[]} AR(p) coefficients for specified lagged value
+ts.i.durbinUpdateMatrix:{[data;n;matrix;j]
+  matrix[n;j]-(matrix[n+1;n+1]*matrix[n;1+n-j])
   }
 
+// @private
+// @kind function
+// @category fitUtility
+// @desc Estimate residual errors using the Hannan Riessanan method
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+// @param p {int} The number/order of time lags of the model
+// @return {dictionary} Residual errors and parameters for calculation of these 
+//   parameters
+ts.i.estimateErrors:{[endog;exog;p]
+  // Construct an AR model to estimate the residual error coeffs
+  estCoeffs:ts.AR.fit[endog;exog;p;0b][`modelInfo;`coefficients];
+  // Convert the endogenous variable to lagged matrix
+  endogMatrix:ts.i.lagMatrix[endog;p];
+  // Predict future values based on estimations from AR model and use to 
+  // estimate the error
+  errors:(p _endog)-((neg[count endogMatrix]#exog),'endogMatrix)mmu estCoeffs;
+  `estCoeffs`errorVals!(estCoeffs;errors)
+  }
 
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview Estimate coefficients as starting points to calculate the sarima coeffs
-// @param endog  {num[]} Endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param exog   {tab} Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param resid  {num[][]} residual errors estimated using i.estimateErrorCoeffs 
-// @param coeff  {num[][]} Estimated coefficients for ARMA model using OLS
-// @param params {dict} Information on seasonal and non seasonal lags to be accounted for
-// @return {dict} updated optimized coefficients for SARMA model
-ts.i.SARMA.coefficients:{[endog;exog;resid;coeff;params]
-  // data length to use
-  lenq:count[resid]-max raze params[`q`Q`seas_add_Q];
-  lenp:count[endog]-max raze params[`p`P`seas_add_P];
-  // prediction values
-  params[`real]:#[m:neg min lenp,lenq;endog];
-  // get lagged values
+// @desc Estimate coefficients as starting points to calculate the 
+//   SARIMA coeffs
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param residuals {number[]} Residual errors estimated using 
+//   i.estimateErrorCoeffs
+// @param coeffs {number[]} Estimated coefficients for ARMA model using OLS
+// @param params {dictionary} Information on seasonal and non seasonal lags to
+//   be accounted for
+// @return {dictionary} Updated optimized coefficients for SARMA model
+ts.i.SARMA.coefficients:{[endog;exog;residuals;coeffs;params]
+  // Data length to use
+  qLen:count[residuals]-max raze params`q`Q`additionalQ;
+  pLen:count[endog]-max raze params`p`P`additionalP;
+  // Prediction values
+  params[`true]:#[m:neg min pLen,qLen;endog];
+  // Get lagged values
   lagVal:ts.i.lagMatrix[endog;params`p];
-  // get seasonal lag values
+  // Get seasonal lag values
   seasLag:flip params[`P]xprev\:endog;
-  // get additional seasonal lag values
-  params[`seas_lag_add]:$[params[`p]&min count params`P;
-    m#flip params[`seas_add_P]xprev\:endog;
+  // Get additional seasonal lag values
+  params[`additionalLags]:$[params[`p]&count params`P;
+    m#flip params[`additionalP]xprev\:endog;
     2#0f
     ];
-  // get resid vals
-  residVal:ts.i.lagMatrix[resid;params`q];
-  seasResid:flip params[`Q]xprev\:resid;
-  params[`seas_resid_add]:$[params[`q]&min count params`Q;
-    m#flip params[`seas_add_Q]xprev\:resid;
+  // Get resid vals
+  residVal:ts.i.lagMatrix[residuals;params`q];
+  seasResid:flip params[`Q]xprev\:residuals;
+  params[`additionalResiduals]:$[params[`q]&count params`Q;
+    m#flip params[`additionalQ]xprev\:residuals;
     2#0f
     ];
-  // normal arima vals
+  // Normal arima vals
   vals:(exog;lagVal;residVal;seasLag;seasResid);
-  params[`norm_mat]:(,'/)m#'vals;
-  optD:`xk`args!(coeff;params);
-  // use optimizer function to improve SARMA coefficients
-  .ml.optimize.BFGS[ts.i.SARMA.maxLikelihood;coeff;params;::]`xVals
+  params[`matrix]:(,'/)m#'vals;
+  // Use optimizer function to improve SARMA coefficients
+  .ml.optimize.BFGS[ts.i.SARMA.maxLikelihood;coeffs;params;::]`xVals
   }
 
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview Calculation of the errors in calculation of the SARIMA coefficients 
-// @param params {dict} Parameters required for calculation of SARIMA coefficients
-// @param dict {dict} Additional parameters required in calculation
-// @return {float} returns the square root of the summed, squared errors
-ts.i.SARMA.maxLikelihood:{[params;dict]
-  // get additional seasonal parameters 
-  dict,:ts.i.SARMA.preproc[params;dict];
-  // calculate sarima model including the additional seasonal coeffs
-  preds:ts.i.SARMA.eval[params;dict];
-  // calculate error
-  sqrt sum n*n:preds-dict`real
+// @desc Calculation of the error when finding the SARIMA coefficients
+// @param coeffs {dictionary} Coefficients of SARIMA model
+// @param dict {dictionary} Additional parameters required in calculation
+// @return {float} The square root of the summed, squared errors
+ts.i.SARMA.maxLikelihood:{[coeffs;dict]
+  // Get additional seasonal parameters 
+  dict,:ts.i.SARMA.preproc[coeffs;dict];
+  // Calculate SARIMA model including the additional seasonal coeffs
+  preds:ts.i.SARMA.eval[coeffs;dict];
+  // Calculate error
+  n:preds-dict`true;
+  sqrt wsum[n;n]
   }
 
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview Extract fitted ARMA model params to return
-// @param endog  {num[]} endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param coeff  {num[]} error coefficients
-// @param params {dict}  information on setup of ARMA model
-// @param errors {dict} error and parameter dictionary information
-// @param lags {integer} the number/order of time lags of the model
-// @return {num[]} list of parameters needed for future predictions
-ts.i.ARMA.parameters:{[endog;coeff;params;errors;lags]
+// @desc Sort ARMA coefficients and parameters into correct order
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param coeff {number[]} Coefficients for calculating residuals
+// @param params {dictionary} Parameter sets used to fit the ARMA model
+// @param errors {dictionary} Error and coefficient dictionary
+// @param n {int} The number/order of time lags in estimated AR model
+// @return {number[]} Information needed for future predictions
+ts.i.ARMA.sortValues:{[endog;coeff;params;errors;n]
   (params[`p]#neg[sum params`q`p]#coeff;neg[params`q]#coeff),
-  (neg[lags]#endog;neg[params`q]#errors`err;errors`params),
+  (neg[n]#endog;neg[params`q]#errors`errorVals;errors`estCoeffs),
   enlist params
   }
 
 // @private
 // @kind function
 // @category fitUtility
-// @fileoverview Extract fitted SARMA model params to return
-// @param endog  {num[]} endogenous variable (time-series) from which to build a model
-//   this is the target variable from which a value is to be predicted
-// @param coeff  {num[]} error coefficients
-// @param params {dict}  information on setup of ARMA model
-// @param errors {dict} error and parameter dictionary information
-// @param lags {integer} the number/order of time lags of the model
-// @return {dict} parameters needed for future predictions
-ts.i.SARMA.parameters:{[endog;coeff;params;errors;lags]
-  // number of seasonal components
-  ns:count raze params`P`Q;
+// @desc Sort SARMA coefficients and parameters into correct order 
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model this is the target variable from which a value is to be 
+//   predicted
+// @param coeff {number[]} Coefficients for calculating residuals
+// @param params {dictionary} Parameter sets used to fit the SARMA model
+// @param errors {dictionary} Error and coefficient dictionary
+// @param n {int} The number/order of time lags in estimated AR model
+// @return {dictionary} Information needed for future predictions
+ts.i.SARMA.sortValues:{[endog;coeff;params;errors;n]
+  // Number of seasonal components
+  seasParams:count raze params`P`Q;
   // Separate coeffs into normal and seasonal componants
-  coefn:neg[ns]_coeff;coefs:neg[ns]#coeff;
-  sarmaParams:(params[`p]#neg[sum params`q`p]#coefn;
-               neg[params`q]#coefn;count[params`P]#coefs;
-               neg count[params`Q]#coefs),
-              (#[neg lags|max raze params`P`seas_add_P;endog];
-               #[neg max raze params`p`Q`seas_add_Q;errors`err];
-               errors`params);
+  coeffNorm:neg[seasParams]_coeff;
+  coeffSeas:neg[seasParams]#coeff;
+  SARMAparams:(params[`p]#neg[sum params`q`p]#coeffNorm;
+    neg[params`q]#coeffNorm;count[params`P]#coeffSeas;
+    neg count[params`Q]#coeffSeas),
+    (#[neg n|max raze params`P`additionalP;endog];
+     #[neg max raze params`p`Q`additionalQ;errors`errorVals];errors`estCoeffs);
   // Update dictionary values for seasonality funcs
-  params[`P`Q`seas_add_P`seas_add_Q]:params[`P`Q`seas_add_P`seas_add_Q]-min params[`m];
-  sarmaParams,enlist params,`tr`n!params[`tr],lags
+  paramKeys:`P`Q`additionalP`additionalQ;
+  params[paramKeys]:params[paramKeys]-min params`m;
+  SARMAparams,enlist params,`trend`n!params[`trend],n
   }
 
-
 // Prediction function utilities
 
-// @private
-// @kind list
-// @category predictUtility
-// @fileoverview lists of keys which must be present in each application of the
-//   various prediction functions to ensure the application of prediction is valid
-ts.i.AR.keyList    :`params`tr_param`exog_param`p_param`lags
-ts.i.ARMA.keyList  :ts.i.AR.keyList,`q_param`resid`estresid`pred_dict
-ts.i.ARIMA.keyList :ts.i.ARMA.keyList,`origd
-ts.i.SARIMA.keyList:ts.i.ARIMA.keyList,`origs`P_param`Q_param
-ts.i.ARCH.keyList  :`params`tr_param`p_param`resid
-
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview predict a set number of values based on a fit model AR/ARMA/SARMA
-// @param mdl    {dict} contains all information regarding model parameters and required
-//   residual information
-// @param exog   {tab}   Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param len    {integer} the number of data points to be predicted
-// @param predfn {function} the function to be used for prediction
-// @return {num[]} predicted values based on fit model
-ts.i.predictFunction:{[mdl;exog;len;predfn]
-  vals:(mdl`lags;mdl`resid;());
-  last{x>count y 2}[len;]predfn[mdl`params;exog;mdl`pred_dict;;mdl`estresid]/vals
+// @desc Predict a set number of future values based on a fit model 
+//   AR/ARMA/SARMA
+// @param model {dictionary} All information regarding model coefficients and 
+//   required residual information
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param len {int} The number of future data points to be predicted
+// @param predFunc {fn} The function to be used for prediction
+// @return {number[]} Predicted values based on fit model
+ts.i.predictFunction:{[model;exog;len;predFunc]
+  vals:(model`lagVals;model`residualVals;());
+  last{x>count y 2}[len;]predFunc
+    [model`coefficients;exog;model`paramDict;;model`residualCoeffs]/vals
   }
 
-
 // ARMA/AR model prediction functionality
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview prediction function for ARMA model
-// @param mdl  {dict} contains all information regarding model parameters and required
-//   residual information
-// @param exog {tab} exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param len  {integer} the number of data points to be predicted
-// @return     {num[]} predicted values based on fit ARMA model
-ts.i.ARMA.predictFunction:{[mdl;exog;len]
-  exog:ts.i.predDataCheck[mdl;exog];
-  ts.i.predictFunction[mdl;exog;len;ts.i.ARMA.singlePredict]
+// @desc Prediction function for ARMA model
+// @param model {dictionary} All information regarding model coefficients and
+//   required residual information
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param len {int} The number of future data points to be predicted
+// @return {number[]} Predicted values based on fit ARMA model
+ts.i.ARMA.predictFunction:{[model;exog;len]
+  exog:ts.i.predDataCheck[model;exog];
+  ts.i.predictFunction[model;exog;len;ts.i.ARMA.singlePredict]
   }
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview predict a single ARMA value
-// @param params   {num[]} model parameters retrieved from initial fit model
-// @param exog     {tab} exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param dict     {dict} additional information which can dictate the behaviour
-//   when making a prediction
-// @param pvals    {num[]} previously predicted values
-// @param estresid {num[]} estimates of the residual errors
-// @return {num[]} information required for the prediction of a set of ARMA values
-ts.i.ARMA.singlePredict:{[params;exog;dict;pvals;estresid]
-  exog:exog count pvals 2;
-  normmat:exog,raze#[neg[dict`p];pvals[0]],pvals[1];
-  pred:$[dict`tr;
-    params[0]+normmat mmu 1_params;
-    params mmu normmat
+// @desc Predict a single ARMA value
+// @param coeffs {number[]} Model coefficients retrieved from initial fit model
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param dict {dictionary} Additional information which can dictate the 
+//   behaviour when making a prediction
+// @param pastPreds {number[]} Previously predicted values
+// @param residualCoeffs {number[]} Coefficients to estimate the residuals
+// @return {number[]} Information required for the prediction of a set of ARMA
+//   values
+ts.i.ARMA.singlePredict:{[coeffs;exog;dict;pastPreds;residualCoeffs]
+  exog:exog count pastPreds 2;
+  matrix:exog,raze#[neg dict`p;pastPreds 0],pastPreds 1;
+  preds:$[dict`trend;
+    coeffs[0]+matrix mmu 1_coeffs;
+    coeffs mmu matrix
     ];
-  if[count pvals 1;
-    estvals:exog,pvals[0];
-    pvals[1]:(1_pvals[1]),pred-mmu[estresid;estvals]
+  if[count pastPreds 1;
+    estVals:exog,pastPreds 0;
+    pastPreds[1]:(1_pastPreds 1),preds-mmu[residualCoeffs;estVals]
     ];
-  ((1_pvals[0]),pred;pvals[1];pvals[2],pred)
+  ((1_pastPreds 0),preds;pastPreds 1;pastPreds[2],preds)
   }
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview prediction function for AR model
-// @param mdl  {dict} contains all information regarding model parameters and required
-//   residual information
-// @param exog {tab}   Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param len  {integer} the number of data points to be predicted
-// @return     {num[]} predicted values based on fit AR model
-ts.i.AR.predictFunction:{[mdl;exog;len]
-  exog:ts.i.predDataCheck[mdl;exog];
-  mdl[`pred_dict]:enlist[`p]!enlist count mdl`p_param;
-  mdl[`estresid]:();
-  mdl[`resid]:();
-  ts.i.predictFunction[mdl;exog;len;ts.i.AR.singlePredict]
+// @desc Prediction function for AR model
+// @param model {dictionary} All information regarding model coefficients and
+//   required residual information
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param len {int} The number of future data points to be predicted
+// @return {number[]} Predicted values based on fit AR model
+ts.i.AR.predictFunction:{[model;exog;len]
+  exog:ts.i.predDataCheck[model;exog];
+  model[`paramDict]:enlist[`p]!enlist count model`pCoeff;
+  model[`residualCoeffs]:();
+  model[`residualVals]:();
+  ts.i.predictFunction[model;exog;len;ts.i.AR.singlePredict]
   }
 
 // Predict a single AR value
 ts.i.AR.singlePredict:ts.i.ARMA.singlePredict
 
-
 // SARIMA model calculation functionality
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview prediction function for SARMA model
-// @param mdl  {dict} contains all information regarding model parameters and required
-//   residual information
-// @param exog {tab}   Exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param len  {integer} the number of data points to be predicted
-// @return     {num[]} predicted values based on fit SARMA model
-ts.i.SARMA.predictFunction:{[mdl;exog;len]
-  exog:ts.i.predDataCheck[mdl;exog];
-  $[count raze mdl[`pred_dict];
-    ts.i.predictFunction[mdl;exog;len;ts.i.SARMA.singlePredict];
-    ts.i.AR.predictFunction[mdl;exog;len]
+// @desc Prediction function for SARMA model
+// @param model  {dictionary} All information regarding model coefficients and
+//   required residual information
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param len {int} The number of future data points to be predicted
+// @return {number[]} Predicted values based on fit SARMA model
+ts.i.SARMA.predictFunction:{[model;exog;len]
+  exog:ts.i.predDataCheck[model;exog];
+  $[count raze model`paramDict;
+    ts.i.predictFunction[model;exog;len;ts.i.SARMA.singlePredict];
+    ts.i.AR.predictFunction[model;exog;len]
     ]
   }
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview predict a single SARMA value
-// @param params   {num[]} model parameters retrieved from initial fit model
-// @param exog     {tab} exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param dict     {dict} additional information which can dictate the behaviour
-//   when making a prediction
-// @param pvals    {num[]} previously predicted values
-// @param estresid {num[]} estimates of the residual errors
-// @return {num[]} information required for the prediction of SARMA values
-ts.i.SARMA.singlePredict:{[params;exog;dict;pvals;estresid];
-  exog:exog count pvals 2;
-  dict,:ts.i.SARMA.preproc[params;dict];
-  pred:ts.i.SARMA.predictValue[params;pvals;exog;dict];
-  if[count pvals 1;
-    estvals:exog,neg[dict`n]#pvals 0;
-    pvals[1]:(1_pvals[1]),pred-mmu[estresid;estvals]
+// @desc Predict a single SARMA value
+// @param coeffs {dictionary} Model coefficients retrieved from initial fit 
+//   model
+// @param exog {float[]|(::)} Exogenous variables, are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param dict {dictionary} Additional information which can dictate the 
+//   behaviour when making a prediction
+// @param pastPreds {number[]} Previously predicted values
+// @param residualCoeffs {number[]} Coefficients to calculate the residual 
+//   errors
+// @return {number[]} Information required for the prediction of SARMA values
+ts.i.SARMA.singlePredict:{[coeffs;exog;dict;pastPreds;residualCoeffs];
+  exog:exog count pastPreds 2;
+  dict,:ts.i.SARMA.preproc[coeffs;dict];
+  preds:ts.i.SARMA.predictVal[coeffs;pastPreds;exog;dict];
+  if[count pastPreds 1;
+    estVals:exog,neg[dict`n]#pastPreds 0;
+    pastPreds[1]:(1_pastPreds 1),preds-mmu[residualCoeffs;estVals]
     ];
-  // append new lag values, for next step calculations
-  ((1_pvals[0]),pred;pvals[1];pvals[2],pred)
+  // Append new lag values, for next step calculations
+  ((1_pastPreds 0),preds;pastPreds 1;pastPreds[2],preds)
   }
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview Calculate new required lags for SARMA prediction surrounding
-//   seasonal components
-// @param params {dict} model parameters retrieved from initial fit model
-// @param dict   {dict} additional information which can dictate the behaviour
-//   in different situations where predictions are being made 
-// @return       {dict} seasonal parameters for prediction in SARMA models
-ts.i.SARMA.preproc:{[params;dict]
-  // 1. Calculate or retrieve all necessary seasonal lagged values for SARMA prediction
-  // split up the coefficients to their respective p,q,P,Q parts
-  lagp:(dict[`tr] _params)[til dict`p];
-  lagq:((dict[`tr]+dict`p)_params)[til dict`q];
-  lagSeasp:((dict[`tr]+sum dict`q`p)_params)[til count[dict`P]];
-  lagSeasq:neg[count dict`Q]#params;
-  // Function to extract additional seasonal multiplied coefficients
-  // These coefficients multiply p x P vals and q x Q vals
-  seas_multi:{[x;y;z;d]$[d[x]&min count d upper x;(*/)flip y cross z;2#0f]};
-  // append new lags to original dictionary
-  dictKeys:`add_lag_param`add_resid_param;
-  dictVals:(seas_multi[`p;lagp;lagSeasp;dict];seas_multi[`q;lagq;lagSeasq;dict]);
+// @desc Calculate additional coefficients for SARMA prediction 
+//   surrounding seasonal components
+// @param coeffs {dictionary} Model coefficients retrieved from initial fit 
+//   model
+// @param dict {dictionary} Additional information which can dictate the 
+//   behaviour in different situations where predictions are being made 
+// @return {dictionary} Seasonal parameters for prediction in SARMA models
+ts.i.SARMA.preproc:{[coeffs;dict]
+  // Calculate or retrieve all necessary seasonal lagged values for SARMA 
+  // prediction and split up the coefficients to their respective p,q,P,Q parts
+  pVals:(dict[`trend] _coeffs)til dict`p;
+  qVals:((dict[`trend]+dict`p)_coeffs)til dict`q;
+  pSeasonVals:((dict[`trend]+sum dict`q`p)_coeffs)til count dict`P;
+  qSeasonVals:neg[count dict`Q]#coeffs;
+  // Append new lags to original dictionary
+  dictKeys:`additionalpCoeff`additionalqCoeff;
+  dictVals:(ts.i.SARMA.multiplySeason[`p;pVals;pSeasonVals;dict];
+   ts.i.SARMA.multiplySeason[`q;qVals;qSeasonVals;dict]);
   dictKeys!dictVals
   }
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview predict a single SARMA value
-// @param params   {num[]} model parameters retrieved from initial fit model
-// @param pvals    {num[]} previously predicted values
-// @param exog     {tab} exogenous variables, are additional variables which
-//   may be accounted for to improve the model
-// @param dict     {dict} additional information which can dictate the behaviour
-//   when making a prediction
-// @return {num[]} information required for the prediction of a set of SARMA values
-ts.i.SARMA.predictValue:{[params;pvals;exog;dict]
-  dict[`seas_resid_add]:$[dict[`q]&min count dict`Q;
-    pvals[1]dict[`seas_add_Q];
+// @desc Function to extract additional seasonal multiplied 
+//   coefficients. These coefficients multiply p x P vals and q x Q vals
+// @param dictKeys {symbol} Key of dictionary to extract info from 
+// @param normVals {number[]} Non seasonal coefficients
+// @param seasonVals {number[]} Seasonal coefficients
+// @param dict {dictionary} Model parameters retrieved from initial fit model
+// @return {dictionary} Seasonal coefficients multiplied by non seasonal 
+//   coefficients
+ts.i.SARMA.multiplySeason:{[dictKey;normVals;seasonVals;dict]
+  $[dict[dictKey]&min count dict upper dictKey;
+    (*/)flip normVals cross seasonVals;
+    2#0f
+    ]
+  }
+
+// @private
+// @kind function
+// @category predictUtility
+// @desc Predict a single SARMA value
+// @param coeffs {number[]} Model coefficiants retrieved from initial fit model
+// @param pastPreds {number[]} Previously predicted values
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @param dict {dictionary} Additional information which can dictate the 
+//   behaviour when making a prediction
+// @return {number[]} information required for the prediction of a set of SARMA 
+//   values
+ts.i.SARMA.predictVal:{[coeffs;pastPreds;exog;dict]
+  dict[`additionalResiduals]:$[dict[`q]&min count dict`Q;
+    pastPreds[1]dict`additionalQ;
     2#0f
     ];
-  dict[`seas_lag_add]:$[dict[`p]&min count dict`P;
-    pvals[0]dict[`seas_add_P];
+  dict[`additionalLags]:$[dict[`p]&min count dict`P;
+    pastPreds[0]dict`additionalP;
     2#0f
     ];
-  sarmavals:raze#[neg dict`p;pvals 0],#[neg dict`q;pvals 1],pvals[0][dict`P],pvals[1][dict`Q];
-  dict[`norm_mat]:exog,sarmavals;
-  ts.i.SARMA.eval[params;dict]
+  SARMAvals:raze#[neg dict`p;pastPreds 0],#[neg dict`q;pastPreds 1],
+    pastPreds[0][dict`P],pastPreds[1]dict`Q;
+  dict[`matrix]:exog,SARMAvals;
+  ts.i.SARMA.eval[coeffs;dict]
   }
 
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview calculate the value of a SARMA prediction based on 
-//   provided params/dictionary
-// @param params {num[]} model parameters retrieved from initial fit model
-// @param dict   {dict} additional information which can dictate the behaviour
-//   when making a prediction
-// @return {num[]} the SARMA prediction values 
-ts.i.SARMA.eval:{[params;dict]
-  normVal  :mmu[dict`norm_mat;dict[`tr] _params];
-  seasResid:mmu[dict`seas_resid_add;dict`add_resid_param];
-  seasLag  :mmu[dict`seas_lag_add;dict`add_lag_param];
-  $[dict`tr;params[0]+;]normVal+seasResid+seasLag
+// @desc Calculate the value of a SARMA prediction based on 
+//   provided coeffs/dictionary
+// @param coeffs {number[]} Model coefficients retrieved from initial fit model
+// @param dict {dictionary} Additional information which can dictate the 
+//   behaviour when making a prediction
+// @return {number[]} The SARMA prediction values 
+ts.i.SARMA.eval:{[coeffs;dict]
+  normVals  :mmu[dict`matrix;dict[`trend] _coeffs];
+  seasResids:mmu[dict`additionalResiduals;dict`additionalqCoeff];
+  seasLags  :mmu[dict`additionalLags;dict`additionalpCoeff];
+  $[dict`trend;coeffs[0]+;]normVals+seasResids+seasLags
   }
 
-
 // @private
 // @kind function
 // @category predictUtility
-// @fileoverview calculate a single ARCH value, 
-// @param params   {dict}   model parameters retrieved from initial fit model
-// @param pvals    {num[]}  list of values over which predictions are composed
-// @return {num[]} list containing residuals and predicted values
-ts.i.ARCH.singlePredict:{[params;pvals]
-  predict:params[0]+pvals[0] mmu 1_params;
-  ((1_pvals 0),predict;pvals[1],predict)
+// @desc Calculate a single ARCH value, 
+// @param coeffs {dictionary} Model coefficients retrieved from 
+//   initial fit model
+// @param pastPreds {number[]} Previously predicted values
+// @return {number[]} Residuals and predicted values
+ts.i.ARCH.singlePredict:{[coeffs;pastPreds]
+  predict:coeffs[0]+pastPreds[0] mmu 1_coeffs;
+  ((1_pastPreds 0),predict;pastPreds[1],predict)
   }
 
 // Akaike Information Criterion
@@ -468,50 +535,52 @@ ts.i.ARCH.singlePredict:{[params;pvals]
 // @private
 // @kind function
 // @category aicUtility
-// @fileoverview calculate the Akaike Information Criterion
-// @param true   {num[]} true values
-// @param pred   {num[]} predicted values
-// @param params {num[]} list of the lag/residual parameters
+// @desc Calculate the Akaike Information Criterion
+// @param true {number[]} True values
+// @param pred {number[]} Predicted values
+// @param params {number[]} The lag/residual parameters
 // @return {float} Akaike Information Criterion score
 ts.i.aicScore:{[true;pred;params]
   // Calculate residual sum of squares, normalised for number of values
-  rss:{wsum[x;x]%y}[true-pred;n:count pred];
+  sumSquares:{wsum[x;x]%y}[true-pred;n:count pred];
   // Number of parameter
   k:sum params;
-  aic:(2*k)+n*log rss;
-  // if k<40 use the altered aic score
+  aic:(2*k)+n*log sumSquares;
+  // If k<40 use the altered aic score
   $[k<40;aic+(2*k*k+1)%n-k-1;aic]
   }
 
 // @private
 // @kind function
 // @category aicUtility
-// @fileoverview Fit a model, predict the test, return AIC score 
+// @desc Fit a model, predict the test, return AIC score 
 //   for a single set of input params
-// @param train  {dict}    training data as a dictionary with endog and exog data
-// @param test   {dict}    testing data as a dictionary with endog and exog data
-// @param len    {integer} number of steps in the future to be predicted
-// @param params {dict}    parameters used in prediction
+// @param train {dictionary} Training data as a dictionary with 
+//   endog and exog data
+// @param test {dictionary} Testing data as a dictionary with 
+//   endog and exog data
+// @param len {integer} Number of steps in the future to be predicted
+// @param params {dictionary} Parameters used in prediction
 // @return {float} Akaike Information Criterion score
 ts.i.aicFitScore:{[train;test;len;params]
   // Fit an model using the specified parameters
-  mdl :ts.ARIMA.fit[train`endog;train`exog;;;;]. params`p`d`q`tr;
+  model:ts.ARIMA.fit[train`endog;train`exog]. params`p`d`q`trend;
   // Predict using the fitted model
-  pred:ts.ARIMA.predict[mdl;test`exog;len];
+  preds:model[`predict][test`exog;len];
   // Score the predictions
-  ts.i.aicScore[len#test`endog;pred;params]
+  ts.i.aicScore[len#test`endog;preds;params]
   }
 
-
 // Autocorrelation functionality
 
 // @private
 // @kind function
 // @category autocorrelationUtility
-// @fileoverview Lagged covariance between a dataset at time t and time t-lag
-// @param data {num[]}   vector on which to calculate the lagged covariance
-// @param lag  {integer} size of the lag to use when calculating covariance
-// @return {float} covariance between a time series and lagged version of itself
+// @desc Lagged covariance between a dataset at time t and time t-lag
+// @param data {number[]} Vector on which to calculate the lagged covariance
+// @param lag {int} Size of the lag to use when calculating covariance
+// @return {float} Covariance between a time series and lagged version of 
+//   itself
 ts.i.lagCovariance:{[data;lag]
   cov[neg[lag] _ data;lag _ data]
   }
@@ -519,28 +588,28 @@ ts.i.lagCovariance:{[data;lag]
 // @private
 // @kind function
 // @category autocorrelationUtility
-// @fileoverview Calculate the autocorrelation between a series
+// @desc Calculate the autocorrelation between a time series
 //   and lagged version of itself
-// @param data {num[]}   vector on which to calculate the lagged covariance
-// @param lag  {integer} size of the lag to use when calculating covariance
-// @return {float} autocorrelation between a time series and lagged version of itself
+// @param data {number[]} Vector on which to calculate the lagged covariance
+// @param lag {int} Size of the lag to use when calculating covariance
+// @return {float} Autocorrelation between a time series and lagged version of
+//   itself
 ts.i.autoCorrFunction:{[data;lag]
   ts.i.lagCovariance[data;lag]%var data
   }
 
-
 // Matrix creation/manipulation functionality
 
 // @private
 // @kind function
 // @category matrixUtilities
-// @fileoverview create a lagged matrix with each row containing the original
+// @desc Create a lagged matrix with each row containing the original
 //   data as its first element and the remaining 'lag' values as additional row
 //   elements
-// @param data {num[]} vector from which to create the lagged matrix
-// @param lag  {integer} size of the lag to use when creating lagged matrix
-// @return {num[][]} a numeric matrix containing original data augmented with
-//   lagged versions of the original dataset.
+// @param data {number[]} Vector from which to create the lagged matrix
+// @param lag {int} Size of the lag to use when creating lagged matrix
+// @return {number[][]} A numeric matrix containing original data augmented 
+//   with lagged versions of the original dataset.
 ts.i.lagMatrix:{[data;lag]
   data til[count[data]-lag]+\:til lag
   }
@@ -548,90 +617,111 @@ ts.i.lagMatrix:{[data;lag]
 // @private
 // @kind function
 // @category matrixUtilities
-// @fileoverview convert a simple table into a matrix
-// @param data {tab} simple table to be converted to a matrix representation
-// @return {num[][]} matrix representation of the input table in the same 'configuration'
+// @desc Convert a simple table into a matrix
+// @param data {table} Simple table to be converted to a matrix representation
+// @return {number[]} Matrix representation of the input table in the same 
+//   'configuration'
 ts.i.tabToMatrix:{[data]
   flip value flip data
   }
 
-
-// Stationarity functionality used to test if datasets are suitable for application of the ARIMA
-// and to facilitate transformation of the data to a more suitable form if relevant
+// Stationarity functionality used to test if datasets are suitable for 
+// application of the ARIMA and to facilitate transformation of the data to a
+// more suitable form if relevant
 
 // @private
 // @kind function
 // @category stationaryUtilities
-// @fileoverview calculate relevant augmented dickey fuller statistics using python
-// @param data  {dict/tab/num[]} dataset to be testing for stationarity
-// @param dtype {short} type of the dataset that's being passed to the function
-// @return {num[]/num[][]} all relevant scores from an augmented dickey fuller test
+// @desc Calculate relevant augmented dickey fuller statistics using
+//   python
+// @param data {dictionary|table|number[]} Dataset to be testing for 
+//   stationarity
+// @param dtype {short} Type of the dataset that's being passed to the function
+// @return {number[]} All relevant scores from an augmented dickey fuller test
 ts.i.stationaryScores:{[data;dtype]
   // Calculate the augmented dickey-fuller scores for a dict/tab/vector input
-  scores:{.ml.fresh.i.adfuller[x]`}@'
-    $[98h=dtype;flip data;
-      99h=dtype;data;
-      dtype in(6h;7h;8h;9h);enlist data;
-      '"Inappropriate type provided"];
+  scores:{.ml.fresh.i.adFuller[x]`}@'
+    $[98h=dtype;
+        flip data;
+      99h=dtype;
+        data;
+      dtype in(6h;7h;8h;9h);
+        enlist data;
+      '"Inappropriate type provided"
+      ];
   flip{x[0 1],(0.05>x 1),value x 4}each$[dtype in 98 99h;value::;]scores
   }
 
 // @private
 // @kind function
 // @category stationaryUtilities
-// @fileoverview Are all of the series provided by a user stationary,
+// @desc Are all of the series provided by a user stationary,
 //   determined using augmented dickey fuller?
-// @param data  {dict/tab/num[]} dataset to be testing for stationarity
-// @return {bool} indicate if all time series are stationary or not
+// @param data {dictionary|table|number[]} Dataset to be testing for 
+//   stationarity
+// @return {boolean} Indicate if all time series are stationary or not
 ts.i.stationary:{[data]
   (all/)ts.i.stationaryScores[data;type data][2]
   }
 
-
 // Differencing utilities
 
 // @private
 // @kind function
 // @category differUtility
-// @fileoverview apply time-series differencing and remove first diff elements
-// @param data  {num[]/num[][]} dataset to apply differencing to
-// @param diff  {integer} order of time series differencing
-// @return {num[]/num[][]} differenced time series
-ts.i.diff:{[data;diff]
-  diffData:diff{deltas x}/data;
-  diff _ diffData
+// @desc Apply time-series differencing and remove first d elements
+// @param data {number[]} Dataset to apply differencing to
+// @param d {int} Order of time series differencing
+// @return {number[]} Differenced time series
+ts.i.diff:{[data;d]
+  diffData:d{deltas x}/data;
+  d _ diffData
   }
 
 // @private
 // @kind function
 // @category differUtility
-// @fileoverview apply seasonal differencing and remove first diff elements
-// @param diff  {integer} how many points in the past does data need to be
+// @desc Apply seasonal differencing and remove first d elements
+// @param d {int} How many points in the past does data need to be
 //   differenced with respect to
-// @param data {num[]/num[][]} dataset to apply differencing to 
-// @return {num[]/num[][]} differenced time series
-ts.i.seasonDiff:{[diff;data]
-  diffData:data - xprev[diff;data];
-  diff _ diffData
+// @param data {number[]} Dataset to apply differencing to 
+// @return {number[]} Differenced time series
+ts.i.seasonDiff:{[d;data]
+  diffData:data - xprev[d;data];
+  d _ diffData
   }
 
 // @private
 // @kind function
 // @category differUtility
-// @fileoverview revert seasonally differenced data to correct representation
-// @param origd  {num[]} set of original dataset saved before being differenced
-// @param dfdata {num[]} differenced dataset
-// @return {num[]} the data reverted back to its original format before differencing 
-ts.i.reverseSeasonDiff:{[origd;dfdata]
-  seasd:origd,dfdata;
-  n:count origd;
-  [n]_first{x[1]<y}[;count[seasd]]{[n;sdi]
-  sd:sdi[0];
-  i:sdi[1];
-  sd[i]:sd[i-n]+sd[i];
-  (sd;i+1)}[n]/(seasd;n)
+// @desc Revert seasonally differenced data to correct representation
+// @param originData {number[]} Set of original dataset saved before being 
+//   differenced
+// @param diffData {number[]} Differenced dataset
+// @return {number[]} Data reverted back to its original format before 
+//   differencing 
+ts.i.reverseSeasonDiff:{[originData;diffData]
+  seasonData:originData,diffData;
+  n:count originData;
+  [n]_first{x[1]<y}[;count seasonData]ts.i.revertDiffFunc[n]/(seasonData;n)
   }
 
+// @private
+// @kind function
+// @category differUtility
+// @desc Revert each individual seasonally differenced data to correct 
+//   representation one by one
+// @param n {int} Number of datapoints to revert
+// @param diffInfo {number[]} The differenced dataset along with what index in
+//   the dataset is currently being reverted to its original state
+// @return {number[]} The updated dataset along with the next index of the list 
+//  that's to be updated next
+ts.i.revertDiffFunc:{[n;diffInfo]
+  seasonDiff:diffInfo 0;
+  i:diffInfo 1;
+  seasonDiff[i]:seasonDiff[i-n]+seasonDiff i;
+  (seasonDiff;i+1)
+  }
 
 // Error flags
 
@@ -641,23 +731,26 @@ ts.i.err.stat:{'`$"Time series not stationary, try another value of d"}
 ts.i.err.len:{'`$"Endog length less than length"}
 ts.i.err.exog:{'`$"Test exog length does not match train exog length"}
 
-
 // Checks on suitability of datasets for application of time-series analysis
 
 // @private
 // @kind function
 // @category dataCheckUtility
-// @fileoverview check that the lengths of endogenous and exogenous data when
+// @desc Check that the lengths of endogenous and exogenous data when
 //   fitting the model are consistent, in the case they are not flag an error,
 //   ensure that the exogenous data is returned as a matrix
-// @param endog {num[]} endogenous dataset
-// @param exog  {tab/num[][]} exogenous dataset
-// @return {num[][]} exogenous data as a matrix
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param exog {float[]|(::)} Exogenous variables are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @return {number[]} Exogenous data as a matrix
 ts.i.fitDataCheck:{[endog;exog]
   // Accept null as input
   if[exog~(::);exog:()];
   // check that exogenous variable length is appropriate
-  if[not[()~exog]&(count[endog])>count exog;ts.i.err.len[]];
+  if[not[()~exog]&count[endog]>count exog;ts.i.err.len[]];
   // convert exon table to matrix
   $[98h~type exog;:"f"$ts.i.tabToMatrix exog;()~exog;:exog;:"f"$exog];
   }
@@ -665,109 +758,130 @@ ts.i.fitDataCheck:{[endog;exog]
 // @private
 // @kind function
 // @category dataCheckUtility
-// @fileoverview ensure that all required keys are present for the application of
-//   the various prediction functions
-// @param dict    {dict}   the dictionary parameter to be validated
-// @param keyvals {sym[]}  list of the keys which should be present in order to
-//   fully execute the logic of the function
-// @param input   {string} name of the input dictionary which issue is
-//   highlighted in
-// @return {err/(::)} will error on incorrect inputs otherwise run silently
-ts.i.dictCheck:{[dict;keyvals;input]
+// @desc Ensure that all required keys are present for the application
+//   of the various prediction functions
+// @param dict {dictionary} dictionary parameter to be validated
+// @param keyVals {symbol[]} Keys which should be present in order to fully 
+//   execute the logic of the function
+// @param input {string} Name of input dictionary which issue is highlighted in
+// @return {err|::} Will error on incorrect inputs otherwise run silently
+ts.i.dictCheck:{[dict;keyVals;input]
   if[99h<>type dict;'input," must be a dictionary input"];
-  validKeys:keyvals in key dict;
+  validKeys:keyVals in key dict;
   if[not all validKeys;
-    invalid:sv[", ";string[keyvals]where not validKeys];
-    '"The following required dictionary keys for '",input,"' are not provided: ",invalid
+    invalid:sv[", ";string[keyVals]where not validKeys];
+    '"The following required dictionary keys for '",input,
+     "' are not provided: ",invalid
     ];
   }
  
 // @private
 // @kind function
 // @category dataCheckUtility
-// @fileoverview check that the exogenous data match the expected input when
-//   predicting data using a the model are consistent, in the case they are not,
-//   flag an error ensure that the exogenous data is returned as a matrix
-// @param mdl  {dict} dictionary containing required information to predict
-//   future values
-// @param exog {tab/num[][]} exogenous dataset
-// @return {num[][]} exogenous data as a matrix
-ts.i.predDataCheck:{[mdl;exog]
-  // allow null to be provided as exogenous variable
+// @desc Check that the exogenous data match the expected input when
+//   predicting data using a the model are consistent, in the case they are 
+//   not, flag an error ensure that the exogenous data is returned as a matrix
+// @param model {dictionary} Dictionary containing required information to 
+//   predict future values
+// @param exog {float[]|(::)} Exogenous variables, are additional variables 
+//   which may be accounted for to improve the model, if (::)/()
+//   this will be ignored
+// @return {number[]} Exogenous data as a matrix
+ts.i.predDataCheck:{[model;exog]
+  // Allow null to be provided as exogenous variable
   if[exog~(::);exog:()];
-  // check that the fit and new params are equivalent
-  if[not count[mdl`exog_param]~count exog[0];ts.i.err.exog[]];
-  // convert exogenous variable to a matrix if required
+  // Check that the fit and new params are equivalent
+  if[not count[model`exogCoeff]~count exog 0;ts.i.err.exog[]];
+  // Convert exogenous variable to a matrix if required
   $[98h~type exog;"f"$ts.i.tabToMatrix exog;()~exog;:exog;"f"$exog]
   }
 
 // @private
 // @kind function
 // @category dataCheckUtility
-// @fileoverview Apply seasonal and non-seasonal time-series differencing,
-//   error checking stationarity of the dataset following application of differencing
-// @param endog {num[]}   endogenous dataset
-// @param diff  {integer} non seasonal differencing component (integer)
-// @param sdict {dict}    dictionary containing relevant seasonal differencing components
-// @return {dict} Seasonal and non-seasonally differenced stationary time-series
-ts.i.differ:{[endog;d;s]
+// @desc Apply seasonal and non-seasonal time-series differencing,error
+//   checking stationarity of the dataset following application of differencing
+// @param endog {number[]} Endogenous variable (time-series) from which to 
+//   build a model. This is the target variable from which a value is to be 
+//   predicted
+// @param d {int} Non seasonal differencing component
+// @param seasonDict {dictionary} Dictionary containing relevant seasonal
+//  differencing components
+// @return {dictionary} Seasonal and nonseasonally differenced stationary 
+//   time-series
+ts.i.differ:{[endog;d;seasonDict]
   // Apply non seasonal differencing if appropriate (handling of AR/ARMA)
-  if[s~()!();s[`D]:0b];
+  if[seasonDict~()!();seasonDict[`D]:0b];
   initDiff:ts.i.diff[endog;d];
   // Apply seasonal differencing if appropriate
-  finalDiff:$[s[`D];s[`D]ts.i.seasonDiff[s`m]/initDiff;initDiff];
+  finalDiff:$[seasonDict[`D];
+    seasonDict[`D]ts.i.seasonDiff[seasonDict`m]/initDiff;
+    initDiff];
   // Check stationarity
   if[not ts.i.stationary[finalDiff];ts.i.err.stat[]];
   // Return integrated data
   `final`init!(finalDiff;initDiff)
   }
 
-
 // Feature extraction utilities
 
 // @private
 // @kind function
 // @category featureExtractUtilities
-// @fileoverview Apply a user defined unary function across a dataset 
+// @desc Apply a user defined unary function across a dataset 
 //   using a sliding window of specified length
 //   Note: this is a modified version of a function provided in qidioms
-//   using floating point windows instead
-//   of long windows to increase the diversity of functions that can be applied
-// @param func {lambda}  unary function to be applied with the data in the sliding window
-// @param win  {integer} size of the sliding window 
-// @param data {num[]}   data on which the sliding window and associated function
-//   are to be applied
-// @return {num[]} result of the application of the function on each of the sliding window
-//   components over the data vector
-ts.i.slidingWindowFunction:{[func;win;data]
-  0f,-1_func each{ 1_x,y }\[win#0f;data]
+//   using floating point windows instead of long windows to increase the 
+//   diversity of functions that can be applied
+// @param func {fn} Unary function to be applied with the data in the sliding
+//  window
+// @param winSize {int} Size of the sliding window 
+// @param data {number[]} Data on which the sliding window and associated 
+//   function are to be applied
+// @return {number[]} Result of the application of the function on each of the 
+//   sliding window components over the data vector
+ts.i.slidingWindowFunction:{[func;winSize;data]
+  0f,-1_func each{1_x,y}\[winSize#0f;data]
   }
 
+// @private
+// @kind function
+// @category featureExtractUtilities
+// @desc Set up the order for the inputs of the sliding window function
+// @param tab {table} Dataset onto which to apply the windowed functions 
+// @param uniCombs {number[]} Unique combinations of columns/windows and 
+//   functions to be applied to the dataset  
+// @return {number[]} Result of the application of the function on each of the 
+//   sliding window components over the data vector
+ts.i.setupWindow:{[tab;uniCombs]
+  ts.i.slidingWindowFunction[get string uniCombs 0;uniCombs 1;tab uniCombs 2]
+  }
 
 // Plotting utilities 
 
 // @private
 // @kind function
 // @category plottingUtility
-// @fileoverview Plotting function used in the creation of plots
+// @desc Plotting function used in the creation of plots
 //   for both full and partial autocorrelation graphics
-// @param data  {num[]} x-axis original dataset
-// @param vals  {num[]} calculated values
-// @param m     {num[]} bar plot indices
-// @param title {string} title to be given to the plot
-// @return {graph} presents a plot to screen associated with relevant analysis
+// @param data {number[]} x-axis original dataset
+// @param vals {number[]} Calculated values
+// @param m {number[]} Bar plot indices
+// @param title {string} Title to be given to the plot
+// @return {graph} Presents a plot to screen associated with relevant analysis
 ts.i.plotFunction:{[data;vals;m;width;title]
-  plt:.p.import[`matplotlib.pyplot];
+  plt:.p.import`matplotlib.pyplot;
   conf:count[m]#1.95%sqrt count data;
   plt[`:bar][m;vals;`width pykw width%2];
-  cfgkeys:`linewidth`linestyle`color`label;
-  cfgvals:3,`dashed`red`conf_interval;
-  plt[`:plot][m;conf;pykwargs cfgkeys!cfgvals];
+  configKeys:`linewidth`linestyle`color`label;
+  configVals:3,`dashed`red`conf_interval;
+  plt[`:plot][m;conf;pykwargs configKeys!configVals];
   if[0>min vals;
-    plt[`:plot][m;neg conf;pykwargs -1_cfgkeys!cfgvals]
+    plt[`:plot][m;neg conf;pykwargs -1_configKeys!configVals]
     ];
   plt[`:legend][];
-  plt[`:xlabel][`lags];
-  plt[`:ylabel][`acf];
-  plt[`:title][title];
-  plt[`:show][];}
+  plt[`:xlabel]`lags;
+  plt[`:ylabel]`acf;
+  plt[`:title]title;
+  plt[`:show][];
+  }
diff --git a/util/README.md b/util/README.md
index 1a252ce2..d54a4356 100644
--- a/util/README.md
+++ b/util/README.md
@@ -36,6 +36,6 @@ Documentation is available on the [Utilities](https://code.kx.com/v2/ml/toolkit/
 
 ## Status
   
-The machine-learning utilities library is still in development and is available here as a beta release. Further functionality and improvements will be made to the library in the coming months.
+The machine-learning utilities library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
 
 If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/util/functionMapping.json b/util/functionMapping.json
new file mode 100644
index 00000000..515d35a5
--- /dev/null
+++ b/util/functionMapping.json
@@ -0,0 +1,310 @@
+{
+  "util":{
+    ".ml.imin":{
+      "function":".ml.iMin",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.imax":{
+      "function":".ml.iMax",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.df2tab_tz":{
+      "function":".ml.df2tabTimezone",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.linspace":{
+      "function":".ml.linearSpace",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.traintestsplit":{
+      "function":".ml.trainTestSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.classreport":{
+      "function":".ml.classReport",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.confdict":{
+      "function":".ml.confDict",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.confmat":{
+      "function":".ml.confMatrix",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.corrmat":{
+      "function":".ml.corrMatrix",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.cvm":{
+      "function":".ml.covMatrix",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.f1score":{
+      "function":".ml.f1Score",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.fbscore":{
+      "function":".ml.fBetaScore",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.logloss":{
+      "function":".ml.logLoss",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.r2score":{
+      "function":".ml.r2Score",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rocaucscore":{
+      "function":".ml.rocAucScore",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.tscore":{
+      "function":".ml.tScore",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.tscoreeq":{
+      "function":".ml.tScoreEqual",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.applylabelencode":{
+      "function":".ml.applyLabelEncode",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.dropconstant":{
+      "function":".ml.dropConstant",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.filltab":{
+      "function":".ml.fillTab",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.infreplace":{
+      "function":".ml.infReplace",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.labelencode":{
+      "function":".ml.labelEncode.fitTransform",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.lexiencode":{
+      "function":".ml.lexiEncode.fitTransform",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.minmaxscaler":{
+      "function":".ml.minMaxScaler.fitTransform",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.onehot":{
+      "function":".ml.oneHot.fitTransform",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.polytab":{
+      "function":".ml.polyTab",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.stdscaler":{
+      "function":".ml.stdScaler.fitTransform",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.timesplit":{
+      "function":".ml.timeSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.describe":{
+      "function":".ml.stats.describe",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.percentile":{
+      "function":".ml.stats.percentile",
+      "warning":"futureWarning",
+      "version":"3.0"
+    }
+  },
+  "clust":{
+    ".ml.clust.cure.cutk":{
+      "function":".ml.clust.cure.cutK",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.clust.cure.cutdist":{
+      "function":".ml.clust.cure.cutDist",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.clust.hc.cutk":{
+      "function":".ml.clust.hc.cutK",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.clust.hc.cutdist":{
+      "function":".ml.clust.hc.cutDist",
+      "warning":"futureWarning",
+      "version":"3.0"
+    }
+  },
+  "fresh":{
+    ".ml.fresh.createfeatures":{
+      "function":".ml.fresh.createFeatures",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.fresh.sigfeat":{
+      "function":".ml.fresh.sigFeat",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.fresh.ksigfeat":{
+      "function":".ml.fresh.kSigFeat",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.fresh.significantfeatures":{
+      "function":".ml.fresh.significantFeatures",
+      "warning":"futureWarning",
+      "version":"3.0"
+    }
+  },
+  "xval":{
+    ".ml.gs.kfshuff":{
+      "function":".ml.gs.kfShuff",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.gs.kfsplit":{
+      "function":".ml.gs.kfSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.gs.kfstrat":{
+      "function":".ml.gs.kfStrat",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.gs.mcsplit":{
+      "function":".ml.gs.mcSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.gs.pcsplit":{
+      "function":".ml.gs.pcSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.gs.tschain":{
+      "function":".ml.gs.tsChain",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.gs.tsrolls":{
+      "function":".ml.gs.tsRolls",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rs.kfshuff":{
+      "function":".ml.rs.kfShuff",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rs.kfsplit":{
+      "function":".ml.rs.kfSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rs.kfstrat":{
+      "function":".ml.rs.kfStrat",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rs.mcsplit":{
+      "function":".ml.rs.mcSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rs.pcsplit":{
+      "function":".ml.rs.pcSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rs.tschain":{
+      "function":".ml.rs.tsChain",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.rs.tsrolls":{
+      "function":".ml.rs.tsRolls",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.xv.kfshuff":{
+      "function":".ml.xv.kfShuff",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.xv.kfsplit":{
+      "function":".ml.xv.kfSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.xv.kfstrat":{
+      "function":".ml.xv.kfStrat",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.xv.mcsplit":{
+      "function":".ml.xv.mcSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.xv.pcsplit":{
+      "function":".ml.xv.pcSplit",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+   ".ml.xv.tschain":{
+      "function":".ml.xv.tsChain",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.xv.tsrolls":{
+      "function":".ml.xv.tsRolls",
+      "warning":"futureWarning",
+      "version":"3.0"
+    },
+    ".ml.xv.fitscore":{
+      "function":".ml.xv.fitScore",
+      "warning":"futureWarning",
+      "version":"3.0"
+    }
+  }
+}
diff --git a/util/init.q b/util/init.q
index 44bb6867..3284ce43 100644
--- a/util/init.q
+++ b/util/init.q
@@ -1,3 +1,11 @@
-.ml.loadfile`:util/util.q
+// util/init.q - Load utilities library
+// Copyright (c) 2021 Kx Systems Inc
+
+.ml.loadfile`:util/utils.q
+.ml.loadfile`:util/utilities.q
 .ml.loadfile`:util/metrics.q
 .ml.loadfile`:util/preproc.q
+.ml.loadfile`:fresh/utils.q
+.ml.loadfile`:stats/init.q
+
+.ml.i.deprecWarning`util
diff --git a/util/metrics.q b/util/metrics.q
index 1d956425..08a6d040 100644
--- a/util/metrics.q
+++ b/util/metrics.q
@@ -1,60 +1,346 @@
+// util/metrics.q - Metrics 
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Metrics for scoring ml models
+
 \d .ml
 
-/ descriptive statistics
-range:{max[x]-min x}
-/ percentile y of list x
-percentile:{r[0]+(p-i 0)*last r:0^deltas asc[x]i:0 1+\:floor p:y*-1+count x}
-describe:{`count`mean`std`min`q1`q2`q3`max!flip(count;avg;sdev;min;percentile[;.25];percentile[;.5];percentile[;.75];max)@\:/:flip(exec c from meta[x]where t in"hijefpmdznuvt")#x}
-
-/ classification scores (x predictions, y labels, z positive label)
-accuracy:{avg x=y}
-precision:  {sum[u&y =z]%sum u:x =z}
-sensitivity:{sum[u&x =z]%sum u:y =z}
-specificity:{sum[u&x<>z]%sum u:y<>z}
-/ f1&fbeta scores
-fbscore:{[x;y;z;b](sum[ap&pp]*1+b*b)%sum[pp:x=z]+b*b*sum ap:y=z}
-f1score:fbscore[;;;1]
-/ matthews correlation coefficient
-matcorr:{.[-;prd raze[m](0 1;3 2)]%sqrt prd sum[m],sum each m:value confmat[x;y]}
-/ confusion matrix
-confmat:{(k!(2#count k)#0),0^((count each group@)each x group y)@\:k:$[1=type k:asc distinct x,y;01b;k]}
-/ confusion dictionary
-confdict:{`tn`fp`fn`tp!raze value confmat .(x;y)=z}
-/ class report
-classreport:{[x;y]k:asc distinct y;
- t:`precision`recall`f1_score`support!((precision;sensitivity;f1score;{sum y=z}).\:(x;y))@/:\:k;
- ([]class:`$string[k],enlist"avg/total")!flip[t],(avg;avg;avg;sum)@'t}
-
-/ x list of class labels (0,1,...,n-1), y list of lists of (n) probabilities (one per class)
-i.EPS:1e-15
-crossentropy:logloss:{neg avg log i.EPS|y@'x}
-
-/ regression scores (x predictions, y values)
-mse:{avg d*d:x-y} 
-sse:{sum d*d:x-y}
-rmse:{sqrt mse[x;y]}
-rmsle:{rmse . log(x;y)+1}
-mae:{avg abs x-y}
-mape:{100*avg abs 1-x%y}
-smape:{100*avg abs[y-x]%abs[x]+abs y}
-r2score:{1-sse[y;x]%sse[y]avg y}
-
-/ t-score for a test (one sample)
-tscore:{[x;mu](avg[x]-mu)%sdev[x]%sqrt count x}
-/ t-score for t-test (two independent samples, not equal variances)
-tscoreeq:{abs[avg[x]-avg y]%sqrt(svar[x]%count x)+svar[y]%count y}
-
-/ covariance/correlation calculate upper triangle only
-cvm:{(x+flip(not n=\:n)*x:(n#'0.0),'(x$/:'(n:til count x)_\:x)%count first x)-a*\:a:avg each x:"f"$x}
-crm:{cvm[x]%u*/:u:dev each x}
-/ correlation matrix, in dictionary format if input is a table
-corrmat:{$[t;{x!x!/:y}cols x;]crm$[t:98=type x;value flip@;]x}
-
-/ exclude colinear point 
-i.curvepts:{(x;y)@\:where(1b,2_differ deltas[y]%deltas x),1b}
-/ area under curve (x,y)
-i.auc:{sum 1_deltas[x]*y-.5*deltas y}
-/ ROC curve: y the actual class, p the positive probability
-roc:{[y;p]{0.,x%last x}each value exec 1+i-y,y from(update sums y from`p xdesc([]y;p))where p<>next p}
-/ area under ROC curve
-rocaucscore:{[y;p]i.auc . i.curvepts . roc[y;p]}
+// @kind function
+// @category metric
+// @desc Accuracy of classification results
+// @param pred {int[]|boolean[]|string[]} A vector/matrix of predicted labels 
+// @param true {int[]|boolean[]|string[]} A vector/matrix of true labels 
+// @returns {float} The accuracy of predictions made
+accuracy:{[pred;true]
+  avg pred=true
+  }
+
+// @kind function
+// @category metric
+// @desc Precision of a binary classifier
+// @param pred {boolean[]} A vector of predicted labels 
+// @param true {boolean[]} A vector of true labels
+// @param posClass {boolean} The positive class 
+// @returns {float} A measure of the precision
+precision:{[pred;true;posClass]
+  predPos:pred=posClass;
+  truePos:predPos&true=posClass;
+  sum[truePos]%sum predPos
+  }
+
+// @kind function
+// @category metric
+// @desc Sensitivity of a binary classifier
+// @param pred {boolean[]} A vector of predicted labels 
+// @param true {boolean[]} A vector of true labels
+// @param posClass {boolean} The positive class 
+// @returns {float} A measure of the sensitivity
+sensitivity:{[pred;true;posClass]
+  realPos:true=posClass;
+  truePos:realPos&pred=posClass;
+  sum[truePos]%sum realPos
+  }
+
+// @kind function
+// @category metric
+// @desc Specificity of a binary classifier
+// @param pred {boolean[]} A vector of predicted labels 
+// @param true {boolean[]} A vector of true labels
+// @param posClass {boolean} The positive class 
+// @returns {float} A measure of the specificity
+specificity:{[pred;true;posClass]
+  allNeg:true<>posClass;
+  trueNeg:allNeg&pred<>posClass;
+  sum[trueNeg]%sum allNeg
+  }
+
+// @kind function
+// @category metric
+// @desc F-beta score for classification results
+// @param pred {number[]|boolean[]} A vector of predicted labels 
+// @param true {number[]|boolean[]} A vector of true labels
+// @param posClass {number|boolean} The positive class
+// @param beta {float} The value of beta
+// @returns {float} The F-beta score between predicted and true labels
+fBetaScore:{[pred;true;posClass;beta]
+  realPos:true=posClass;
+  predPos:pred=posClass;
+  minPos:realPos&predPos;  
+  (sum[minPos]*1+beta*beta)%sum[predPos]+beta*beta*sum realPos
+  }
+
+// @kind function
+// @category metric
+// @desc F-1 score for classification results
+// @param pred {int[]|boolean[]|string[]} A vector of predicted labels 
+// @param true {int[]|boolean[]|string[]} A vector of true labels
+// @param posClass {number|boolean} The positive class
+// @returns {float} The F-1 score between predicted and true labels
+f1Score:fBetaScore[;;;1]
+
+// @kind function
+// @category metric
+// @desc Matthews-correlation coefficient
+// @param pred {int[]|boolean[]|string[]} A vector of predicted labels 
+// @param true {int[]|boolean[]|string[]} A vector of true labels
+// @returns {float} The Matthews-correlation coefficient between predicted
+//   and true values
+matthewCorr:{[true;pred]
+  confMat:value confMatrix[true;pred];
+  sqrtConfMat:sqrt prd sum[confMat],sum each confMat;
+  .[-;prd raze[confMat](0 1;3 2)]%sqrtConfMat
+  }
+
+// @kind function
+// @category metric
+// @desc Confusion matrix
+// @param pred {int[]|boolean[]|string[]} A vector of predicted labels 
+// @param true {int[]|boolean[]|string[]} A vector of true labels
+// @returns {dictionary} A confusion matrix
+confMatrix:{[pred;true]
+  classes:asc distinct pred,true;
+  if[1=type classes;classes:01b];
+  classDict:classes!(2#count classes)#0;
+  groupClass:0^((count each group@)each pred group true)@\:classes;
+  classDict,groupClass
+  }
+
+// @kind function
+// @category metric
+// @desc True/false positives and true/false negatives
+// @param pred {int[]|boolean[]|string[]} A vector of predicted labels 
+// @param true {int[]|boolean[]|string[]} A vector of true labels
+// @param posClass {number|boolean} The positive class
+// @returns {dictionary} The count of true positives (tp), true negatives (tn),
+//   false positives (fp) and false negatives (fn) 
+confDict:{[pred;true;posClass]
+  confKeys:`tn`fp`fn`tp;
+  confVals:raze value confMatrix .(pred;true)=posClass;
+  confKeys!confVals
+  }
+
+// @kind function
+// @category metric
+// @desc Statistical information about classification result
+// @param pred {int[]|boolean[]|string[]} A vector of predicted labels 
+// @param true {int[]|boolean[]|string[]} A vector of true labels
+// @returns {table} The accuracy, precision, f1 scores and the support 
+//   (number of occurrences) of each class.
+classReport:{[pred;true]
+  trueClass:asc distinct true;
+  dictCols:`precision`recall`f1_score`support;
+  funcs:(precision;sensitivity;f1Score;{sum y=z});
+  dictVals:(funcs .\:(pred;true))@/:\:trueClass;
+  dict:dictCols!dictVals;
+  classTab:([]class:`$string[trueClass],enlist"avg/total");
+  classTab!flip[dict],(avg;avg;avg;sum)@'dict
+  }
+
+// @kind function
+// @category metric
+// @desc Logarithmic loss
+// @param class {boolean[]} Class labels
+// @param prob {float[]} Representing the probability of belonging to 
+//   each class
+// @returns {float} Total logarithmic loss
+crossEntropy:logLoss:{[class;prob]
+  EPS:1e-15;
+  neg avg log EPS|prob@'class
+  }
+
+// @kind function
+// @category metric
+// @desc Mean square error
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The mean squared error between predicted values and
+//   the true values
+mse:{[pred;true]
+  avg diff*diff:pred-true
+  } 
+
+// @kind function
+// @category metric
+// @desc Sum squared error
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The sum squared error between predicted values and
+//   the true values
+sse:{[pred;true]
+  sum diff*diff:pred-true
+  }
+
+// @kind function
+// @category metric
+// @desc Root mean squared error 
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The root mean squared error between predicted values 
+//   and the true values
+rmse:{[pred;true]
+  sqrt mse[pred;true]
+  }
+
+// @kind function
+// @category metric
+// @desc Root mean squared log error 
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The root mean squared log error between predicted values
+//   and the true values
+rmsle:{[pred;true]
+  rmse . log(pred;true)+1
+  }
+
+// @kind function
+// @category metric
+// @desc Residual squared error 
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @param n {long} The degrees of freedom of the residual
+// @returns {float} The residual squared error between predicted values
+//   and the true values
+rse:{[pred;true;n]
+  sqrt sse[pred;true]%n
+  }
+ 
+// @kind function
+// @category metric
+// @desc Mean absolute error
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The mean absolute error between predicted values
+//   and the true values
+mae:{[pred;true]
+  avg abs pred-true
+  }
+
+// @kind function
+// @category metric
+// @desc Mean absolute percentage error
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The mean absolute percentage error between predicted values
+//   and the true values
+mape:{[pred;true]
+  100*avg abs 1-pred%true
+  }
+
+// @kind function
+// @category metric
+// @desc Symmetric mean absolute percentage error
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The symmetric-mean absolute percentage between predicted
+//   and true values
+smape:{[pred;true]
+  sumAbsVals:abs[pred]+abs true;
+  100*avg abs[true-pred]%sumAbsVals
+  }
+
+// @kind function
+// @category metric
+// @desc R2-score for regression model validation
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @returns {float} The R2-score between the true and predicted values.
+//   Values close to 1 indicate good prediction, while negative values 
+//   indicate poor predictors of the system behavior
+r2Score:{[pred;true]
+  1-sse[true;pred]%sse[true]avg true
+  }
+
+// @kind function
+// @category metric
+// @desc R2 adjusted score for regression model validation
+// @param pred {float[]} A vector of predicted labels 
+// @param true {float[]} A vector of true labels
+// @param p {long} Number of independent regressors, i.e. the number of 
+//   variables in your model, excluding the constant
+// @returns {float} The R2 adjusted score between the true and predicted 
+//   values. Values close to 1 indicate good prediction, while negative values 
+//   indicate poor predictors of the system behavior
+r2AdjScore:{[pred;true;p]
+  n:count pred;
+  r2:r2Score[pred;true];
+  1-(1-r2)*(n-1)%(n-p)-1
+  }
+
+// @kind function
+// @category metric
+// @desc One-sample t-test score
+// @param sample {number[]} A set of samples from a distribution
+// @param mu {float} The population mean
+// @returns {float} The one sample t-score for a distribution with less than 
+//   30 samples. 
+tScore:{[sample;mu]
+  (avg[sample]-mu)%sdev[sample]%sqrt count sample
+  }
+
+// @kind function
+// @category metric
+// @desc T-test for independent samples with equal variances 
+//   and equal sample size
+// @param sample1 {number[]} A sample from a distribution
+// @param sample1 {number[]} A sample from a distribution
+// sample1&2 are independent with equal variance and sample size
+// @returns {float} Their t-test score 
+tScoreEqual:{[sample1;sample2]
+  count1:count sample1;
+  count2:count sample2;
+  absAvg:abs avg[sample1]-avg sample2;
+  absAvg%sqrt(svar[sample1]%count1)+svar[sample2]%count2
+  }
+
+// @kind function
+// @category metric
+// @desc Calculate the covariance of a matrix
+// @param matrix {number[]} A sample from a distribution
+// @returns {number[]} The covariance matrix 
+covMatrix:{[matrix]
+  matrix:"f"$matrix;
+  n:til count matrix;
+  avgMat:avg each matrix;
+  upperTri:matrix$/:'n _\:matrix;
+  diag:not n=\:n;
+  matrix:(n#'0.0),'upperTri%count first matrix;
+  multiplyMat:matrix+flip diag*matrix;
+  multiplyMat-avgMat*\:avgMat
+  }
+
+// @kind function
+// @category metric
+// @desc Calculate the correlation of a matrix or table
+// @param data {table|number[]} A sample from a distribution
+// @returns {dictionary|number[]} The covariance of the data 
+corrMatrix:{[data]
+  dataTab:98=type data;
+  matrix:$[dataTab;value flip@;]data;
+  corrMat:i.corrMatrix matrix;
+  $[dataTab;{x!x!/:y}cols data;]corrMat
+  }
+
+// @kind function
+// @category metric
+// @desc X- and Y-axis values for an ROC curve
+// @param label {number[]|boolean[]} Label associated with a prediction
+// @param prob {float[]} Probability that each prediction belongs to 
+//   the positive class
+// @returns {number[]} The coordinates of the true-positive and false-positive 
+//   values associated with the ROC curve
+roc:{[label;prob]
+  tab:(update sums label from`prob xdesc([]label;prob));
+  probDict:exec 1+i-label,label from tab where prob<>next prob;
+  {0.,x%last x}each value probDict
+  }
+
+// @kind function
+// @category metric
+// @desc Area under an ROC curve
+// @param label {number[]|boolean[]} Label associated with a prediction
+// @param prob {float[]} Probability that each prediction belongs to 
+//   the positive class
+// @returns {float} The area under the ROC curve
+rocAucScore:{[label;prob]
+  i.auc . i.curvePts . roc[label;prob]
+  }
diff --git a/util/mproc.q b/util/mproc.q
index 8fb5021a..e9de8518 100644
--- a/util/mproc.q
+++ b/util/mproc.q
@@ -1,11 +1,48 @@
+// util/mproc.q - Utilities for multiprocessing
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Distributes functions to worker processes
+
 \d .ml
 
-if[not `mproc in key .ml;.z.pd:`u#0#0i;mproc.N:0]
-.z.pc:{[f;x].z.pd:`u#.z.pd except x;f[x]}@[value;`.z.pc;{{}}]
-mproc.reg:{.z.pd,:.z.w;neg[.z.w]@/:mproc.cmds}
-mproc.init:{[n;x]
+// @kind function
+// @category multiProcess
+// @desc If the multiProc key is not already loaded in set .`z.pd` and 
+//   N to 0
+// @return {::} `.z.pd` and N are set to 0
+if[not`multiProc in key .ml;.z.pd:`u#0#0i;multiProc.N:0]
+
+// @kind function
+// @category multiProcess
+// @desc Define what happens when the connection is closed
+// @param func {fn} Value of `.z.pc` function 
+// @param proc {int} Handle to the worker process
+// @return {::} Appropriate handles are closed
+.z.pc:{[func;proc]
+  .z.pd:`u#.z.pd except proc;
+  func proc
+  }@[value;`.z.pc;{{}}]
+
+// @kind function
+// @category multiProcess
+// @desc Register the handle and pass any functions required to the
+//   worker processes
+// @return {::} The handle is registered and function is passed to process
+multiProc.reg:{
+  .z.pd,:.z.w;
+  neg[.z.w]@/:multiProc.cmds
+  }
+
+// @kind function
+// @category multiProcess
+// @desc Distributes functions to worker processes
+// @param n {int} Number of processes open
+// @param func {string} Function to be passed to the process
+// @return {::} Each of the `n` worker processes evaluate `func`
+multiProc.init:{[n;func]
   if[not p:system"p";'"set port to multiprocess"];
-  neg[.z.pd]@\:/:x;
-  mproc.cmds,:x;
-  do[0|n-mproc.N;system"q ",path,"/util/mprocw.q -pp ",string p];
-  mproc.N|:n;}
+  neg[.z.pd]@\:/:func;
+  multiProc.cmds,:func;
+  do[0|n-multiProc.N;system"q ",path,"/util/mprocw.q -pp ",string p];
+  multiProc.N|:n;
+  }
diff --git a/util/mprocw.q b/util/mprocw.q
index 9930d3d2..ef1a7062 100644
--- a/util/mprocw.q
+++ b/util/mprocw.q
@@ -1,5 +1,15 @@
+// util/mprocw.q - Multiprocessing 
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Mutliprocessing based on command line input
+
+// Exit if `pp isn't passed as a command parameter
 if[not`pp in key .Q.opt .z.x;exit 1];
+// Exit if no values were passed with pp
 if[not count .Q.opt[.z.x]`pp;exit 2];
+// Exit if cannot open port
 if[not h:@[hopen;"J"$first .Q.opt[.z.x]`pp;0];exit 3];
+// Exit if cannot load ml.q
 @[system;"l ml/ml.q";{exit 4}]
-neg[h]`.ml.mproc.reg`
+// Register the handle and run appropriate functions
+neg[h]`.ml.multiProc.reg`
diff --git a/util/pickle.q b/util/pickle.q
index 39e30cbb..e6d4d534 100644
--- a/util/pickle.q
+++ b/util/pickle.q
@@ -1,5 +1,28 @@
+// util/pickle.q - Pickle file utilities 
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Save and load python objects to and from pickle files
+
 \d .ml
 
-pickledump:.p.import[`pickle;`:dumps;<]
-pickleload:.p.import[`pickle;`:loads]
-picklewrap:{[b;x]$[b;{.ml.pickleload y}[;pickledump x];{y}[;x]]}
+// @kind function
+// @cateogory pickle
+// @desc Generate python pickle dump module to save a python object
+pickleDump:.p.import[`pickle;`:dumps;<]
+
+// @kind function
+// @cateogory pickle
+// @desc Generate python pickle lodas module to load a python object
+pickleLoad:.p.import[`pickle;`:loads]
+
+// @kind function
+// @cateogory pickle
+// @desc A wrapper function to load and save python
+//   objects using pickle 
+// @param module {boolean} Whether the pickle load module (1b) or 
+//   dump module (0b) is to be invoked
+// @param obj {<} Python object to be saved/loaded
+// @return {::;<} Object is saved/loaded  
+pickleWrap:{[module;obj]
+  $[module;{.ml.pickleLoad y}[;pickleDump obj];{y}[;obj]]
+  }
diff --git a/util/preproc.q b/util/preproc.q
index 16129bd6..1e41d93b 100644
--- a/util/preproc.q
+++ b/util/preproc.q
@@ -1,88 +1,365 @@
+// util/preproc.q - Preprocessing functions
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Preprocessing of data prior to training
+
 \d .ml
 
-/ data preprocessing
-
-/* x = simple table/dictionary
-dropconstant:{
- if[not(typ:type x)in 98 99h;'"Data must be simple table or dictionary"];
- if[99h=typ;if[98h~type value x;'"Data cannot be a keyed table"]];
- // find keys/cols that contain non-numeric data
- fc:$[typ=99h;i.fndkey;i.fndcols].(x;"csg ",upper .Q.t);
- // store instructions to flip table and execute this
- dt:(fdata:$[99=typ;;flip])x;
- // drop constant numeric and non numeric cols/keys
- fdata i.dropconst.num[fc _ dt],i.dropconst.other fc#dt
- }
-
-// logic to find numeric and drop constant columns
-i.dropconst.num:{(where 0=0^var each x)_x}
-i.dropconst.other:{(where{all 1_(~':)x}each x)_x}
-// Find keys relating to a specific type
-i.fndkey:{where({.Q.t abs type x}each x)in y}
-
-
-minmaxscaler:i.ap{(x-mnx)%max[x]-mnx:min x}
-stdscaler   :i.ap{(x-avg x)%dev x}
-/ replace +/- 0w with max/min vals
-infreplace  :i.ap{@[x;i;:;z@[x;i:where x=y;:;0n]]}/[;-0w 0w;min,max]
-
-/ produce features which are combinations of n features from table x
-polytab:{[x;n]flip(`$"_"sv'string c)!prd each x c@:combs[count c:cols x;n]}
-
-filltab:{[t;gc;tc;d]
- d:$[0=count d;:t;(::)~d;c!(count c:i.fndcols[t;"ghijefcspmdznuvt"]except gc,tc)#`forward;d];
- t:flip flip[t],(`$string[k],\:"_null")!null t k:key d;
- ![t;();$[count gc,:();gc!gc;0b];@[i.fillmap;`linear;,';tc][d],'k]}
-
-/ fill methods
-i.fillmap.zero:{0^x}
-i.fillmap.median:{med[x]^x}
-i.fillmap.mean:{avg[x]^x}
-i.fillmap.forward:{"f"$(x first where not null x)^fills x}
-i.fillmap.linear:{[t;v]
- if[2>count i:where not n:null v;:v];
- g:1_deltas[v i]%deltas t i;
- "f"$@[v;n;:;v[i][u]+g[u]*t[n]-t[i]u:0|(i:-1_i)bin n:where n]}
-
-/ encode categorical features using one-hot encoding
-i.onehot1:{d!"f"$x=/:d:asc distinct x}
-onehot:{[x;c]
-  if[(::)~c;c:i.fndcols[x;"s"]];
-  flip(c _ flip x),raze{[x;c](`$"_"sv'string c,'key r)!value r:i.onehot1 x c}/:[x]c,:()}
-
-/ encode categorical features with frequency of category occurrence
-freqencode:{[x;c]
-  if[(::)~c;c:i.fndcols[x;"s"]];  
-  flip(c _ flip x),(`$string[c],\:"_freq")!{(g%sum g:count each group x)x}each x c,:()}
-
-/ encode categorical features with lexigraphical order
-lexiencode:{[x;c]
-  if[(::)~c;c:i.fndcols[x;"s"]];
-  flip(c _ flip x),(`$string[c],\:"_lexi")!{(asc distinct x)?x}each x c,:()}
-
-// Encode the a dataset to a list of integers, and provide a mapping allowing a user to
-// revert new integer lists to the original version
-/* x = data to be encoded and mapped
-labelencode:{[x]
-  adx:asc distinct x;
-  `mapping`encoding!(adx!til count adx;adx?x)
-  }
-
-// Map a list of integers to their true representation based on a label encoding schema
-/* x = data to be revert to true representation based on 
-/* y = label encoding map either labelencode[x]`mapping or labelencode[x]
-applylabelencode:{[x;y]
-  if[99h<>type y;'"Input must be a dictionary"];
-  $[`mapping`encoding~key y;y[`mapping]?;y?]x
-  }
-
-/ split temporal types into constituents
-i.timesplit.d:{update wd:1<dow from update dow:dow mod 7,qtr:1+(mm-1)div 3 from`dow`year`mm`dd!`date`year`mm`dd$/:\:x}
-i.timesplit.m:{update qtr:1+(mm-1)div 3 from k!(k:`year`mm)$/:\:x}
-i.timesplit[`n`t`v]:{k!(k:`hh`uu`ss)$/:\:x}
-i.timesplit.u:{k!(k:`hh`uu)$/:\:x}
-i.timesplit[`p`z]:{raze i.timesplit[`d`n]@\:x}
-i.timesplit1:{i.timesplit[`$.Q.t type x]x:raze x}
-timesplit:{[x;c]
-  if[(::)~c;c:i.fndcols[x;"dmntvupz"]];
-  flip(c _ flip x),raze{(`$"_"sv'string y,'key r)!value r:i.timesplit1 x y}/:[x]c,:()}
+// @kind function
+// @category preprocessing
+// @desc Remove columns/keys with zero variance
+// @param data {table|dictionary} Data in various formats
+// @return {table|dictionary} All columns/keys with zero variance are removed
+dropConstant:{[data]
+  typeData:type data;
+  if[not typeData in 98 99h;
+    '"Data must be simple table or dictionary"
+    ];
+  if[99h=typeData;
+    if[98h~type value data;
+      '"Data cannot be a keyed table"
+      ]
+    ];
+  // Find keys/cols that contain non-numeric data
+  findFunc:$[typeData=99h;i.findKey;i.findCols];
+  findKeys:findFunc .(data;"csg ",upper .Q.t);
+  // Store instructions to flip table and execute this
+  flipData:$[99=typeData;;flip];
+  dataDict:flipData data;
+  // Drop constant numeric and non numeric cols/keys
+  dropNum:i.dropConstant.num[findKeys _ dataDict];
+  dropOther:i.dropConstant.other findKeys#dataDict;
+  flipData dropNum,dropOther
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Fit min max scaling model
+// @param data {table|dictionary|number[]} Numerical data
+// @return {dictionary} Contains the following information:
+//   modelInfo - The min/max value of the fitted data
+//   transform - A projection allowing for transformation on new input data
+minMaxScaler.fit:{[data]
+  typData:type[data] in 0 99h;
+  minData:$[typData;min each;min]data;
+  maxData:$[typData;max each;max]data;
+  scalingInfo:`minData`maxData!(minData;maxData);
+  returnInfo:enlist[`modelInfo]!enlist scalingInfo;
+  transform:i.apUpd minMaxScaler.transform returnInfo;
+  returnInfo,enlist[`transform]!enlist transform
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Scale data between 0-1 based on fitted model
+// @params config {dictionary} Information returned from `ml.minMaxScaler.fit`
+//   including:
+//   modelInfo - The min/max value of the fitted data
+//   transform - A projection allowing for transformation on new input data
+// @param data {table|dictionary|number[]} Numerical data
+// @return {table|dictionary|number[]} A min-max scaled representation with 
+// values scaled between 0 and 1f
+minMaxScaler.transform:{[config;data]
+  minData:config[`modelInfo;`minData];
+  maxData:config[`modelInfo;`maxData];
+  (data-minData)%maxData-minData
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Scale data between 0-1
+// @param data {table|dictionary|number[]} Numerical data
+// @return {table|dictionary|number[]} A min-max scaled representation with 
+// values scaled between 0 and 1f
+minMaxScaler.fitTransform:{[data]
+  scaler:minMaxScaler.fit data;
+  scaler[`transform]data
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Fit standard scaler model
+// @param data {table|dictionary|number[]} Numerical data
+// @return {dictionary} Contains the following information:
+//   modelInfo - The avg/dev value of the fitted data
+//   transform - A projection allowing for transformation on new input data
+stdScaler.fit:{[data]
+  typData:type[data];
+  if[typData=98;data:flip data];
+  avgData:$[typData in 0 98 99h;avg each;avg]data;
+  devData:$[typData in 0 98 99h;dev each;dev]data;
+  scalingInfo:`avgData`devData!(avgData;devData);
+  returnInfo:enlist[`modelInfo]!enlist scalingInfo;
+  transform:i.apUpd stdScaler.transform returnInfo;
+  returnInfo,enlist[`transform]!enlist transform
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Standard scaler transform-based representation of data
+//  using a fitted model
+// @params config {dictionary} Information returned from `ml.stdScaler.fit`
+//   including:
+//   modelInfo - The avg/dev value of the fitted data
+//   transform - A projection allowing for transformation on new input data
+// @param data {table|dictionary|number[]} Numerical data
+// @return {table|dictionary|number[]} All data has undergone standard scaling
+stdScaler.transform:{[config;data]
+  avgData:config[`modelInfo;`avgData];
+  devData:config[`modelInfo;`devData];
+  (data-avgData)%devData
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Standard scaler transform-based representation of data
+// @param data {table|dictionary|number[]} Numerical data
+// @return {table|dictionary|number[]} All data has undergone standard scaling
+stdScaler.fitTransform:{[data]
+  scaler:stdScaler.fit data;
+  scaler[`transform]data
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Replace +/- infinities with data min/max
+// @param data {table|dictionary|number[]} Numerical data
+// @return {table|dictionary|number[]} Data with positive/negative 
+//   infinities are replaced by max/min values
+infReplace:i.ap{[data;inf;func]
+  t:.Q.t abs type first first data;
+  if[not t in "hijefpnuv";:data];
+  i:$[t;]@/:(inf;0n);
+  @[data;i;:;func@[data;i:where data=i 0;:;i 1]]
+  }/[;-0w 0w;min,max]
+
+// @kind function
+// @category preprocessing
+// @desc Tunable polynomial features from an input table
+// @param tab {table} Numerical data
+// @param n {int} Order of the polynomial feature being created
+// @return {table} The polynomial derived features of degree n 
+polyTab:{[tab;n]
+  colsTab:cols tab;
+  colsTab@:combs[count colsTab;n];
+  updCols:`$"_"sv'string colsTab;
+  updVals:prd each tab colsTab;
+  flip updCols!updVals
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Tunable filling of null data for a simple table
+// @param tab {table} Numerical and non numerical data
+// @param groupCol {symbol} A grouping column for the fill 
+// @param timeCol {symbol} A time column in the data 
+// @param dict {dictionary} Defines fill behavior, setting this to (::) will 
+//   result in forward followed by reverse filling
+// @return {table} Columns filled according to assignment of keys in the 
+//   dictionary dict, the null values are also encoded within a new column 
+//   to maintain knowledge of the null positions
+fillTab:{[tab;groupCol;timeCol;dict]
+ dict:$[0=count dict;
+     :tab;
+   (::)~dict;
+     [fillCols:i.findCols[tab;"ghijefcspmdznuvt"]except groupCol,timeCol;
+      fillCols!(count fillCols)#`forward
+      ];
+   dict
+   ];
+  keyDict:key dict;
+  nullKeys:`$string[keyDict],\:"_null";
+  nullVals:null tab keyDict;
+  tab:flip flip[tab],nullKeys!nullVals;
+  grouping:$[count groupCol,:();groupCol!groupCol;0b];
+  ![tab;();grouping;@[i.fillMap;`linear;,';timeCol][dict],'keyDict]
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Fit one-hot encoding model to categorical data
+// @param tab {table} Numerical and non numerical data
+// @param symCols {symbol[]} Columns to apply encoding to
+// @return {dictionary} Contains the following information:
+//   modelInfo - The mapping information
+//   transform - A projection allowing for transformation on new input data
+oneHot.fit:{[tab;symCols]
+  if[(::)~symCols;symCols:i.findCols[tab;"s"]];
+  mapVals:asc each distinct each tab symCols,:(); 
+  mapDict:symCols!mapVals;
+  returnInfo:enlist[`modelInfo]!enlist mapDict;
+  transform:oneHot.transform returnInfo;
+  returnInfo,enlist[`transform]!enlist transform
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Encode categorical features using one-hot encoded fitted model
+// @params config {dictionary} Information returned from `ml.oneHot.fit`
+//   including:
+//   modelInfo - The mapping information
+//   transform - A projection allowing for transformation on new input data
+// @param tab {table} Numerical and non numerical data
+// @param symDict {dictionary} Keys indicate the columns in the table to be 
+//   encoded, values indicate what mapping to use when encoding 
+// @return {table} One-hot encoded representation of categorical data
+oneHot.transform:{[config;tab;symDict]
+  mapDict:config`modelInfo;
+  symDict:i.mappingCheck[tab;symDict;mapDict];
+  oneHotVal:mapDict value symDict;
+  oneHotData:key symDict;
+  updDict:i.oneHotCols[tab]'[oneHotData;oneHotVal];
+  flip(oneHotData _ flip tab),raze updDict
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Encode categorical features using one-hot encoding
+// @param tab {table} Numerical and non numerical data
+// @param symCols {symbol[]} Columns to apply encoding to
+// @return {table} One-hot encoded representation of categorical data
+oneHot.fitTransform:{[tab;symCols]
+  encode:oneHot.fit[tab;symCols];
+  map:raze key encode`modelInfo;
+  symDict:map!map;
+  encode[`transform][tab;symDict]
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Encode categorical features with frequency of 
+//   category occurrence
+// @param tab {table} Numerical data
+// @param symCols {symbol[]} Columns to apply encoding to
+// @return {table} Frequency of occurrance of individual symbols 
+//   within a column
+freqEncode:{[tab;symCols]
+  if[(::)~symCols;symCols:i.findCols[tab;"s"]];
+  updCols:`$string[symCols],\:"_freq";
+  updVals:i.freqEncode each tab symCols,:();
+  updDict:updCols!updVals;
+  flip(symCols _ flip tab),updDict
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Fit lexigraphical ordering model to categorical data
+// @param tab {table} Numerical and categorical data
+// @param symCols {symbol[]} Columns to apply encoding to
+// @return {dictionary} Contains the following information:
+//   modelInfo - The mapping information
+//   transform - A projection allowing for transformation on new input data
+lexiEncode.fit:{[tab;symCols]
+  if[(::)~symCols;symCols:i.findCols[tab;"s"]];
+  mapping:labelEncode.fit each tab symCols,:();
+  mapVals:exec modelInfo from mapping;
+  mapDict:symCols!mapVals;
+  returnInfo:enlist[`modelInfo]!enlist mapDict;
+  transform:lexiEncode.transform returnInfo;
+  returnInfo,enlist[`transform]!enlist transform
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Lexicode encode data based on previously fitted model
+// @params config {dictionary} Information returned from `ml.lexiEncode.fit`
+//   including:
+//   modelInfo - The mapping information
+//   transform - A projection allowing for transformation on new input data
+// @param tab {table} Numerical and categorical data
+// @param symDict {dictionary} Keys indicate the columns in the table to be
+//   encoded, values indicate what mapping to use when encoding 
+// @return {table} Addition of lexigraphical order of symbol column
+lexiEncode.transform:{[config;tab;symDict]
+  mapDict:config`modelInfo;
+  symDict:i.mappingCheck[tab;symDict;mapDict];
+  tabCols:key symDict;
+  mapCols:value symDict;
+  updCols:`$string[tabCols],\:"_lexi";
+  modelInfo:enlist[`modelInfo]!/:enlist each mapDict mapCols;
+  updVals:labelEncode.transform'[modelInfo;tab tabCols];
+  updDict:updCols!updVals;
+  flip(tabCols _ flip tab),updDict
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Encode categorical features based on lexigraphical order
+// @param tab {table} Numerical data
+// @param symCols {symbol[]} Columns to apply encoding to
+// @return {table} Addition of lexigraphical order of symbol column
+lexiEncode.fitTransform:{[tab;symCols]
+  encode:lexiEncode.fit[tab;symCols];
+  map:raze key encode`modelInfo;
+  symDict:map!map;
+  encode[`transform][tab;symDict]
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Fit a label encoder model
+// @param data {any[]} Data to encode
+// @return {dictionary} Contains the following information:
+//   modelInfo - The schema mapping values
+//   transform - A projection allowing for transformation on new input data
+labelEncode.fit:{[data]
+  uniqueData:asc distinct data;
+  map:uniqueData!til count uniqueData;
+  returnInfo:enlist[`modelInfo]!enlist map;
+  transform:labelEncode.transform returnInfo;
+  encoding:uniqueData?data;
+  returnInfo,enlist[`transform]!enlist transform
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Encode categorical data to an integer value representation
+// @params config {dictionary} Information returned from `ml.labelEncode.fit`
+//   including:
+//   modelInfo - The schema mapping values
+//   transform - A projection allowing for transformation on new input data
+// @param data {any[]} Data to be reverted to original representation
+// @return {int[]} List transformed to integer value 
+labelEncode.transform:{[config;data]
+  map:config`modelInfo;
+  -1^map data
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Encode categorical data to an integer value representation
+// @param data {any[]} Data to encode
+// @return {int[]} List is encoded to an integer representation 
+labelEncode.fitTransform:{[data]
+  encoder:labelEncode.fit data;
+  encoder[`transform]data
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Transform a list of integers based on a previously generated
+//    label encoding
+// @param data {int[]} Data to be reverted to original representation
+// @param map {dictionary} Maps true representation to associated integer or
+//   the return from .ml.labelEncode.fit
+// @return {symbol[]} Integer values of `data` replaced by their appropriate 
+//  'true' representation. Values that do not appear in the mapping supplied
+//   by `map` are returned as null values 
+applyLabelEncode:{[data;map]
+  if[99h<>type map;'"Input must be a dictionary"];
+  $[`modelInfo`transform~key map;map[`modelInfo]?;map?]data
+  }
+
+// @kind function
+// @category preprocessing
+// @desc Break specified time columns into constituent components
+// @param tab {table} Contains time columns
+// @param timeCols {symbol[]} Columns to apply encoding to, if set to :: 
+//   all columns with date/time types will be encoded
+// @return {dictionary} All time or date types broken into labeled versions
+//   of their constituent components
+timeSplit:{[tab;timeCols]
+  if[(::)~timeCols;timeCols:i.findCols[tab;"dmntvupz"]];
+  timeDict:i.timeDict/:[tab]timeCols,:();
+  flip(timeCols _ flip tab),raze timeDict
+  }
diff --git a/util/tests/metric.t b/util/tests/metric.t
index d18f56bb..8620fd6c 100644
--- a/util/tests/metric.t
+++ b/util/tests/metric.t
@@ -28,17 +28,6 @@ ymb:100 10#yb
 plaintab:([]4 5 6.;1 2 3.;-1 -2 -3.;0.4 0.5 0.6)
 plaintabn:plaintab,'([]x4:1 3 0n)
 
-.ml.range[til 63] ~ 62
-.ml.range[5] ~ 0
-.ml.range[0 1 3 2f]~3f
-.ml.range[0 1 0n 2]~2f
-.ml.percentile[x;0.75]~np[`:percentile][x;75]`
-.ml.percentile[x;0.02]~np[`:percentile][x;2]`
-.ml.percentile[xf;0.5]~np[`:percentile][xf;50]`
-.ml.percentile[3 0n 4 4 0n 4 4 3 3 4;0.5]~3.5
-("f"$flip value .ml.describe[plaintab])~flip .ml.df2tab .p.import[`pandas][`:DataFrame.describe][.ml.tab2df[plaintab]]
-("f"$flip value .ml.describe[plaintabn])~flip (.ml.df2tab .p.import[`pandas][`:DataFrame.describe][.ml.tab2df[plaintab]]),'"f"$([]x4:3 2,sdev[1 3 0n],1 0 1 2 3)
-
 .ml.accuracy[x;y] ~ skmetric[`:accuracy_score][x;y]`
 .ml.accuracy[xb;yb] ~ 0.5
 .ml.accuracy[3 2 2 0n 4;0n 4 3 2 4]~0.2
@@ -65,45 +54,45 @@ plaintabn:plaintab,'([]x4:1 3 0n)
 .ml.specificity[10#1b;10#0b;1b]~0f
 .ml.specificity[10#1b;10#1b;0b]~1f
 
-.ml.fbscore[xb;yb;1b;0.02] ~ fbscore[yb;xb;`beta pykw 0.02]`
-.ml.fbscore[xb;yb;1b;0.5] ~ fbscore[yb;xb;`beta pykw 0.5]`
-.ml.fbscore[xb;yb;1b;1.5] ~ fbscore[yb;xb;`beta pykw 1.5]`
-.ml.fbscore[xb;yb;0b;1.5] ~ 0.493670886075949
-.ml.fbscore[1000#1b;yb;0b;.5]~0f
-.ml.fbscore[xb;1000#1b;0b;.5]~0f
-.ml.fbscore[1000#0b;1000#1b;1b;.2]~0f
-
-.ml.f1score[xb;yb;0b] ~ f1[xb;yb;`pos_label pykw 0]`
-.ml.f1score[xb;yb;1b] ~ f1[xb;yb;`pos_label pykw 1]`
-.ml.f1score[xb;1000#0b;1b]~0f
-.ml.f1score[1000#1b;yb;1b]~f1[1000#1b;yb;`pos_label pykw 1]`
-.ml.f1score[10#1b;10#0b;1b]~f1[10#1b;10#0b;`pos_label pykw 1]`
-
-.ml.matcorr[xb;yb]~mcoeff[xb;yb]`
-.ml.matcorr[110010b;111111b]~0n
-.ml.matcorr[111111b;110010b]~0n
-
-(value .ml.confmat[xb;yb])~(300 400;100 200)
-(value .ml.confmat[2 3# 0 0 1 1 0 0;2 3# 1 0 1 0 0 1]) ~ (0 1 0;0 0 0;1 0 0)
-(value .ml.confmat[1 2 3;3 2 1])~(0 0 1;0 1 0;1 0 0)
-(value .ml.confmat[1 2 3f;3 2 1f])~(0 0 1;0 1 0;1 0 0)
-(value .ml.confmat[3#1b;3#0b])~(0 3;0 0)
-
-.ml.confdict[xb;yb;1b] ~ `tn`fp`fn`tp!300 400 100 200
-.ml.confdict[3#0b;3#1b;0b] ~`tn`fp`fn`tp!0 3 0 0
-.ml.confdict[3#1b;3#0b;0b]~`tn`fp`fn`tp!0 0 3 0
-
-.ml.classreport[110b;101b]~1!flip`class`precision`recall`f1_score`support!((`$string each 0 1),`$"avg/total";0 0.5 0.25; 0 0.5 0.25;0.0 0.5 0.25;1 2 3i)
-.ml.classreport[3 3 5 2 5 1;3 5 2 3 5 1]~1!flip`class`precision`recall`f1_score`support!((`$string each 1 2 3 5),`$"avg/total";1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 1 2 2 6i)
-.ml.classreport[3 3 5 2 5 1f;3 5 2 3 5 1f]~1!flip`class`precision`recall`f1_score`support!((`$string each 1 2 3 5),`$"avg/total";1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 1 2 2 6i)
-.ml.classreport[3 3 5 0n 5 1;3 5 2 3 5 0n]~1!flip`class`precision`recall`f1_score`support!((`$string each 0n 2 3 5),`$"avg/total";0 0n 0.5 0.5 0.33333333333333;0 0 0.5 0.5 0.25;0 0 0.5 0.5 0.25;1 1 2 2 6i)
-
-{.ml.logloss[x;y]~logloss[x;y]`}[1000?0b;(1-p),'p:1000?1f]
-{.ml.logloss[x;y]~logloss[x;y]`}[1000?0b;(1-p),'p:1000?1i]
-.ml.logloss[10#0b;(1-p),'p:10?1i]~-0f
-(floor .ml.logloss[10110b;(2 0n;1 1; 3 1;0n 2; 3 3)])~floor 6
-(floor .ml.logloss[1000?0b;(1-p),'p:1000#0n])~34
-{.ml.crossentropy[x;y]~logloss[x;y]`}[(first idesc@)each p;p%:sum each p:1000 5#5000?1f]
+.ml.fBetaScore[xb;yb;1b;0.02] ~ fbscore[yb;xb;`beta pykw 0.02]`
+.ml.fBetaScore[xb;yb;1b;0.5] ~ fbscore[yb;xb;`beta pykw 0.5]`
+.ml.fBetaScore[xb;yb;1b;1.5] ~ fbscore[yb;xb;`beta pykw 1.5]`
+.ml.fBetaScore[xb;yb;0b;1.5] ~ 0.493670886075949
+.ml.fBetaScore[1000#1b;yb;0b;.5]~0f
+.ml.fBetaScore[xb;1000#1b;0b;.5]~0f
+.ml.fBetaScore[1000#0b;1000#1b;1b;.2]~0f
+
+.ml.f1Score[xb;yb;0b] ~ f1[xb;yb;`pos_label pykw 0]`
+.ml.f1Score[xb;yb;1b] ~ f1[xb;yb;`pos_label pykw 1]`
+.ml.f1Score[xb;1000#0b;1b]~0f
+.ml.f1Score[1000#1b;yb;1b]~f1[1000#1b;yb;`pos_label pykw 1]`
+.ml.f1Score[10#1b;10#0b;1b]~f1[10#1b;10#0b;`pos_label pykw 1]`
+
+.ml.matthewCorr[xb;yb]~mcoeff[xb;yb]`
+.ml.matthewCorr[110010b;111111b]~0n
+.ml.matthewCorr[111111b;110010b]~0n
+
+(value .ml.confMatrix[xb;yb])~(300 400;100 200)
+(value .ml.confMatrix[2 3# 0 0 1 1 0 0;2 3# 1 0 1 0 0 1]) ~ (0 1 0;0 0 0;1 0 0)
+(value .ml.confMatrix[1 2 3;3 2 1])~(0 0 1;0 1 0;1 0 0)
+(value .ml.confMatrix[1 2 3f;3 2 1f])~(0 0 1;0 1 0;1 0 0)
+(value .ml.confMatrix[3#1b;3#0b])~(0 3;0 0)
+
+.ml.confDict[xb;yb;1b] ~ `tn`fp`fn`tp!300 400 100 200
+.ml.confDict[3#0b;3#1b;0b] ~`tn`fp`fn`tp!0 3 0 0
+.ml.confDict[3#1b;3#0b;0b]~`tn`fp`fn`tp!0 0 3 0
+
+.ml.classReport[110b;101b]~1!flip`class`precision`recall`f1_score`support!((`$string each 0 1),`$"avg/total";0 0.5 0.25; 0 0.5 0.25;0.0 0.5 0.25;1 2 3i)
+.ml.classReport[3 3 5 2 5 1;3 5 2 3 5 1]~1!flip`class`precision`recall`f1_score`support!((`$string each 1 2 3 5),`$"avg/total";1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 1 2 2 6i)
+.ml.classReport[3 3 5 2 5 1f;3 5 2 3 5 1f]~1!flip`class`precision`recall`f1_score`support!((`$string each 1 2 3 5),`$"avg/total";1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 0 0.5 0.5 0.5;1 1 2 2 6i)
+.ml.classReport[3 3 5 0n 5 1;3 5 2 3 5 0n]~1!flip`class`precision`recall`f1_score`support!((`$string each 0n 2 3 5),`$"avg/total";0 0n 0.5 0.5 0.33333333333333;0 0 0.5 0.5 0.25;0 0 0.5 0.5 0.25;1 1 2 2 6i)
+
+{.ml.logLoss[x;y]~logloss[x;y]`}[1000?0b;(1-p),'p:1000?1f]
+{.ml.logLoss[x;y]~logloss[x;y]`}[1000?0b;(1-p),'p:1000?1i]
+.ml.logLoss[10#0b;(1-p),'p:10?1i]~-0f
+(floor .ml.logLoss[10110b;(2 0n;1 1; 3 1;0n 2; 3 3)])~floor 6
+(floor .ml.logLoss[1000?0b;(1-p),'p:1000#0n])~34
+{.ml.crossEntropy[x;y]~logloss[x;y]`}[(first idesc@)each p;p%:sum each p:1000 5#5000?1f]
 .ml.mse[x;y] ~ skmetric[`:mean_squared_error][x;y]`
 .ml.mse[xf;yf] ~ skmetric[`:mean_squared_error][xf;yf]`
 .ml.mse[x;x]~0f
@@ -138,35 +127,35 @@ plaintabn:plaintab,'([]x4:1 3 0n)
 .ml.smape[xm;ym]~{smape[x;y]}'[flip xm;flip ym]
 .ml.smape[x;x]~0f
 .ml.smape[1 0n 4 2 0n;1 2 4 3 1]~6.666666666666666667
-.ml.r2score[xf;yf] ~ r2[yf;xf]`
-.ml.r2score[xf;xf] ~ r2[xf;xf]`
-.ml.r2score[2 2 2;1 2 3] ~ r2[1 2 3;2 2 2]`
-.ml.r2score[x;x]~1f
-.ml.r2score[1 0n 4 2 0n;1 2 4 2 1]~1f
-.ml.tscore[x;y] ~first stats[`:ttest_1samp][x;y]`
-.ml.tscore[xf;yf]~first stats[`:ttest_1samp][xf;yf]`
-.ml.tscore[xb;yb]~first stats[`:ttest_1samp][xb;yb]`
-.ml.tscore[x;x]~first stats[`:ttest_1samp][x;x]`
-.ml.tscoreeq[x;y]~abs first stats[`:ttest_ind][x;y]`
-.ml.tscoreeq[xf;yf]~abs first stats[`:ttest_ind][xf;yf]`
-.ml.tscoreeq[xb;yb]~abs first stats[`:ttest_ind][xb;yb]`
-.ml.tscoreeq[x;x]~abs first stats[`:ttest_ind][x;x]`
-.ml.cvm[flip value flip plaintab]~np[`:cov][flip value flip  plaintab;`bias pykw 1b]`
-.ml.cvm[(10110b;01110b)]~(0.24 0.04;0.04 0.24)
-.ml.cvm[(10110b;11111b)]~(0.24 0f;0 0f)
-.ml.cvm[(11111b;11111b)]~(0 0f;0 0f)
-.ml.cvm[(10110b;1101b,0n)]~(0.24 0n;2#0n)
-.ml.crm[(1 2;2 1)]~(2 2#1 -1 -1 1f)
-.ml.crm[(011b;001b)]~(1 0.5;0.5 1)
-.ml.crm[(1111b;1111b)]~(2 2#4#0n)
-.ml.crm[(1 1 2;1 2 0n)]~(1 0n;2#0n)
-(value .ml.corrmat[plaintab]) ~ "f"$([]1 1 -1 1;1 1 -1 1;-1 -1 1 -1;1 1 -1 1)
-.ml.corrmat[(0011b;1010b)]~(1 0f;0 1f)
-.ml.corrmat[(0011b;1111b)]~(1 0n;2#0n)
-.ml.corrmat[(1111b;1111b)]~(2 2#2#0n)
-.ml.corrmat[(1 1 2;1 2 0n)]~(1 0n;2#0n)
-{.ml.rocaucscore[x;y]~rocau[x;y]`}[10?0b;10?1f]
-.ml.rocaucscore[10#01b;10#1f]~0.5
-.ml.rocaucscore[10#0b;10?1f]~0f
-.ml.rocaucscore[10#1b;10#0f]~0f
-.ml.rocaucscore[1011000110b;0n 0.1 0.2 0.1 0.3 0.4 0.2 0.4 0.3 0.2]~0.525
+.ml.r2Score[xf;yf] ~ r2[yf;xf]`
+.ml.r2Score[xf;xf] ~ r2[xf;xf]`
+.ml.r2Score[2 2 2;1 2 3] ~ r2[1 2 3;2 2 2]`
+.ml.r2Score[x;x]~1f
+.ml.r2Score[1 0n 4 2 0n;1 2 4 2 1]~1f
+.ml.tScore[x;y] ~first stats[`:ttest_1samp][x;y]`
+.ml.tScore[xf;yf]~first stats[`:ttest_1samp][xf;yf]`
+.ml.tScore[xb;yb]~first stats[`:ttest_1samp][xb;yb]`
+.ml.tScore[x;x]~first stats[`:ttest_1samp][x;x]`
+.ml.tScoreEqual[x;y]~abs first stats[`:ttest_ind][x;y]`
+.ml.tScoreEqual[xf;yf]~abs first stats[`:ttest_ind][xf;yf]`
+.ml.tScoreEqual[xb;yb]~abs first stats[`:ttest_ind][xb;yb]`
+.ml.tScoreEqual[x;x]~abs first stats[`:ttest_ind][x;x]`
+.ml.covMatrix[flip value flip plaintab]~np[`:cov][flip value flip  plaintab;`bias pykw 1b]`
+.ml.covMatrix[(10110b;01110b)]~(0.24 0.04;0.04 0.24)
+.ml.covMatrix[(10110b;11111b)]~(0.24 0f;0 0f)
+.ml.covMatrix[(11111b;11111b)]~(0 0f;0 0f)
+.ml.covMatrix[(10110b;1101b,0n)]~(0.24 0n;2#0n)
+.ml.corrMatrix[(1 2;2 1)]~(2 2#1 -1 -1 1f)
+.ml.corrMatrix[(011b;001b)]~(1 0.5;0.5 1)
+.ml.corrMatrix[(1111b;1111b)]~(2 2#4#0n)
+.ml.corrMatrix[(1 1 2;1 2 0n)]~(1 0n;2#0n)
+(value .ml.corrMatrix[plaintab]) ~ "f"$([]1 1 -1 1;1 1 -1 1;-1 -1 1 -1;1 1 -1 1)
+.ml.corrMatrix[(0011b;1010b)]~(1 0f;0 1f)
+.ml.corrMatrix[(0011b;1111b)]~(1 0n;2#0n)
+.ml.corrMatrix[(1111b;1111b)]~(2 2#2#0n)
+.ml.corrMatrix[(1 1 2;1 2 0n)]~(1 0n;2#0n)
+{.ml.rocAucScore[x;y]~rocau[x;y]`}[10?0b;10?1f]
+.ml.rocAucScore[10#01b;10#1f]~0.5
+.ml.rocAucScore[10#0b;10?1f]~0f
+.ml.rocAucScore[10#1b;10#0f]~0f
+.ml.rocAucScore[1011000110b;0n 0.1 0.2 0.1 0.3 0.4 0.2 0.4 0.3 0.2]~0.525
diff --git a/util/tests/preproctst.t b/util/tests/preproctst.t
index c3fc2581..09c97824 100644
--- a/util/tests/preproctst.t
+++ b/util/tests/preproctst.t
@@ -17,12 +17,18 @@ tab:([]sym:`a`a`a`b`b;time:`time$til 5;@[5#0n;2 4;:;1f];@["f"$til 5;4;:;0n])
 timetab:([]`timestamp$(2000.01.01+til 3);1 3 2;2 1 3)
 timetabn:([]`timestamp$(2000.01.01+til 3),0n;1 3 3 2;2 1 3 3)
 
+\S 42
+
 x:1000?40
 y:1000?40
 xf:1000?100f
 yf:1000?100f
 xb:1000#0101101011b
 yb:1000#0000111000b
+scale1:(2 3f;4 2f;5 3f)
+scale2:3 2 5 4 1f
+scale3:0011b
+scale4:3 2#3 5 1 0n 4 0n
 onehotx:`a`p`l`h`j
 symtf:([]`a`b`b`a`a;"f"$til 5)
 symti:([]`a`b`b`a`a;til 5)
@@ -36,93 +42,144 @@ tf:([]1000?500f;1000#30f;1000?1000f;1000?100f)
 tb:([]1000?0b;1000#1b;1000?0b;1000?0b)
 infdict:`x`x1`x2!(0 1 2 0w;0 1 2 -0w;1 2 3 0w)
 nt:([]101b;000b;1 2 0n)
+keyedinfs:([k:1 2]x:0 0W)
 
-.ml.dropconstant[ti]~flip `x`x2`x3!ti`x`x2`x3
-.ml.dropconstant[tf]~flip `x`x2`x3!tf`x`x2`x3
-.ml.dropconstant[tb]~flip `x`x2`x3!tb`x`x2`x3
-.ml.dropconstant[nt]~([]101b;x2:1 2 0n)
-.ml.dropconstant[nulltab]~select x,x1,x2,x3 from nulltab
+.ml.dropConstant[ti]~flip `x`x2`x3!ti`x`x2`x3
+.ml.dropConstant[tf]~flip `x`x2`x3!tf`x`x2`x3
+.ml.dropConstant[tb]~flip `x`x2`x3!tb`x`x2`x3
+.ml.dropConstant[flip ti]~`x`x2`x3!ti`x`x2`x3
+.ml.dropConstant[flip tf]~`x`x2`x3!tf`x`x2`x3
+.ml.dropConstant[flip tb]~`x`x2`x3!tb`x`x2`x3
+.ml.dropConstant[nt]~([]101b;x2:1 2 0n)
+.ml.dropConstant[nulltab]~select x,x1,x2,x3 from nulltab
 
 MinMaxScaler[`:fit][flip plainmat];
+minMaxKeys:`minData`maxData
+minMax1:.ml.minMaxScaler.fit[plainmat]
+minMax2:.ml.minMaxScaler.fit[scale1]
+minMax3:.ml.minMaxScaler.fit[scale2]
+minMax4:.ml.minMaxScaler.fit[scale3]
+minMax5:.ml.minMaxScaler.fit[scale4]
+
+minMax1[`modelInfo]~minMaxKeys!(4 1 -3 0.4f;6 3 -1 0.6f)
+minMax2[`modelInfo]~minMaxKeys!(2 2 3f;3 4 5f)
+minMax3[`modelInfo]~minMaxKeys!1 5f
+minMax4[`modelInfo]~minMaxKeys!01b
+minMax5[`modelInfo]~minMaxKeys!(3 1 4f;5 1 4f)
+
+.ml.minMaxScaler.fitTransform[plainmat]~flip"f"$MinMaxScaler[`:transform][flip plainmat]`
+.ml.minMaxScaler.fitTransform[scale1]~(0 1f;1 0f;1 0f)
+.ml.minMaxScaler.fitTransform[scale2]~0.5 0.25 1 0.75 0f
+.ml.minMaxScaler.fitTransform[scale3]~0 0 1 1f
+.ml.minMaxScaler.fitTransform[scale4]~(0 1f;2#0n;2#0n)
+minMax2.transform[scale4]~(1 3f;-0.5 0n;0.5 0n)
+minMax3.transform[5#y]~5.75 1.75 9.5 5.5 4.25
+
 StdScaler[`:fit][flip plainmat];
-.ml.minmaxscaler[plainmat] ~ flip"f"$MinMaxScaler[`:transform][flip plainmat]`
-.ml.minmaxscaler[(2 3f;4 2f;5 3f)]~(0 1f;1 0f;1 0f)
-.ml.minmaxscaler[3 2 5 4 1f]~0.5 0.25 1 0.75 0f
-.ml.minmaxscaler[0011b]~0 0 1 1f
-.ml.minmaxscaler[3 2#3 5 1 0n 4 0n]~(0 1f;2#0n;2#0n)
-
-.ml.stdscaler[plainmat] ~ flip"f"$StdScaler[`:transform][flip plainmat]`
-.ml.stdscaler[(2 3f;4 2f;5 3f)]~(-1 1f;1 -1f;1 -1f)
-.ml.stdscaler[xf]~scale[xf]`
-.ml.stdscaler[y]~scale[y]`
-.ml.stdscaler[yb]~scale[yb]`
-.ml.stdscaler[3 2#2 4 1 0n 2 0n]~(-1 1f;2#0n;2#0n)
-
-.ml.infreplace[infdict]~`x`x1`x2!"f"$(0 1 2 2;0 1 2 0;1 2 3 3)
-.ml.infreplace[flip infdict]~flip `x`x1`x2!"f"$(0 1 2 2;0 1 2 0;1 2 3 3)
-.ml.infreplace[infdict`x]~0 1 2 2f
-
-.ml.polytab[([] 2 4 1f;3 4 1f;3 2 3f);2]~([]x_x1:6 16 1f;x_x2:6 8 3f;x1_x2:9 8 3f)
-.ml.polytab[([] 2 4 1;3 4 1;3 2 3);2]~([]x_x1:6 16 1;x_x2:6 8 3;x1_x2:9 8 3)
-.ml.polytab[([]101b;110b;100b);2]~([]x_x1:1 0 0i;x_x2:1 0 0i;x1_x2:1 0 0i)
-.ml.polytab[nt;2]~([]x_x1:0 0 0i;x_x2:1 0 0n;x1_x2:0 0 0n)
-.ml.polytab[([] 0n 0n;2 3;1 2);2]~([]x_x1:2#0n;x_x2:2#0n;x1_x2:2 6)
-
-.ml.filltab[tab;0#();`time;`x1`x!`linear`mean]~flip`sym`time`x`x1`x1_null`x_null!(`a`a`a`b`b;00:00:00.000 00:00:00.001 00:00:00.002 00:00:00.003 00:00:00.004;1 1 1 1 1f;0 1 2 3 4f;00001b;11010b)
-.ml.filltab[tab;`sym;`time;()!()]~tab
-.ml.filltab[tab;`sym;`time;::]~flip`sym`time`x`x1`x_null`x1_null!(`a`a`a`b`b;00:00:00.000 00:00:00.001 00:00:00.002 00:00:00.003 00:00:00.004;1 1 1 1 1f;0 1 2 3 3f;11010b;00001b)
-(select x4,x5,x1_null,x3_null from .ml.filltab[nulltab;`x2;x;`x1`x3!`median`mean])~([]x4:5#0;x5:5#0n;x1_null:00100b;x3_null:11000b)
-.ml.filltab[tab,'flip (enlist `x2)!enlist 5#0n;`sym;`time;`x1`x`x2!`median`mean`max]~flip`sym`time`x`x1`x2`x1_null`x_null`x2_null!(`a`a`a`b`b;00:00:00.000 00:00:00.001 00:00:00.002 00:00:00.003 00:00:00.004;1 1 1 1 1f;0 1 2 3 3f;5#0n;00001b;11010b;11111b)
-
-.ml.onehot[symtf;`x] ~"f"$([] x1:til 5;x_a:1 0 0 1 1;x_b: 0 1 1 0 0)
-.ml.onehot[symtf;::] ~"f"$([] x1:til 5;x_a:1 0 0 1 1;x_b: 0 1 1 0 0)
-.ml.onehot[symti;`x] ~([] x1:til 5;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
-.ml.onehot[symti;::] ~([] x1:til 5;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
-.ml.onehot[symtb;`x]~([] x1:11001b;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
-.ml.onehot[symtb;::]~([] x1:11001b;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
-.ml.onehot[symtn;`x]~([]x1:til 5;x_:0 0 0 1 0f;x_a:1 0 0 0 1f;x_b:0 1 1 0 0f)
-.ml.onehot[symtn;::]~([]x1:til 5;x_:0 0 0 1 0f;x_a:1 0 0 0 1f;x_b:0 1 1 0 0f)
-.ml.onehot[symm;::]~([]x1:til 5;x_a:1 0 0 1 1f;x_b:0 1 1 0 0f;x2_q:1 0 1 1 0f;x2_w:0 1 0 0 1f) 
-
-.ml.freqencode[symtf;`x]~(delete x from symtf),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
-.ml.freqencode[symtf;::]~(delete x from symtf),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
-.ml.freqencode[symti;`x]~(delete x from symti),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
-.ml.freqencode[symti;::]~(delete x from symti),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
-.ml.freqencode[symtb;`x]~(delete x from symtb),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
-.ml.freqencode[symtb;::]~(delete x from symtb),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
-.ml.freqencode[symtn;`x]~([] x1:til 5;x_freq:0.4 0.4 0.4 0.2 0.4)
-.ml.freqencode[symtn;::]~([] x1:til 5;x_freq:0.4 0.4 0.4 0.2 0.4)
-.ml.freqencode[symm;::]~([]x1:til 5;x_freq:0.6 0.4 0.4 0.6 0.6;x2_freq:0.6 0.4 0.6 0.6 0.4)
-
-.ml.lexiencode[symtf;`x]~(delete x from symtf),'([]x_lexi:0 1 1 0 0)
-.ml.lexiencode[symtf;::]~(delete x from symtf),'([]x_lexi:0 1 1 0 0)
-.ml.lexiencode[symti;`x]~(delete x from symti),'([]x_lexi:0 1 1 0 0)
-.ml.lexiencode[symti;::]~(delete x from symti),'([]x_lexi:0 1 1 0 0)
-.ml.lexiencode[symtb;`x]~(delete x from symtb),'([]x_lexi:0 1 1 0 0)
-.ml.lexiencode[symtb;::]~(delete x from symtb),'([]x_lexi:0 1 1 0 0)
-.ml.lexiencode[symtn;`x]~([] x1:til 5;x_lexi:1 2 2 0 1)
-.ml.lexiencode[symtn;::]~([] x1:til 5;x_lexi:1 2 2 0 1)
-.ml.lexiencode[symm;::]~([]x1:til 5;x_lexi: 0 1 1 0 0;x2_lexi:0 1 0 0 1)
+stdScaleKeys:`avgData`devData
+stdScale1:.ml.stdScaler.fit[plainmat]
+stdScale2:.ml.stdScaler.fit[scale1]
+stdScale3:.ml.stdScaler.fit[xf]
+stdScale4:.ml.stdScaler.fit[y]
+stdScale5:.ml.stdScaler.fit[yb]
+stdScale6:.ml.stdScaler.fit[scale4]
+
+key[stdScale1[`modelInfo]]~stdScaleKeys
+key[stdScale2[`modelInfo]]~stdScaleKeys
+key[stdScale3[`modelInfo]]~stdScaleKeys
+key[stdScale4[`modelInfo]]~stdScaleKeys
+key[stdScale5[`modelInfo]]~stdScaleKeys
+key[stdScale6[`modelInfo]]~stdScaleKeys
+
+stdScale1.transform[plainmat]~flip"f"$StdScaler[`:transform][flip plainmat]`
+stdScale2.transform[scale1]~(-1 1f;1 -1f;1 -1f)
+stdScale3.transform[xf]~scale[xf]`
+stdScale4.transform[y]~scale[y]`
+stdScale5.transform[yb]~scale[yb]`
+stdScale6.transform[scale4]~(-1 1f;2#0n;2#0n)
+stdScale2.transform[scale4]~(1 5f;-2 0n;0 0n)
+
+.ml.infReplace[infdict]~`x`x1`x2!"f"$(0 1 2 2;0 1 2 0;1 2 3 3)
+.ml.infReplace[flip infdict]~flip `x`x1`x2!"f"$(0 1 2 2;0 1 2 0;1 2 3 3)
+.ml.infReplace[infdict`x]~0 1 2 2f
+.ml.infReplace[keyedinfs]~([k:1 2]x:0 0)
+
+.ml.polyTab[([] 2 4 1f;3 4 1f;3 2 3f);2]~([]x_x1:6 16 1f;x_x2:6 8 3f;x1_x2:9 8 3f)
+.ml.polyTab[([] 2 4 1;3 4 1;3 2 3);2]~([]x_x1:6 16 1;x_x2:6 8 3;x1_x2:9 8 3)
+.ml.polyTab[([]101b;110b;100b);2]~([]x_x1:1 0 0i;x_x2:1 0 0i;x1_x2:1 0 0i)
+.ml.polyTab[nt;2]~([]x_x1:0 0 0i;x_x2:1 0 0n;x1_x2:0 0 0n)
+.ml.polyTab[([] 0n 0n;2 3;1 2);2]~([]x_x1:2#0n;x_x2:2#0n;x1_x2:2 6)
+
+.ml.fillTab[tab;0#();`time;`x1`x!`linear`mean]~flip`sym`time`x`x1`x1_null`x_null!(`a`a`a`b`b;00:00:00.000 00:00:00.001 00:00:00.002 00:00:00.003 00:00:00.004;1 1 1 1 1f;0 1 2 3 4f;00001b;11010b)
+.ml.fillTab[tab;`sym;`time;()!()]~tab
+.ml.fillTab[tab;`sym;`time;::]~flip`sym`time`x`x1`x_null`x1_null!(`a`a`a`b`b;00:00:00.000 00:00:00.001 00:00:00.002 00:00:00.003 00:00:00.004;1 1 1 1 1f;0 1 2 3 3f;11010b;00001b)
+(select x4,x5,x1_null,x3_null from .ml.fillTab[nulltab;`x2;x;`x1`x3!`median`mean])~([]x4:5#0;x5:5#0n;x1_null:00100b;x3_null:11000b)
+.ml.fillTab[tab,'flip (enlist `x2)!enlist 5#0n;`sym;`time;`x1`x`x2!`median`mean`max]~flip`sym`time`x`x1`x2`x1_null`x_null`x2_null!(`a`a`a`b`b;00:00:00.000 00:00:00.001 00:00:00.002 00:00:00.003 00:00:00.004;1 1 1 1 1f;0 1 2 3 3f;5#0n;00001b;11010b;11111b)
+
+.ml.oneHot.fitTransform[symtf;`x] ~"f"$([] x1:til 5;x_a:1 0 0 1 1;x_b: 0 1 1 0 0)
+.ml.oneHot.fitTransform[symtf;::] ~"f"$([] x1:til 5;x_a:1 0 0 1 1;x_b: 0 1 1 0 0)
+.ml.oneHot.fitTransform[symti;`x] ~([] x1:til 5;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
+.ml.oneHot.fitTransform[symti;::] ~([] x1:til 5;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
+.ml.oneHot.fitTransform[symtb;`x]~([] x1:11001b;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
+.ml.oneHot.fitTransform[symtb;::]~([] x1:11001b;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
+.ml.oneHot.fitTransform[symtn;`x]~([]x1:til 5;x_:0 0 0 1 0f;x_a:1 0 0 0 1f;x_b:0 1 1 0 0f)
+.ml.oneHot.fitTransform[symtn;::]~([]x1:til 5;x_:0 0 0 1 0f;x_a:1 0 0 0 1f;x_b:0 1 1 0 0f)
+.ml.oneHot.fitTransform[symm;::]~([]x1:til 5;x_a:1 0 0 1 1f;x_b:0 1 1 0 0f;x2_q:1 0 1 1 0f;x2_w:0 1 0 0 1f) 
+
+oneHot1:.ml.oneHot.fit[symtf;::]
+oneHot1.transform[symtb;::]~([] x1:11001b;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
+oneHot1.transform[symti;::]~([] x1:til 5;x_a:1 0 0 1 1f;x_b: 0 1 1 0 0f)
+oneHot1.transform[symm;`x`x2!`x`x]~([]x1:til 5;x_a:1 0 0 1 1f;x_b:0 1 1 0 0f;x2_a:5#0f;x2_b:5#0f)
+
+.ml.freqEncode[symtf;`x]~(delete x from symtf),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
+.ml.freqEncode[symtf;::]~(delete x from symtf),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
+.ml.freqEncode[symti;`x]~(delete x from symti),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
+.ml.freqEncode[symti;::]~(delete x from symti),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
+.ml.freqEncode[symtb;`x]~(delete x from symtb),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
+.ml.freqEncode[symtb;::]~(delete x from symtb),'([]x_freq:0.6 0.4 0.4 0.6 0.6)
+.ml.freqEncode[symtn;`x]~([] x1:til 5;x_freq:0.4 0.4 0.4 0.2 0.4)
+.ml.freqEncode[symtn;::]~([] x1:til 5;x_freq:0.4 0.4 0.4 0.2 0.4)
+.ml.freqEncode[symm;::]~([]x1:til 5;x_freq:0.6 0.4 0.4 0.6 0.6;x2_freq:0.6 0.4 0.6 0.6 0.4)
+
+.ml.lexiEncode.fitTransform[symtf;`x]~(delete x from symtf),'([]x_lexi:0 1 1 0 0)
+.ml.lexiEncode.fitTransform[symtf;::]~(delete x from symtf),'([]x_lexi:0 1 1 0 0)
+.ml.lexiEncode.fitTransform[symti;`x]~(delete x from symti),'([]x_lexi:0 1 1 0 0)
+.ml.lexiEncode.fitTransform[symti;::]~(delete x from symti),'([]x_lexi:0 1 1 0 0)
+.ml.lexiEncode.fitTransform[symtb;`x]~(delete x from symtb),'([]x_lexi:0 1 1 0 0)
+.ml.lexiEncode.fitTransform[symtb;::]~(delete x from symtb),'([]x_lexi:0 1 1 0 0)
+.ml.lexiEncode.fitTransform[symtn;`x]~([] x1:til 5;x_lexi:1 2 2 0 1)
+.ml.lexiEncode.fitTransform[symtn;::]~([] x1:til 5;x_lexi:1 2 2 0 1)
+.ml.lexiEncode.fitTransform[symm;::]~([]x1:til 5;x_lexi: 0 1 1 0 0;x2_lexi:0 1 0 0 1)
+
+lexi1:.ml.lexiEncode.fit[symtf;::]
+lexi1.transform[symtb;::]~(delete x from symtb),'([]x_lexi:0 1 1 0 0)
+lexi1.transform[symti;::]~(delete x from symti),'([]x_lexi:0 1 1 0 0)
+lexi1.transform[symm;`x`x2!`x`x]~([]x1:til 5;x_lexi: 0 1 1 0 0;x2_lexi:5#-1)
 
 guidList :asc 5?0Ng
-symList  :`b`a`d`c
+symList1 :`b`a`d`c
+symList2 :`e`a`d`d
 floatList:1.2 2 2.5 0.1
 
-guidReturn:`mapping`encoding!(((asc distinct guidList)!til count distinct guidList);til 5)
-.ml.labelencode[guidList] ~guidReturn
-.ml.labelencode[symList]  ~`mapping`encoding!((`a`b`c`d!til 4);1 0 3 2)
-.ml.labelencode[floatList]~`mapping`encoding!((0.1 1.2 2 2.5!til 4);1 2 3 0)
-
-.ml.applylabelencode[0 0 2 3 4  ;.ml.labelencode floatList]~(0.1;0.1;2f;2.5;0n)
-.ml.applylabelencode[1 1 2 5 3 0;.ml.labelencode symList  ]~`b`b`c``d`a
-.ml.applylabelencode[0 0 0 1 6  ;.ml.labelencode guidList ]~(3#guidList 0),(guidList 1),`guid$0Ng
-.ml.applylabelencode[0 0 2 3 4  ;.ml.labelencode[floatList]`mapping]~(0.1;0.1;2f;2.5;0n)
-.ml.applylabelencode[1 1 2 5 3 0;.ml.labelencode[symList]`mapping]~`b`b`c``d`a
-.ml.applylabelencode[0 0 0 1 6  ;.ml.labelencode[guidList]`mapping]~(3#guidList 0),(guidList 1),`guid$0Ng
-
-.ml.timesplit[timetab;::]~(delete x from timetab),'flip`x_dow`x_year`x_mm`x_dd`x_qtr`x_wd`x_hh`x_uu`x_ss!(0 1 2i;2000 2000 2000i;1 1 1i;1 2 3i;1 1 1j;001b;0 0 0i;0 0 0i;0 0 0i)
-.ml.timesplit[timetab;`x]~(delete x from timetab),'flip`x_dow`x_year`x_mm`x_dd`x_qtr`x_wd`x_hh`x_uu`x_ss!(0 1 2i;2000 2000 2000i;1 1 1i;1 2 3i;1 1 1j;001b;0 0 0i;0 0 0i;0 0 0i)
-.ml.timesplit[timetabn;::]~(delete x from timetabn),'flip`x_dow`x_year`x_mm`x_dd`x_qtr`x_wd`x_hh`x_uu`x_ss!(`int$(0 1 2 0n);`int$(2000 2000 2000 0n);`int$(1 1 1 0n);`int$(1 2 3 0n);"j"$(1 1 1 0n);0010b;`int$(0 0 0 0n);`int$(0 0 0 0n);`int$(0 0 0 0n))
-.ml.timesplit[symtf;::]~symtf
-.ml.timesplit[symti;::]~symti
-.ml.timesplit[symtb;::]~symtb
+.ml.labelEncode.fit[guidList][`modelInfo] ~(asc distinct guidList)!til count distinct guidList
+.ml.labelEncode.fit[symList1][`modelInfo]  ~`a`b`c`d!til 4
+.ml.labelEncode.fit[floatList][`modelInfo]~0.1 1.2 2 2.5!til 4
+
+label1:.ml.labelEncode.fit[symList1]
+label1.transform[symList1]~1 0 3 2
+label1.transform[symList2]~-1 0 3 3
+
+.ml.applyLabelEncode[0 0 2 3 4  ;.ml.labelEncode.fit floatList]~(0.1;0.1;2f;2.5;0n)
+.ml.applyLabelEncode[1 1 2 5 3 0;.ml.labelEncode.fit symList1  ]~`b`b`c``d`a
+.ml.applyLabelEncode[0 0 0 1 6  ;.ml.labelEncode.fit guidList ]~(3#guidList 0),(guidList 1),`guid$0Ng
+.ml.applyLabelEncode[0 0 2 3 4  ;.ml.labelEncode.fit [floatList]`modelInfo]~(0.1;0.1;2f;2.5;0n)
+.ml.applyLabelEncode[1 1 2 5 3 0;.ml.labelEncode.fit [symList1]`modelInfo]~`b`b`c``d`a
+.ml.applyLabelEncode[0 0 0 1 6  ;.ml.labelEncode.fit [guidList]`modelInfo]~(3#guidList 0),(guidList 1),`guid$0Ng
+
+timesplitKeys:`x_dayOfWeek`x_year`x_month`x_day`x_quarter`x_weekday`x_hour`x_minute`x_second
+.ml.timeSplit[timetab;::]~(delete x from timetab),'flip  timesplitKeys!(0 1 2i;2000 2000 2000i;1 1 1i;1 2 3i;1 1 1j;001b;0 0 0i;0 0 0i;0 0 0i)
+.ml.timeSplit[timetab;`x]~(delete x from timetab),'flip timesplitKeys!(0 1 2i;2000 2000 2000i;1 1 1i;1 2 3i;1 1 1j;001b;0 0 0i;0 0 0i;0 0 0i)
+.ml.timeSplit[timetabn;::]~(delete x from timetabn),'flip timesplitKeys!(`int$(0 1 2 0n);`int$(2000 2000 2000 0n);`int$(1 1 1 0n);`int$(1 2 3 0n);"j"$(1 1 1 0n);0010b;`int$(0 0 0 0n);`int$(0 0 0 0n);`int$(0 0 0 0n))
+.ml.timeSplit[symtf;::]~symtf
+.ml.timeSplit[symti;::]~symti
+.ml.timeSplit[symtb;::]~symtb
diff --git a/util/tests/utiltst.t b/util/tests/utiltst.t
index a64a44d6..4482721b 100644
--- a/util/tests/utiltst.t
+++ b/util/tests/utiltst.t
@@ -2,7 +2,6 @@
 \l util/init.q
 
 np:.p.import[`numpy]
-
 p)import pandas as pd
 p)import numpy as np
 p)import datetime
@@ -19,6 +18,13 @@ dt1:2019.01.01D01:30:00.000000000 2019.01.02D01:30:00.000000000
 
 plaintab:([]4 5 6.;1 2 3.;-1 -2 -3.;0.4 0.5 0.6)
 xm:100 10#1000?100f
+x:1000?1000
+xf:1000?100f
+
+.ml.range[til 63] ~ 62
+.ml.range[5] ~ 0
+.ml.range[0 1 3 2f]~3f
+.ml.range[0 1 0n 2]~2f
 
 df :.ml.tab2df tt:([]fcol:12?1.;jcol:12?100;scol:12?`aaa`bbb`ccc)
 dfj:.ml.tab2df tj:select by jcol from tt
@@ -45,9 +51,9 @@ tt2:([]date:2005.07.14 2005.07.15;timesp:("N"$"12:10:30.000500000";"N"$"12:13:30
 .ml.arange[2.5;50.2;0.2] ~ np[`:arange][2.5;50.2;0.2]`
 .ml.arange[2f;10f;1f]~2 3 4 5 6 7 8 9f
 
-.ml.linspace[1;10;9] ~ np[`:linspace][1;10;9]`
-.ml.linspace[-0.2;109;62] ~ np[`:linspace][-0.2;109;62]`
-.ml.linspace[-0.2;10.4;20] ~ np[`:linspace][-0.2;10.4;20]`
+.ml.linearSpace[1;10;9] ~ np[`:linspace][1;10;9]`
+.ml.linearSpace[-0.2;109;62] ~ np[`:linspace][-0.2;109;62]`
+.ml.linearSpace[-0.2;10.4;20] ~ np[`:linspace][-0.2;10.4;20]`
 
 .ml.eye[3] ~ "f"$(1 0 0;0 1 0;0 0 1)
 first[.ml.eye[1]] ~ enlist 1f
@@ -57,10 +63,10 @@ first[.ml.eye[1]] ~ enlist 1f
 
 .ml.df2tab[t]~([]fcol:0.1*1+til 5;jcol:10*1+til 5)
 .ml.df2tab[t2]~([]fcol:5#(::);jcol:10101b)
-.ml.df2tab_tz[t3;0b;1b]~([]date:2005.07.14 2005.07.15;time:("N"$"12:10:30.000500000";"N"$"12:13:30.000200000");str:enlist each ("h";"i");ind:1.3 2.5;bool:10b)
-.ml.df2tab_tz[t4;0b;1b]~([]bool:10b;date:"p"$(2005.02.25;2015.12.22);timed:(neg "N"$"05:00:00";"N"$"00:16:40"))
-.ml.df2tab_tz[t5;1b;0b]~([]dt:dt1;dt_with_tz:dt1)
-.ml.df2tab_tz[t5;0b;0b]~([]dt:dt1;dt_with_tz:dt1-"T"$"01:00:00")
+.ml.df2tabTimezone[t3;0b;1b]~([]date:2005.07.14 2005.07.15;time:("N"$"12:10:30.000500000";"N"$"12:13:30.000200000");str:enlist each ("h";"i");ind:1.3 2.5;bool:10b)
+.ml.df2tabTimezone[t4;0b;1b]~([]bool:10b;date:"p"$(2005.02.25;2015.12.22);timed:(neg "N"$"05:00:00";"N"$"00:16:40"))
+.ml.df2tabTimezone[t5;1b;0b]~([]dt:dt1;dt_with_tz:dt1)
+.ml.df2tabTimezone[t5;0b;0b]~([]dt:dt1;dt_with_tz:dt1-"T"$"01:00:00")
 
 tt~update`$scol from .ml.df2tab df
 tj~update`$scol from .ml.df2tab dfj
@@ -71,10 +77,8 @@ tx~update`$scol from`scol`jcol xcol .ml.df2tab dfxj
 tx~update`$scol from`scol`jcol xcol .ml.df2tab dfxx
 
 \S 43
-.ml.traintestsplit[til 10;1+til 10;0.2]~`xtrain`ytrain`xtest`ytest!(2 3 7 1 6 4 9 5;3 4 8 2 7 5 10 6;0 8;1 9)
+.ml.trainTestSplit[til 10;1+til 10;0.2]~`xtrain`ytrain`xtest`ytest!(2 3 7 1 6 4 9 5;3 4 8 2 7 5 10 6;0 8;1 9)
 \S 43
-.ml.traintestsplit["f"$til 10;1+"f"$til 10;0.2]~`xtrain`ytrain`xtest`ytest!(2 3 7 1 6 4 9 5f;3 4 8 2 7 5 10 6f;0 8f;1 9f)
+.ml.trainTestSplit["f"$til 10;1+"f"$til 10;0.2]~`xtrain`ytrain`xtest`ytest!(2 3 7 1 6 4 9 5f;3 4 8 2 7 5 10 6f;0 8f;1 9f)
 \S 22
-.ml.traintestsplit[1010110011b;1001100011b;0.33]~`xtrain`ytrain`xtest`ytest!(110100b;111100b;1011b;0001b)
-
-
+.ml.trainTestSplit[1010110011b;1001100011b;0.33]~`xtrain`ytrain`xtest`ytest!(110100b;111100b;1011b;0001b)
diff --git a/util/util.q b/util/util.q
deleted file mode 100644
index 12e98b7f..00000000
--- a/util/util.q
+++ /dev/null
@@ -1,72 +0,0 @@
-\d .ml
-
-/ values between x and y in steps of length z
-arange:{x+z*til 0|ceiling(y-x)%z}
-/ combinations of k elements from 0,1,...,n-1
-combs:{[n;k]flip(k-1){[n;x]j@:i:where 0<>k:n-j:1+last x;(x@\:where k),enlist -1_sums@[(1+sum k i)#1;0,sums k i;:;(j,0)-0,-1+j+k i]}[n]/enlist til n}
-/ identity matrix
-eye:{@[x#0.;;:;1.]each til x}
-/ indexing functions
-imax:{x?max x}
-imin:{x?min x}
-/ z evenly spaced values between x and y
-linspace:{x+til[z]*(y-x)%z-1}
-/ shape of matrix/table
-shape:{-1_count each first scan x}
-/ split into train/test sets with sz% in test
-traintestsplit:{[x;y;sz]`xtrain`ytrain`xtest`ytest!raze(x;y)@\:/:(0,floor n*1-sz)_neg[n]?n:count x}
-
-/ q vector to numpy datetime
-i.q2npdt:{.p.import[`numpy;`:array;("p"$@[4#+["d"$0];-16+type x]x)-"p"$1970.01m;"datetime64[ns]"]`.}
-/ q tab to pandas dataframe
-tab2df:{
- updx:@[flip 0!x;i.fndcols[x;"c"];enlist each];
- r:.p.import[`pandas;`:DataFrame;@[updx;i.fndcols[x]"pmdznuvt";i.q2npdt]][@;cols x];
- $[count k:keys x;r[`:set_index]k;r]}
-/ pandas dataframe to q tab
-df2tab_tz:{
- n:$[enlist[::]~x[`:index.names]`;0;x[`:index.nlevels]`];
- c:`$(x:$[n;x[`:reset_index][];x])[`:columns.to_numpy][]`;
- d:x[`:select_dtypes][pykwargs enlist[`exclude]!enlist`float32`datetime`datetimetz`timedelta][`:to_dict;`list]`;
- d,:dt_convert x[`:select_dtypes][`include pykw`datetime];
- d,:dt_dict[x[`:select_dtypes][`include pykw`timedelta]]+"n"$0;
- d,:tz_convert[;y]x[`:select_dtypes][`include pykw`datetimetz];
- d,:float32_convert[;y]x[`:select_dtypes][`include pykw`float32][`:to_dict;`list]`;
- / check if the first value in columns are foreign
- if[0<count dti:where 112h=type each first each value d;
-    d,:dtk!date_time_convert[;z] each d dtk:key[d]dti];
- n!flip c#d}
-/ Convert python float32 function to produce correct precision without conversion to real
-/ note check for x~()!() which is required in cases where underlying representation is float32 for dates/times
-float32_convert:{$[(y~0b)|x~()!();x;?[0.000001>x;"F"$string x;0.000001*floor 0.5+x*1000000]]}
-/ Convert time zone data (0b -> UTC time; 1b -> local time)
-tz_convert:{$[y~0b;dt_convert;{"P"$neg[6]_/:'x[`:astype;`str][`:to_dict;<;`list]}]x}
-/ Convert datetime/datetimetz to timestamp
-dt_convert:{
-  $[count nulCols:where any each x[`:isnull;::][`:to_dict;<;`list];
-    [c:`$x[`:columns.to_numpy][]`;
-     null_data:"P"$x[`:drop;c except nulCols;`axis pykw 1][`:astype;`str][`:to_dict;<;`list];
-     non_null_data:dt_dict x[`:drop;nulCols;`axis pykw 1];
-     null_data,non_null_data+1970.01.01D0];
-    dt_dict[x]+1970.01.01D0]}
-/ Convert data to integer representation and return as a dict
-dt_dict:{x[`:astype;`int64][`:to_dict;<;`list]}
-/ Convert datetime.date/time types to kdb+ date/time
-date_time_convert:{
-  $[y~0b;x;
-    [ fval:.p.wrap first x;
-     / convert datetime.time/date to iso string format and convert to kdb+
-     / otherwise return foreign
-     $[i.isinstance[fval;i.dt`:time];{"N"$.p.wrap[x][`:isoformat][]`}each x;
-       i.isinstance[fval;i.dt`:date];{"D"$.p.wrap[x][`:isoformat][]`}each x;
-       x]]]}
-/ function defaults to return UTC timezone(y) and non converted date/times(z)
-df2tab:df2tab_tz[;0b;0b]
-
-/ apply to list, mixed list, dictionary, table, keyed table
-i.ap:{$[0=type y;x each y;98=type y;flip x each flip y;99<>type y;x y;98=type key y;key[y]!.z.s value y;x each y]}
-/ find columns of x with type in y
-i.fndcols:{m[`c]where(m:0!meta x)[`t]in y}
-/ required python utilities for df2tab
-i.isinstance:.p.import[`builtins][`:isinstance;<]
-i.dt        :.p.import[`datetime]
diff --git a/util/utilities.q b/util/utilities.q
new file mode 100644
index 00000000..281ba7c2
--- /dev/null
+++ b/util/utilities.q
@@ -0,0 +1,159 @@
+// util/utilities.q - Utilities library
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Includes range, arange, combs, eye, iMax, iMin,
+// linearSpace, shape, trainTestSplit, tab2df,
+// df2tabTimezone, df2tab
+
+\d .ml
+
+// @kind function
+// @category utilities
+// @desc Range of values
+// @param array {number[]} A numerical array 
+// @returns {float} Range of its values
+range:{[array]
+  max[array]-min array
+  }
+
+// @kind function
+// @category utilities
+// @desc Evenly-spaced values
+// @param start {number} Start of the interval (inclusive)
+// @param end {number} End of the interval (non-inclusive)
+// @param step {number} Spacing between values 
+// @return {number[]} A vector of evenly-spaced values between start and end
+//   in steps of length `step`
+arange:{[start;end;step]
+  start+step*til 0|ceiling(end-start)%step
+  }
+
+// @kind function
+// @category utilities
+// @desc Unique combinations of a vector or matrix
+// @param n {int} Number of values required for combinations
+// @param degree {int} Degree of the combinations to be produced
+// @return {int[]} Unique combinations of values from the data 
+combs:{[n;degree]
+  flip(degree-1)i.combFunc[n]/enlist til n
+  }
+
+// @kind function
+// @category utilities
+// @desc Create identity matrix 
+// @param n {int} Width/height of identity matrix
+// @return {int[]} Identity matrix of height/width n
+eye:{[n]
+  @[n#0.;;:;1.]each til n
+  }
+
+// @kind function
+// @category utilities
+// @desc Index of the first occurance of the maximum value in a list
+// @param array {number[]} Array of values 
+// @return {number} The index of the maximum element of the array
+iMax:{[array]
+  array?max array
+  }
+
+// @kind function
+// @category utilities
+// @desc Index of minimum element of a list
+// @param array {number[]} Array of values 
+// @return {number} The index of the minimum element of the array
+iMin:{[array]
+  array?min array
+  }
+
+// @kind function
+// @category utilities
+// @desc Create an array of evenly-spaced values
+// @param start {number} Start of the interval (inclusive)
+// @param end {number} End of the interval (non-inclusive)
+// @param n {int} How many spaces are to be created
+// @return {number[]} A vector of `n` evenly-spaced values between
+//   start and end
+linearSpace:{[start;end;n]
+  start+til[n]*(end-start)%n-1
+  }
+
+// @kind function
+// @category utilities
+// @desc Shape of a matrix
+// @param matrix {number[]} Matrix of values
+// @return {number[]} Its shape as a list of dimensions
+shape:{[matrix]
+  -1_count each first scan matrix
+  }
+
+// @kind function
+// @category utilities
+// @desc Split data into training and test sets
+// @param data {any[]} Matrix of input values
+// @param target {any[]} A vector of target values the same count as data
+// @param size {float[]} Percentage size of the testing set
+// @return {dictionary} Contains the data matrix and target split into a
+//   training and testing set
+trainTestSplit:{[data;target;size]
+  dictKeys:`xtrain`ytrain`xtest`ytest;
+  n:count data;
+  split:(0,floor n*1-size)_neg[n]?n;
+  dictVals:raze(data;target)@\:/:split;
+  dictKeys!dictVals
+  }
+
+// @kind function
+// @category utilities
+// @desc Convert q table to Pandas dataframe
+// @param tab {table} A q table
+// @return {<} a Pandas dataframe
+tab2df:{[tab]
+  updTab:@[flip 0!tab;i.findCols[tab;"c"];enlist each];
+  transformTab:@[updTab;i.findCols[tab]"pmdznuvt";i.q2npDate];
+  pandasDF:i.pandasDF[transformTab][@;cols tab];
+  $[count keyTab:keys tab;
+    pandasDF[`:set_index]keyTab;
+    pandasDF
+    ]
+  }
+
+// @kind function
+// @category utilities
+// @desc Convert a pandas dataframe containing datetime timezones and
+//   datetime objects (datetime.datetime, datetime.time) to a q table
+// @param tab {<} An embedPy representation of a Pandas dataframe
+// @param local {boolean} Indicates if timezone objects are to be converted
+//   to local time (1b) or UTC (0b)
+// @param qObj {boolean} Indicates if python datetime.date/datetime.time
+//   objects are returned as q (1b) or foreign objects (0b)
+// @return {<} a q table
+df2tabTimezone:{[tab;local;qObj]
+  index:$[enlist[::]~tab[`:index.names]`;0;tab[`:index.nlevels]`];
+  tab:$[index;tab[`:reset_index][];tab];
+  numpyCols:`$tab[`:columns.to_numpy][]`;
+  dataArgs:enlist[`exclude]!enlist`float32`datetime`datetimetz`timedelta;
+  dict:tab[`:select_dtypes][pykwargs dataArgs][`:to_dict;`list]`;
+  dateTimeData:tab[`:select_dtypes][`include pykw`datetime];
+  dict,:i.dateConvert dateTimeData;
+  timeDeltaData:tab[`:select_dtypes][`include pykw`timedelta];
+  dict,:i.dateDict[timeDeltaData]+"n"$0;
+  timezoneData:tab[`:select_dtypes][`include pykw`datetimetz];
+  dict,:i.timezoneConvert[timezoneData;local];
+  float32Data:tab[`:select_dtypes][`include pykw`float32][`:to_dict;`list]`;
+  dict,:i.float32Convert[float32Data;local];
+  // Check if the first value in columns are foreign
+  foreign:where 112h=type each first each value dict;
+  if[0<count foreign;
+    dictKeys:key[dict]foreign;
+    dictVals:i.dateTimeConvert[;qObj] each dict dictKeys;
+    dict,:dictKeys!dictVals
+    ];
+  index!flip numpyCols#dict
+  }
+
+// @kind function
+// @category utilities
+// @desc Convert pandas dataframe to q table
+// @param tab {<} An embedPy representation of a Pandas dataframe
+// @return {<} a q table
+df2tab:df2tabTimezone[;0b;0b]
diff --git a/util/utils.q b/util/utils.q
new file mode 100644
index 00000000..dd2d8ce6
--- /dev/null
+++ b/util/utils.q
@@ -0,0 +1,605 @@
+// util/utils.q - Utility functions
+// Copyright (c) 2021 Kx Systems Inc
+//
+// General utility functions for the ML Toolkit
+
+\d .ml
+
+// @kind function
+// @category utilitiesUtility
+// @desc Unique combinations of a vector or matrix
+// @param n {int} Number of values required for combinations
+// @param vals {int[]} Indices involved in the combination 
+// @return {int[]} Unique combinations of values from the data 
+i.combFunc:{[n;vals]
+  j@:i:where 0<>k:n-j:1+last vals;
+  sumVals:-1_sums@[(1+sum k i)#1;0,sums k i;:;(j,0)-0,-1+j+k i];
+  (vals@\:where k),enlist sumVals
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Transform q object to numpy date
+// @param date {date} q datetime object
+// @return {<} Numpy datetime object
+i.q2npDate:{[date]
+  dateConvert:("p"$@[4#+["d"$0];-16+type date]date)-"p"$1970.01m;
+  .p.import[`numpy;`:array;dateConvert;"datetime64[ns]"]`.
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc  Convert python float32 function to produce correct precision
+//   Note check for x~()!() which is required in cases where underlying 
+//   representation is float32 for dates/times
+// @param data {float[]} Floating point data from the dataFrame
+// @param local {boolean} Indicates if timezone objects are to be converted
+//   to local time (1b) or UTC (0b)
+// @return {float[]} Python float32 objects converted to correct precision 
+//   in kdb
+i.float32Convert:{[data;local]
+  $[(local~0b)|data~()!();
+    data;
+    ?[0.000001>data;"F"$string data;0.000001*floor 0.5+data*1000000]
+    ]
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Convert datetime.timezone types to kdb+ date/time
+// @param tab {<} Contains columns with datetime timezone objects
+// @param local {boolean} Indicates if timezone objects are to be converted
+//   to local time (1b) or UTC (0b)
+// @return {dictionary} Datetime objects are converted to kdb date/time 
+//   objects
+i.timezoneConvert:{[tab;local]
+  $[local~0b;
+    i.dateConvert tab;
+    "P"$neg[6]_/:'tab[`:astype;`str][`:to_dict;<;`list]
+    ]
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Convert datetime/datetimetz objects to kdb timestamp
+// @param dataFrame {<} Pandas dataFrame containing datetime data
+// @return {dictionary} Datetime objects are converted to timestamps in kdb
+i.dateConvert:{[dataFrame]
+  nullCols:where any each dataFrame[`:isnull;::][`:to_dict;<;`list];
+  $[count nullCols;
+    [npCols:`$dateFrame[`:columns.to_numpy][]`;
+     dropCols:dataFrame[`:drop;npCols except nulCols;`axis pykw 1];
+     nullData:"P"$dropCols[`:astype;`str][`:to_dict;<;`list];
+     nonNullData:i.dateDict dataFrame[`:drop;nullCols;`axis pykw 1];
+     nullData,nonNullData+1970.01.01D0
+    ];
+    i.dateDict[dataFrame]+1970.01.01D0
+   ]
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Convert datetime data to integer representation
+// @param data {<} Pandas dataframe object containing timedelta objects
+// @return {dictionary} Datetime objects are converted to integer values
+i.dateDict:{[data]
+  data[`:astype;`int64][`:to_dict;<;`list]
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Convert datetime.date/time objects to kdb+ date/time
+// @param dateTime {<} Python datetime object
+// @param qObj {boolean} Indicates if python datetime.date/datetime.time 
+//   objects
+//   are returned as q (1b) or foreign objects (0b)
+// @return {datetime;<} kdb date/time format or embedpy object
+i.dateTimeConvert:{[dateTime;qObj]
+  $[qObj~0b;
+    dateTime;
+    [firstVal:.p.wrap first dateTime;
+     // Convert datetime.time/date to iso string format and convert to kdb+
+     // otherwise return foreign
+     $[i.isInstance[firstVal;i.dateTime`:time];
+       i.isoFormat["N"]each dateTime;
+       i.isInstance[firstVal;i.dateTime`:date];
+       i.isoFormat["D"]each dateTime;
+       dateTime
+       ]
+     ]
+    ]
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Cast python datetime object to a kdb datatype
+// @param cast {string} Data type in which python object will be cast to
+// @param dateTime {<} Python datetime object
+// @return {any} Python datetime object casted to kdb datatype 
+i.isoFormat:{[cast;dateTime]
+  cast$.p.wrap[dateTime][`:isoformat][]`
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Apply function to data of various types
+// @param func {fn} Function to apply to data
+// @param data {any} Data of various types
+// @return {fn} function to apply to data
+i.ap:{[func;data] 
+  $[0=type data;
+      func each data;
+    98=type data;
+      flip func each flip data;
+    99<>type data;
+      func data;
+    98=type key data;
+      key[data]!.z.s[func] value data;
+    func each data
+    ]
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Apply function to data of various types
+// @param func {fn} Function to apply to data
+// @param data {any} Data of various types
+// @return {fn} function to apply to data
+i.apUpd:{[func;data] 
+  $[0=type data;
+      func data;
+    98=type data;
+      func each data;
+    99<>type data;
+      func data;
+    98=type key data;
+      key[data]!.z.s value data;
+    func data
+    ]
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Find columns of certain types
+// @param tab {table} Data in tabular format
+// @param char {char[]} Type of column to find  
+// @return {symbol[]} Columns containing the type being searched 
+i.findCols:{[tab;char]
+  metaTab:0!meta tab;
+  metaTab[`c]where metaTab[`t]in char
+  }
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Checks if object is of a specified type
+i.isInstance:.p.import[`builtins][`:isinstance;<]
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Python datetime module
+i.dateTime:.p.import`datetime
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Python pandas dataframe module
+i.pandasDF:.p.import[`pandas]`:DataFrame
+
+// @private
+// @kind function
+// @category utilitiesUtility
+// @desc Check that the length of the endog and another parameter
+//   are equal 
+// @param endog {float[]} The endogenous variable
+// @param param {number[][]|number[]} A parameter to compare the length of
+// @param paramName {string} The name of the parameter
+// @returns {::|err} Return an error if they aren't equal
+i.checkLen:{[endog;param;paramName]
+  if[not count[endog]=count param;
+    '"The length of the endog variable and ",paramName," must be equal"
+    ]
+  }
+
+// Metric utility functions
+
+// @private
+// @kind function
+// @category metricUtility
+// @desc Exclude collinear points 
+// @param x {number[]} X coordinate of true positives and false negatives
+// @param y {number[]} Y coorfinate of true positives and false negatives
+// @returns {number[]} any colinear points are excluded
+i.curvePts:{[x;y]
+  (x;y)@\:where(1b,2_differ deltas[y]%deltas x),1b
+  }
+
+// @private
+// @kind function
+// @category metricUtility
+// @desc Calculate the area under an ROC curve
+// @param x {number[]} X coordinate of true positives and false negatives
+// @param y {number[]} Y coorfinate of true positives and false negatives
+// @returns {number[]} Area under the curve
+i.auc:{[x;y]
+  sum 1_deltas[x]*y-.5*deltas y
+  }
+
+// @private
+// @kind function
+// @category metricUtility
+// @desc Calculate the correlation of a matrix
+// @param matrix {number[]} A sample from a distribution
+// @returns {number[]} The covariance matrix
+i.corrMatrix:{[matrix]
+  devMatrix:dev each matrix;
+  covMatrix[matrix]%devMatrix*/:devMatrix
+  }
+
+// Preproc utility functions
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Drop any constant numeric values
+// @param data {dictionary} Numerical data
+// @return {dictionary} All keys with zero variance are removed
+i.dropConstant.num:{[num]
+  (where 0=0^var each num)_num
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc All non numeric values with zero variance are removed
+// @param data {dictionary} Non-numerical data
+// @return {dictionary} All keys with zero variance are removed
+i.dropConstant.other:{[data]
+  (where{all 1_(~':)x}each data)_data
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Find keys of certain types
+// @param dict {dictionary} Data stored as a dictionary
+// @param char {char[]} Type of key to find  
+// @return {symbol[]} Keys containing the type being searched
+i.findKey:{[dict;char]
+  where({.Q.t abs type x}each dict)in char
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Fill nulls with 0 
+// @param data {table|number[]} Numerical data
+// @return {table|number[]} Nulls filled with 0 
+i.fillMap.zero:{[data]
+  0^data
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Fill nulls with the median value 
+// @param data {table|number[]} Numerical data
+// @return {table|number[]} Nulls filled with the median value
+i.fillMap.median:{[data]
+  med[data]^data
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Fill nulls with the average value
+// @param data {table|number[]} Numerical data
+// @return {table|number[]} Nulls filled with the average value
+i.fillMap.mean:{[data]
+  avg[data]^data
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Fill nulls forward
+// @param data {table|number[]} Numerical data
+// @return {table|number[]} Nulls filled foward  
+i.fillMap.forward:{[data]
+  "f"$(data first where not null data)^fills data
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Fill nulls depending on timestamp component
+// @param time {time[]} Data containing a time component
+// @param nulls {any[]} Contains null values
+// @return {table|number[]} Nulls filled in respect to time component
+i.fillMap.linear:{[time;vals]
+  nullVal:null vals;
+  i:where not nullVal; 
+  if[2>count i;:vals];
+  diffs:1_deltas[vals i]%deltas time i;
+  nullVal:where nullVal;
+  iBin:0|(i:-1_i)bin nullVal;
+  "f"$@[vals;nullVal;:;vals[i][iBin]+diffs[iBin]*time[nullVal]-time[i]iBin]
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Encode categorical features using one-hot encoding
+// @param data {symbol[]} Data to encode
+// @return {dictionary} One-hot encoded representation 
+i.oneHot:{[data]
+  vals:asc distinct data;
+  vals!"f"$data=/:vals
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Encode categorical features with frequency of 
+//   category occurrence
+// @param data {symbol[]} Data to encode
+// @return {number[]} Frequency of occurrance of individual symbols within 
+//   a column
+i.freqEncode:{[data]
+  (groupVals%sum groupVals:count each group data)data
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Break date column into constituent components
+// @param date {date} Data containing a date component
+// @return {dictionary} A date broken into its constituent components
+i.timeSplit.d:{[date]
+  dateDict:`dayOfWeek`year`month`day!`date`year`mm`dd$/:\:date;
+  update weekday:1<dayOfWeek from update dayOfWeek:dayOfWeek mod 7,
+    quarter:1+(month-1)div 3 from dateDict
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Break month column into constituent components
+// @param month {month} Data containing a monthly component
+// @return {dictionary} A month broken into its constituent components
+i.timeSplit.m:{[month]
+  monthDict:monthKey!(monthKey:`year`mm)$/:\:month;
+  update quarter:1+(mm-1)div 3 from monthDict
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Break time column into constituent components
+// @param time {time} Data containing a time component
+// @return {dictionary} A time broken into its constituent components
+i.timeSplit[`n`t`v]:{[time]
+  `hour`minute`second!`hh`uu`ss$/:\:time
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Break minute columns into constituent components
+// @param time {minute} Data containing a minute component
+// @return {dictionary} A minute broken into its constituent components
+i.timeSplit.u:{[time]
+  `hour`minute!`hh`uu$/:\:time
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Break datetime and timestamp columns into constituent 
+//   components
+// @param time {datetime|timestamp} Data containing a datetime or 
+//   datetime component
+// @return {dictionary} A datetime or timestamp broken into its constituent
+//   components
+i.timeSplit[`p`z]:{[time]raze i.timeSplit[`d`n]@\:time}
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Break time endog columns into constituent components
+// @param data {any} Data containing a time endog component
+// @return {dictionary} Time or date types broken into their constituent 
+//   components
+i.timeSplit1:{[data]
+  i.timeSplit[`$.Q.t type data]data:raze data
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Break time endog columns into constituent components
+// @param tab {table} Contains time endog columns
+// @param timeCols {symbol[]} Columns to apply encoding to, if set to :: all 
+//  columns with date/time types will be encoded
+// @return {dictionary} All time or date types broken into labeled versions of 
+//  their constituent components
+i.timeDict:{[tab;timeCol]
+  timeVals:i.timeSplit1 tab timeCol;
+  timeKeys:`$"_"sv'string timeCol,'key timeVals;
+  timeKeys!value timeVals
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Ensure that keys in the mapping dictionary matches values in 
+//   the sym dictionary
+// @param tab {table} Numerical and categorical data
+// @param symDict {dictionary} Keys indicate columns in the table to be 
+// encoded, values indicate what mapping to use when encoding
+// @params mapDict {dictionary} Map cateogorical values to their encoded values
+// @return {err;dictionary} Error if mapping keys don't match sym values or 
+//  update symDict if null is passed
+i.mappingCheck:{[tab;symDict;mapDict]
+  map:key mapDict;
+  if[(::)~symDict;
+    symCols:i.findCols[tab;"s"];
+    symDict:@[symCols!;map;{'"Length of mapping and sym keys don't match"}]
+    ];
+  if[not all value[symDict]in map;
+    '"Mapping keys do not match mapping dictionary"
+    ];
+  symDict
+  }
+
+// @private
+// @kind function
+// @category preprocessingUtility
+// @desc Create one hot encoded columns 
+// @param tab {table} Numerical and categorical data
+// @param colName {symbol[]} Name of columns in the table to apply encoding to
+// @params val {symbol[]} One hot encoded values
+// @return {dictionary} Columns in tab transformed to one hot encoded 
+//   representation
+i.oneHotCols:{[tab;colName;val]
+  updCols:`$"_"sv'string colName,'val;
+  updVals:"f"$tab[colName]='/:val;
+  updCols!updVals
+  }
+
+// General utility functions
+
+// @private
+// @kind function
+// @category utility
+// @desc Save a model locally
+// @param modelName {string|symbol} Name of the model to be saved
+// @param path {string|symbol} The path in which to save the model. 
+//  If ()/(::) is used then saves to the current directory 
+// @return {::;err} Saves locally or returns an error
+i.saveModel:{[modelName;path]
+  savePath:i.constructPath[modelName;path];
+  save savePath
+  }
+
+// @private
+// @kind function
+// @category utility
+// @desc Load a model
+// @param modelName {string|symbol} Name of the model to be loaded
+// @param path {string|symbol} The path in which to load the model from. 
+//   If ()/(::) is used then saves to the current directory 
+// @return {::;err} Loads a model or returns an error
+i.loadModel:{[modelName;path]
+  loadPath:i.constructPath[modelName;path];
+  load loadPath
+  }
+
+
+// @private
+// @kind function
+// @category utility
+// @desc Construct a path to save/load a model
+// @param modelName {string|symbol} Name of the model to be saved/loaded
+// @param path {string|symbol} The path in which to save/load the model. 
+//   If ()/(::) is used then saves to the current directory 
+// @return {symbol|err} Constructs a path or returns an error
+i.constructPath:{[modelName;path]  
+  pathType:abs type path;
+  modelType:abs type modelName;
+  if[not modelType in 10 11h;i.inputError"modelName"];
+  if[11h=abs modelType;modelName:string modelName];
+  joinPath:$[(path~())|path~(::);
+      ;
+    pathType=10h;
+      path,"/",;
+    pathType=11h;
+      string[path],"/",;
+    i.inputError"path"
+    ]modelName;
+   hsym`$joinPath
+   }
+
+// @private
+// @kind function
+// @category utility
+// @desc Return an error for the wrong input type
+// @param input {string} Name of the input parameter
+// @return {err} Error for the wrong input typr
+i.inputError:{[input]
+  '`$input," must be a string or a symbol"
+  }
+
+// @private
+// @kind function
+// @category deprecation
+// @desc Mapping between old names and new names - can read from file
+i.versionMap:.j.k raze read0 hsym`$path,"/util/functionMapping.json"
+
+// @private
+// @kind function
+// @category utility
+// @desc Warning function
+i.deprecatedWarning:"Deprecation Warning: function no longer supported as of",
+  " version '"
+
+// @private
+// @kind function
+// @category utility
+// @desc Warning function
+i.futureWarning:"Future Deprecation Warning: function will no longer be ",
+  "callable after version '"
+
+// @private
+// @kind function
+// @category utility
+// @desc Give deprecation warning along with returning the result
+//   of the function
+// @param func {string} Name of updated function
+// @pararm warn {string} Warning message to use
+// @param ver {string} Version of the update
+// @param res {any} Result from the updated function
+// @returns {any} Results from the function
+i.depWarn :{[func;warn;ver;res]
+  if[not i.ignoreWarning;
+    depFunction:$[warn~"deprecatedWarning";{'x};-1];
+    depFunction get[".ml.i.",warn],ver,"'. Please use '",func,"' instead."
+    ];
+  res
+  }
+
+// @private
+// @kind function
+// @category utility
+// @desc Run new function and warn user of deprecation of old function
+// @param dict {dictionary} Contains information pertaining to what the new 
+//   function name is along with warning error information needed
+// @returns {any} Results from the updated function 
+i.depApply:{[dict]
+  (i.depWarn . dict`function`warning`version)get[dict`function]::
+  }
+
+// @private
+// @kind function
+// @category utility
+// @desc Run new function and warn user of deprecation of old function
+// @param dict {dictionary} Contains information pertaining to what the new 
+//   function name is along with warning error information needed
+// @returns {any} Results from the updated function 
+i.deprecWarning:{[nameKey;versionMap]
+  mapping:versionMap nameKey;
+  newNames:key mapping;
+  newFunctions:i.depApply each value mapping;
+  {@[x set y]}'[newNames;newFunctions];
+  }[;i.versionMap]
diff --git a/xval/README.md b/xval/README.md
index 16abd744..2150695d 100644
--- a/xval/README.md
+++ b/xval/README.md
@@ -1,13 +1,10 @@
-# Cross-Validation
+# Cross Validation
 
-The functions contained in this folder surround the implementation of various cross-validation procedures both time-series and non time-series in nature. The goal of this is to make such procedures available using a q-like syntax.
+The functions contained in this folder surround the implementation of various cross validation procedures, both time series and non time series in nature. The goal of this library is to make such procedures available to kdb+ users using a q-like syntax.
 
 ## Functionality
 
-Within this folder are two scripts that constitutes the cross-validation procedures which have been implemented to date. These scripts are:
-
-1. Base-level algorithm implementations (These do not include any distribution procedures)
-2. Distributed versions of a number of these algorithms. This will be expanded on to include each of the available algorithms.
+Within this folder, users will find `xval.q` which contains base-level implementations of cross validation procedures, grid/random/Sobol-random hyperparameter searching methods and multi-processing procedures.
 
 ## Requirements
 
@@ -22,7 +19,8 @@ Place the `ml` library in `$QHOME` and load into a q instance using `ml/ml.q`
 
 ### Load
 
-The following will load cross-validation functionality into the `.ml` namespace  
+The following will load cross validation functionality into the `.ml` namespace  
+
 ```q
 q)\l ml/ml.q
 q).ml.loadfile`:xval/init.q
@@ -30,10 +28,10 @@ q).ml.loadfile`:xval/init.q
 
 ## Documentation
 
-Documentation is available on the [Cross-Validation](https://code.kx.com/v2/ml/toolkit/xval/) homepage.
+Documentation is available on the [Cross Validation](https://code.kx.com/v2/ml/toolkit/xval/) homepage.
 
 ## Status
 
-The cross-validation library is still in development and is available here as a beta release. Further functionality and improvements will be made to the library in the coming months.
+The cross validation library is still in development. Further functionality and improvements will be made to the library on an ongoing basis.
 
 If you have any issues, questions or suggestions, please write to ai@kx.com.
diff --git a/xval/init.q b/xval/init.q
index 6a59c902..4353e4b1 100644
--- a/xval/init.q
+++ b/xval/init.q
@@ -1 +1,12 @@
+// xval/init.q - Load cross validation library
+// Copyright (c) 2021 Kx Systems Inc
+//
+// These algorithms are used in machine learning to test how 
+// robust or stable a model is to changes in the volume of data 
+// or to the specific subsets of data used for model generation.
+
+.ml.loadfile`:xval/utils.q
 .ml.loadfile`:xval/xval.q
+
+.ml.loadfile`:util/utils.q
+.ml.i.deprecWarning`xval
diff --git a/xval/tests/xval.t b/xval/tests/xval.t
index bc1c6658..18aa825c 100644
--- a/xval/tests/xval.t
+++ b/xval/tests/xval.t
@@ -1,5 +1,6 @@
 \l ml.q
-\l util/util.q
+\l util/init.q
+\l xval/utils.q
 \l xval/xval.q
 \l xval/tests/test.p
 
@@ -12,16 +13,16 @@ yb:1000#110011001100b
 xc:flip(1000?100f;asc 1000?100f)
 yc:1000#`A`B`A`C`B`C
 
-df:(2;0N)#value .ml.traintestsplit[xf;yf;.2]
-di:(2;0N)#value .ml.traintestsplit[xi;yi;.2]
-db:(2;0N)#value .ml.traintestsplit[xb;yb;.2]
-dc:(2;0N)#value .ml.traintestsplit[xc;yc;.2]
+df:(2;0N)#value .ml.trainTestSplit[xf;yf;.2]
+di:(2;0N)#value .ml.trainTestSplit[xi;yi;.2]
+db:(2;0N)#value .ml.trainTestSplit[xb;yb;.2]
+dc:(2;0N)#value .ml.trainTestSplit[xc;yc;.2]
 
 net:{.p.import[`sklearn.linear_model]`:ElasticNet}
 lin:{.p.import[`sklearn.linear_model]`:LinearRegression}
 dtc:{.p.import[`sklearn.tree]`:DecisionTreeClassifier}
 
-fs:.ml.xv.fitscore
+fs:.ml.xv.fitScore
 rnd:{.01*"j"$100*x}
 bp:{first where a=max a:avg each x}
 
@@ -31,12 +32,12 @@ seed:42
 trials:8
 gs_pr:`alpha`max_iter!(0.1 0.2;100 200 1000)
 gs_pc:enlist[`max_depth]!enlist(::;1;2;3;4;5)
-rs_pr_rdm:`typ`random_state`n`p!(`random;seed;trials;`alpha`max_iter!((`uniform;.1;1.;"f");(`loguniform;1;4;"j")))
-rs_pc_rdm:`typ`random_state`n`p!(`random;seed;trials;enlist[`max_depth]!enlist(`uniform;1;30;"j"))
-rs_pr_sbl:`typ`random_state`n`p!(`sobol;seed;trials;`alpha`max_iter!((`uniform;.1;1.;"f");(`loguniform;1;4;"j")))
-rs_pc_sbl:`typ`random_state`n`p!(`sobol;seed;trials;enlist[`max_depth]!enlist(`uniform;1;30;"j"))
-rs_pr_err:`typ`random_state`n`p!(`sobol;seed;10;`alpha`max_iter!((`uniform;.1;1.;"f");(`loguniform;1;4;"j")))
-rs_pc_err:`typ`random_state`n`p!(`sobol;seed;10;enlist[`max_depth]!enlist(`uniform;1;30;"j"))
+rs_pr_rdm:`typ`randomState`n`p!(`random;seed;trials;`alpha`max_iter!((`uniform;.1;1.;"f");(`loguniform;1;4;"j")))
+rs_pc_rdm:`typ`randomState`n`p!(`random;seed;trials;enlist[`max_depth]!enlist(`uniform;1;30;"j"))
+rs_pr_sbl:`typ`randomState`n`p!(`sobol;seed;trials;`alpha`max_iter!((`uniform;.1;1.;"f");(`loguniform;1;4;"j")))
+rs_pc_sbl:`typ`randomState`n`p!(`sobol;seed;trials;enlist[`max_depth]!enlist(`uniform;1;30;"j"))
+rs_pr_err:`typ`randomState`n`p!(`sobol;seed;10;`alpha`max_iter!((`uniform;.1;1.;"f");(`loguniform;1;4;"j")))
+rs_pc_err:`typ`randomState`n`p!(`sobol;seed;10;enlist[`max_depth]!enlist(`uniform;1;30;"j"))
 
 ms:enlist(2 800 2;2 200 2)
 ridx:1_(1 xprev l),'l:enlist each(3,0N)#t:til count yf
@@ -53,129 +54,129 @@ not(count[s]~count[yi])&(s:.ml.xv.i.shuffle[yi])~yi
 not(count[s]~count[yb])&(s:.ml.xv.i.shuffle[yb])~yb
 not(count[s]~count[yc])&(s:.ml.xv.i.shuffle[yc])~yc
 
-(`int$.ml.xv.i.splitidx[2;yf])~`int$reverse first(.p.list kfsplit[yf;2])`
-(`int$.ml.xv.i.splitidx[2;yi])~`int$reverse first(.p.list kfsplit[yi;2])`
-(`int$.ml.xv.i.splitidx[2;yb])~`int$reverse first(.p.list kfsplit[yb;2])`
-(`int$.ml.xv.i.splitidx[2;yc])~`int$reverse first(.p.list kfsplit[yc;2])`
+(`int$.ml.xv.i.splitIdx[2;yf])~`int$reverse first(.p.list kfsplit[yf;2])`
+(`int$.ml.xv.i.splitIdx[2;yi])~`int$reverse first(.p.list kfsplit[yi;2])`
+(`int$.ml.xv.i.splitIdx[2;yb])~`int$reverse first(.p.list kfsplit[yb;2])`
+(`int$.ml.xv.i.splitIdx[2;yc])~`int$reverse first(.p.list kfsplit[yc;2])`
 
-(.ml.shape .ml.xv.i.shuffidx[k;yf])~3 333
-(.ml.shape .ml.xv.i.shuffidx[5;yi])~5 200
-(.ml.shape .ml.xv.i.shuffidx[2;yb])~2 500
-(.ml.shape .ml.xv.i.shuffidx[0;yc])~`long$()
+(.ml.shape .ml.xv.i.shuffIdx[k;yf])~3 333
+(.ml.shape .ml.xv.i.shuffIdx[5;yi])~5 200
+(.ml.shape .ml.xv.i.shuffIdx[2;yb])~2 500
+(.ml.shape .ml.xv.i.shuffIdx[0;yc])~`long$()
 
-(count each .ml.xv.i.stratidx[k;yf])~0 0 1000
-({count@'value group x}each yb .ml.xv.i.stratidx[k;yb])~(166 166;167 167;167 167)
-(.ml.shape .ml.xv.i.stratidx[0;yb])~`long$()
+(count each .ml.xv.i.stratIdx[k;yf])~0 0 1000
+({count@'value group x}each yb .ml.xv.i.stratIdx[k;yb])~(166 166;167 167;167 167)
+(.ml.shape .ml.xv.i.stratIdx[0;yb])~`long$()
 
-.ml.xv.i.groupidx[1]~enlist(`long$();0,())
-.ml.xv.i.groupidx[2]~enlist@''((0;1);(1;0))
-.ml.xv.i.groupidx[k]~((0 1;2,());(2 0;1,());(1 2;0,()))
+.ml.xv.i.groupIdx[1]~enlist(`long$();0,())
+.ml.xv.i.groupIdx[2]~enlist@''((0;1);(1;0))
+.ml.xv.i.groupIdx[k]~((0 1;2,());(2 0;1,());(1 2;0,()))
 
 / xval
 
-(avg[.ml.xv.kfsplit[k;1;xf;yf;fs[net][]]]-avg kfoldr[xf;yf])<.05
-(avg[.ml.xv.kfsplit[k;1;xi;yi;fs[net][]]]-avg kfoldr[xi;yi])<.05
-(avg[.ml.xv.kfsplit[k;1;xb;yb;fs[dtc][]]]-avg kfoldc[xb;yb])<.05
-(avg[.ml.xv.kfsplit[k;1;xc;yc;fs[dtc][]]]-avg kfoldc[xc;yc])<.05
-
-count[.ml.xv.kfshuff[k;1;xf;yf;fs[net][]]]~3
-count[.ml.xv.kfshuff[k;1;xi;yi;fs[net][]]]~3
-count[.ml.xv.kfshuff[k;1;xb;yb;fs[dtc][]]]~3
-count[.ml.xv.kfshuff[k;1;xc;yc;fs[dtc][]]]~3
-
-count[.ml.xv.kfstrat[k;1;xb;yb;fs[dtc][]]]~3
-count[.ml.xv.kfstrat[k;1;xc;yc;fs[dtc][]]]~3
-
-.ml.xv.tsrolls[k;1;xf;yf;fs[lin][]]~crossvalr[xf;yf;fr;lr;3]
-.ml.xv.tsrolls[k;1;xi;yi;fs[lin][]]~crossvalr[xi;yi;fr;lr;3]
-(avg[.ml.xv.tsrolls[k;1;xb;yb;fs[dtc][]]]-avg crossvalc[xb;yb;fr;lr;3])<.05
-(avg[.ml.xv.tsrolls[k;1;xc;yc;fs[dtc][]]]-avg crossvalc[xc;yc;fr;lr;3])<.05
-
-.ml.xv.tschain[k;1;xf;yf;fs[lin][]]~crossvalr[xf;yf;fc;lc;3]
-.ml.xv.tschain[k;1;xi;yi;fs[lin][]]~crossvalr[xi;yi;fc;lc;3]
-(avg[.ml.xv.tschain[k;1;xb;yb;fs[dtc][]]]-avg crossvalc[xb;yb;fc;lc;3])<.05
-(avg[.ml.xv.tschain[k;1;xc;yc;fs[dtc][]]]-avg crossvalc[xc;yc;fc;lc;3])<.05
-
-(.ml.xv.pcsplit[p;1;xf;yf]{[d].ml.shape each d})~ms
-(.ml.xv.pcsplit[p;1;xi;yi]{[d].ml.shape each d})~ms
-(.ml.xv.pcsplit[p;1;xb;yb]{[d].ml.shape each d})~ms
-(.ml.xv.pcsplit[p;1;xc;yc]{[d].ml.shape each d})~ms
-(.ml.xv.pcsplit[p;1;xf;yf]{count@''x})~enlist(800 800;200 200)
-(.ml.xv.pcsplit[.1;3;xf;yf]{count@''x})~3#enlist(900 900;100 100)
-(.ml.xv.pcsplit[.3;5;xf;yf]{count@''x})~5#enlist(700 700;300 300)
-
-(.ml.xv.mcsplit[p;1;xf;yf]{[d].ml.shape each d})~ms
-(.ml.xv.mcsplit[p;1;xi;yi]{[d].ml.shape each d})~ms
-(.ml.xv.mcsplit[p;1;xb;yb]{[d].ml.shape each d})~ms
-(.ml.xv.mcsplit[p;1;xc;yc]{[d].ml.shape each d})~ms
-(.ml.xv.mcsplit[p;1;xf;yf]{count@''x})~enlist(800 800;200 200)
-(.ml.xv.mcsplit[.1;3;xf;yf]{count@''x})~3#enlist(900 900;100 100)
-(.ml.xv.mcsplit[.3;5;xf;yf]{count@''x})~5#enlist(700 700;300 300)
+(avg[.ml.xv.kfSplit[k;1;xf;yf;fs[net][]]]-avg kfoldr[xf;yf])<.05
+(avg[.ml.xv.kfSplit[k;1;xi;yi;fs[net][]]]-avg kfoldr[xi;yi])<.05
+(avg[.ml.xv.kfSplit[k;1;xb;yb;fs[dtc][]]]-avg kfoldc[xb;yb])<.05
+(avg[.ml.xv.kfSplit[k;1;xc;yc;fs[dtc][]]]-avg kfoldc[xc;yc])<.05
+
+count[.ml.xv.kfShuff[k;1;xf;yf;fs[net][]]]~3
+count[.ml.xv.kfShuff[k;1;xi;yi;fs[net][]]]~3
+count[.ml.xv.kfShuff[k;1;xb;yb;fs[dtc][]]]~3
+count[.ml.xv.kfShuff[k;1;xc;yc;fs[dtc][]]]~3
+
+count[.ml.xv.kfStrat[k;1;xb;yb;fs[dtc][]]]~3
+count[.ml.xv.kfStrat[k;1;xc;yc;fs[dtc][]]]~3
+
+.ml.xv.tsRolls[k;1;xf;yf;fs[lin][]]~crossvalr[xf;yf;fr;lr;3]
+.ml.xv.tsRolls[k;1;xi;yi;fs[lin][]]~crossvalr[xi;yi;fr;lr;3]
+(avg[.ml.xv.tsRolls[k;1;xb;yb;fs[dtc][]]]-avg crossvalc[xb;yb;fr;lr;3])<.05
+(avg[.ml.xv.tsRolls[k;1;xc;yc;fs[dtc][]]]-avg crossvalc[xc;yc;fr;lr;3])<.05
+
+.ml.xv.tsChain[k;1;xf;yf;fs[lin][]]~crossvalr[xf;yf;fc;lc;3]
+.ml.xv.tsChain[k;1;xi;yi;fs[lin][]]~crossvalr[xi;yi;fc;lc;3]
+(avg[.ml.xv.tsChain[k;1;xb;yb;fs[dtc][]]]-avg crossvalc[xb;yb;fc;lc;3])<.05
+(avg[.ml.xv.tsChain[k;1;xc;yc;fs[dtc][]]]-avg crossvalc[xc;yc;fc;lc;3])<.05
+
+(.ml.xv.pcSplit[p;1;xf;yf]{[d].ml.shape each d})~ms
+(.ml.xv.pcSplit[p;1;xi;yi]{[d].ml.shape each d})~ms
+(.ml.xv.pcSplit[p;1;xb;yb]{[d].ml.shape each d})~ms
+(.ml.xv.pcSplit[p;1;xc;yc]{[d].ml.shape each d})~ms
+(.ml.xv.pcSplit[p;1;xf;yf]{count@''x})~enlist(800 800;200 200)
+(.ml.xv.pcSplit[.1;3;xf;yf]{count@''x})~3#enlist(900 900;100 100)
+(.ml.xv.pcSplit[.3;5;xf;yf]{count@''x})~5#enlist(700 700;300 300)
+
+(.ml.xv.mcSplit[p;1;xf;yf]{[d].ml.shape each d})~ms
+(.ml.xv.mcSplit[p;1;xi;yi]{[d].ml.shape each d})~ms
+(.ml.xv.mcSplit[p;1;xb;yb]{[d].ml.shape each d})~ms
+(.ml.xv.mcSplit[p;1;xc;yc]{[d].ml.shape each d})~ms
+(.ml.xv.mcSplit[p;1;xf;yf]{count@''x})~enlist(800 800;200 200)
+(.ml.xv.mcSplit[.1;3;xf;yf]{count@''x})~3#enlist(900 900;100 100)
+(.ml.xv.mcSplit[.3;5;xf;yf]{count@''x})~5#enlist(700 700;300 300)
 
 / grid search
 
-(bp .ml.gs.kfsplit[k;1;xf;yf;fs net;gs_pr;0])~@[;1]gridsearchr[xf;yf]
-(bp .ml.gs.kfsplit[k;1;xi;yi;fs net;gs_pr;0])~@[;1]gridsearchr[xi;yi]
-(rnd[(avg/).ml.gs.kfsplit[k;1;xf;yf;fs net;gs_pr;0]]-rnd@[;0]gridsearchr[xf;yf])<.05
-(rnd[(avg/).ml.gs.kfsplit[k;1;xi;yi;fs net;gs_pr;0]]-rnd@[;0]gridsearchr[xi;yi])<.05
-(rnd[(avg/).ml.gs.kfsplit[k;1;xb;yb;fs dtc;gs_pc;0]]-rnd@[;0]gridsearchc[xb;yb])<.05
-(rnd[(avg/).ml.gs.kfsplit[k;1;xc;yc;fs dtc;gs_pc;0]]-rnd@[;0]gridsearchc[xc;yc])<.05
-
-((@[;2].ml.gs.kfsplit[k;1;xf;yf;fs net;gs_pr;.2])-@[;0]gridsearchr[xf;yf])<.05
-((@[;2].ml.gs.kfsplit[k;1;xi;yi;fs net;gs_pr;.2])-@[;0]gridsearchr[xi;yi])<.06
-((@[;2].ml.gs.kfsplit[k;1;xb;yb;fs dtc;gs_pc;.2])-@[;0]gridsearchc[xb;yb])<.05
-((@[;2].ml.gs.kfsplit[k;1;xc;yc;fs dtc;gs_pc;.2])-@[;0]gridsearchc[xc;yc])<.05
-
-(key@[;1].ml.gs.kfsplit[k;1;xf;yf;fs net;gs_pr;.2])~`alpha`max_iter
-(key@[;1].ml.gs.kfsplit[k;1;xi;yi;fs net;gs_pr;.2])~`alpha`max_iter
-(key@[;1].ml.gs.kfsplit[k;1;xb;yb;fs dtc;gs_pc;.2])~enlist`max_depth
-(key@[;1].ml.gs.kfsplit[k;1;xc;yc;fs dtc;gs_pc;.2])~enlist`max_depth
-
-.ml.shape[.ml.gs.kfsplit[ 4;2;xf;yf;.ml.xv.fitscore net;gs_pr; .2]]~3 6 8
-.ml.shape[.ml.gs.kfshuff[ 4;2;xf;yf;.ml.xv.fitscore net;gs_pr;-.2]]~3 6 8
-.ml.shape[.ml.gs.kfstrat[ 4;2;xb;yb;.ml.xv.fitscore dtc;gs_pc;-.2]]~3 6 8
-.ml.shape[.ml.gs.tsrolls[ 2;5;xb;yb;.ml.xv.fitscore dtc;gs_pc; .2]]~3 6 5
-.ml.shape[.ml.gs.tschain[ 2;5;xb;yb;.ml.xv.fitscore dtc;gs_pc; .2]]~3 6 5
-.ml.shape[.ml.gs.pcsplit[.3;5;xf;yf;.ml.xv.fitscore net;gs_pr; .2]]~3 6 5
-.ml.shape[.ml.gs.mcsplit[.3;5;xf;yf;.ml.xv.fitscore net;gs_pr;-.2]]~3 6 5
+(bp .ml.gs.kfSplit[k;1;xf;yf;fs net;gs_pr;0])~@[;1]gridsearchr[xf;yf]
+(bp .ml.gs.kfSplit[k;1;xi;yi;fs net;gs_pr;0])~@[;1]gridsearchr[xi;yi]
+(rnd[(avg/).ml.gs.kfSplit[k;1;xf;yf;fs net;gs_pr;0]]-rnd@[;0]gridsearchr[xf;yf])<.05
+(rnd[(avg/).ml.gs.kfSplit[k;1;xi;yi;fs net;gs_pr;0]]-rnd@[;0]gridsearchr[xi;yi])<.05
+(rnd[(avg/).ml.gs.kfSplit[k;1;xb;yb;fs dtc;gs_pc;0]]-rnd@[;0]gridsearchc[xb;yb])<.05
+(rnd[(avg/).ml.gs.kfSplit[k;1;xc;yc;fs dtc;gs_pc;0]]-rnd@[;0]gridsearchc[xc;yc])<.05
+
+((@[;2].ml.gs.kfSplit[k;1;xf;yf;fs net;gs_pr;.2])-@[;0]gridsearchr[xf;yf])<.05
+((@[;2].ml.gs.kfSplit[k;1;xi;yi;fs net;gs_pr;.2])-@[;0]gridsearchr[xi;yi])<.06
+((@[;2].ml.gs.kfSplit[k;1;xb;yb;fs dtc;gs_pc;.2])-@[;0]gridsearchc[xb;yb])<.05
+((@[;2].ml.gs.kfSplit[k;1;xc;yc;fs dtc;gs_pc;.2])-@[;0]gridsearchc[xc;yc])<.05
+
+(key@[;1].ml.gs.kfSplit[k;1;xf;yf;fs net;gs_pr;.2])~`alpha`max_iter
+(key@[;1].ml.gs.kfSplit[k;1;xi;yi;fs net;gs_pr;.2])~`alpha`max_iter
+(key@[;1].ml.gs.kfSplit[k;1;xb;yb;fs dtc;gs_pc;.2])~enlist`max_depth
+(key@[;1].ml.gs.kfSplit[k;1;xc;yc;fs dtc;gs_pc;.2])~enlist`max_depth
+
+.ml.shape[.ml.gs.kfSplit[ 4;2;xf;yf;.ml.xv.fitScore net;gs_pr; .2]]~3 6 8
+.ml.shape[.ml.gs.kfShuff[ 4;2;xf;yf;.ml.xv.fitScore net;gs_pr;-.2]]~3 6 8
+.ml.shape[.ml.gs.kfStrat[ 4;2;xb;yb;.ml.xv.fitScore dtc;gs_pc;-.2]]~3 6 8
+.ml.shape[.ml.gs.tsRolls[ 2;5;xb;yb;.ml.xv.fitScore dtc;gs_pc; .2]]~3 6 5
+.ml.shape[.ml.gs.tsChain[ 2;5;xb;yb;.ml.xv.fitScore dtc;gs_pc; .2]]~3 6 5
+.ml.shape[.ml.gs.pcSplit[.3;5;xf;yf;.ml.xv.fitScore net;gs_pr; .2]]~3 6 5
+.ml.shape[.ml.gs.mcSplit[.3;5;xf;yf;.ml.xv.fitScore net;gs_pr;-.2]]~3 6 5
 
 / random search
 
-(key@[;1].ml.rs.kfsplit[k;1;xf;yf;fs net;rs_pr_rdm;.2])~`alpha`max_iter
-(key@[;1].ml.rs.kfsplit[k;1;xi;yi;fs net;rs_pr_rdm;.2])~`alpha`max_iter
-(key@[;1].ml.rs.kfsplit[k;1;xb;yb;fs dtc;rs_pc_rdm;.2])~enlist`max_depth
-(key@[;1].ml.rs.kfsplit[k;1;xc;yc;fs dtc;rs_pc_rdm;.2])~enlist`max_depth
+(key@[;1].ml.rs.kfSplit[k;1;xf;yf;fs net;rs_pr_rdm;.2])~`alpha`max_iter
+(key@[;1].ml.rs.kfSplit[k;1;xi;yi;fs net;rs_pr_rdm;.2])~`alpha`max_iter
+(key@[;1].ml.rs.kfSplit[k;1;xb;yb;fs dtc;rs_pc_rdm;.2])~enlist`max_depth
+(key@[;1].ml.rs.kfSplit[k;1;xc;yc;fs dtc;rs_pc_rdm;.2])~enlist`max_depth
 
-.ml.shape[.ml.rs.kfsplit[ 4;2;xf;yf;.ml.xv.fitscore net;rs_pr_rdm; .2]]~3 8 8
-.ml.shape[.ml.rs.kfshuff[ 4;2;xf;yf;.ml.xv.fitscore net;rs_pr_rdm;-.2]]~3 8 8
-.ml.shape[.ml.rs.kfstrat[ 4;2;xb;yb;.ml.xv.fitscore dtc;rs_pc_rdm;-.2]]~3 7 8
-.ml.shape[.ml.rs.tsrolls[ 2;5;xb;yb;.ml.xv.fitscore dtc;rs_pc_rdm; .2]]~3 7 5
-.ml.shape[.ml.rs.tschain[ 2;5;xb;yb;.ml.xv.fitscore dtc;rs_pc_rdm; .2]]~3 7 5
-.ml.shape[.ml.rs.pcsplit[.3;5;xf;yf;.ml.xv.fitscore net;rs_pr_rdm; .2]]~3 8 5
-.ml.shape[.ml.rs.mcsplit[.3;5;xf;yf;.ml.xv.fitscore net;rs_pr_rdm;-.2]]~3 8 5
+.ml.shape[.ml.rs.kfSplit[ 4;2;xf;yf;.ml.xv.fitScore net;rs_pr_rdm; .2]]~3 8 8
+.ml.shape[.ml.rs.kfShuff[ 4;2;xf;yf;.ml.xv.fitScore net;rs_pr_rdm;-.2]]~3 8 8
+.ml.shape[.ml.rs.kfStrat[ 4;2;xb;yb;.ml.xv.fitScore dtc;rs_pc_rdm;-.2]]~3 7 8
+.ml.shape[.ml.rs.tsRolls[ 2;5;xb;yb;.ml.xv.fitScore dtc;rs_pc_rdm; .2]]~3 7 5
+.ml.shape[.ml.rs.tsChain[ 2;5;xb;yb;.ml.xv.fitScore dtc;rs_pc_rdm; .2]]~3 7 5
+.ml.shape[.ml.rs.pcSplit[.3;5;xf;yf;.ml.xv.fitScore net;rs_pr_rdm; .2]]~3 8 5
+.ml.shape[.ml.rs.mcSplit[.3;5;xf;yf;.ml.xv.fitScore net;rs_pr_rdm;-.2]]~3 8 5
 
 / sobol search
 
-(key@[;1].ml.rs.kfsplit[k;1;xf;yf;fs net;rs_pr_sbl;.2])~`alpha`max_iter
-(key@[;1].ml.rs.kfsplit[k;1;xi;yi;fs net;rs_pr_sbl;.2])~`alpha`max_iter
-(key@[;1].ml.rs.kfsplit[k;1;xb;yb;fs dtc;rs_pc_sbl;.2])~enlist`max_depth
-(key@[;1].ml.rs.kfsplit[k;1;xc;yc;fs dtc;rs_pc_sbl;.2])~enlist`max_depth
-
-.ml.shape[.ml.rs.kfsplit[ 4;2;xf;yf;.ml.xv.fitscore net;rs_pr_sbl; .2]]~3 8 8
-.ml.shape[.ml.rs.kfshuff[ 4;2;xf;yf;.ml.xv.fitscore net;rs_pr_sbl;-.2]]~3 8 8
-.ml.shape[.ml.rs.kfstrat[ 4;2;xb;yb;.ml.xv.fitscore dtc;rs_pc_sbl;-.2]]~3 8 8
-.ml.shape[.ml.rs.tsrolls[ 2;5;xb;yb;.ml.xv.fitscore dtc;rs_pc_sbl; .2]]~3 8 5
-.ml.shape[.ml.rs.tschain[ 2;5;xb;yb;.ml.xv.fitscore dtc;rs_pc_sbl; .2]]~3 8 5
-.ml.shape[.ml.rs.pcsplit[.3;5;xf;yf;.ml.xv.fitscore net;rs_pr_sbl; .2]]~3 8 5
-.ml.shape[.ml.rs.mcsplit[.3;5;xf;yf;.ml.xv.fitscore net;rs_pr_sbl;-.2]]~3 8 5
-
-$[0b=@[{.ml.rs.kfsplit[ 4;2;xf;yf;.ml.xv.fitscore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
-$[0b=@[{.ml.rs.kfshuff[ 4;2;xf;yf;.ml.xv.fitscore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
-$[0b=@[{.ml.rs.kfstrat[ 4;2;xb;yb;.ml.xv.fitscore dtc;x;-.2];};rs_pc_err;{[err]err;0b}];1b;0b]
-$[0b=@[{.ml.rs.tsrolls[ 4;2;xb;yb;.ml.xv.fitscore dtc;x;-.2];};rs_pc_err;{[err]err;0b}];1b;0b]
-$[0b=@[{.ml.rs.tschain[ 4;2;xb;yb;.ml.xv.fitscore dtc;x;-.2];};rs_pc_err;{[err]err;0b}];1b;0b]
-$[0b=@[{.ml.rs.pcsplit[ 4;2;xf;yf;.ml.xv.fitscore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
-$[0b=@[{.ml.rs.mcsplit[ 4;2;xf;yf;.ml.xv.fitscore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
+(key@[;1].ml.rs.kfSplit[k;1;xf;yf;fs net;rs_pr_sbl;.2])~`alpha`max_iter
+(key@[;1].ml.rs.kfSplit[k;1;xi;yi;fs net;rs_pr_sbl;.2])~`alpha`max_iter
+(key@[;1].ml.rs.kfSplit[k;1;xb;yb;fs dtc;rs_pc_sbl;.2])~enlist`max_depth
+(key@[;1].ml.rs.kfSplit[k;1;xc;yc;fs dtc;rs_pc_sbl;.2])~enlist`max_depth
+
+.ml.shape[.ml.rs.kfSplit[ 4;2;xf;yf;.ml.xv.fitScore net;rs_pr_sbl; .2]]~3 8 8
+.ml.shape[.ml.rs.kfShuff[ 4;2;xf;yf;.ml.xv.fitScore net;rs_pr_sbl;-.2]]~3 8 8
+.ml.shape[.ml.rs.kfStrat[ 4;2;xb;yb;.ml.xv.fitScore dtc;rs_pc_sbl;-.2]]~3 8 8
+.ml.shape[.ml.rs.tsRolls[ 2;5;xb;yb;.ml.xv.fitScore dtc;rs_pc_sbl; .2]]~3 8 5
+.ml.shape[.ml.rs.tsChain[ 2;5;xb;yb;.ml.xv.fitScore dtc;rs_pc_sbl; .2]]~3 8 5
+.ml.shape[.ml.rs.pcSplit[.3;5;xf;yf;.ml.xv.fitScore net;rs_pr_sbl; .2]]~3 8 5
+.ml.shape[.ml.rs.mcSplit[.3;5;xf;yf;.ml.xv.fitScore net;rs_pr_sbl;-.2]]~3 8 5
+
+$[0b=@[{.ml.rs.kfSplit[ 4;2;xf;yf;.ml.xv.fitScore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
+$[0b=@[{.ml.rs.kfShuff[ 4;2;xf;yf;.ml.xv.fitScore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
+$[0b=@[{.ml.rs.kfStrat[ 4;2;xb;yb;.ml.xv.fitScore dtc;x;-.2];};rs_pc_err;{[err]err;0b}];1b;0b]
+$[0b=@[{.ml.rs.tsRolls[ 4;2;xb;yb;.ml.xv.fitScore dtc;x;-.2];};rs_pc_err;{[err]err;0b}];1b;0b]
+$[0b=@[{.ml.rs.tsChain[ 4;2;xb;yb;.ml.xv.fitScore dtc;x;-.2];};rs_pc_err;{[err]err;0b}];1b;0b]
+$[0b=@[{.ml.rs.pcSplit[ 4;2;xf;yf;.ml.xv.fitScore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
+$[0b=@[{.ml.rs.mcSplit[ 4;2;xf;yf;.ml.xv.fitScore dtc;x;-.2];};rs_pr_err;{[err]err;0b}];1b;0b]
 
 / scoring
 
diff --git a/xval/utils.q b/xval/utils.q
new file mode 100644
index 00000000..0e8a23ea
--- /dev/null
+++ b/xval/utils.q
@@ -0,0 +1,342 @@
+// xval/utils.q - Cross validation utilities
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Utilities for cross validation library
+
+\d .ml
+
+// Cross validation indexing
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Shuffle data point indices
+// @param data {any} Table, matrix or list
+// @return {long[]} Indices of data shuffled
+xv.i.shuffle:{[data]
+  0N?count data
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Find indices required to split data into k-folds
+// @param k {int} Number of folds
+// @param data {any} Table, matrix or list
+// @return {long[][]} Indices required to split data into k sub-sets
+xv.i.splitIdx:{[k;data]
+  (k;0N)#til count data
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Find shuffled indices required to split data into k-folds
+// @param k {int} Number of folds
+// @param data {any} Table, matrix or list
+// @return {long[][]} Shuffled indices required to split data into k 
+//   sub-sets
+xv.i.shuffIdx:{[k;data]
+  (k;0N)#xv.i.shuffle data
+  }
+
+// @private
+// @kind function 
+// @category xvUtility
+// @desc Split target data ensuring that the percentage of each class are 
+//   preserved in each fold
+// @param k {int} Number of folds
+// @param data {any} Table, matrix or list
+// @return {long[][]} Data split into k-folds with distinct values 
+//   appearing in each
+xv.i.stratIdx:{[k;data]
+  // Find indices for each distinct group
+  idx:group data;
+  // Shuffle/split groups into folds with distinct groups present in each fold
+  fold:(,'/)(k;0N)#/:value idx@'xv.i.shuffle each idx;
+  // Shuffle each fold
+  fold@'xv.i.shuffle each fold
+  }
+
+// @private
+// @kind function 
+// @category xvUtility
+// @desc Get training and testing indices for each fold
+// @param k {int} Number of folds
+// @return {long[][]} Training and testing indices for each fold
+xv.i.groupIdx:{[k]
+  (0;k-1)_/:rotate[-1]\[til k]
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Get training/testing indices for equi-distanced bins of data 
+//   across k-folds
+// @param k {int} Number of folds
+// @return {long[][]} Indices for equi-distanced bins of data based on k
+xv.i.tsRollsIdx:{[k]
+  enlist@''0 1+/:til k-1
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Get training/testing indices for equi-distanced bins of data 
+//   across k-folds with increasing amounts of data added to the training set 
+//   at each stage
+// @param k {int} Number of folds
+// @return {long[][]} Indices for equi-distanced bins of data based on k
+xv.i.tsChainIdx:{[k]
+  flip(til each j;enlist@'j:1+til k-1)
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Creates projection contining data split according to k
+//   in ((xtrain;ytrain);(xtest;ytest)) format for each fold
+// @param func1 {fn} Function to be applied to x data
+// @param func2 {fn} Function to be applied to k
+// @param k {int} Number of folds
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @return {fn} Projection of data split per fold
+xv.i.idx1:{[func1;func2;k;features;target]
+  dataSplit:flip@'((features;target)@/:\:func1[k;target])@\:/:func2 k;
+  {{raze@''y}[;x]}each dataSplit
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Creates projection contining data split according to k
+//   in ((xtrain;ytrain);(xtest;ytest)) format for each fold
+// @param func1 {fn} Function to be applied to x data
+// @param func2 {fn} Function to be applied to k
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @return {fn} Projection of data split per fold
+xv.i.idxR:{[func1;func2;k;n;features;target]
+  n#enlist xv.i.idx1[func1;func2;k;features;target]
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Creates projection contining data split according to k
+//   in ((xtrain;ytrain);(xtest;ytest)) format for each fold
+// @param func1 {fn} Function to be applied to x data
+// @param func2 {fn} Function to be applied to k
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @return {fn} Projection of data split per fold
+xv.i.idxN:{[func1;func2;k;n;features;target]
+  xv.i.idx1[func1;func2;;features;target]@'n#k
+  }
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Apply funct to data split using specified indexing functions
+// @param idx {long[][]} Indicies to apply to data
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of func with idx applied to data
+xv.i.applyIdx:{[idx;k;n;features;target;function]
+  splitData:raze idx[k;n;features;target];
+  {[function;data]function data[]}[function]peach splitData
+  }
+
+// Python utilities required for xval.q
+
+// @private
+// @kind function
+// @category xvUtility
+// @desc Convert q list to numpy array
+// @param x {any[]} q list to be converted
+// @return {<} embedPy object following numpy array conversion
+numpyArray:.p.import[`numpy]`:array
+
+// Hyperparameter search functionality
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Perform hyperparameter generation and cross validation
+// @param paramFunc {fn} Parameter function
+// @param xvalFunc {fn} Cross validation function
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param dataFunc {fn} Function which takes data as input
+// @param hyperparams {dictionary} Hyperparameters
+// @return {table} Cross validation scores for each hyperparameter set
+hp.i.xvScore:{[paramFunc;xvalFunc;k;n;features;target;dataFunc;hyperparams]
+  // Generate hyperparameter sets
+  hyperparams:paramFunc hyperparams;
+  // Perform cross validation for each set
+  hyperparams!(xvalFunc[k;n;features;target]dataFunc pykwargs@)@'hyperparams
+  }
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Hyperparameter search with option to test final model 
+// @param scoreFunc {fn} Scoring function
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param dataFunc {fn} Function which takes data as input
+// @param hyperparams {dictionary} Dictionary of hyperparameters
+// @param testType {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Either validation or testing results from 
+//   hyperparameter search with (full results;best set;testing score)
+hp.i.search:{[scoreFunc;k;n;features;target;dataFunc;hyperparams;testType]
+  if[testType=0;:scoreFunc[k;n;features;target;dataFunc;hyperparams]];
+  dataShuffle:$[testType<0;xv.i.shuffle;til count@]target;
+  i:(0,floor count[target]*1-abs testType)_dataShuffle;
+  r:scoreFunc[k;n;features i 0;target i 0;dataFunc;hyperparams];
+  res:dataFunc[pykwargs pr:first key desc avg each r](features;target)@\:/:i;
+  (r;pr;res)
+  }
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Hyperparameter generation for .ml.gs
+// @param hyperparams {dictionary} Hyperparameters with all possible values 
+//  for a given parameter specified by the user, e.g.
+//   pdict = `randomState`max_depth!(42 72 84;1 3 4 7)
+// @return {table} All possible hyperparameter sets
+hp.i.gsGen:{[hyperparams]
+  key[hyperparams]!/:1_'(::)cross/value hyperparams
+  }
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Hyperparameter generation for .ml.rs
+// @param params {dictionary} Parameters with form `randomState`n`typ`p where 
+//   randomState is the seed, n is the number of hyperparameters to generate 
+//   (must equal 2^n for sobol), typ is the type of search (random/sobol) and p
+//   is a dictionary of hyperparameter spaces - see documentation for more info
+// @return {table} Hyperparameters
+hp.i.rsGen:{[params]
+  // Set default number of trials
+  if[(::)~n:params`n;n:16];
+  // Check sobol trials = 2^n
+  if[(`sobol=params`typ)&k<>floor k:xlog[2]n;
+    '"trials must equal 2^n for sobol search"
+    ];
+  // Find numerical hyperparameter spaces
+  num:where any`uniform`loguniform=\:first each p:params`p;
+  // Set random seed
+  system"S ",string$[(::)~params`randomState;42;params`random_state];
+  // Import sobol sequence generator and check requirements
+  pySobol:.p.import[`sobol_seq;`:i4_sobol_generate;<];
+  genPts:$[`sobol~typ:params`typ;
+      enlist each flip pySobol[count num;n];
+    `random~typ;
+      n;
+    '"hyperparam type not supported"
+    ];
+  // Generate hyperparameters
+  hyperparams:distinct flip hp.i.hpGen[typ;n]each p,:num!p[num],'genPts;
+  // Take distinct sets
+  if[n>dst:count hyperparams;
+    -1"Distinct hp sets less than n - returning ",string[dst]," sets."
+    ];
+  hyperparams
+  }
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Random/sobol hyperparameter generation for .ml.rs
+// @param randomType {symbol} Type of random search, denoting the namespace 
+//   to use
+// @param n {long} Number of hyperparameter sets
+// @param params {dictionary} Parameters
+// @return {any} Hyperparameters
+hp.i.hpGen:{[randomType;n;params]
+  // Split parameters
+  params:@[;0;first](0;1)_params,();
+  // Respective parameter generation
+  $[(typ:params 0)~`boolean;n?0b;
+    typ in`rand`symbol;
+      n?(),params[1]0;
+    typ~`uniform;
+      hp.i.uniform[randomType]. params 1;
+    typ~`loguniform;
+      hp.i.logUniform[randomType]. params 1;
+    '"please enter a valid type"
+    ]
+  }
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Uniform number generator 
+// @param randomType {symbol} Type of random search, denoting the namespace 
+//   to use
+// @param low {long} Lower bound
+// @param high {long} Higher bound
+// @param paramType {char} Type of parameter, e.g. "i", "f", etc
+// @param params {number[]} Parameters
+// @return {number[]} Uniform numbers
+hp.i.uniform:{[randomType;low;high;paramType;params]
+  if[high<low;'"upper bound must be greater than lower bound"];
+  hp.i[randomType][`uniform][low;high;paramType;params]
+  }
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Generate list of log uniform numbers
+// @param randomType {symbol} Type of random search, denoting the namespace 
+//   to use
+// @param low {number} Lower bound as power of 10
+// @param high {number} Higher bound as power of 10
+// @param paramType {char} Type of parameter, e.g. "i", "f", etc
+// @param params {number[]} Parameters
+// @return {number[]} Log uniform numbers
+hp.i.logUniform:xexp[10]hp.i.uniform::
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Random uniform generator
+// @param low {number} Lower bound as power of 10
+// @param high {number} Higher bound as power of 10
+// @param paramType {char} Type of parameter, e.g. "i", "f", etc
+// @param n {long} Number of hyperparameter sets
+// @return {number[]} Random uniform numbers
+hp.i.random.uniform:{[low;high;paramType;n]
+  low+n?paramType$high-low
+  }
+
+// @private
+// @kind function
+// @category hyperparameterUtility
+// @desc Sobol uniform generator
+// @param low {number} Lower bound as power of 10
+// @param high {number} Higher bound as power of 10
+// @param paramType {char} Type of parameter, e.g. "i", "f", etc
+// @param sequence {float[]} Sobol sequence
+// @return {number[]} Uniform numbers from sobol sequence
+hp.i.sobol.uniform:{[low;high;paramType;sequence]
+  paramType$low+(high-low)*sequence
+  }
diff --git a/xval/xval.q b/xval/xval.q
index b2b85058..04b758b5 100644
--- a/xval/xval.q
+++ b/xval/xval.q
@@ -1,59 +1,430 @@
+// xval/xval.q - Cross validation
+// Copyright (c) 2021 Kx Systems Inc
+//
+// Cross validation, grid/random/Sobol-random hyperparameter search and multi-
+// processing procedures
+
 \d .ml
 
-xv.i.shuffle:{neg[n]?n:count x}
-xv.i.splitidx:{[k;x](k;0N)#til count x}
-xv.i.shuffidx:{[k;x](k;0N)#xv.i.shuffle x}
-xv.i.stratidx:{[k;x]r@'xv.i.shuffle each r:(,'/)(k;0N)#/:value n@'xv.i.shuffle each n:group x}
-xv.i.groupidx:{[k](0;k-1)_/:rotate[-1]\[til k]}
-xv.i.idx1:{[f;g;k;x;y]{{raze@''y}[;x]}each flip@'((x;y)@/:\:f[k;y])@\:/:g k}
-xv.i.idxR:{[f;g;k;n;x;y]n#enlist xv.i.idx1[f;g;k;x;y]}
-xv.i.idxN:{[f;g;k;n;x;y]xv.i.idx1[f;g;;x;y]@'n#k}
-
-xv.j.kfsplit:xv.i.idxR . xv.i`splitidx`groupidx
-xv.j.kfshuff:xv.i.idxN . xv.i`shuffidx`groupidx
-xv.j.kfstrat:xv.i.idxN . xv.i`stratidx`groupidx
-xv.j.tsrolls:xv.i.idxR[xv.i.splitidx]{[k]enlist@''0 1+/:til k-1}
-xv.j.tschain:xv.i.idxR[xv.i.splitidx]{[k]flip(til each j;enlist@'j:1+til k-1)}
-xv.j.pcsplit:{[p;n;x;y]n#{[p;x;y;z](x;y)@\:/:(0,floor n*1-p)_til n:count y}[p;x;y]}
-xv.j.mcsplit:{[p;n;x;y]n#{[p;x;y;z](x;y)@\:/:(0,floor count[y]*1-p)_{neg[n]?n:count x}y}[p;x;y]}
-xv,:xv.j,:1_{[idx;k;n;x;y;f]{[f;d]f d[]}[f]peach raze idx[k;n;x;y]}@'xv.j
-
-xv.i.search:{[sf;k;n;x;y;f;p;t]
- if[t=0;:sf[k;n;x;y;f;p]];i:(0,floor count[y]*1-abs t)_$[t<0;xv.i.shuffle;til count@]y;
- (r;pr;f[pykwargs pr:first key desc avg each r:sf[k;n;x i 0;y i 0;f;p]](x;y)@\:/:i)}
-xv.i.xvpf:{[pf;xv;k;n;x;y;f;p]p!(xv[k;n;x;y]f pykwargs@)@'p:pf p}
-gs:1_xv.i.search@'xv.i.xvpf[{[p]key[p]!/:1_'(::)cross/value p}]@'xv.j
-rs:1_xv.i.search@'xv.i.xvpf[{[p]hp.hpgen p}]@'xv.j
-
-npa:.p.import[`numpy]`:array
-xv.fitscore:{[f;p;d]
-  .[.[f[][p]`:fit;npa each d 0]`:score;npa each d 1]`
+// @kind function
+// @category xv
+// @desc Cross validation for ascending indices split into k-folds
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of function applied to each of the k-folds
+xv.kfSplit:xv.i.applyIdx xv.i.idxR . xv.i`splitIdx`groupIdx
+
+// @kind function
+// @category xv
+// @desc Cross validation for randomized non-repeating indices split 
+//   into k-folds
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of function applied to each of the k-folds
+xv.kfShuff:xv.i.applyIdx xv.i.idxN . xv.i`shuffIdx`groupIdx
+
+// @kind function
+// @category xv
+// @desc Stratified k-fold cross validation with an approximately equal
+//   distribution of classes per fold
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of function applied to each of the k-folds
+xv.kfStrat:xv.i.applyIdx xv.i.idxN . xv.i`stratIdx`groupIdx
+
+// @kind function
+// @category xv
+// @desc Roll-forward cross validation procedure
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of function applied to each of the chained 
+//   iterations
+xv.tsRolls:xv.i.applyIdx xv.i.idxR . xv.i`splitIdx`tsRollsIdx
+
+// @kind function
+// @category xv
+// @desc Chain-forward cross validation procedure
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of function applied to each of the chained 
+//   iterations
+xv.tsChain:xv.i.applyIdx xv.i.idxR . xv.i`splitIdx`tsChainIdx
+
+// @kind function
+// @category xv
+// @desc Percentage split cross validation procedure
+// @param pc {float} (0-1) representing the percentage of validation data
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of function applied to each of the k-folds
+xv.pcSplit:xv.i.applyIdx{[pc;n;features;target]
+  split:{[pc;x;y;z](x;y)@\:/:(0,floor n*1-pc)_til n:count y};
+  n#split[pc;features;target]
+  }
+
+// @kind function
+// @category xv
+// @desc Monte-Carlo cross validation using randomized non-repeating 
+//   indices
+// @param pc {float} (0-1) representing the percentage of validation data
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function which takes data as input
+// @return {any} Output of function applied to each of the k-folds
+xv.mcSplit:xv.i.applyIdx{[pc;n;features;target]
+  split:{[pc;x;y;z](x;y)@\:/:(0,floor count[y]*1-pc)_{neg[n]?n:count x}y};
+  n#split[pc;features;target]
+  }
+
+// @kind function
+// @category xv
+// @desc Default scoring function used in conjunction with .ml.xv/gs/rs
+//   methods
+// @param function {fn} Takes empty list, parameters and data as input
+// @param p {dictionary} Hyperparameters
+// @param data {any[][]} ((xtrain;xtest);(ytrain;ytest)) format
+// @return {float[]} Scores outputted by function applied to p and data
+xv.fitScore:{[function;p;data]
+  fitFunc:function[][p]`:fit;
+  scoreFunc:.[fitFunc;numpyArray each data 0]`:score;
+  .[scoreFunc;numpyArray each data 1]`
   }
 
-hp.hpgen:{
-  if[(::)~n:x`n;n:16];
-  if[(`sobol=x`typ)&k<>floor k:xlog[2]n;'"trials must equal 2^n for sobol search"];
-  num:where any`uniform`loguniform=\:first each p:x`p;
-  system"S ",string$[(::)~x`random_state;42;x`random_state];
-  pysobol:.p.import[`sobol_seq;`:i4_sobol_generate;<];
-  genpts:$[`sobol~typ:x`typ;enlist each flip pysobol[count num;n];`random~typ;n;'"hyperparam type not supported"];
-  prms:distinct flip hp.i.hpgen[typ;n]each p,:num!p[num],'genpts;
-  if[n>dst:count prms;-1"Number of distinct hp sets less than n, returning ",string[dst]," sets."];
-  prms}
-hp.i.hpgen:{[ns;n;p]
-  p:@[;0;first](0;1)_p,();
-  $[(typ:p 0)~`boolean;n?0b;
-    typ in`rand`symbol;n?(),p[1]0;
-    typ~`uniform;hp.i.uniform[ns]. p 1;
-    typ~`loguniform;hp.i.loguniform[ns]. p 1;
-    '"please enter a valid type"]}
-hp.i.uniform:{[ns;lo;hi;typ;p]if[hi<lo;'"upper bound must be greater than lower bound"];hp.i[ns][`uniform][lo;hi;typ;p]}
-hp.i.loguniform:xexp[10]hp.i.uniform::
-hp.i.random.uniform:{[lo;hi;typ;n]lo+n?typ$hi-lo}
-hp.i.sobol.uniform:{[lo;hi;typ;seq]typ$lo+(hi-lo)*seq}
-
-/ multiprocess
+// Hyperparameter search procedures
+
+// @kind function
+// @category gs
+// @desc Cross validated parameter grid search applied to data with 
+//   ascending split indices
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input 
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on 
+//   each of the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+gs.kfSplit:hp.i.search hp.i.xvScore[hp.i.gsGen;xv.kfSplit]
+
+// @kind function
+// @category gs
+// @desc Cross validated parameter grid search applied to data with 
+//   shuffled split indices
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+gs.kfShuff:hp.i.search hp.i.xvScore[hp.i.gsGen;xv.kfShuff]
+
+// @kind function
+// @category gs
+// @desc Cross validated parameter grid search applied to data with an 
+//   equi-distributions of targets per fold
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+gs.kfStrat:hp.i.search hp.i.xvScore[hp.i.gsGen;xv.kfStrat]
+
+// @kind function
+// @category gs
+// @desc Cross validated parameter grid search applied to roll forward 
+//   time-series sets
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+gs.tsRolls:hp.i.search hp.i.xvScore[hp.i.gsGen;xv.tsRolls]
+
+// @kind function
+// @category gs
+// @desc Cross validated parameter grid search applied to chain forward 
+//   time-series sets
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+gs.tsChain:hp.i.search hp.i.xvScore[hp.i.gsGen;xv.tsChain]
+
+// @kind function
+// @category gs
+// @desc Cross validated parameter grid search applied to percentage 
+//   split dataset
+// @param pc {float} (0-1) representing percentage of validation data
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+gs.pcSplit:hp.i.search hp.i.xvScore[hp.i.gsGen;xv.pcSplit]
+
+// @kind function
+// @category gs
+// @desc Cross validated parameter grid search applied to randomly 
+//   shuffled data and validated on a percentage holdout set
+// @param pc {float} (0-1) representing percentage of validation data
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+gs.mcSplit:hp.i.search hp.i.xvScore[hp.i.gsGen;xv.mcSplit]
+
+// @kind function
+// @category rs
+// @desc Cross validated parameter random search applied to data with
+//   ascending split indices
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters to be searched with 
+//   format `typ`randomState`n`p where typ is the type of search 
+//   (random/sobol), randomState is the seed, n is the number of 
+//   hyperparameter sets and p is a dictionary of parameters - see 
+//   documentation for more info.
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+rs.kfSplit:hp.i.search hp.i.xvScore[hp.i.rsGen;xv.kfSplit]
+
+// @kind function
+// @category rs
+// @desc Cross validated parameter random search applied to data with 
+//   shuffled split indices
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters to be searched with 
+//   format `typ`randomState`n`p where typ is the type of search 
+//   (random/sobol), randomState is the seed, n is the number of 
+//   hyperparameter sets and p is a dictionary of parameters - see 
+//   documentation for more info.
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+rs.kfShuff:hp.i.search hp.i.xvScore[hp.i.rsGen;xv.kfShuff]
+
+// @kind function
+// @category rs
+// @desc Cross validated parameter random search applied to data with 
+//   an equi-distributions of targets per fold
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters to be searched with 
+//   format `typ`randomState`n`p where typ is the type of search 
+//   (random/sobol), randomState is the seed, n is the number of 
+//   hyperparameter sets and p is a dictionary of parameters - see 
+//   documentation for more info.
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+rs.kfStrat:hp.i.search hp.i.xvScore[hp.i.rsGen;xv.kfStrat]
+
+// @kind function
+// @category rs
+// @desc Cross validated parameter random search applied to roll 
+//   forward time-series sets
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters to be searched with 
+//   format `typ`randomState`n`p where typ is the type of search 
+//   (random/sobol), randomState is the seed, n is the number of 
+//   hyperparameter sets and p is a dictionary of parameters - see 
+//   documentation for more info.
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+rs.tsRolls:hp.i.search hp.i.xvScore[hp.i.rsGen;xv.tsRolls]
+
+// @kind function
+// @category rs
+// @desc Cross validated parameter random search applied to chain 
+//   forward time-series sets
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters to be searched with 
+//   format `typ`randomState`n`p where typ is the type of search 
+//   (random/sobol), randomState is the seed, n is the number of 
+//   hyperparameter sets and p is a dictionary of parameters - see 
+//   documentation for more info.
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+rs.tsChain:hp.i.search hp.i.xvScore[hp.i.rsGen;xv.tsChain]
+
+// @kind function
+// @category rs
+// @desc Cross validated parameter random search applied to percentage 
+//   split dataset
+// @param pc {float} (0-1) representing percentage of validation data
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters to be searched with 
+//   format `typ`randomState`n`p where typ is the type of search 
+//   (random/sobol), randomState is the seed, n is the number of 
+//   hyperparameter sets and p is a dictionary of parameters - see 
+//   documentation for more info.
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+rs.pcSplit:hp.i.search hp.i.xvScore[hp.i.rsGen;xv.pcSplit]
+
+// @kind function
+// @category rs
+// @desc Cross validated parameter random search applied to randomly 
+//   shuffled data and validated on a percentage holdout set
+// @param pc {float} (0-1) representing percentage of validation data
+// @param k {int} Number of folds
+// @param n {int} Number of repetitions
+// @param features {any[][]} Matrix of features
+// @param target {any[]} Vector of targets
+// @param function {fn} Function that takes parameters and data as input
+//   and returns a score
+// @param p {dictionary} Dictionary of hyperparameters to be searched with 
+//   format `typ`randomState`n`p where typ is the type of search 
+//   (random/sobol), randomState is the seed, n is the number of 
+//   hyperparameter sets and p is a dictionary of parameters - see 
+//   documentation for more info.
+// @param tstTyp {float} Size of the holdout set used in a fitted grid 
+//   search, where the best model is fit to the holdout set. If 0 the function 
+//   will return scores for each fold for the given hyperparameters. If 
+//   negative the data will be shuffled prior to designation of the holdout set
+// @return {table|list} Scores for hyperparameter sets on each of
+//   the k folds for all values of h and additionally returns the best 
+//   hyperparameters and score on the holdout set for 0 < h <=1.
+rs.mcSplit:hp.i.search hp.i.xvScore[hp.i.rsGen;xv.mcSplit]
+
+// Multi-processing functionality
+
+//  Load multi-processing modules
 loadfile`:util/mproc.q
 loadfile`:util/pickle.q
+
+//  If multiple processes are available, multi-process cross validation library
 if[0>system"s";mproc.init[abs system"s"]enlist".ml.loadfile`:util/pickle.q"];
-xv.picklewrap:{picklewrap[(0>system"s")&.p.i.isw x]x}
\ No newline at end of file
+xv.picklewrap:{picklewrap[(0>system"s")&.p.i.isw x]x}