SparseMatrixCSC support for fixed effects #309

behinger · 2020-04-28T11:26:56Z

The only thing not working with sparse designmatrices is the rank-check of the fixed effects matrix. So I used multiple dispatch to circumvent it.

It might be worthwhile to use rank(X) in addition for sparse matrices, but I don't really understand the underlying motiv to use QR decomposition for the rank check (because I don't know what pivoting does ;)).

this is probably something @palday wants to look at.

PS: Doug asked me at some point to push to MixedModels.jl what I changed for unfold.jl to get it running with sparse designs. I think this is the only thing

palday · 2020-04-28T11:51:24Z

The Cholesky decomposition + pivoting is what allows us to work with rank-deficient fixef matrices -- basically the extra columns are moved out of the way for all the numerics and assigned a set coefficient estimate of zero and standard error of NaN.

palday · 2020-04-28T11:52:49Z

Can you write some tests? There are a few tweaks I'll do to the method (using parametrics instead of eltype and adding in the pivoting), but having tests to check things would be nice. :)

behinger · 2020-04-28T11:54:28Z

sure, I will have a look at the fixef tests and write some for sparse matrices. Not sure yet when I will find time for it though!

dmbates · 2020-04-29T21:31:11Z

Thanks for the PR @behinger . It is probably not necessary to worry about the rank deficient case for the time being.

Are you intending that the products of the form Z'X and X'X will be dense? If so, it will straightforward to do the adjustments for rank-deficient sparse X. Even a sparse Z'X is okay if we need it. The tricky bit is working with a sparse X'X.

In any case it will help to have an example to work on.

BTW, I liked your 5-part tweet on why you are enjoying using Julia.

…in unfold.jl

behinger · 2020-05-01T14:46:12Z

I started coding up a (comparatively simple) example for a single subject sparse designmatrix as used in unfold.jl.

I didn't finish it for a full MixedModels, partially due to time, partially because I did not know where to best put it in MixedModels.jl (keep in the unittests?). Or do you prefer a pregenerated csv-example?

Regarding your questions: in my brief tests X'X was sometimes sparse (not in this example though). I have to check Z'X - but julia CSV stopped working for now so I will comment later

behinger · 2020-05-01T16:03:41Z

Z'X seems to be dense in the cases I tested (currently only possible to have one random grouping variable due to limitations in mixedmodels.jl)

test/matrixterm.jl

palday

I changed things to use StableRNGs because there the MersenneTwister stream can (and does!) change between Julia versions.

The test looks like something straight from unfold. :) I suspect we could make it more minimal for actually testing functionality here. I would do something like:

construct a standard model from the test datasets
swap out its model matrix with a sparse version of it
fit it to make sure everything works.

I'll investigate how well this actually works later. 😄

test/matrixterm.jl

…tch-1

…into patch-1

palday · 2020-09-28T22:33:51Z

@dmbates How do you feel about this? I think the missing rank-deficiency checks are fine for the sparse FeMat because if you're constructing your own design matrix instead of using the formula interface, then you should be able to make sure its full column rank.

Update femat.jl

f9f949e

added very simple unittest. includes an example sparse array as used …

7378dc8

…in unfold.jl

palday reviewed Sep 14, 2020

View reviewed changes

test/matrixterm.jl Outdated Show resolved Hide resolved

palday reviewed Sep 14, 2020

View reviewed changes

test/matrixterm.jl Outdated Show resolved Hide resolved

palday reviewed Sep 14, 2020

View reviewed changes

test/matrixterm.jl Outdated Show resolved Hide resolved

palday and others added 6 commits September 28, 2020 23:37

Merge branch 'master' of github.com:JuliaStats/MixedModels.jl into pa…

303339c

…tch-1

Update test/matrixterm.jl

d837e31

Update test/matrixterm.jl

d222b6f

Update test/matrixterm.jl

4964fad

Merge branch 'patch-1' of https://github.com/behinger/MixedModels.jl …

c72af90

…into patch-1

tests

ab4dc02

palday approved these changes Sep 28, 2020

View reviewed changes

docstrings

1cb406a

dmbates merged commit b8defc7 into JuliaStats:master Sep 30, 2020

palday deleted the patch-1 branch September 30, 2020 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SparseMatrixCSC support for fixed effects #309

SparseMatrixCSC support for fixed effects #309

behinger commented Apr 28, 2020

palday commented Apr 28, 2020

palday commented Apr 28, 2020

behinger commented Apr 28, 2020

dmbates commented Apr 29, 2020

behinger commented May 1, 2020

behinger commented May 1, 2020

palday left a comment

palday commented Sep 28, 2020

SparseMatrixCSC support for fixed effects #309

SparseMatrixCSC support for fixed effects #309

Conversation

behinger commented Apr 28, 2020

palday commented Apr 28, 2020

palday commented Apr 28, 2020

behinger commented Apr 28, 2020

dmbates commented Apr 29, 2020

behinger commented May 1, 2020

behinger commented May 1, 2020

palday left a comment

Choose a reason for hiding this comment

palday commented Sep 28, 2020