Investigate whether dispersion should be fixed or estimated from the data #255

jr-leary7 · 2024-10-17T16:31:11Z

related to Speed up GEE mode #241, as not estimating scale probably increases speed by reducing the number of scoring iterations
in extreme cases, very high dispersion values lead to wildly inflated standard errors and thus deflated test statistics / weird plots
dispersion is currently estimated via:

$$ \hat{\phi} = \left(-p + \sum_{i=1}^n n_i\right)^{-1} \sum_{i=1}^n \sum_{t=1}^{n_i} \hat{r}_{it}^2 $$

where $\hat{r}_{it}$ is the estimated residual for subject $i$ at timepoint $t$

… to #255

jr-leary7 · 2024-10-21T17:04:29Z

commit 990bee8 removed the gee.scale.fix argument in favor of strictly fixing dispersion to be equal to 1 throughout. this decision was based on benchmarking performed on simulated data, in which we saw that fixing the scale lead to slightly better dynamic gene classification as well as lower runtimes

jr-leary7 added enhancement New feature or request GEE related to the GEE model backend labels Oct 17, 2024

jr-leary7 added a commit that referenced this issue Oct 17, 2024

added gee.scale.fix argument to testDynamic() and marge2() -- related…

8d7b7be

… to #255

jr-leary7 closed this as completed Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Investigate whether dispersion should be fixed or estimated from the data #255

Investigate whether dispersion should be fixed or estimated from the data #255

jr-leary7 commented Oct 17, 2024

jr-leary7 commented Oct 21, 2024

Uh oh!

Investigate whether dispersion should be fixed or estimated from the data #255

Investigate whether dispersion should be fixed or estimated from the data #255

Comments

jr-leary7 commented Oct 17, 2024

jr-leary7 commented Oct 21, 2024

Uh oh!