Describing our approach to modeling an arbitrary infectiousness distribution #19

ChiragKumar9 · 2024-11-29T18:00:59Z

This PR adds just a readme that describes an approach for modeling an arbitrary infectious period using order statistics. The focus of this document is explaining the need for this particular approach and a brief description of the math, not code.

Any and all feedback is welcome. I welcome particular feedback on the explanations of rejection sampling. Please also let me know any sections you think need to be more fleshed out.

This PR will not be merged in until all relevant parties have gotten the relevant time to review.

Cargo build/test will fail because the version of our code on main does not compile with the latest updates to ixa, in particular the need for IxaError in define_global_properties! with the new addition of a validator. However, there is a PR in place to fix that, and I can make a dummy commit to get the tests to rerun once that PR is merged in.

Looking forward to hearing thoughts!

ekr-cfa · 2024-11-30T02:20:44Z

docs/time-varying-infectiousness.md

+If we had pre-scheduled all infections, we would have had to store the plan IDs for all the infections in
+`HashMap<PersonId, Vec<PlanId>>`. Then, each time we had executed one of these plans, we would have to have
+removed it from the vector, so that the entry for each `PersonId` tells us the plans we have _left_ for a given
+person. Then, when the agent dies, we could iterate through the remaining plans and cancel them.


I don't think that this changes your basic argument, but this isn't the only implementation choice.

Instead, you can store the times of the next infection, rather than have them as plans, and then just plan the next one. This removes the need to cancel, and if you keep them in a sorted list, then you also don't need to iterate over to plan the next one.

As for the storage, that depends very much on the data structure. For instance, Person Properties are stored as a HashMap of Vecs of PersonProperty, so you actually need an item for each person, whether they have scheduled plans or not, so the only real additional cost is the Vec of times itself

Thanks! I had not thought of this approach and appreciate this insight for now revising this section to drive home the main points concisely.

confunguido

It looks great! I think we can improve conciseness a bit more if we focus more on what's implemented in our model.

confunguido · 2024-12-09T21:00:44Z

docs/time-varying-infectiousness.md

+distribution, $\mathcal{U}(x_{(1)}, 1)$, from which we need to draw an infection attempt. Because this is a new distribution,
+we want the first of $n - 1$ infection attempt times on this distribution. We can do that by drawing the
+minimum of $n - 1$ infection attempts from $\mathcal{U}(0, 1)$, and scaling that value to be on $(x_{(1)}, 1)$.
+In other words, we are using a trick where we shrink the available uniform distribution with each infection


I would delete this "we are using a trick where", and just say what we are doing.

confunguido · 2024-12-09T21:01:20Z

docs/time-varying-infectiousness.md

+This is the CDF for a Beta distribution with $alpha = 1$ and $beta = n$. More generally, the distribution
+of the $k$th infection attempt from $n$ total infection attempts is $\beta(k, n - 1 + k)$.
+
+However, we cannot just independently sample from these Beta distributions. Instead, we must update the distributions


This paragraph is a bit confusing. Could you try make it a bit more concise?

confunguido · 2024-12-09T21:03:07Z

docs/time-varying-infectiousness.md

+
+    The result of passing the uniform time through the GI's inverse CDF is the time _since_ the agent first become
+infectious at which the given $n$th infection attempt occurs. To determine the amount of time _elapsed_ until the next
+infection attempt, given that the agent is currently at their $n-1$th infection attempt, schedule the next infection


should this be $(n-1)$th for the rendering?

confunguido · 2024-12-09T21:04:25Z

docs/time-varying-infectiousness.md

+
+### Changes in the number of infection attempts in the middle of an agent's infectious course
+
+Imagine an agent dies while they are still infectious. Clearly, they cannot be infecting others. (Or, if the


Clearly they cannot infect others.

confunguido · 2024-12-09T21:10:18Z

docs/time-varying-infectiousness.md

+part way through an infection course. Sequentially scheduling the attempts makes it possible to accomodate
+changes to the number of infection attempts that may happen in the middle of an infection course.
+
+Why not just check whether the agent is alive or not at the beginning of the infection attempt? If they are


I honestly think we could remove this entire paragraph for conciseness.

confunguido · 2024-12-09T21:12:57Z

docs/time-varying-infectiousness.md

+distribution. Note that $a(t)$ and $g(t)$ must be on an absolute scale in this example and not scaled to have
+a unit integral. In the case where they are scaled, $g(t)$ can be rescaled to be $Mg(t)$ where $M = \max a(t)$.
+
+This general idea of rejection sampling is useful for other applications. Consider the case


Not quite sure what the purpose of this paragraph is. I think the previous paragraph drives the message home just fine.

confunguido · 2024-12-09T21:14:20Z

docs/time-varying-infectiousness.md

+particularly inefficient because we are rejecting the majority of samples. Instead, we may try making our proposal
+distribution better fit our underlying distribution. We may make $s(t)$ a similar linear approximation for $g(t)$.
+
+However, this approximation is only possible if we sequentially sample infection attempts. If we sample all


I think "However," isn't necessary here.

confunguido

lgtm

ChiragKumar9 changed the title ~~Describing our approach to modeling an arbitrary infectious period~~ Describing our approach to modeling an arbitrary infectiousness distribution Nov 29, 2024

ekr-cfa reviewed Nov 30, 2024

View reviewed changes

ChiragKumar9 requested a review from confunguido December 3, 2024 02:11

ChiragKumar9 added 8 commits December 4, 2024 16:27

motivation section

ca39441

checkpoint because things crash

2b5fbef

full draft

ae863e5

revised for clarity and typos

114f567

emphasize sequential sampling more

784ccfe

add uniform shifting math

b47a22a

clarify text

dd0c628

make clear that you need to schedule in elpased time forward

1e3d5ea

ChiragKumar9 force-pushed the ckk_docs_arbitrary_infectiousness branch from 7b035d7 to 1e3d5ea Compare December 4, 2024 17:10

ChiragKumar9 added 3 commits December 4, 2024 17:13

use programmer's notation to trick github md

5109918

added to contents

8a91d09

notation

99d7f37

ChiragKumar9 mentioned this pull request Dec 4, 2024

Algorithm for sampling from an arbitrary GI #24

Merged

confunguido requested changes Dec 9, 2024

View reviewed changes

ChiragKumar9 added 2 commits December 9, 2024 21:35

review comments

f6d9e94

make more concise

a0f7af9

ChiragKumar9 requested a review from confunguido December 9, 2024 21:53

confunguido approved these changes Dec 10, 2024

View reviewed changes

ChiragKumar9 merged commit b804259 into main Dec 10, 2024
3 checks passed

ChiragKumar9 deleted the ckk_docs_arbitrary_infectiousness branch December 10, 2024 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Describing our approach to modeling an arbitrary infectiousness distribution #19

Describing our approach to modeling an arbitrary infectiousness distribution #19

ChiragKumar9 commented Nov 29, 2024

ekr-cfa Nov 30, 2024

ChiragKumar9 Dec 2, 2024

confunguido left a comment

confunguido Dec 9, 2024

confunguido Dec 9, 2024

confunguido Dec 9, 2024

confunguido Dec 9, 2024

confunguido Dec 9, 2024

confunguido Dec 9, 2024

confunguido Dec 9, 2024

confunguido left a comment


		### Changes in the number of infection attempts in the middle of an agent's infectious course

		Imagine an agent dies while they are still infectious. Clearly, they cannot be infecting others. (Or, if the

Describing our approach to modeling an arbitrary infectiousness distribution #19

Describing our approach to modeling an arbitrary infectiousness distribution #19

Conversation

ChiragKumar9 commented Nov 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

confunguido left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

confunguido left a comment

Choose a reason for hiding this comment