[Question] How to sample an event time delta range within [0, 10,000] #55

SiriusHou · 2025-02-03T00:42:30Z

In your example data, time_since_last_event is always within the range [0, 10]. If my sampled time_since_last_event can range from [0, 10,000], can you guide me on how to sample it?

The text was updated successfully, but these errors were encountered:

iLampard · 2025-02-03T03:11:09Z

Hi， this is a 'dtime_max' in the thinning algo params that determine the range of the

model_config:
.....

    thinning:

      .....

      dtime_max: 5.    <-------------------- HERE
....

SiriusHou · 2025-02-03T03:46:17Z

Thank you for your answer. In fact, I adjusted this dtime_max but it didn't help.

SiriusHou · 2025-02-03T03:49:35Z

Even after adjusting dtime_max in reproducing retweet results #49, the event type prediction accuracy remains lower than when normalizing the data delta time to the range [0, 10].

iLampard · 2025-02-03T04:37:34Z

let me have a look

SiriusHou · 2025-02-03T05:13:03Z

I'm not sure I understand your code correctly. Here I found you used pad_token_id to pad time_delta_sequence. Suppose we have 10 event types, but the delta time can be as large as 100. Should we use 100 to pad the time_delta_sequence?

iLampard · 2025-02-05T05:09:01Z

I'm not sure I understand your code correctly. Here I found you used pad_token_id to pad time_delta_sequence. Suppose we have 10 event types, but the delta time can be as large as 100. Should we use 100 to pad the time_delta_sequence?

Hi,

The perfect case is indeed to use a different pad token for time_delta_sequence.

The current implementation of using type pad token is a simple workaround. When computing loss, we use masks from type_sequence to eliminate padded events, and therefore, the pad tokens for time_delta_sequence are not used.

see https://github.com/ant-research/EasyTemporalPointProcess/blob/main/easy_tpp/model/torch_model/torch_basemodel.py#L110

Another reason is there is not a straightforward way to determine the pad token id for time sequences. One way is to compute the statistics of the time delta sequences and then choose a large number. But this causes computations and not very friendly for users.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] How to sample an event time delta range within [0, 10,000] #55

[Question] How to sample an event time delta range within [0, 10,000] #55

SiriusHou commented Feb 3, 2025

iLampard commented Feb 3, 2025

SiriusHou commented Feb 3, 2025

SiriusHou commented Feb 3, 2025

iLampard commented Feb 3, 2025

SiriusHou commented Feb 3, 2025 •

edited

Loading

iLampard commented Feb 5, 2025

[Question] How to sample an event time delta range within [0, 10,000] #55

[Question] How to sample an event time delta range within [0, 10,000] #55

Comments

SiriusHou commented Feb 3, 2025

iLampard commented Feb 3, 2025

SiriusHou commented Feb 3, 2025

SiriusHou commented Feb 3, 2025

iLampard commented Feb 3, 2025

SiriusHou commented Feb 3, 2025 • edited Loading

iLampard commented Feb 5, 2025

SiriusHou commented Feb 3, 2025 •

edited

Loading