-
-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create a new, deduplicated mailbox with unique emails too (Documentation: What is "discarded"?) #599
Comments
One other thing that isn't clear from the documentation: If two items tie, e.g. have the same datestamp, is a tiebreak made. This would be logical, but a strict reading of the documentation would be that BOTH emails are selected. Meaning, if 1A and 1B have identical timestamps, are BOTH selected and acted upon? Or just one, for actions that typically select one message. |
Just to followup, I still could not determine the behavior. I used GPT4 and plugged in each file, trying to see if I could determine code that would answer my question. However, I was unable to determine which code directly addresses the handling of unique emails in the deduplication process or the resolution of ties in duplicate selection. |
So I am constucting a toy mbox to understand the behavior, but now I am more confused than ever:
Giving:
This suggests:
Anyway, what is clear is that all emails are selected, and
|
Is your feature request related to a problem? Please describe.
I have several mailboxes, with many duplicates.
I want to create a new mailbox, with all de-duplicated mail from the old mailboxes, including non-duplicates.
Documentation confusion
I'm puzzling over the documentation, because it is not really clear what "selected" and "discarded" mean.
Let's say there are emails 1A, 1B, and 2. (1A and 1B are duplicates in different mailboxes.)
Whatever strategy I choose, 1A and 1B are compared and one is selected and the other is discarded.
But what happens to 2?
a) Has no hash matches so it is never compared, or selected, and isn't copied to my new mailbox. Then I am stuck on how to solve my problem.
b) There is always a "selected" mail, even if it is unique and has no hash matches.
Can you please clarify? (I also think a documentation update would help. I read over the main docs and didn't understand, which is why I post.)
The text was updated successfully, but these errors were encountered: