On 12/5/24 2:02 PM, Mark Sapiro wrote:
Message-ID is not one of the fields indexed. These are defined at https://gitlab.com/mailman/hyperkitty/-/blob/master/hyperkitty/search_indexe...
Looks like it would trivial to add there. Think anyone would be opposed to adding support for this?
A bit obscure for laypersons, but one can compute the message_id_hash from the Message-ID
from base64 import b32encode from hashlib import sha1 def get_message_id_hash(msg_id): return b32encode(sha1(msg_id).digest()).decode()
This is great to know. I assume the thread ID is generated the same way?
I can't help but wonder though how one has the Message-ID without having the message. I'm wondering in general how often if ever the layperson would be searching the archive for a Message-ID.
Well I've been working with an import of 34 years of mail archives (200k messages or so), and we have several people spot checking what we have and fixing threading based on their archive. Since they have the message already, it would be easy to search for the message ID for them. The older archives predate RFC2822 by a decade or so and in-reply-to is missing or just wrong on quite a few of them. From what I can see it was migrated from LISTSERV to majordomo in 1993, then to mailman2 in 2008, and what I have for an archive was from the mail to NNTP spool, at least for the pre-1993 list.
Thanks,
Bryan Fields
727-409-1194 - Voice http://bryanfields.net