Paul Tomblin writes:
On Sun, Mar 1, 2026, at 5:43 AM, Stephen J. Turnbull wrote:
Paul Tomblin via Mailman-users writes:
Full-text indexing is a pretty slow operation, and it's I/O bound. If you've migrated a substantial archive, that will be a full-archive
Is 431,000+ messages a substantial archive?
I would think that would be measured in terms of hours, not days, unless there are a substantial proportion of messages that are like all of Congress's annual appropriations bill.
Of course in terms of how long it would be pinning I/O the important unit is "terabytes", not "messages".
after the current message gets indexed). But if you didn't manually trigger a full archive index at the initial migration, the first
I did a "update_index_one_list" on each archive after I brought it into hyperkitty. I was impressed with the full text search.
Yes, Xapian is quite good for my purposes. Especially compared to "Whoosh" for performance. I can't say about the other backends supported by Django Haystack, I'd like to try them as well.
-- GNU Mailman consultant (installation, migration, customization) Sirius Open Source https://www.siriusopensource.com/ Software systems consulting in Europe, North America, and Japan