I see the following log entries in mailmansuite.log:
INFO 2020-11-28 00:00:52,242 29142 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-11-28 00:01:00,600 29142 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000 INFO 2020-11-29 00:00:52,135 15513 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-11-29 00:01:00,192 15513 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000 INFO 2020-11-30 00:00:52,484 22622 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-11-30 00:01:00,758 22622 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000 INFO 2020-12-01 00:00:50,246 11946 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-12-01 00:00:59,688 11946 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000
I assume this is a result of importing archives from a MM2 mbox file. This happen months ago however. Why are these entries still showing up?
-- Brian Carpenter Harmonylists.com Emwd.com
On 12/1/20 5:16 PM, Brian Carpenter wrote:
I see the following log entries in mailmansuite.log:
INFO 2020-11-28 00:00:52,242 29142 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-11-28 00:01:00,600 29142 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000 INFO 2020-11-29 00:00:52,135 15513 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-11-29 00:01:00,192 15513 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000 INFO 2020-11-30 00:00:52,484 22622 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-11-30 00:01:00,758 22622 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000 INFO 2020-12-01 00:00:50,246 11946 hyperkitty.lib.mailman 2117 emails left to refresh, checked 1000 INFO 2020-12-01 00:00:59,688 11946 hyperkitty.lib.mailman 2117 emails left to refresh, checked 2000
I assume this is a result of importing archives from a MM2 mbox file. This happen months ago however. Why are these entries still showing up?
Those come from the HyperKitty sync_mailman job which is run by hyperkitty_import if you don't specify --no-sync-mailman, but is also one of the Django daily jobs.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan
On 12/1/20 8:22 PM, Mark Sapiro wrote:
Those come from the HyperKitty sync_mailman job which is run by hyperkitty_import if you don't specify --no-sync-mailman, but is also one of the Django daily jobs.
Ok but those number are the same for thousands of entries and apparently are being logged for weeks. That can't be good. How does one fix this?
-- Brian Carpenter Harmonylists.com Emwd.com
On 12/1/20 5:31 PM, Brian Carpenter wrote:
On 12/1/20 8:22 PM, Mark Sapiro wrote:
Those come from the HyperKitty sync_mailman job which is run by hyperkitty_import if you don't specify --no-sync-mailman, but is also one of the Django daily jobs.
Ok but those number are the same for thousands of entries and apparently are being logged for weeks. That can't be good. How does one fix this?
The sync_mailman job does two things. The first thing it does is for all lists known to HyperKitty, it gets the list attributes "display_name", "description", "subject_prefix", "archive_policy", "created_at" and "list_id" from Mailman core and updates the list properties in Hyperkitty.
Then it gets every message sender known to Hyperkitty for which HK's sender.mailman_id is null, and, 1000 senders at a time, it tries to get and set the core user_id for the address to the sender's mailman_id.
What happens when you import old archives, you archive a lot of posts with senders that are no longer list members and are not known to core, and every time sync_mailman runs, it tries again to get those core user_ids.
You definitely want the first part to run to pick up changes like a change in archive_policy or a change in subject_prefix (to be able to strip it from Subject: headers.
The second step could also find a newly registered core user, but is less important. Whether it is bad is a matter of opinion. At worst, it is a waste of some processing time, and at best it will pick up new users if there are any. "Fixing" it requires modification of the sync_with_mailman function in hyperkitty/libs/mailman.py.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan
participants (2)
-
Brian Carpenter
-
Mark Sapiro