Importing mbox file into MM3
Hi All
I'm trying to import a large mbox file from a MM2 list into a new list on MM3.
The new MM3 list has 3 test emails in its archive.
# cd /usr/share/mailman3-web/
# python3 manage.py hyperkitty_import -l LISTNAME@DOMAIN /home/mark/tmp/LISTNAME.mbox Importing from mbox file /home/mark/tmp/LISTNAME.mbox to LISTNAME@DOMAIN Computing thread structure Synchronizing properties with Mailman Warming up cache The full-text search index is not updated for this list. It will not be updated by the 'minutely' incremental update job. To update the index for this list, run the Django admin command with arguments 'update_index_one_list LISTNAME@DOMAIN'.
That all seems pretty good. Then I attempt to rebuild the archive.
# python3 manage.py update_index_one_list LISTNAME@DOMAIN Indexing 3 emails
The import or rebuild has failed as it's just rebuilding the existing archive of the 3 test messages.
What am I doing wrong?
Thanks Mark
On 6/22/23 7:00 PM, Mark wrote:
Hi All
I'm trying to import a large mbox file from a MM2 list into a new list on MM3.
The new MM3 list has 3 test emails in its archive. ... The import or rebuild has failed as it's just rebuilding the existing archive of the 3 test messages.
What am I doing wrong?
Presumably, the 3 test emails are newer than the messages in the archive
.mbox. The default for hyperkitty_import is to not import messages older
than the newest message in the existing archive.
You need to specify something like --since='1970-01-01'
on the
hyperkitty_import command to import these older messages.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan
On 2023-06-23 12:19, Mark Sapiro wrote:
On 6/22/23 7:00 PM, Mark wrote:
Hi All
I'm trying to import a large mbox file from a MM2 list into a new list on MM3.
The new MM3 list has 3 test emails in its archive. ... The import or rebuild has failed as it's just rebuilding the existing archive of the 3 test messages.
What am I doing wrong?
Presumably, the 3 test emails are newer than the messages in the archive .mbox. The default for hyperkitty_import is to not import messages older than the newest message in the existing archive. You need to specify something like
--since='1970-01-01'
on the hyperkitty_import command to import these older messages.
Thanks Mark. That looks more promising. It's been chugging away now for 3 hours with only 4 errors reported so far.
# cd /usr/share/mailman3-web/ # python3 manage.py hyperkitty_import --since='1970-01-01' -l LISTNAME@DOMAIN /home/mark/tmp/LISTNAME.mbox
"Failed adding message ......@mail.yahoo.com>: A string literal cannot contain NUL (0x00) characters".
The mbox file is about 3 GB. The errors look like they're for messages that aren't kosher (maybe misplaced "From"s in replies).
I can live with a few missing messages.
Back to watching the spinning pipe. Chug chug.
Cheers Mark
participants (2)
-
Mark
-
Mark Sapiro