From: Chris Nehren on
On 2010-06-16, Ben Bacarisse scribbled these curious markings:
> Tuxedo <tuxedo(a)mailinator.com> writes:
>
>> John Kelly wrote:
>>
>> [...]
>>
>>> It is. You could read the Mozilla source and try to hack out your own
>>> solution. But as for shell related solutions, I think we have reached
>>> the point of diminishing returns.
>>
>> Yes, I agree, it's now off-topic here. I will search for relevant Mozilla
>> groups/forums/docs.
>
> Just in case you missed the suggestion a while back... have you tried
> the file command? It may be able to tell you the format if it is some
> standard compressed file.

It's very likely not, as jwz is too pleased to rant^Wtell you:
http://www.jwz.org/doc/mailsum.html (or at least it wasn't a standard
format in the iterations he discusses there)

Though perhaps searching the Mozilla community sites will yield an
extension or addon that can massage the data into something resembling a
sane (inasmuch as email can be sane) format.


--
Thanks and best regards,
Chris Nehren
Unless noted, all content I post is CC-BY-SA.
From: Tuxedo on
Chris Nehren wrote:

[...]

> It's very likely not, as jwz is too pleased to rant^Wtell you:
> http://www.jwz.org/doc/mailsum.html (or at least it wasn't a standard
> format in the iterations he discusses there)
>
> Though perhaps searching the Mozilla community sites will yield an
> extension or addon that can massage the data into something resembling a
> sane (inasmuch as email can be sane) format.

Thanks for the jwz link. I searched a couple of Mozilla related forums but
for some reason links from for example:
http://kb.mozillazine.org/Thunderbird_:_Tips_:_Compacting_Folders to:
http://kb.mozillazine.org/Compacting_folders#Compacting_does_not_seem_to_work
do not load.

I think I'll put this whole affair down to bad software design by Mozilla
developers for not clearly communicating the growing mailbox risk to
end-users as part of the actual user interface, combined with user-error of
the person having lost years of data that is now probably irretrievable on
anything less than a small super-computer and/or by custom programming
solutions. That said, I've learned some interesting stuff in this thread :-)

Tuxedo


From: Ben Bacarisse on
Tuxedo <tuxedo(a)mailinator.com> writes:
<snip>
> I didn't miss it, there always seems to exist a new and interesting command
> to try. Unfortunately the output is a bit vague on this particular file:
>
> $ file MyBigCrapBox
> MyBigCrapBox: data

OK, that just means file does not know this format. It was worth a try.

<snip>
--
Ben.
From: Christian Brabandt on
On 2010-06-15, Tuxedo <tuxedo(a)mailinator.com> wrote:
> I thereafter tested to copy 100 bytes of the beginning of the huge file:
> dd count=1 bs=100 if=myBigCrapBox of=myBigCrapBox.1
>
> But thereby I realise there must be something wrong with the huge mbox
> file. The resulting file, myBigCrapBox.1, should be the first 100 bytes
> ASCII but it all appears to be binary data, or nothing at all; one editor
> (Nedit) just shows the file with a long line of <nul><nul><nul>, while the
> filesize is exactly 100 bytes.

Could it be, by any chance, that the file was opened for writing by
mozilla and the system crashed. So that all opened files that have not
been flushed to disk have been nullified?

In any case, it look at the other end of the file and check whether
there is somewhere a valid mail file. You could also try mutt -f
big_brocken_file as well to see if mutt can read it. In my experience,
mutt's mbox parser works quite well and I have been able to restore
broken mbox files using mutt (that contained junk in between, because it
was a original deleted file).

Any way, to come back to your original question, to split a huge mbox
file, I'd try archivemail.


regards,
Christian
From: Tuxedo on
Christian Brabandt wrote:

[...]

> Could it be, by any chance, that the file was opened for writing by
> mozilla and the system crashed. So that all opened files that have not
> been flushed to disk have been nullified?

That may very well be the case as the system probably crashed with the mail
application open. But I don't know for sure what happened and the user
probably doesn't either.

> In any case, it look at the other end of the file and check whether
> there is somewhere a valid mail file. You could also try mutt -f
> big_brocken_file as well to see if mutt can read it. In my experience,
> mutt's mbox parser works quite well and I have been able to restore
> broken mbox files using mutt (that contained junk in between, because it
> was a original deleted file).

$ mutt -f MyBigCrapBox
MyBigCrapBox: Value too large for defined data type (errno = 75)

> Any way, to come back to your original question, to split a huge mbox
> file, I'd try archivemail.

I don't have archivemail on my system but maybe I'll give it a try,
although dd appars to be the ideal solution for dealing with extra large
files.

Thanks,
Tuxedo