MLUG: Re: [MLUG - DISCUSSION] headers gone wild!
Re: [MLUG - DISCUSSION] headers gone wild!
Email address obfuscation in effect -- please click here to turn it off.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
On Tue, 1 Feb 2005, Mike Miller wrote:

> See this message...
>
> http://taxa.epi.umn.edu/bgnews/2004/msg00032.html
>
> ...which has been translated using MHonArc.  You will see a few '=20's 
> in there and all of them look like they should be spaces.  You will also 
> see some '=EA', '=E5' and '=E6' which should be ê, å and æ, 
> respectively. That shows that the '=XX' correspondes to ascii character 
> XX where XX is the hexadecimal value.


More data:

perl -pe 's/=\n// ; s/=20/ /g ; s/=EA/\352/g ; s/=E5/\345/g ; s/=E6/\346/g' msg00032.html > ~/www/test_junk.html

I used perl to translate some of the '=' junk and this is what I got:

http://taxa.epi.umn.edu/~mbmiller/test_junk.html

Which seems to be properly repaired.

Note that when I had an '=' at the end of a line, I removed both the '=' 
and the newline.  I converted '=20' to a single space, and I converted the 
'=XX' to the appropriate octal from the extended ascii set.

Another interesting fact that you should note, Michael, is that MHonArc 
(written in perl and GPL'd) seems usually to handle some of this weirdness 
correctly.  There are many other messages in my BGnews archive that are 
full of '=' and '=20' in the e-mail text (seen if I 'less' the mbox), but 
they are translated perfectly in the MHonArc archive so that it is 
perfectly readable.  So there must be some good documentation out there 
for this craziness.

Mike
_______________________________________________
discussion mailing list
EMAIL:PROTECTED
http://mlug.missouri.edu/mailman/listinfo/discussion