PHP Classes

Not able to read utf-8 message

Recommend this page to a friend!

      PHP MIME Email Message Parser  >  All threads  >  Not able to read utf-8 message  >  (Un) Subscribe thread alerts  
Subject:Not able to read utf-8 message
Summary:Not able to read utf-8 message
Messages:6
Author:humble.techy
Date:2008-10-09 01:51:51
Update:2008-10-10 10:51:48
 

  1. Not able to read utf-8 message   Reply   Report abuse  
Picture of humble.techy humble.techy - 2008-10-09 01:51:51
Hi,

Thanks for the script and actively maintaining it with good support.

I tried your class with a utf-8 message, but it is failing. Please find the attached message.

=======================================
Fromx: [email protected] Oct 07 19:08:48 2008
Return-Path: <useremail@...>
X-Sender: useremail@...
X-Apparently-To: [email protected]
X-Received: (qmail 55170 invoked from network); 8 Oct 2008 02:08:46 -0000
X-Received: from unknown (66.218.67.96)
by m57.grp.scd.yahoo.com with QMQP; 8 Oct 2008 02:08:46 -0000
X-Received: from unknown (HELO ti-out-0910.google.com) (209.85.142.188)
by mta17.grp.scd.yahoo.com with SMTP; 8 Oct 2008 02:08:46 -0000
X-Received: by ti-out-0910.google.com with SMTP id j3so1985354tid.8
for <[email protected]>; Tue, 07 Oct 2008 19:08:45 -0700 (PDT)
X-Received: by 10.110.49.2 with SMTP id w2mr1212417tiw.56.1223431725635;
Tue, 07 Oct 2008 19:08:45 -0700 (PDT)
X-Received: by 10.110.105.1 with HTTP; Tue, 7 Oct 2008 19:08:45 -0700 (PDT)
Message-ID: <96ab1a490810071908r2f544e6fh2661d0969e5f06a1@...>
Date: Wed, 8 Oct 2008 10:08:45 +0800
To: [email protected]
In-Reply-To: <96ef192be15ad1dd961b8b8e05141603@...>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: base64
Content-Disposition: inline
References: <96ef192be15ad1dd961b8b8e05141603@...>
X-Google-Sender-Auth: 04ed20463299c011
X-Originating-IP: 209.85.142.188
X-eGroups-Msg-Info: 1:12:0:0:0
X-eGroups-From: "user" <user@...>
From: "user" <user@...>
Subject: =?UTF-8?Q?Will_anybody_throw_a_beautiful_harp_o?= =?UTF-8?Q?ut_on_the_street_|=E0=AE=A8?= =?UTF-8?Q?=E0=AE=B2=E0=AF=8D=E0=AE=B2=E0=AE=A4?= =?UTF-8?Q?=E0=AF=8B=E0=AE=B0=E0=AF=8D_=E0=AE=B5?= =?UTF-8?Q?=E0=AF=80=E0=AE=A3=E0=AF=88_=E0=AE=9A?= =?UTF-8?Q?=E0=AF=86=E0=AE=AF=E0=AF=8D=E0=AE=A4=E0=AF=81?=
X-Yahoo-Group-Post: member; u=332924112; y=_ECl9v2I_cpOwB1cJdWrJr2GFuWxBQTdrppK3e-cF9GXXcQRptw
X-Yahoo-Profile: user

V2lsbCBhbnlib2R5IHRocm93IGEgYmVhdXRpZnVsIGhhcnAgb3V0IG9uIHRoZSBzdHJlZXQg
fOCuqOCusuCvjeCusuCupOCvi+CusOCvjSAg4K614K+A4K6j4K+IIOCumuCvhuCur+CvjeCu
pOCvgSAnCgpUaGUgcHJldmlvdXMgcG9zdCBvbiB0aGUgY2hvbGEgYnJvbnplIGluIHRoZSBT
aW5nYXBvcmUgbXVzZXVtLCBldmluY2VkIGxvdCBvZgppbnRlcmVzdC4gTG90cyBvZiBmcmVp
bmRzIHdlcmUgZWFnZXIgdG8ga25vdyBhYm91dCBob3cgdGhlIHN0YXR1ZSBnb3QgdGhlcmUg
ZXRjLgpJdHMgdHJ1ZSB0aGF0IG1vc3QgTXVzZXVtcyBhcm91bmQgdGhlIGdsb2JlIGhhdmUg
Y2hvbGEgYnJvbnplcyBpbiB0aGVpcgpjb2xsZWN0aW9ucy4gSXRzIGEgdHJpYnV0ZSB0byB0
aGUgbWFzdGVyIGNyYWZ0c21lbiBmb3IgdGhlaXIgZW5kdXJhbmNlIGFuZCBhbHNvClsuLi5d
CgpZb3UgbWF5IHZpZXcgdGhlIGxhdGVzdCBwb3N0IGF0Cmh0dHA6Ly93d3cucG9ldHJ5aW5z
dG9uZS5pbi8yMDA4LzEwLzA4L3dpbGwtYW55Ym9keS10aHJvdy1hLWJlYXV0aWZ1bC1oYXJw
LW91dC1vbi10aGUtc3RyZWV0Lmh0bWwKCgrgrprgrr/grpngr43grpXgr4gg4K6F4K6w4K+B
4K6Z4K+N4K6V4K6+4K6f4K+N4K6a4K6/4K6v4K6V4K6k4K+N4K6k4K6/4K6y4K+NIOCuh+Cu
sOCvgeCuqOCvjeCupCDgrprgr4vgrrQg4K614K+G4K6j4K+N4K6V4K6yIOCumuCuv+CusuCv
iOCur+CviCDgrqrgrrHgr43grrHgrr/grq8g4K6u4K6f4K6y4K+B4K6V4K+N4K6V4K+BCuCu
quCusuCusOCvgeCuruCvjSDgrq7grrHgr4Hgrq7gr4rgrrTgrr8g4K6V4K+C4K6x4K+B4K6V
4K+I4K6v4K6/4K6y4K+NLCDgrofgrqjgr43grqQg4K6a4K6/4K6y4K+IIOCuheCumeCvjeCu
leCvhyDgro7grqrgr43grqrgrp/grr8g4K614K6o4K+N4K6k4K6k4K+BIOCujuCuqeCvjeCu
sQrgrpXgr4fgrp/gr43grp/grr7grrDgr43grpXgrrPgr40uIOCuquCusiDgroXgrq7gr4bg
rrDgrr/grpXgr43grpUsIOCukOCusOCvi+CuquCvjeCuquCuv+CuryDgroXgrrDgr4Hgrpng
r43grpXgrr7grp/gr43grprgrr/grq/grpXgrpngr43grpXgrrPgrr/grrLgr40g4K6a4K+L
4K604K6w4K+NIOCuleCuvuCusgrgrrXgr4bgrqPgr43grpXgrrIg4K6a4K6/4K6y4K+I4K6V
4K6z4K+NIOCuieCus+CvjeCus+CuqS4KCmh0dHA6Ly93d3cucG9ldHJ5aW5zdG9uZS5pbi9s
YW5nL3RhLzIwMDgvMTAvMDgvd2lsbC1hbnlib2R5LXRocm93LWEtYmVhdXRpZnVsLWhhcnAt
b3V0LW9uLXRoZS1zdHJlZXQuaHRtbAoKQmVzdCByZWdhcmRzLApWaWpheQp2ai5lcGlzdGVt
ZUBwb2V0cnlpbnN0b25lLmluCi0tIApodHRwOi8vd3d3LnBvZXRyeWluc3RvbmUuaW4KSGVy
ZSB0aGUgbGFuZ3VhZ2Ugb2Ygc3RvbmUgc3VycGFzc2VzIHRoZSBsYW5ndWFnZSBvZiBtYW4K


==============================

  2. Re: Not able to read utf-8 message   Reply   Report abuse  
Picture of Manuel Lemos Manuel Lemos - 2008-10-09 02:07:15 - In reply to message 1 from humble.techy
I was able to parse the message without problems. What is failing for you?

  3. Re: Not able to read utf-8 message   Reply   Report abuse  
Picture of humble.techy humble.techy - 2008-10-09 17:07:51 - In reply to message 2 from Manuel Lemos
sorry, I should have pasted the output.

MIME message decoding successful.
1 message was found.
Message 1:
array(6) {
["Headers"]=>
array(27) {
["from "]=>
string(46) "[email protected] Tue Oct 07 19:08:48 2008"
["return-path:"]=>
string(23) "<[email protected]>"
["x-sender:"]=>
string(21) "[email protected]"
["x-apparently-to:"]=>
string(30) "[email protected]"
["x-received:"]=>
array(4) {
[0]=>
string(55) "(qmail 55170 invoked from network); 8 Oct 2008 02:08:46"
[1]=>
string(59) "from unknown (HELO ti-out-0910.google.com) (209.85.142.188)"
[2]=>
string(55) "by ti-out-0910.google.com with SMTP id j3so1985354tid.8"
[3]=>
string(57) "by 10.110.105.1 with HTTP; Tue, 7 Oct 2008 19:08:45 -0700"
}
["-0000
x-received:"]=>
string(27) "from unknown (66.218.67.96)"
["by "]=>
array(2) {
[0]=>
string(58) "m57.grp.scd.yahoo.com with QMQP; 8 Oct 2008 02:08:46 -0000"
[1]=>
string(60) "mta17.grp.scd.yahoo.com with SMTP; 8 Oct 2008 02:08:46 -0000"
}
["for "]=>
string(65) "<[email protected]>; Tue, 07 Oct 2008 19:08:45 -0700"
["(pdt)
x-received:"]=>
string(60) "by 10.110.49.2 with SMTP id w2mr1212417tiw.56.1223431725635;"
["tue, "]=>
string(32) "07 Oct 2008 19:08:45 -0700 (PDT)"
["(pdt)
message-id:"]=>
string(0) ""
["<[email protected]>
date:"]=>
string(30) "Wed, 8 Oct 2008 10:08:45 +0800"
["to:"]=>
string(30) "[email protected]"
["in-reply-to:"]=>
string(55) "<[email protected]>"
["mime-version:"]=>
string(3) "1.0"
["content-type:"]=>
string(25) "text/plain; charset=UTF-8"
["content-transfer-encoding:"]=>
string(6) "base64"
["content-disposition:"]=>
string(6) "inline"
["references:"]=>
string(55) "<[email protected]>"
["x-google-sender-auth:"]=>
string(16) "04ed20463299c011"
["x-originating-ip:"]=>
string(14) "209.85.142.188"
["x-egroups-msg-info:"]=>
string(10) "1:12:0:0:0"
["x-egroups-from:"]=>
string(41) ""vj kumar" <[email protected]>"
["from:"]=>
string(34) ""vj kumar" <[email protected]>"
["subject:"]=>
string(49) "=?UTF-8?Q?Will_anybody_throw_a_beautiful_harp_o?="
["=?utf-8?q?ut_on_the_street_|=e0=ae=a8?=
=?utf-8?q?=e0=ae=b2=e0=af=8d=e0=ae=b2=e0=ae=a4?=
=?utf-8?q?=e0=af=8b=e0=ae=b0=e0=af=8d_=e0=ae=b5?=
=?utf-8?q?=e0=af=80=e0=ae=a3=e0=af=88_=e0=ae=9a?=
=?utf-8?q?=e0=af=86=e0=ae=af=e0=af=8d=e0=ae=a4=e0=af=81?=
x-yahoo-group-post:"]=>
string(20) "member; u=332924112;"
["y=_ecl9v2i_cpowb1cjdwrjr2gfuwxbqtdrppk3e-cf9gxxcqrptw
x-yahoo-profile:"]=>
string(11) "vj_episteme"
}
["Parts"]=>
array(0) {
}
["DecodedHeaders"]=>
array(1) {
["subject:"]=>
array(1) {
[0]=>
array(1) {
[0]=>
array(2) {
["Value"]=>
string(37) "Will anybody throw a beautiful harp o"
["Encoding"]=>
string(5) "UTF-8"
}
}
}
}
["BodyPart"]=>
int(1)
["BodyLength"]=>
int(1404)
["ExtractedAddresses"]=>
array(3) {
["return-path:"]=>
array(1) {
[0]=>
array(1) {
["address"]=>
string(21) "[email protected]"
}
}
["to:"]=>
array(1) {
[0]=>
array(1) {
["address"]=>
string(30) "[email protected]"
}
}
["from:"]=>
array(1) {
[0]=>
array(2) {
["address"]=>
string(21) "[email protected]"
["name"]=>
string(8) "vj kumar"
}
}
}
}
array(9) {
["Type"]=>
string(4) "text"
["Description"]=>
string(12) "Text message"
["Encoding"]=>
string(5) "utf-8"
["Subject"]=>
string(37) "Will anybody throw a beautiful harp o"
["SubjectEncoding"]=>
string(5) "utf-8"
["From"]=>
array(1) {
[0]=>
array(2) {
["address"]=>
string(21) "[email protected]"
["name"]=>
string(8) "vj kumar"
}
}
["To"]=>
array(1) {
[0]=>
array(1) {
["address"]=>
string(30) "[email protected]"
}
}
["Return-path"]=>
array(1) {
[0]=>
array(1) {
["address"]=>
string(21) "[email protected]"
}
}
["DataLength"]=>
int(1404)
}
Warning: invalid header name line at position 283
Warning: invalid header name line at position 417
Warning: invalid header name line at position 549
Warning: invalid header name line at position 698

If you see, the subject is not getting the utf-8 characters

  4. Re: Not able to read utf-8 message   Reply   Report abuse  
Picture of humble.techy humble.techy - 2008-10-09 17:20:11 - In reply to message 3 from humble.techy
In other words, I want to retireve the subject with utf-8 encoded. expected result is "Will anybody throw a beautiful harp out on the street |&#2984;&#2994;&#3021;&#2994;&#2980;&#3019;&#2992;&#3021;
&#2997;&#3008;&#2979;&#3016; &#2970;&#3014;&#2991;&#3021;&#2980;&#3009;" which I am not getting. Hope this is wrong place to ask this question.

  5. Re: Not able to read utf-8 message   Reply   Report abuse  
Picture of Manuel Lemos Manuel Lemos - 2008-10-10 07:11:48 - In reply to message 4 from humble.techy
You just need to pick the value of the Subject entry of the array returned by the Analyze function.

What happens is that you are displaying the results of the var_dump() function in an HTML page. The browser processes the HTML entities that you have there like &#2984; and they are not rendered as visible characters, but they are there in the page. So everything is working as expected.

  6. Re: Not able to read utf-8 message   Reply   Report abuse  
Picture of humble.techy humble.techy - 2008-10-10 10:51:48 - In reply to message 5 from Manuel Lemos
cool. Thanks a lot, appreciate it