<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:series="http://unfoldingneurons.com/"
		>
<channel>
	<title>Comments on: GSM-7 Encoding with the GNU iconv library</title>
	<atom:link href="http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/feed/" rel="self" type="application/rss+xml" />
	<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/</link>
	<description>A blog about Internet protocols, mobile technologies and things remotely related</description>
	<lastBuildDate>Thu, 25 Feb 2010 10:43:22 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Jeroen</title>
		<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/comment-page-1/#comment-1373</link>
		<dc:creator>Jeroen</dc:creator>
		<pubDate>Thu, 10 Sep 2009 03:02:13 +0000</pubDate>
		<guid isPermaLink="false">http://mobiletidings.com/?p=279#comment-1373</guid>
		<description>Geir,

I had sent an email to that mailing list right after I published this article. I got a response saying that libiconv is more for streaming applications and SMS messages are short and a simpler API is more appropriate. It was also noted that some later standards on GSM-7 encodings include language specific escape codes for diffent western alphabets (like Turkish), in this case my algorithm would not work.... I&#039;ll continue to use my version of libiconv in SMS software I write and will probably address the language specifica escape codes and update the article for this. If anything; though this article I complied with the LGPL requirements to publish any enhancements.

Regards,
Jeroen</description>
		<content:encoded><![CDATA[<p>Geir,</p>
<p>I had sent an email to that mailing list right after I published this article. I got a response saying that libiconv is more for streaming applications and SMS messages are short and a simpler API is more appropriate. It was also noted that some later standards on GSM-7 encodings include language specific escape codes for diffent western alphabets (like Turkish), in this case my algorithm would not work&#8230;. I&#8217;ll continue to use my version of libiconv in SMS software I write and will probably address the language specifica escape codes and update the article for this. If anything; though this article I complied with the LGPL requirements to publish any enhancements.</p>
<p>Regards,<br />
Jeroen</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Geir</title>
		<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/comment-page-1/#comment-1368</link>
		<dc:creator>Geir</dc:creator>
		<pubDate>Tue, 08 Sep 2009 14:56:18 +0000</pubDate>
		<guid isPermaLink="false">http://mobiletidings.com/?p=279#comment-1368</guid>
		<description>if you haven&#039;t already done so; would you please consider submiting your changes to the libiconv project? Their mailing list:

bug-gnu-libiconv @ gnu.org -- for discussing bugs and features of GNU libiconv. An archive for this list can be found at http://lists.gnu.org/archive/html/bug-gnu-libiconv/</description>
		<content:encoded><![CDATA[<p>if you haven&#8217;t already done so; would you please consider submiting your changes to the libiconv project? Their mailing list:</p>
<p>bug-gnu-libiconv @ gnu.org &#8212; for discussing bugs and features of GNU libiconv. An archive for this list can be found at <a href="http://lists.gnu.org/archive/html/bug-gnu-libiconv/" rel="nofollow">http://lists.gnu.org/archive/html/bug-gnu-libiconv/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Franco</title>
		<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/comment-page-1/#comment-1291</link>
		<dc:creator>Franco</dc:creator>
		<pubDate>Mon, 20 Jul 2009 23:58:36 +0000</pubDate>
		<guid isPermaLink="false">http://mobiletidings.com/?p=279#comment-1291</guid>
		<description>Jeroen:
Thanks again.
You wrote 22 lines full of characters and of a huge amount of helpful knowledge (unmeasurable).
Regards
Franco</description>
		<content:encoded><![CDATA[<p>Jeroen:<br />
Thanks again.<br />
You wrote 22 lines full of characters and of a huge amount of helpful knowledge (unmeasurable).<br />
Regards<br />
Franco</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeroen</title>
		<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/comment-page-1/#comment-1290</link>
		<dc:creator>Jeroen</dc:creator>
		<pubDate>Mon, 20 Jul 2009 16:50:46 +0000</pubDate>
		<guid isPermaLink="false">http://mobiletidings.com/?p=279#comment-1290</guid>
		<description>There are standards for compressing and decompressing SMS messages, but a.f.a.i.k. nobody has implemented them. There are plenty of SMS applications for various smart-phones that will compress SMS messages, but they all require the same app to be running on the sender and the receiver&#039;s phone.

I don&#039;t get your question of &#039;which is best&#039;... Do you mean for use by SMS applications?
- UCS-4 uses 4 octets per Unicode character. This is the most wasteful. All well-known character sets around the world can be coded with 2 octets (UCS-2).
- UCS-2 uses 2 octets per Unicode character. This is wasteful for messages that consist largely of ASCII characters (as used in US and Western European languages). For these messages UTF-8 is more efficient.
- UTF-8 is an encoding of UCS-2 and UCS-4. If used with UCS-4 it will use 1-5 octets per character, if used with UCS-2 it will use 1-3 octets per character. If the you&#039;re dealing with text in Chinese or Japanese, most characters would require 3 octets if you encode them using UTF-8. For these languages UCS-2 is the most efficient.

Since phones only support GSM-7 and/or UCS-2 you should use GSM-7 for English and other Western European languages and use UCS-2 for Chines, Japanese and the likes. Which is just how it was intended.

Though all phones will display UCS-2 messages, doesn&#039;t mean it will correctly display all the characters in the world. What it manages to display depends on which fonts it has installed, which is highly localized (Chinese phones will support Chinese but US phones won&#039;t).

Cheers,
Jeroen</description>
		<content:encoded><![CDATA[<p>There are standards for compressing and decompressing SMS messages, but a.f.a.i.k. nobody has implemented them. There are plenty of SMS applications for various smart-phones that will compress SMS messages, but they all require the same app to be running on the sender and the receiver&#8217;s phone.</p>
<p>I don&#8217;t get your question of &#8216;which is best&#8217;&#8230; Do you mean for use by SMS applications?<br />
- UCS-4 uses 4 octets per Unicode character. This is the most wasteful. All well-known character sets around the world can be coded with 2 octets (UCS-2).<br />
- UCS-2 uses 2 octets per Unicode character. This is wasteful for messages that consist largely of ASCII characters (as used in US and Western European languages). For these messages UTF-8 is more efficient.<br />
- UTF-8 is an encoding of UCS-2 and UCS-4. If used with UCS-4 it will use 1-5 octets per character, if used with UCS-2 it will use 1-3 octets per character. If the you&#8217;re dealing with text in Chinese or Japanese, most characters would require 3 octets if you encode them using UTF-8. For these languages UCS-2 is the most efficient.</p>
<p>Since phones only support GSM-7 and/or UCS-2 you should use GSM-7 for English and other Western European languages and use UCS-2 for Chines, Japanese and the likes. Which is just how it was intended.</p>
<p>Though all phones will display UCS-2 messages, doesn&#8217;t mean it will correctly display all the characters in the world. What it manages to display depends on which fonts it has installed, which is highly localized (Chinese phones will support Chinese but US phones won&#8217;t).</p>
<p>Cheers,<br />
Jeroen</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Franco</title>
		<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/comment-page-1/#comment-1286</link>
		<dc:creator>Franco</dc:creator>
		<pubDate>Sun, 19 Jul 2009 23:17:04 +0000</pubDate>
		<guid isPermaLink="false">http://mobiletidings.com/?p=279#comment-1286</guid>
		<description>Jeroen:
I understand that I get better coverage with UCS-2; however, since it is 16 bits coded, reduces the capacity to send characters in half.
Is there a way to send it compressed and be decompressed
by the device?
Which one do you consider best UCS-2, UCS-4 or UTF-8?
Regards
Franco</description>
		<content:encoded><![CDATA[<p>Jeroen:<br />
I understand that I get better coverage with UCS-2; however, since it is 16 bits coded, reduces the capacity to send characters in half.<br />
Is there a way to send it compressed and be decompressed<br />
by the device?<br />
Which one do you consider best UCS-2, UCS-4 or UTF-8?<br />
Regards<br />
Franco</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeroen</title>
		<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/comment-page-1/#comment-1236</link>
		<dc:creator>Jeroen</dc:creator>
		<pubDate>Thu, 09 Jul 2009 02:19:28 +0000</pubDate>
		<guid isPermaLink="false">http://mobiletidings.com/?p=279#comment-1236</guid>
		<description>Franco,

Thanks for your offer, but I have everything I need in this area :-).
I used the GNU iconv library because there are many characters in the GSM-7 alphabet that are not in the ASCII character set. In fact GSM-7 contains characters (from languages like Greek, Danish, Spanish) that can&#039;t be mapped to any single byte character set (like iso8859-1), so to get access to the full range of characters offered by GSM-7 you&#039;ll have to use UCS-4, UCS-2 or UTF-8.

Cheers,
Jeroen</description>
		<content:encoded><![CDATA[<p>Franco,</p>
<p>Thanks for your offer, but I have everything I need in this area <img src='http://mobiletidings.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> .<br />
I used the GNU iconv library because there are many characters in the GSM-7 alphabet that are not in the ASCII character set. In fact GSM-7 contains characters (from languages like Greek, Danish, Spanish) that can&#8217;t be mapped to any single byte character set (like iso8859-1), so to get access to the full range of characters offered by GSM-7 you&#8217;ll have to use UCS-4, UCS-2 or UTF-8.</p>
<p>Cheers,<br />
Jeroen</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Franco</title>
		<link>http://mobiletidings.com/2009/07/06/gsm-7-encoding-gnu-libiconv/comment-page-1/#comment-1235</link>
		<dc:creator>Franco</dc:creator>
		<pubDate>Thu, 09 Jul 2009 01:54:51 +0000</pubDate>
		<guid isPermaLink="false">http://mobiletidings.com/?p=279#comment-1235</guid>
		<description>Jeroen:
I have a php function that receives an ascii message
and returns GSM-7 hex value, ready to be included in the pdu.
Have another one for concatenated messages.
Have a final one for phone numbers.
If you give me an e-mail I will send a copy to you for your blog.
Regards
Franco</description>
		<content:encoded><![CDATA[<p>Jeroen:<br />
I have a php function that receives an ascii message<br />
and returns GSM-7 hex value, ready to be included in the pdu.<br />
Have another one for concatenated messages.<br />
Have a final one for phone numbers.<br />
If you give me an e-mail I will send a copy to you for your blog.<br />
Regards<br />
Franco</p>
]]></content:encoded>
	</item>
</channel>
</rss>
