锘??xml version="1.0" encoding="utf-8" standalone="yes"?>亚洲大胆av,国产麻豆精品视频,亚洲一二三区精品http://www.shnenglu.com/kb/category/55.html鍐?鏄矇鐫$潃鐨勬按......zh-cnTue, 20 May 2008 05:11:46 GMTTue, 20 May 2008 05:11:46 GMT60璇勪環涓涓婾TF-8涓嶶NICODE鐩鎬簰杞崲鐨勪唬鐮?/title><link>http://www.shnenglu.com/kb/archive/2005/09/29/491.html</link><dc:creator>鍙啺</dc:creator><author>鍙啺</author><pubDate>Thu, 29 Sep 2005 12:34:00 GMT</pubDate><guid>http://www.shnenglu.com/kb/archive/2005/09/29/491.html</guid><wfw:comment>http://www.shnenglu.com/kb/comments/491.html</wfw:comment><comments>http://www.shnenglu.com/kb/archive/2005/09/29/491.html#Feedback</comments><slash:comments>8</slash:comments><wfw:commentRss>http://www.shnenglu.com/kb/comments/commentRss/491.html</wfw:commentRss><trackback:ping>http://www.shnenglu.com/kb/services/trackbacks/491.html</trackback:ping><description><![CDATA[<font color="#000000" face="Verdana" size="2">涓婂懆,鎴戣姳浜嗗緢澶氬績鎬濅嬌鐢ㄦā鏉垮啓浜嗕竴涓猆TF-8涓嶶NICODE鐩鎬簰杞崲鐨勫姛鑳?瑙佹枃浠?/font><a ><font color="#000080" face="Verdana" size="2">code.rar</font></a><font color="#000000" face="Verdana" size="2">),鍒氬紑濮嬫劅瑙夎繕鍙互,浣嗚繖鍑犲ぉ鎱㈡參鐨勮寰?涓轟粈涔堜笉鐩存帴鎻愪緵涓や釜鍑芥暟鍛?榪欐牱涓嶆槸綆鍗曟柟渚垮悧?鎴戣繖鏍風殑璁捐鍙堣兘甯︽潵棰濆鐨勪粈涔堝ソ澶勫憿?鍒氬紑濮嬫垜鏄兂鎻愪緵姣旇緝鏂逛究濂界敤浠ュ強瀹規槗鎵╁睍涓庣淮鎶ょ殑浠g爜,浣嗙幇鍦ㄦ劅瑙夊埌涓庣洿鎺ユ彁渚汣寮忕殑鍑芥暟騫舵病鏈夊灝戦澶栫殑濂藉.鎴栬榪欐牱鐨勭畝鍗曞姛鑳芥牴鏈氨鐢ㄤ笉鐫榪欐牱澶嶆潅鐨勪唬鐮佸惂.姝eEric Raymond瀵笴++鐨勮瘎浠蜂竴鏍?瀹?浣跨▼搴忓憳鍊懼悜浜庡啓澶嶆潅鐨勪唬鐮?.<br>鎴戞兂澶у鐪嬬湅鎴戠殑浠g爜,緇欐垜涓鐐規剰瑙佸拰寤鴻.</font><img src ="http://www.shnenglu.com/kb/aggbug/491.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.shnenglu.com/kb/" target="_blank">鍙啺</a> 2005-09-29 20:34 <a href="http://www.shnenglu.com/kb/archive/2005/09/29/491.html#Feedback" target="_blank" style="text-decoration:none;">鍙戣〃璇勮</a></div>]]></description></item><item><title>鏋勬漊TF-8瑙g爜妯″潡http://www.shnenglu.com/kb/archive/2005/09/22/399.html鍙啺鍙啺Thu, 22 Sep 2005 15:24:00 GMThttp://www.shnenglu.com/kb/archive/2005/09/22/399.htmlhttp://www.shnenglu.com/kb/comments/399.htmlhttp://www.shnenglu.com/kb/archive/2005/09/22/399.html#Feedback1http://www.shnenglu.com/kb/comments/commentRss/399.htmlhttp://www.shnenglu.com/kb/services/trackbacks/399.html 鎯沖疄鐜頒竴涓В鐮乁TF-8鏍煎紡鏂囨。涓篣nicode鏍煎紡浠g爜鐨?寮曟搸",瑕佺敤璧鋒潵鏂逛究欏烘墜.
浣嗘兂浜嗗嚑澶╀簡,閮芥病鏈変竴涓悎閫傜殑鏂規鏉ュ疄鐜?
鍞?.....
浠婂ぉ鍏堣瘯鐫鍐欎簡鍐?鎵炬壘鎰熻,鎺ョ潃鍐嶆兂鍚?..



鍙啺 2005-09-22 23:24 鍙戣〃璇勮
]]>
std::wfstream鏄庝箞鏀寔瀹藉瓧絎︾殑?http://www.shnenglu.com/kb/archive/2005/09/22/396.html鍙啺鍙啺Thu, 22 Sep 2005 14:47:00 GMThttp://www.shnenglu.com/kb/archive/2005/09/22/396.htmlhttp://www.shnenglu.com/kb/comments/396.htmlhttp://www.shnenglu.com/kb/archive/2005/09/22/396.html#Feedback4http://www.shnenglu.com/kb/comments/commentRss/396.htmlhttp://www.shnenglu.com/kb/services/trackbacks/396.html
std::wfstream鐨勫畾涔変負:
typedef basic_fstream<wchar_t, char_traits<wchar_t> > wfstream;
鍦ㄨ鍙栧瓧絎︽椂:
wfstream wfile( "wcharfile.txt" );
wchar_t wch = wfile.get();
鎸夎涔夎搴旇鏄鍏ヤ袱涓瓧鑺傚唴瀹圭殑.浣嗙粡杈撳嚭媯嫻?瀹冨嵈鍙鍏ヤ竴涓瓧鑺?榪欐牱鍜宖stream榪樻湁浠涔堝垎鍒?
鍒板簳鍦ㄥ鐞哢nicode緙栫爜鐨勬枃浠舵椂,搴旇濡備綍浣跨敤瀹藉瓧絎︽祦?


鍙啺 2005-09-22 22:47 鍙戣〃璇勮
]]>
"榪欐槸涓涓猆TF-8鏍煎紡鐨勬枃妗?"鐨勫嚑縐嶄笉鍚岀紪鐮佽〃紺?/title><link>http://www.shnenglu.com/kb/archive/2005/09/20/343.html</link><dc:creator>鍙啺</dc:creator><author>鍙啺</author><pubDate>Tue, 20 Sep 2005 12:39:00 GMT</pubDate><guid>http://www.shnenglu.com/kb/archive/2005/09/20/343.html</guid><wfw:comment>http://www.shnenglu.com/kb/comments/343.html</wfw:comment><comments>http://www.shnenglu.com/kb/archive/2005/09/20/343.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.shnenglu.com/kb/comments/commentRss/343.html</wfw:commentRss><trackback:ping>http://www.shnenglu.com/kb/services/trackbacks/343.html</trackback:ping><description><![CDATA[<p class="box"><img src="http://www.shnenglu.com/images/cppblog_com/kb/58/r_charcode.gif"> </p><img src ="http://www.shnenglu.com/kb/aggbug/343.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.shnenglu.com/kb/" target="_blank">鍙啺</a> 2005-09-20 20:39 <a href="http://www.shnenglu.com/kb/archive/2005/09/20/343.html#Feedback" target="_blank" style="text-decoration:none;">鍙戣〃璇勮</a></div>]]></description></item><item><title>UTF-8 緙栫爜鏍煎紡鎬葷粨http://www.shnenglu.com/kb/archive/2005/09/19/320.html鍙啺鍙啺Mon, 19 Sep 2005 12:03:00 GMThttp://www.shnenglu.com/kb/archive/2005/09/19/320.htmlhttp://www.shnenglu.com/kb/comments/320.htmlhttp://www.shnenglu.com/kb/archive/2005/09/19/320.html#Feedback3http://www.shnenglu.com/kb/comments/commentRss/320.htmlhttp://www.shnenglu.com/kb/services/trackbacks/320.html[浠ヤ笅鍙槸涓漢鐨勬葷粨,濡傝嫢鏈夎,鎭寵鎸囨,璋㈣阿!]
涓嬪垪瀛楄妭涓茬敤鏉ヨ〃紺轟竴涓瓧絎? 鐢ㄥ埌鍝釜涓插彇鍐充簬璇ュ瓧絎﹀湪 Unicode 涓殑搴忓彿.
U+00000000 - U+0000007F: 0 xxxxxxx 0x - 7x  
U+00000080 - U+000007FF: 110 xxxxx 10 xxxxxx Cx 8x - Dx Bx  
U+00000800 - U+0000FFFF: 1110 xxxx 10 xxxxxx 10 xxxxxx Ex 8x 8x - Ex Bx Bx  
U+00010000 - U+001FFFFF: 11110 xxx 10 xxxxxx 10 xxxxxx 10 xxxxxx F0 8x 8x 8x - F7 Bx Bx Bx 寰堝皯鐢?/td>
U+00200000 - U+03FFFFFF: 111110 xx 10 xxxxxx 10 xxxxxx 10 xxxxxx 10 xxxxxx F8 8x 8x 8x 8x - FB Bx Bx Bx Bx
U+04000000 - U+7FFFFFFF: 1111110 x 10 xxxxxx 10 xxxxxx 10 xxxxxx 10 xxxxxx 10 xxxxxx FC 8x 8x 8x 8x 8x - FD Bx Bx Bx Bx Bx


* FE FF浠庢湭鍦ㄧ紪鐮佷腑鍑虹幇榪?
* 闄ょ涓涓瓧鑺傚,鍏朵綑瀛楄妭閮藉湪 0x80 鍒?0xBF鑼冨洿鍐?姣忎釜瀛楃鐨勮搗濮嬩綅緗敤0xC0-0xD0,0xE0,0xF0絳夊彲浠ョ‘瀹?楠岃瘉鍓嶅洓浣嶆垨鍏綅),涓嶅湪榪欎竴鑼冨洿鐨勫嵆涓哄崟瀛楄妭瀛楃.鍑℃槸浠?span style="color: rgb(153, 0, 0); font-weight: bold;">0x80 鍒?0xBF寮澶寸殑閮芥槸鍚庣戶瀛楄妭,璁℃暟鏃墮兘瑕佽煩榪?
* Unicode鏄竴縐嶇紪鐮佽〃,鍙皢瀛楃鎸囧畾緇欐煇涓鏁板瓧(Unicode鍋氬緱榪樿鏇村涓浜?姣斿鎻愪緵姣旇緝鍙婃樉紺虹瓑寰堝綆楁硶絳夌瓑);
鑰孶TF-8鏄紪鐮佹柟寮?鏄畾涔夊浣曡〃紺哄茍瀛樺偍鎸囧畾緙栫爜鐨勬牸寮?
* UTF-8緙栫爜杞崲涓篣nicode緙栫爜: 灝嗘墍鏈夋爣蹇椾綅鍘婚櫎,鍓╀綑浣嶆暟鑻ヤ笉瓚沖垯鍦ㄩ珮浣嶈ˉ闆?鍑戣凍32浣嶅嵆鍙?
* Unicode緙栫爜杞崲涓篣TF-8緙栫爜: 浠庝綆浣嶅紑濮?姣忓彇6浣嶈ˉ涓や釜浣?0,涓嶈凍6浣?涓嶇畻楂樹綅鐨?)鍒欐寜瀛楄妭闀垮害琛ョ浉搴旂殑瀛楃鏍囧織浣?銆?10銆?110絳?/font>



鍙啺 2005-09-19 20:03 鍙戣〃璇勮
]]>
UTF typeshttp://www.shnenglu.com/kb/archive/2005/09/19/312.html鍙啺鍙啺Mon, 19 Sep 2005 07:38:00 GMThttp://www.shnenglu.com/kb/archive/2005/09/19/312.htmlhttp://www.shnenglu.com/kb/comments/312.htmlhttp://www.shnenglu.com/kb/archive/2005/09/19/312.html#Feedback0http://www.shnenglu.com/kb/comments/commentRss/312.htmlhttp://www.shnenglu.com/kb/services/trackbacks/312.html UTF Formats Estimated average storage required per page (3000 characters) UTF-8




3 KB
(1999)
5 KB
(2003) On average, English takes slightly over one unit per code point. Most Latin-script languages take about 1.1 bytes. Greek, Russian, Arabic and Hebrew take about 1.7 bytes, and most others (including Japanese, Chinese, Korean and Hindi) take about 3 bytes. Characters in surrogate space take 4 bytes, but as a proportion of all world text they will always be very rare. UTF-16


6 KB All of the most common characters in use for all modern writing systems are already represented with 2 bytes. Characters in surrogate space take 4 bytes, but as a proportion of all world text they will always be very rare. UTF-32

12 KB All take 4 bytes

[鏉ユ簮: http://icu.sourceforge.net/docs/papers/forms_of_unicode/]


UTF-8(ISO 10646-1) 鏈変互涓嬬壒鎬?

  • UCS 瀛楃 U+0000 鍒?U+007F (ASCII) 琚紪鐮佷負瀛楄妭 0x00 鍒?0x7F (ASCII 鍏煎). 榪欐剰鍛崇潃鍙寘鍚?7 浣?ASCII 瀛楃鐨勬枃浠跺湪 ASCII 鍜?UTF-8 涓ょ緙栫爜鏂瑰紡涓嬫槸涓鏍風殑.
  • 鎵鏈?span style="color: red;"> > U+007F 鐨?UCS 瀛楃琚紪鐮佷負涓涓垨澶氫釜瀛楄妭鐨勪覆, 姣忎釜瀛楄妭閮芥湁鏍囪浣嶉泦. 鍥犳, ASCII 瀛楄妭 (0x00-0x7F) 涓嶅彲鑳戒綔涓轟換浣曞叾浠栧瓧絎︾殑涓閮ㄥ垎.
  • 琛ㄧず闈?ASCII 瀛楃鐨勫瀛楄妭涓茬殑絎竴涓瓧鑺?/span>鎬繪槸鍦?0xC0 鍒?0xFD 鐨勮寖鍥撮噷, 騫舵寚鍑鴻繖涓瓧絎﹀寘鍚灝戜釜瀛楄妭. 澶氬瓧鑺備覆鐨?span style="color: red;">鍏朵綑瀛楄妭閮藉湪 0x80 鍒?0xBF 鑼冨洿閲? 榪欎嬌寰楅噸鏂板悓姝ラ潪甯稿鏄? 騫朵嬌緙栫爜鏃犲浗鐣? 涓斿緢灝戝彈涓㈠け瀛楄妭鐨勫獎鍝?
  • 鍙互緙栧叆鎵鏈夊彲鑳界殑 231涓?UCS 浠g爜
  • UTF-8 緙栫爜瀛楃鐞嗚涓婂彲浠ユ渶澶氬埌 6 涓瓧鑺傞暱, 鐒惰?16 浣?BMP 瀛楃鏈澶氬彧鐢ㄥ埌 3 瀛楄妭闀?
  • Bigendian UCS-4 瀛楄妭涓茬殑鎺掑垪欏哄簭鏄瀹氱殑.
  • 瀛楄妭 0xFE 鍜?0xFF 鍦?UTF-8 緙栫爜涓粠鏈敤鍒?

涓嬪垪瀛楄妭涓茬敤鏉ヨ〃紺轟竴涓瓧絎? 鐢ㄥ埌鍝釜涓插彇鍐充簬璇ュ瓧絎﹀湪 Unicode 涓殑搴忓彿.

U-00000000 - U-0000007F: 0xxxxxxx
U-00000080 - U-000007FF: 110xxxxx 10xxxxxx
U-00000800 - U-0000FFFF: 1110xxxx 10xxxxxx 10xxxxxx
U-00010000 - U-001FFFFF: 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
U-00200000 - U-03FFFFFF: 111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx
U-04000000 - U-7FFFFFFF: 1111110x 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx

xxx 鐨勪綅緗敱瀛楃緙栫爜鏁扮殑浜岃繘鍒惰〃紺虹殑浣嶅~鍏? 瓚婇潬鍙崇殑 x 鍏鋒湁瓚婂皯鐨勭壒孌婃剰涔? 鍙敤鏈鐭殑閭d釜瓚沖琛ㄨ揪涓涓瓧絎︾紪鐮佹暟鐨勫瀛楄妭涓? 娉ㄦ剰鍦ㄥ瀛楄妭涓蹭腑, 絎竴涓瓧鑺傜殑寮澶?1"鐨勬暟鐩氨鏄暣涓覆涓瓧鑺傜殑鏁扮洰.

渚嬪: Unicode 瀛楃 U+00A9 = 1010 1001 (鐗堟潈絎﹀彿) 鍦?UTF-8 閲岀殑緙栫爜涓?

11000010 10101001 = 0xC2 0xA9

鑰屽瓧絎?U+2260 = 0010 0010 0110 0000 (涓嶇瓑浜? 緙栫爜涓?

11100010 10001001 10100000 = 0xE2 0x89 0xA0

榪欑緙栫爜鐨勫畼鏂瑰悕瀛楁嫾鍐欎負 UTF-8, 鍏朵腑 UTF 浠h〃 UCS Transformation Format. 璇峰嬁鍦ㄤ換浣曟枃妗d腑鐢ㄥ叾浠栧悕瀛?(姣斿 utf8 鎴?UTF_8) 鏉ヨ〃紺?UTF-8, 褰撶劧闄ら潪浣犳寚鐨勬槸涓涓彉閲忓悕鑰屼笉鏄繖縐嶇紪鐮佹湰韜?

浠涔堢紪紼嬭璦鏀寔 Unicode?

鍦ㄥぇ綰?1993 騫翠箣鍚庡紑鍙戠殑澶у鏁扮幇浠g紪紼嬭璦閮芥湁涓涓壒鍒殑鏁版嵁綾誨瀷, 鍙仛 Unicode/ISO 10646-1 瀛楃. 鍦?Ada95 涓彨 Wide_Character, 鍦?Java 涓彨 char.

ISO C 涔熻緇嗚鏄庝簡澶勭悊澶氬瓧鑺傜紪鐮佸拰瀹藉瓧絎?(wide characters) 鐨勬満鍒? 1994 騫?9 鏈?Amendment 1 to ISO C 鍙戣〃鏃跺張鍔犲叆浜嗘洿澶? 榪欎簺鏈哄埗涓昏鏄負鍚勭被涓滀簹緙栫爜鑰岃璁$殑, 瀹冧滑姣斿鐞?UCS 鎵闇鐨勮鍋ュ.寰楀. UTF-8 鏄?ISO C 鏍囧噯璋冪敤澶氬瓧鑺傚瓧絎︿覆鐨勭紪鐮佺殑涓涓緥瀛? wchar_t 綾誨瀷鍙互鐢ㄦ潵瀛樻斁 Unicode 瀛楃.
[鏉ユ簮: http://www.linuxforum.net/books/UTF-8-Unicode.html]



鍙啺 2005-09-19 15:38 鍙戣〃璇勮
]]>
UTF serializationshttp://www.shnenglu.com/kb/archive/2005/09/19/310.html鍙啺鍙啺Mon, 19 Sep 2005 07:23:00 GMThttp://www.shnenglu.com/kb/archive/2005/09/19/310.htmlhttp://www.shnenglu.com/kb/comments/310.htmlhttp://www.shnenglu.com/kb/archive/2005/09/19/310.html#Feedback0http://www.shnenglu.com/kb/comments/commentRss/310.htmlhttp://www.shnenglu.com/kb/services/trackbacks/310.html
UTF-8
  • Inital EF BB BF is a signature, indicating that the rest of the file is UTF-8.
  • Any EF BF BE is an error.
  • A real ZWNBSP at the start of a file requires a signature first.
UTF-8N
  • All of the text is normal UTF-8; there is no signature.
  • Inital EF BB BF is a ZWNBSP.
  • Any EF BF BE is an error.
UTF-16
  • Initial FE FF is a signature indicating the rest of the text is big endian UTF-16.
  • Initial FF FE is a signature indicating the rest of the text is little endian UTF-16.
  • If neither of these are present, all of the text is big endian.
  • A real ZWNBSP at the start of a file requires a signature first.
UTF-16BE
  • All of the text is big endian: there is no signature.
  • Initial FE FF is a ZWNBSP.
  • Any FF FE is an error.
UTF-16LE
  • All of the text is little endian: there is no signature.
  • Initial FF FE is a ZWNBSP.
  • Any FE FF is an error.
UTF-32
  • Initial 00 00 FE FF is a signature indicating the rest of the text is big endian UTF-32.
  • Initial FF FE 00 00 is a signature indicating the rest of the text is little endian UTF-32.
  • If neither of these are present, all of the text is big endian.
  • A real ZWNBSP at the start of a file requires a signature first.
UTF-32BE
  • All of the text is big endian: there is no signature.
  • Initial 00 00 FE FF is a ZWNBSP.
  • Any FF FE 00 00 is an error.
UTF-32LE
  • All of the text is little endian: there is no signature.
  • Initial FF FE 00 00 is a ZWNBSP.
  • Initial 00 00 FE FF is an error.
Note: The italicized names are not yet registered, but are useful for reference.
[from: http://icu.sourceforge.net/docs/papers/forms_of_unicode/]


鍙啺 2005-09-19 15:23 鍙戣〃璇勮
]]>
久久免费看黄a级毛片| 久久久这里有精品中文字幕| 精品无码久久久久久午夜| 久久er热视频在这里精品| 色婷婷狠狠久久综合五月| 久久精品毛片免费观看| 少妇人妻综合久久中文字幕| 久久国产成人精品麻豆| 久久免费的精品国产V∧| 久久久精品人妻无码专区不卡| 亚洲精品高清国产一线久久| 久久精品国产精品亜洲毛片| 国产情侣久久久久aⅴ免费| 婷婷久久综合九色综合九七| 国产精品18久久久久久vr| 色偷偷久久一区二区三区| 亚洲伊人久久综合中文成人网| 88久久精品无码一区二区毛片| 国产亚洲综合久久系列| 无码AV中文字幕久久专区| 久久亚洲日韩看片无码| 一个色综合久久| 老司机午夜网站国内精品久久久久久久久 | 一级A毛片免费观看久久精品| 国产精品久久久久影院嫩草 | 久久久久国色AV免费观看| 国内精品久久久久影院免费| 久久久久亚洲AV无码专区体验| 国产精品乱码久久久久久软件| 久久久久久一区国产精品| 精品无码久久久久久国产| 亚洲午夜精品久久久久久人妖| 久久er国产精品免费观看2| 97久久综合精品久久久综合| 国产精品一区二区久久| 国产精品久久网| 久久精品成人影院| 99久久香蕉国产线看观香| 一本一本久久aa综合精品| 青草国产精品久久久久久| 97久久久精品综合88久久|