色欲综合久久躁天天躁,欧洲性大片xxxxx久久久,国内精品久久久久

windows核心編程--字符集

字符unicode與windows

1 軟件的本地化要解決的真正問題，實際上就是如何來處理不同的字符集。以前我們習慣與用單字節字符集來編程.
2 單字符集:將文本串作為一系列單字節字符來進行編碼，并在結尾處放上一個零。(每個字符用一個字節來表示)
3 雙字節字符集（D B C S ):在雙字節字符集中，字符串中的每個字符可以包含一個字節或包含兩個字節。
4 unicode字符集:U n i c o d e 提供了一種簡單而又一致的表示字符串的方法。U n i c o d e 字符串中的所有字符都是1 6 位的（兩個字節）。
5 當M i c r o s o f t 公司將C O M 從1 6 位Wi n d o w s 轉換成Wi n 3 2 時，公司作出了一個決定，即需要字符串的所有C O M 接口方法都只能接受U n i c o d e 字符串。
6 c運行期庫支持unicode,即使是windows98也支持.
7 Windows 2000 的N o t e p a d (記事本)應用程序允許你既能打開U n i c o d e 文件，也能打開A N S I 文件，并且可以創建這些文件。
8 I s Te x t U n i c o d e 函數能夠幫助進行區分ANSIC字符和unicode：
DWORD IsTextUnicode(CONST PVOID pvBuffer, int cb,PINT pResult);

第一個參數p v B u ff e r 用于標識要測試的緩存的地址。該數據是個無效指針，因為你不知道你擁有的是A N S I 字符數組還是U n i c o d e
字符數組。

第二個參數c b 用于設定p v B u ff e r 指向的字節數。同樣，由于你不知道緩存中放的是什么，因此c b 是個字節數，而不是字符數。請注意，不必設定緩存的整個長度。當然，I s Te x t U n i c o d e能夠測試的字節越多，得到的結果越準確。

第三個參數p R e s u l t 是個整數的地址，必須在調用I s Te x t U n i c o d e 之前對它進行初始化。對該整數進行初始化后，就可以指明你要I s Te x t U n i c o d e 執行哪些測試。也可以為該參數傳遞N U L L ，在這種情況下，I s Te x t U n i c o d e 將執行它能夠進行的所有測試（詳細說明請參見Platform SDK 文檔）。

9 對D B C S 字符串進行操作的幫助函數

函數	描述
PTSTR CharNext(PCTSTR pszCurrentChar);	返回字符串中的下一個字符的地址
PTSTR CharPrev (PCTSTR pszStart,PCTSTR p s z C u r r e n t C h a r);	返回字符串中的上一個字符的地址
BOOL IsDBCSLeadByteTRUE(BYTE bTestChar);	如果該字節是DBCS 字符的第一個字節，則返回

10 “M i c r o s o f t 公司對U n i c o d e 支持的情況”：

• Windows 2000 既支持U n i c o d e ，也支持A N S I ，因此可以為任意一種開發應用程序。

• Windows 98 只支持A N S I ，只能為A N S I 開發應用程序。

• Windows CE 只支持U n i c o d e ，只能為U n i c o d e 開發應用程序。

11 Wi n d o w s 頭文件定義de Uincode 數據類型

數據類型	說明
W C H A R	U n i c o d e 字符
P W S T R	指向U n i c o d e 字符串的指針
P C W S T R	指向一個恒定的U n i c o d e 字符串的指針

使用實例如下:

#ifdef UNICODE
#define CreateWindowEx CreateWindowExW
#else
#define CreateWindowEx CreateWindowExA
#endif //!UNICODE

在Unicode與ANSI之間轉換字符串

Wi n d o w s 函數M u l t i B y t e To Wi d e C h a r 用于將多字節字符串轉換成寬字符串。下面顯示了M u l t i B y t e To Wi d e C h a r 函數。

int MultiByteToWideChar(
UINT CodePage,          //code page
DWORD dwFlags,          //character-type options
LPCSTR lpMultiByteStr,  //address of string to map
int cchMultiByte,       //number of bytes in string
LPWSTR lpWideCharStr,   //address of wide-character buffer
int cchWideChar         //size of buffer
);

u C o d e P a g e 參數用于標識一個與多字節字符串相關的代碼頁號。d w F l a g s 參數用于設定另一個控件，它可以用重音符號之類的區分標記來影響字符。這些標志通常并不使用，在d w F l a g s參數中傳遞0 。p M u l t i B y t e S t r 參數用于設定要轉換的字符串，c c h M u l t i B y t e 參數用于指明該字符串的長度（按字節計算）。如果為c c h M u l t i B y t e 參數傳遞- 1 ，那么該函數用于確定源字符串的長度。

轉換后產生的U n i c o d e 版本字符串將被寫入內存中的緩存，其地址由p Wi d e C h a r S t r 參數指定。必須在c c h Wi d e C h a r 參數中設定該緩存的最大值（以字符為計量單位）。如果調用M u l t i B y t e To Wi d e C h a r ，給c c h Wi d e C h a r 參數傳遞0 ，那么該參數將不執行字符串的轉換，而是返回為使轉換取得成功所需要的緩存的值。一般來說，可以通過下列步驟將多字節字符串轉換成U n i c o d e 等價字符串：

1) 調用M u l t i B y t e To Wi d e C h a r 函數，為p Wi d e C h a r S t r 參數傳遞N U L L ，為c c h Wi d e C h a r 參數傳遞0 。
2) 分配足夠的內存塊，用于存放轉換后的U n i c o d e 字符串。該內存塊的大小由前面對M u l t B y t e To Wi d e C h a r 的調用返回。
3) 再次調用M u l t i B y t e To Wi d e C h a r ，這次將緩存的地址作為p Wi d e C h a r S t r 參數來傳遞，并傳遞第一次調用M u l t i B y t e To Wi d e C h a r 時返回的緩存大小，作為c c h Wi d e c h a r 參數。
4. 使用轉換后的字符串。
5) 釋放U n i c o d e 字符串占用的內存塊。
函數Wi d e C h a r To M u l t i B y t e 將寬字符串轉換成等價的多字節字符串，如下所示：

int WideCharToMultiByte(
UINT CodePage,         // code page
DWORD dwFlags,         // performance and mapping flags
LPCWSTR lpWideCharStr, // address of wide-character string
int cchWideChar,       // number of characters in string
LPSTR lpMultiByteStr,  // address of buffer for new string
int cchMultiByte,      // size of buffer
LPCSTR lpDefaultChar,  // address of default for unmappable
// characters
LPBOOL lpUsedDefaultChar   // address of flag set when default
// char. used
);

該函數與M u l t i B i t e To Wi d e C h a r 函數相似。同樣，u C o d e P a g e 參數用于標識與新轉換的字符串相關的代碼頁。d w F l a g s 則設定用于轉換的其他控件。這些標志能夠作用于帶有區分符號的字符和系統不能轉換的字符。通常不需要為字符串的轉換而擁有這種程度的控制手段，你將為d w F l a g s 參數傳遞0 。

p Wi d e C h a r S t r 參數用于設定要轉換的字符串的內存地址，c c h Wi d e C h a r 參數用于指明該字符串的長度（用字符數來計量）。如果你為c c h Wi d e C h a r 參數傳遞- 1 ，那么該函數用于確定源字符串的長度。

轉換產生的多字節版本的字符串被寫入由p M u l t i B y t e S t r 參數指明的緩存。必須在c c h M u l t i B y t e參數中設定該緩存的最大值（用字節來計量）。如果傳遞0 作為Wi d e C h a r To M u l t i B y t e 函數的c c h M u l t i B y t e 參數，那么該函數將返回目標緩存需要的大小值。通?？梢允褂脤⒍嘧止澴址D換成寬字節字符串時介紹的一系列類似的事件，將寬字節字符串轉換成多字節字符串。

你會發現，Wi d e C h a r To M u l t i B y t e 函數接受的參數比M u l t i B y t e To Wi d e C h a r 函數要多2 個，即p D e f a u l t C h a r 和p f U s e d D e f a u l t C h a r 。只有當Wi d e C h a r To M u l t i B y t e 函數遇到一個寬字節字符，而該字符在u C o d e P a g e 參數標識的代碼頁中并沒有它的表示法時，Wi d e C h a r To M u l t i B y t e 函數才使用這兩個參數。如果寬字節字符不能被轉換，該函數便使用p D e f a u l t C h a r 參數指向的字符。如果該參數是N U L L （這是大多數情況下的參數值），那么該函數使用系統的默認字符。該默認字符通常是個問號。這對于文件名來說是危險的，因為問號是個通配符。

p f U s e d D e f a u l t C h a r 參數指向一個布爾變量，如果寬字符串中至少有一個字符不能轉換成等價多字節字符，那么函數就將該變量置為T R U E 。如果所有字符均被成功地轉換，那么該函數就將該變量置為FA L S E 。當函數返回以便檢查寬字節字符串是否被成功地轉換后，可以測試該變量。同樣，通常為該測試傳遞N U L L 。

關于如何使用這些函數的詳細說明，請參見Platform SDK 文檔。

如果使用這兩個函數，就可以很容易創建這些函數的U n i c o d e 版本和A N S I 版本。例如，你可能有一個動態鏈接庫，它包含一個函數，能夠轉換字符串中的所有字符?？梢韵裣旅孢@樣編寫該函數的U n i c o d e 版本：

BOOL StringReverseW(PWSTR pWideCharStr)
{
//Get a pointer to the last character in the string.
PWSTR pEndOfStr=pWideCharStr+wcslen(pWideCharStr)-1;
wchar_t cCharT;
//Repeat until we reach the center character
//in the string.
while (pWideCharStr < pEndOfStr)
{
//Save a character in a temporary variable.
cCharT=*pWideCharStr;
//Put the last character in the first character.
*pWideCharStr =*pEndOfStr;
//Put the temporary character in the last character.
*pEndOfStr=cCharT;
//Move in one character from the left.
pWideCharStr++;
//Move in one character from the right.
pEndOfStr--;
}
//The string is reversed; return success.
return(TRUE);
}

你可以編寫該函數的A N S I 版本以便該函數根本不執行轉換字符串的實際操作。你也可以編寫該函數的A N S I 版本，以便該函數它將A N S I 字符串轉換成U n i c o d e 字符串，將U n i c o d e 字符串傳遞給S t r i n g R e v e r s e W 函數，然后將轉換后的字符串重新轉換成A N S I 字符串。該函數類似下面的樣子：

BOOL StringReverseA(PSTR pMultiByteStr)
{
PWSTR pWideCharStr;
int nLenOfWideCharStr;
BOOL fOk = FALSE;
//Calculate the number of characters needed to hold
//the wide_character version of string.
nLenOfWideCharStr = MultiRyteToWideChar(CP_ACP, 0,
pMultiByteStr, -1, NULL, 0);
//Allocate memory from the process's default heap to
//accommodate the size of the wide-character string.
//Don't forget that MultiByteToWideChar returns the
//number of characters,not the number of bytes,so
//you must multiply by the size of wide character.
pWideCharStr = HeapAlloc(GetProcessHeap(), 0,
nLenOfWideCharStr * sizeof(WCHAR));
if (pWideCharStr == NULL)
return(fOk);
//Convert the multibyte string to a wide_character string.
MultiByteToWideChar(CP_ACP, 0, pMulti8yteStr, -1,
pWideCharStr, nLenOfWideCharStr);
//Call the wide-character version of this
//function to do the actual work
fOk = StnngReverseW(pWideCharStr);
if (fOk)
{
//Convert the wide-character string back
//to a multibyte string.
WideCharToMultiByte(CP_ACP, 0, pWideCharStr, -1,
pMultiByteStr, strlen(pMultiByteStr), NULL, NULL);
}
//Free the momory containing the wide-character string.
HeapFree(GetProcessHeap(), 0, pWideCharStr);
return(fOk),
}

最后，在用動態鏈接庫分配的頭文件中，可以像下面這樣建立這兩個函數的原型：

BOOL StringReverseW (PWSTR pWideCharStr);
BOOL StringReverseA (PSTR pMultiByteStr);
#ifdef UNICODE
#define StnngReverse StringReverseW
#else
#define StringRevcrsc StringReverseA
#endif // UNICODE

posted on 2006-09-11 16:57 夢在天涯閱讀(1867) 評論(2) 編輯收藏引用所屬分類: Windows API

只有注冊用戶登錄后才能發表評論。
【推薦】100%開源！大型工業跨平臺軟件C++源碼提供，建模，組態！

相關文章: windows消息機制 windows消息大全《windows核心編程》之精華 windows核心編程－－SEH（結構異常處理） windows核心編程－－DLL高級 windows核心編程－－DLL基本 windows核心編程－－內存堆棧 windows核心編程－－內存映射文件 windows核心編程－－虛擬內存 windows核心編程－－內存結構

網站導航: 博客園 IT新聞 BlogJava 博問 Chat2DB 管理

# re: windows核心編程--字符集 2006-09-12 08:50 笨笨

# re: windows核心編程--字符集 2006-09-12 09:18 夢在天涯

C++ Programmer's Cookbook

windows核心編程--字符集

評論

公告

導航

統計

常用鏈接

隨筆分類

隨筆檔案

收藏夾

Blogs

c#(csharp)

C++(cpp)

Enlish

Forums(bbs)

My self

Often go

Useful Webs

Xml/Uml/html

搜索

積分與排名

最新評論

閱讀排行榜