作者:Jeff Bogan
原文:
http://www.codeproject.com/vcpp/stl/PracticalGuideStl.asp翻譯:
WinterWinter注: 這是一篇非常不錯(cuò)的文章,以前周翔已經(jīng)翻譯過了。只是感覺翻譯得有些欠妥之處,特別是一些術(shù)語的翻譯,因此這里重新翻譯。
1 介紹
對(duì)于當(dāng)今所有C++程序員來說,STL(標(biāo)準(zhǔn)模板庫的縮寫)都是非常不錯(cuò)的技術(shù)。但我必須要提醒的是要想習(xí)慣使用有一定難度,例如,會(huì)有很陡峭的學(xué)習(xí)曲線,其使用許多名字也不是憑直覺就可以知道其意思(或許是因?yàn)樗泻糜浀拿侄急挥霉饬耍?。但一旦你學(xué)會(huì)了STL,你將會(huì)因此而受益匪淺。和MFC的容器相比,STL更加靈活且功能強(qiáng)大。
其優(yōu)勢如下:
- 能方便的排序和搜索。
- 更安全且更容易調(diào)試。
- 你能讀懂Unix程序員的代碼注1。
- 將為你的簡歷上增加技能。
2 背景
寫本文檔的目的在于讓讀者可以在這富有挑戰(zhàn)性的計(jì)算機(jī)科學(xué)領(lǐng)域有個(gè)良好的開端,不必費(fèi)力地了解那無窮無盡的行話術(shù)語和沉悶的規(guī)則,那些行話和規(guī)則只是STLer們用于自娛的創(chuàng)造品。
3 使用代碼
本文檔中的代碼對(duì)讀者在使用STL實(shí)踐之路上有很強(qiáng)的指導(dǎo)作用。
4 定義
- 模板(template)-- 類(以及結(jié)構(gòu)、數(shù)據(jù)類型、和函數(shù))的宏。有時(shí)也叫cookie cutter. 同時(shí)和已知范型(generic)形式一樣--一個(gè)類模板叫范型類,同樣,一個(gè)函數(shù)模板叫范型函數(shù)。
- STL -- 標(biāo)準(zhǔn)模板庫,由一群聰明人寫的模板,現(xiàn)在作為標(biāo)準(zhǔn)C++語言的一部分被所有人使用。
- 容器(container) -- 可容納一定數(shù)據(jù)的類。在STL中有vector, set, map, multimap, deque等容器。
- vector -- 一個(gè)基礎(chǔ)的數(shù)據(jù)模板,是一種容器。
- 迭代器(Iterator) -- 一個(gè)非常有意思的詞,其實(shí)是STL容器內(nèi)部元素的指針。它同時(shí)完成其他許多功能。
5 Hello Word 程序
I always wanted to write one and here is my golden 24 karet opportunity: a hello world program. 這個(gè)程序把一個(gè)字符串轉(zhuǎn)換為一個(gè)字符vector,然后以逐個(gè)字符顯示整個(gè)字符串。vector就像是盛放變長數(shù)組的花園,在STL所有容器中,大約有一半是基于vector的,故可以這么說,尚若你掌握了這個(gè)程序,那么你就理解了整個(gè)STL的一半了
//?Program:?Vector?Demo?1
//?Purpose:?用于演示STL?vector

//?#include?"stdafx.h"?-?如果你使用預(yù)編譯需要包含此文件[[#ExplainIn2][注2]]
#include?<vector>??//?STL?vector?頭文件.?注意,并沒有".h"
#include?<iostream>??//?需要用到?cout
using?namespace?std;??//?確保命名空間是?std

char*?szHW?=?"Hello?World";??
//?眾所周知,這是個(gè)以NULL結(jié)尾的字符數(shù)組?

int?main(int?argc,?char*?argv[])


{
??vector?<char>?vec;??//?一個(gè)字符類型的vector(相當(dāng)于STL中的數(shù)組)

??//?為字符vector定義迭代器
??vector?<char>::iterator?vi;

??//?初始化字符vector,循環(huán)整個(gè)字符串,把每個(gè)字符放入vector中,直至字符串末尾的NULL字符
??char*?cptr?=?szHW;??//??Hello?World?字符串的首地址
??while?(*cptr?!=?'\0')

??
{??vec.push_back(*cptr);??cptr++;??}
??//?push_back?函數(shù)把數(shù)據(jù)插入vector的最后?

??//?把存在STL數(shù)組中的每個(gè)字符打印到屏幕上
??for?(vi=vec.begin();?vi!=vec.end();?vi++)??
??//?這就是在STL中循環(huán)的標(biāo)準(zhǔn)判斷方式-?經(jīng)常使用?"!="?而不是?"<"?
??//?某些容器可能并沒有重載操作符?"<"?。?
??//begin()和end()會(huì)得到vector的開頭和結(jié)尾兩個(gè)元素的迭代器(指針)?

??
{??cout?<<?*vi;??}??//?使用間接操作符(*)從迭代器中取得數(shù)據(jù)
??cout?<<?endl;??//?輸出完畢,打印?"\n"

??return?0;
}

push_back 是用來向vector或deque容器中插入數(shù)據(jù)的標(biāo)準(zhǔn)函數(shù)。insert是類似功能的函數(shù),適用于所有容器,但用法更復(fù)雜。end()實(shí)際上表示在最后的位置再加一,以便循環(huán)可以正常執(zhí)行 - 它返回的指針指向最靠近數(shù)組界限的數(shù)據(jù)。就像普通循環(huán)中的數(shù)組,比如for (i=0; i<6; i++) {ar[i] = i;} ——ar[6]是不存在的,在循環(huán)中不會(huì)達(dá)到這個(gè)元素,所以在循環(huán)中不會(huì)出現(xiàn)問題。
6 STL的煩惱之一:
STL令人煩惱的地方是在它初始化的時(shí)候。STL中容器的初始化比C/C++數(shù)組初始化要麻煩的多。你只能一個(gè)元素一個(gè)元素地來,或者先初始化一個(gè)普通數(shù)組再通過轉(zhuǎn)化填放到容器中。我認(rèn)為人們通常可以這樣做:
//?Program:?Initialization?Demo
//?Purpose:?To?demonstrate?initialization?of?STL?vectors

#include?<cstring>??//?same?as?<string.h>
#include?<vector>
using?namespace?std;


int?ar[10]?=?
{??12,?45,?234,?64,?12,?35,?63,?23,?12,?55??};
char*?str?=?"Hello?World";

int?main(int?argc,?char*?argv[])


{
??vector?<int>?vec1(ar,?ar+10);
??vector?<char>?vec2(str,?str+strlen(str));
??return?0;
}

在編程中,有很多種方法來完成同樣的工作。另一種填充向量的方法是用更加熟悉的方括號(hào),例如:
//?Program:?Vector?Demo?2
//?Purpose:?To?demonstrate?STL?vectors?with
//?counters?and?square?brackets

#include?<cstring>
#include?<vector>
#include?<iostream>
using?namespace?std;

char*?szHW?=?"Hello?World";
int?main(int?argc,?char*?argv[])


{
??vector?<char>?vec(strlen(sHW));?
??//?The?argument?initializes?the?memory?footprint
??int?i,?k?=?0;
??char*?cptr?=?szHW;
??while?(*cptr?!=?'\0')

??
{??vec[k]?=?*cptr;??cptr++;??k++;??}
??for?(i=0;?i<vec.size();?i++)

??
{??cout?<<?vec[i];??}
??cout?<<?endl;
??return?0;
}

這個(gè)例子更加清晰,但沒有使用迭代器(iterator)操作,并且定義了額外的整數(shù)作為下標(biāo),而且,你必須清楚地在程序中說明為vector分配多少內(nèi)存空間。
7 命名空間(namespace)
與STL相關(guān)的概念是命名空間(namespace)。STL定義在std命名空間中。有3種方法聲明使用的命名空間:
- 用using關(guān)鍵字使用這個(gè)命名空間,在文件的頂部,但在聲明的頭文件下面加入:
using namespace std;
最于簡單工程來說,這是最簡單也是最佳方式。直接把你的代碼定位到std命名空間,
This is the simplest and best for simple projects, limits you to the std namespace, anything you add is improperly put in the std namespace (I think you go to heck for doing this).
- Specify each and every template before use (like prototyping)
using std::cout; using std::endl; using std::flush; using std::set; using std::inserter;
This is slightly more tedious, although a good mnemonic for the functions that will be used, and you can interlace other namespaces easily.
- EVERY time you use a template from the std namespace, use the std scope specifier.
typedef std::vector VEC_STR;
This is tedious but the best way if you are mixing and matching lots of namespaces. Some STL zealots will always use this and call anyone evil who does not. Some people will create macros to simplify matters.
In addition, you can put using namespace std within any scope, for example, at the top of a function or within a control loop. Some Tips
To avoid an annoying error code in debug mode, use the following compiler pragma:
#pragma warning(disable: 4786)
Another gotcha is: you must make sure that the spaces are placed between your angle brackets and the name. This is because >> is the bit shift operator, so:
vector <list<int>> veclis;
will give an error. Instead, write it:
vector > veclis;
to avoid compilation errors. Another Container - The set
This is the explanation lifted from the MS help file of the set: "The template class describes an object that controls a varying-length sequence of elements of type const Key. Each element serves as both a sort key and a value. The sequence is represented in a way that permits lookup, insertion, and removal of an arbitrary element with a number of operations proportional to the logarithm of the number of elements in the sequence (logarithmic time). Moreover, inserting an element invalidates no iterators, and removing an element invalidates only those iterators that point at the removed element."
An alternate, more practical, definition is: A set is a container that contains all unique values. This is useful for cases in which you are required to collect the occurrence of value. It is sorted in an order that is specified at the instantiation of the set. If you need to store data with a key/value pair, then a map is a better choice. A set is organized as a linked list, is faster than a vector on insertion and removal, but slightly slower on search and addition to end.
An example program would be:
// Program: Set Demo // Purpose: To demonstrate STL sets
#include #include #include using namespace std;
int main(int argc, char* argv[]) { set strset; set ::iterator si; strset.insert("cantaloupes"); strset.insert("apple"); strset.insert("orange"); strset.insert("banana"); strset.insert("grapes"); strset.insert("grapes"); // This one overwrites the previous occurrence for (si=strset.begin(); si!=strset.end(); si++) { cout << *si << " "; } cout << endl; return 0; }
// Output: apple banana cantaloupes grapes orange
If you want to become an STL fanatic, you can also replace the output loop in the program with the following lines.
copy(strset.begin(), strset.end(), ostream_iterator(cout, " "));
While instructive, I find this personally less clear and prone to error. If you see it, now you know what it does. All the STL Containers
Containers pre-date templates and are computer science concepts that have been incorporated into STL. The following are the seven containers implemented in STL.
* vector - Your standard safe array. It is expanded in the "front" direction only. * deque - Functionally the same as a vector. Internally, it is different. It can be expanded in both the front and back. * list - Can only be traversed one step at time. If you are already familiar with the concept of a list, an STL list is doubly linked (contains pointer to both the previous and next value). * set - contains unique values that are sorted. * map - sorted set of paired values, one of which is the key on which sorts and searches occur, and the value which is retrieved from the container. E.g. instead of ar[43] = "overripe", a map lets you do this ar["banana"] = "overripe". So if you wanted to draw up a bit of information keyed on full name is easily done. * multiset - same as a set, but does not necessarily have unique values. * multimap - same as a map, but does not necessarily have unique keys.
Note: If you are reading the MFC help then you will also come across the efficiency statement of each container. I.E. (log n * n) insertion time. Unless you are dealing with very large number of values, you should ignore this. If you start to get a noticeable lag or are dealing with time critical stuff then you should learn more about the proper efficiency of various containers. How to Use a Map with some Class
The map is a template that uses a key to obtain a value.
Another issue is that you will want to use your own classes instead of data types, like int that has been used up to now. To create a class that is "template-ready", you must be ensure that the class contains certain member functions and operators. The basics are:
* default constructor (empty, usually) * copy constructor * overload "="
You would overload more operators as required in a specific template, for example, if you plan to have a class that is a key in a map you would have to overload relational operators. But that is another story.
// Program: Map Own Class // Purpose: To demonstrate a map of classes
#include #include #include #include