四虎久久影院,亚洲国产另类久久久精品,久久人妻AV中文字幕

Trie樹就是字典樹，其核心思想就是空間換時間。

舉個簡單的例子。

給你100000個長度不超過10的單詞。對于每一個單詞，我們要判斷他出沒出現過，如果出現了，第一次出現第幾個位置。
這題當然可以用hash來，但是我要介紹的是trie樹。在某些方面它的用途更大。比如說對于某一個單詞，我要詢問它的前綴是否出現過。這樣hash就不好搞了，而用trie還是很簡單。
現在回到例子中，如果我們用最傻的方法，對于每一個單詞，我們都要去查找它前面的單詞中是否有它。那么這個算法的復雜度就是O(n^2)。顯然對于100000的范圍難以接受。現在我們換個思路想。假設我要查詢的單詞是abcd，那么在他前面的單詞中，以b，c，d，f之類開頭的我顯然不必考慮。而只要找以a開頭的中是否存在abcd就可以了。同樣的，在以a開頭中的單詞中，我們只要考慮以b作為第二個字母的……這樣一個樹的模型就漸漸清晰了……
假設有b，abc，abd，bcd，abcd，efg，hii這6個單詞，我們構建的樹就是這樣的。

對于每一個節點，從根遍歷到他的過程就是一個單詞，如果這個節點被標記為紅色，就表示這個單詞存在，否則不存在。
那么，對于一個單詞，我只要順著他從跟走到對應的節點，再看這個節點是否被標記為紅色就可以知道它是否出現過了。把這個節點標記為紅色，就相當于插入了這個單詞。
這樣一來我們詢問和插入可以一起完成，所用時間僅僅為單詞長度，在這一個樣例，便是10。
我們可以看到，trie樹每一層的節點數是26^i級別的。所以為了節省空間。我們用動態鏈表，或者用數組來模擬動態。空間的花費，不會超過單詞數×單詞長度。

給出一個用類封裝的字典樹代碼，厄。。。做ACM的模板用另一個。。應該放在了“ACM模板”文件夾下了。。。

#include <cstdio>

#include <iostream>

#include <cstring>

using namespace std;

const int num_chars = 26;

class Trie {

public:

Trie():root(NULL){};

Trie(Trie& tr);

int search(const char* word, char* entry ) const;

int insert(const char* word, const char* entry);

int remove(const char* word, char* entry);

private:

struct Trie_node

{

char* data;

Trie_node* branch[num_chars];

Trie_node();

}* root;

};

Trie::Trie_node::Trie_node()

{

data = NULL;

for (int i=0; i<num_chars; ++i)

branch[i] = NULL;

}

int Trie::search(const char* word, char* entry ) const

{

int position = 0;

char char_code;

Trie_node *location = root;

while( location!=NULL && *word!=0 )

{

if (*word>='A' && *word<='Z')

char_code = *word-'A';

else if (*word>='a' && *word<='z')

char_code = *word-'a';

else return 0;

location = location->branch[char_code];

position++;

word++;

}

if ( location != NULL && location->data != NULL )

{

strcpy(entry,location->data);

return 1;

}

else return 0;

}

int Trie::insert(const char* word, const char* entry)

{

int result = 1, position = 0;

if ( root == NULL ) root = new Trie_node;

char char_code;

Trie_node *location = root;

while( location!=NULL && *word!=0 )

{

if (*word>='A' && *word<='Z')

char_code = *word-'A';

else if (*word>='a' && *word<='z')

char_code = *word-'a';

else return 0;

if( location->branch[char_code] == NULL )

location->branch[char_code] = new Trie_node;

location = location->branch[char_code];

position++;

word++;

}

if (location->data != NULL)

result = 0;

else {

location->data = new char[strlen(entry)+1];

strcpy(location->data, entry);

}

return result;

}

int main()

{

Trie t;

char entry[100];

t.insert("aa", "DET");

t.insert("abacus","NOUN");

t.insert("abalone","NOUN");

t.insert("abandon","VERB");

t.insert("abandoned","ADJ");

t.insert("abashed","ADJ");

t.insert("abate","VERB");

t.insert("this", "PRON");

if (t.search("this", entry))

cout<<"'this' was found. pos: "<<entry<<endl;

if (t.search("abate", entry))

cout<<"'abate' is found. pos: "<<entry<<endl;

if (t.search("baby", entry))

cout<<"'baby' is found. pos: "<<entry<<endl;

else

cout<<"'baby' does not exist at all!"<<endl;

if (t.search("aa", entry))

cout<<"'aa was found. pos: "<<entry<<endl;

}

PS:實現方法http://met.fzu.edu.cn/eduonline/web/web/resources/articleContent.asp?id=346

發表于 2008-11-16 11:42 hunter 閱讀(7998) 評論(7) 編輯收藏引用所屬分類: algorithm

# re: 字典樹原理（轉）回復更多評論

您的類里邊沒有定義析構函數，這樣會不會造成內存泄露啊。。。

chasefornone 評論于 2010-08-31 16:14

# re: 字典樹原理（轉）[未登錄] 回復更多評論

不會的

yy 評論于 2010-10-15 19:58

不會才怪菜鳥寫的

TonyWang 評論于 2011-09-30 10:28

只有new，沒有delete，必然內存泄露

123 評論于 2012-09-03 13:55

代碼風格不是很好

ygqwna 評論于 2013-08-31 23:23

一看就是c++外行寫的代碼，

ddd 評論于 2013-10-25 08:48

常用鏈接

留言簿(4)

隨筆分類

隨筆檔案

搜索

最新評論

閱讀排行榜

評論排行榜

字典樹原理（轉）