[翻譯]Berkeley DB 文檔 - C++入門篇 - 1.3節(jié) - 訪問(wèn)方式(Access Methods)
Access Methods
訪問(wèn)方式
While this manual will focus primarily on the BTree access method, it is still useful to briefly describe all of the access methods that DB makes available.
本手冊(cè)先來(lái)關(guān)注一下B樹(shù)的訪問(wèn)方式,這對(duì)概述DB其他可用的方式也有幫助.
Note that an access method can be selected only when the database is created. Once selected, actual API usage is generally identical across all access methods. That is, while some exceptions exist, mechanically you interact with the library in the same way regardless of which access method you have selected.
注意,只能在數(shù)據(jù)庫(kù)創(chuàng)建時(shí)設(shè)定訪問(wèn)的方式.一旦選定,所有的訪問(wèn)方式的API的使用實(shí)際上基本一致的.也就是說(shuō),雖然有一些例外的存在,但基本上你可以照本宣章的通過(guò)同樣的途徑控制數(shù)據(jù)庫(kù)而不用操心你選用的是何種方式.
The access method that you should choose is gated first by what you want to use as a key, and then secondly by the performance that you see for a given access method.
你選擇訪問(wèn)的方式首先要考慮的是用什么來(lái)做鍵,然后根據(jù)性能選擇合適的訪問(wèn)方式.
The following are the available access methods:
下面是可用的訪問(wèn)方式:
BTree
B樹(shù)
Data is stored in a sorted, balanced tree structure. Both the key and the data for BTree records can be arbitrarily complex. That is, they can contain single values such as an integer or a string, or complex types such as a structure. Also, although not the default behavior, it is possible for two records to use keys that compare as equals. When this occurs, the records are considered to be duplicates of one another.
數(shù)據(jù)被保存到有序平衡樹(shù)中.鍵和值都可以是任意復(fù)雜的.也就是說(shuō),他們(鍵和值)可以是單一的類型比如整數(shù)或字串,也可以是復(fù)雜的類型比如一個(gè)結(jié)構(gòu)體.另外,盡管不是默認(rèn)的行為,擁有兩個(gè)鍵等同的記錄也是可以的.這種情況下,這些記錄被認(rèn)為一個(gè)是另一個(gè)的副本.
Hash
哈希
Data is stored in an extended linear hash table. Like BTree, the key and the data used for Hash records can be of arbitrarily complex data. Also, like BTree, duplicate records are optionally supported.
數(shù)據(jù)被保存在一個(gè)擴(kuò)展的線性哈希表中.和B樹(shù)類似,鍵和值可以是任意復(fù)雜的.另外,和B樹(shù)類似,多重記錄也是可選的.
Queue
隊(duì)列
Data is stored in a queue as fixed-length records. Each record uses a logical record number as its key. This access method is designed for fast inserts at the tail of the queue, and it has a special operation that deletes and returns a record from the head of the queue.
數(shù)據(jù)被保存在一個(gè)作為定長(zhǎng)記錄集的隊(duì)列中.每個(gè)記錄使用一個(gè)邏輯記錄號(hào)作為鍵.這種訪問(wèn)方式被用在需要在隊(duì)列尾巴快速插入數(shù)據(jù)的場(chǎng)合,它還有特殊的操作來(lái)返回和刪除頭部的數(shù)據(jù).
This access method is unusual in that it provides record level locking. This can provide beneficial performance improvements in applications requiring concurrent access to the queue.
這種訪問(wèn)方式與眾不同處在于他提供記錄級(jí)別的鎖.當(dāng)程序需要并發(fā)的訪問(wèn)隊(duì)列時(shí),這一點(diǎn)可以提高性能.
Recno
Recno
Data is stored in either fixed or variable-length records. Like Queue, Recno records use logical record numbers as keys.
數(shù)據(jù)被保存在一個(gè)定長(zhǎng)/變長(zhǎng)記錄集中.類似Queueh,Recno使用邏輯記錄號(hào)作為鍵.
Selecting Access Methods
選擇訪問(wèn)方式
To select an access method, you should first consider what you want to use as a key for you database records. If you want to use arbitrary data (even strings), then you should use either BTree or Hash. If you want to use logical record numbers (essentially integers) then you should use Queue or Recno.
選擇一種訪問(wèn)方式你首先需要您的數(shù)據(jù)庫(kù)記錄集鍵的類型.如果你想使用任意類型的數(shù)據(jù)(甚至僅是字串),你就應(yīng)該用B樹(shù)或哈希.如果你想使用邏輯記錄號(hào)(本質(zhì)上說(shuō)是整數(shù)),那么使用隊(duì)列或是Recon吧.
Once you have made this decision, you must choose between either BTree or Hash, or Queue or Recno. This decision is described next.
一旦你做出以上的決定,你就要在B樹(shù),哈希,隊(duì)列,Recno中選取了.下面來(lái)對(duì)比描述一下.
Choosing between BTree and Hash
在B樹(shù)和哈希中選取
For small working datasets that fit entirely in memory, there is no difference between BTree and Hash. Both will perform just as well as the other. In this situation, you might just as well use BTree, if for no other reason than the majority of DB applications use BTree.
對(duì)于小到應(yīng)該被完全加載到內(nèi)存的工作數(shù)據(jù)集,B樹(shù)和哈希沒(méi)有什么區(qū)別.他們彼此的表現(xiàn)幾乎一樣優(yōu)秀.這種情況下你或許應(yīng)該選擇B樹(shù),如果沒(méi)有特殊的原因,大半的DB應(yīng)用使用B樹(shù).
Note that the main concern here is your working dataset, not your entire dataset. Many applications maintain large amounts of information but only need to access some small portion of that data with any frequency. So what you want to consider is the data that you will routinely use, not the sum total of all the data managed by your application.
注意,這里主要關(guān)心的是你的工作數(shù)據(jù)集,不是你的整個(gè)數(shù)據(jù)集.許多應(yīng)用維護(hù)著大量信息但是在任何情況下只需要訪問(wèn)其中的一小部分.如此你就需要考慮你通常使用的數(shù)據(jù),而不是你的應(yīng)用程序使用的所有數(shù)據(jù).
However, as your working dataset grows to the point where you cannot fit it all into memory, then you need to take more care when choosing your access method. Specifically, choose:
然而,當(dāng)你的工作數(shù)據(jù)集增長(zhǎng)到不能全部加載到內(nèi)存的臨界點(diǎn)時(shí),你就需要注意選擇訪問(wèn)方式了,特別是,選擇:
* BTree if your keys have some locality of reference. That is, if they sort well and you can expect that a query for a given key will likely be followed by a query for one of its neighbors.
* B樹(shù):如果你的鍵有位置上的關(guān)聯(lián).也就是說(shuō),如果他們排序良好那么你可以預(yù)期一個(gè)給定鍵的查詢很可能在查詢它的一個(gè)鄰居之后.
* Hash if your dataset is extremely large. For any given access method, DB must maintain a certain amount of internal information. However, the amount of information that DB must maintain for BTree is much greater than for Hash. The result is that as your dataset grows, this internal information can dominate the cache to the point where there is relatively little space left for application data. As a result, BTree can be forced to perform disk I/O much more frequently than would Hash given the same amount of data.
* 哈希:如果你的數(shù)據(jù)集非常巨大.所有的給出訪問(wèn)方式,DB都要維護(hù)一個(gè)確定大小額外信息.然而,B樹(shù)比哈希所需要的額外信息多得多.結(jié)果就是當(dāng)你的數(shù)據(jù)集增長(zhǎng)時(shí),額外信息可能裝滿了緩存相對(duì)的應(yīng)用程序可用的空間就小了.影響就是,B樹(shù)不是不比哈希更頻繁的訪問(wèn)相同數(shù)量的數(shù)據(jù).
Moreover, if your dataset becomes so large that DB will almost certainly have to perform disk I/O to satisfy a random request, then Hash will definitely out perform BTree because it has fewer internal records to search through than does BTree.
更重要的是如果你的數(shù)據(jù)集龐大到DB幾乎肯定要執(zhí)行磁盤I/O操作來(lái)滿足一個(gè)隨機(jī)的訪問(wèn),Hash毫無(wú)疑問(wèn)的會(huì)勝出B樹(shù)因?yàn)樗鼉?nèi)部搜索的記錄更少.
Choosing between Queue and Recno
在隊(duì)列和Recno中選取
Queue or Recno are used when the application wants to use logical record numbers for the primary database key. Logical record numbers are essentially integers that uniquely identify the database record. They can be either mutable or fixed, where a mutable record number is one that might change as database records are stored or deleted. Fixed logical record numbers never change regardless of what database operations are performed.
隊(duì)列和Recno被用在當(dāng)程序需要使用邏輯記錄號(hào)作為主記錄的鍵時(shí).邏輯記錄號(hào)本質(zhì)上是唯一標(biāo)識(shí)記錄的整數(shù)。可以是變化的也可以是固定的,可變的記錄號(hào)可以在記錄數(shù)據(jù)數(shù)據(jù)保存或刪除時(shí)被改變.固定的邏輯記錄號(hào)無(wú)論數(shù)據(jù)庫(kù)如何運(yùn)行也不會(huì)改變.
When deciding between Queue and Recno, choose:
當(dāng)在隊(duì)列和Recno中取舍時(shí):
* Queue if your application requires high degrees of concurrency. Queue provides record-level locking (as opposed to the page-level locking that the other access methods use), and this can result in significantly faster throughput for highly concurrent applications.
* 隊(duì)列:如果你的應(yīng)用需要高度并發(fā).隊(duì)列提供了記錄級(jí)(record-level)的鎖(相對(duì)是另一種訪問(wèn)方式的頁(yè)級(jí)(page-level)鎖),這在并發(fā)中性能有著顯著優(yōu)勢(shì).
Note, however, that Queue provides support only for fixed length records. So if the size of the data that you want to store varies widely from record to record, you should probably choose an access method other than
Queue.
注意,然而,隊(duì)列只提供對(duì)定長(zhǎng)記錄的支持.那么如果你的每條數(shù)據(jù)大小差異顯著,你可能需要隊(duì)列之外的一種訪問(wèn)方式.
* Recno if you want mutable record numbers. Queue is only capable of providing fixed record numbers. Also, Recno provides support for databases whose permanent storage is a flat text file. This is useful for applications looking for fast, temporary storage while the data is being read or modified.
* Recno:如果你需要可變的記錄號(hào).隊(duì)列只能提供固定的記錄號(hào).同時(shí),Recno支持將數(shù)據(jù)庫(kù)存儲(chǔ)在一個(gè)文本文件.這對(duì)需要快速,臨時(shí)存儲(chǔ)的正在讀取和修改數(shù)據(jù)的應(yīng)用很有用.