Image Classification and Annotation、圖像標注(image annotation)、圖像檢索(image retrieval)

Caltech101是屬于image categorization數據庫，見Linear Spatial Pyramid Matching using Sparse Coding for Image Classification(CVPR09-ScSPM)摘要寫了： In a number of image categorization experiments, 其標題是Image Classification。Mingming Gong講image/object Classification/categorization 四個是一樣的概念.

我自己到谷歌上搜索：object categorization COIL,發現A novel color-context descriptor and its applications (ICME 2009)摘要最后一句： Experiments validate the discriminant power of the proposed descriptor in object categorization on COIL- 100 database and pedestrian identification in surveillance videos.故Deng Cai主頁的COIL是屬于object categorization的

63486.htm
Learning a Maximum Margin Subspace for Image Retrieval
Dong Xu's Phd thesis Section 7
Cooperative Sparse Representation Semi-supervised Image Annotation

---------------------------------------------------------------------------------------------------------------------------------------------------

圖像分類是測試樣本的預測label和實際的label比較得準確率。圖像標注本質也是分類，是測試樣本預測出來的tag和實際的tag做比較得準確率，看voc 07的訓練tag，是5011*804的矩陣，也就是總共804個tag，如果該樣本有這個tag，則該位置的tag是1否則是0.一個圖像可以有多個tag，image annotation本質也是multi-label的問題。是不是所有樣本的tag確實只有804個，其實有更多，將一些頻率比較低的去掉了。Yong Luo講在測試時有這樣的情況，測試樣本沒有tag，這時沒法算準確率，就將這個測試樣本去掉。

---------------------------------------------------------------------------------------------------------------------------------------------------------
image annotation就是image classification. 見Cooperative Sparse Representation Semi-supervised Image Annotation. VI節A節第三段：with the annotations from a set of total 20 keywords.This is discussing with Weifeng Liu and Yong Luo. Sparse Unsupervised Dimensionality Reduction for Multiple View Data該文IV節E節 2)就用的Image Classification and Annotation.該文用三個數據集，MIML僅有類別沒有tag，后兩個僅有tag沒有類別，論文說了分別81和100個tag，故MIML是測試樣本的預測label和實際的label比較得準確率，后兩個庫是是測試樣本的預測tag和實際的tag比較得準確率。Yong Luo講沒有人在voc上做image annotation，因為其tag沒有太多的語義信息，比如pascal07_dictionary的804個tag中還有2003，voc的tag只能作為特征。

Yong Luo：另外還有三個做annotation的數據庫Corel 5K，IAPR TC-12，ESP GAME. Annotation是給圖像加標注，阿秋TIP fig 3. Tian Xia MSE實驗第三節是Video annotation, Tian Xia寫了，和image retrival做法一樣。這三個數據庫Yong Luo 建議不要按照ML-KNN的五個指標來進行比較。Image annotation就按照annotation的指標，TagProp Discriminative Metric Learning (ICCV 2009)Table 3的P、R和N+。這三個指標的定義該文講了，該文參考文獻17 A. Makadia, V. Pavlovic, and S. Kumar. A new baseline for image annotation. In ECCV, 2008也講了。

整個領域基本都是分類或者回歸問題，少部分是回歸，比如是年齡回歸(age regression)。mingming gong said that介于兩者之間的還有一個是ordinal regression = ranking(值是離散的，還有大小)

發表于 2012-08-05 11:36 杰哥閱讀(1444) 評論(0) 編輯收藏引用所屬分類: 學術

常用鏈接

留言簿(57)

隨筆分類

隨筆檔案

相冊

Other

Paper submission

福彩

留學相關

論壇

搜索

學者

郵箱

中科大和中科院

搜索

最新評論

閱讀排行榜

評論排行榜

Image Classification and Annotation、圖像標注(image annotation)、圖像檢索(image retrieval)