锘??xml version="1.0" encoding="utf-8" standalone="yes"?>要久久爱在线免费观看,国产亚洲色婷婷久久99精品,麻豆一区二区99久久久久http://www.shnenglu.com/beautykingdom/category/13101.htmlzh-cnThu, 18 Feb 2010 13:55:42 GMTThu, 18 Feb 2010 13:55:42 GMT60濡備綍鍐欎竴涓綉緇滆湗铔?/title><link>http://www.shnenglu.com/beautykingdom/archive/2010/02/18/108046.html</link><dc:creator>chatler</dc:creator><author>chatler</author><pubDate>Thu, 18 Feb 2010 13:54:00 GMT</pubDate><guid>http://www.shnenglu.com/beautykingdom/archive/2010/02/18/108046.html</guid><wfw:comment>http://www.shnenglu.com/beautykingdom/comments/108046.html</wfw:comment><comments>http://www.shnenglu.com/beautykingdom/archive/2010/02/18/108046.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.shnenglu.com/beautykingdom/comments/commentRss/108046.html</wfw:commentRss><trackback:ping>http://www.shnenglu.com/beautykingdom/services/trackbacks/108046.html</trackback:ping><description><![CDATA[<p><a onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Web_spider?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>榪欓噷</font></u></a>鏄淮鍩虹櫨縐戝緗戠粶鐖櫕鐨勮瘝鏉¢〉闈€傜綉緇滅埇铏互鍙綉緇滆湗铔涳紝緗戠粶鏈哄櫒浜猴紝榪欐槸涓涓▼搴忥紝鍏朵細鑷姩鐨勯氳繃緗戠粶鎶撳彇浜掕仈緗戜笂鐨勭綉欏碉紝榪欑鎶鏈竴鑸彲鑳界敤鏉ユ鏌ヤ綘鐨勭珯鐐逛笂鎵鏈夌殑閾炬帴鏄惁鏄兘鏄湁鏁堢殑銆傚綋鐒訛紝鏇翠負楂樼駭鐨勬妧鏈槸鎶婄綉欏典腑鐨勭浉鍏蟲暟鎹繚瀛樹笅鏉ワ紝鍙互鎴愪負鎼滅儲寮曟搸銆?/p> <p>浠庢妧鐩告潵璇達紝瀹炵幇鎶撳彇緗戦〉鍙兘騫朵笉鏄竴浠跺緢鍥伴毦鐨勪簨鎯咃紝鍥伴毦鐨勪簨鎯呮槸瀵圭綉欏電殑鍒嗘瀽鍜屾暣鐞嗭紝閭f槸涓浠墮渶瑕佹湁杞婚噺鏅鴻兘錛岄渶瑕佸ぇ閲忔暟瀛﹁綆楃殑紼嬪簭鎵嶈兘鍋氱殑浜嬫儏銆備笅闈竴涓畝鍗曠殑嫻佺▼錛?/p> <p><span id=more-27></span></p> <p> </p> <p>鍦ㄨ繖閲岋紝鎴戜滑鍙槸璇翠竴涓嬪浣曞啓涓涓綉欏墊姄鍙栫▼搴忋?/p> <p>棣栧厛鎴戜滑鍏堢湅涓涓嬶紝濡備綍浣跨敤鍛戒護琛岀殑鏂瑰紡鏉ユ壘寮緗戦〉銆?/p> <p style="TEXT-ALIGN: left; PADDING-LEFT: 30px">telnet somesite.com 80<br>GET /index.html HTTP/1.0<br>鎸夊洖杞︿袱嬈?/p> <p style="TEXT-ALIGN: left">浣跨敤telnet灝辨槸鍛婅瘔浣犲叾瀹炶繖鏄竴涓猻ocket鐨勬妧鏈紝騫朵笖浣跨敤HTTP鐨勫崗璁紝濡?GET鏂規硶鏉ヨ幏寰楃綉欏碉紝褰撶劧錛屾帴涓嬫潵鐨勪簨浣犲氨闇瑕佽В鏋怘TML鏂囨硶錛岀敋鑷寵繕闇瑕佽В鏋怞avascript錛屽洜涓虹幇鍦ㄧ殑緗戦〉浣跨敤Ajax鐨勮秺鏉ヨ秺澶氫簡錛岃屽緢澶氱綉欏靛唴瀹歸兘鏄氳繃Ajax鎶鏈姞杞界殑錛屽洜涓猴紝鍙槸綆鍗曞湴瑙f瀽HTML鏂囦歡鍦ㄦ湭鏉ヤ細榪滆繙涓嶅銆傚綋鐒訛紝鍦ㄨ繖閲岋紝鍙槸灞曠ず涓涓潪甯哥畝鍗曠殑鎶撳彇錛岀畝鍗曞埌鍙兘鍋氫負涓涓緥瀛愶紝涓嬮潰榪欎釜紺轟緥鐨勪吉浠g爜錛?/p> <pre>鍙栫綉欏? for each 閾炬帴 in 褰撳墠緗戦〉鎵鏈夌殑閾炬帴 { if(濡傛灉鏈摼鎺ユ槸鎴戜滑鎯寵鐨?|| 榪欎釜閾炬帴浠庢湭璁塊棶榪? { 澶勭悊瀵規湰閾炬帴 鎶婃湰閾炬帴璁劇疆涓哄凡璁塊棶 } }</pre> <pre class=ruby>require “rubygems” require “mechanize” class Crawler < WWW::Mechanize attr_accessor :callback INDEX = 0 DOWNLOAD = 1 PASS = 2 def initialize super init @first = true self.user_agent_alias = “Windows IE 6″ end def init @visited = [] end def remember(link) @visited << link end def perform_index(link) self.get(link) if(self.page.class.to_s == “WWW::Mechanize::Page”) links = self.page.links.map {|link| link.href } - @visited links.each do |alink| start(alink) end end end def start(link) return if link.nil? if(!@visited.include?(link)) action = @callback.call(link) if(@first) @first = false perform_index(link) end case action when INDEX perform_index(link) when DOWNLOAD self.get(link).save_as(File.basename(link)) when PASS puts “passing on #{link}” end end end def get(site) begin puts “getting #{site}” @visited << site super(site) rescue puts “error getting #{site}” end end end</pre> <p>涓婇潰鐨勪唬鐮佸氨涓嶅繀澶氳浜嗭紝澶у鍙互鍘昏瘯璇曘備笅闈㈡槸濡備綍浣跨敤涓婇潰鐨勪唬鐮侊細</p> <pre class=ruby>require “crawler” x = Crawler.new callback = lambda do |link| if(link =~/\\.(zip|rar|gz|pdf|doc) x.remember(link) return Crawler::PASS elsif(link =~/\\.(jpg|jpeg)/) return Crawler::DOWNLOAD end return Crawler::INDEX; end x.callback = callback x.start(”http://somesite.com”)</pre> <p>涓嬮潰鏄竴浜涘拰緗戠粶鐖櫕鐩稿叧鐨勫紑婧愮綉緇滈」鐩?/p> <ul> <li><a class="external text" title=http://arachnode.net onclick="pageTracker._trackPageview('/outgoing/arachnode.net/?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" rel=nofollow target=_blank><strong><u><font color=#0000ff>arachnode.net</font></u></strong></a> is a .NET crawler written in C# using SQL 2005 and <a title=Lucene onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Lucene?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>Lucene</font></u></a> and is released under the <a title="GNU General Public License" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/GNU_General_Public_License?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>GNU General Public License</font></u></a>. <li><strong><a title=DataparkSearch onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/DataparkSearch?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>DataparkSearch</font></u></a></strong> is a crawler and search engine released under the <a title="GNU General Public License" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/GNU_General_Public_License?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>GNU General Public License</font></u></a>. <li><strong><a title=Wget onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Wget?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>GNU Wget</font></u></a></strong> is a <a class=mw-redirect title="Command line interface" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Command_line_interface?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>command-line</font></u></a>-operated crawler written in <a title="C (programming language)" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/C_28programming_language_29?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>C</font></u></a> and released under the <a title="GNU General Public License" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/GNU_General_Public_License?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>GPL</font></u></a>. It is typically used to mirror Web and FTP sites. <li><strong><a title="Grub (search engine)" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Grub_28search_engine_29?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>GRUB</font></u></a></strong> is an open source distributed search crawler that Wikia Search ( <a class="external free" title=http://wikiasearch.com onclick="pageTracker._trackPageview('/outgoing/wikiasearch.com/?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" rel=nofollow target=_blank><u><font color=#0000ff>http://wikiasearch.com</font></u></a> ) uses to crawl the web. <li><strong><a title=Heritrix onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Heritrix?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>Heritrix</font></u></a></strong> is the <a title="Internet Archive" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Internet_Archive?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>Internet Archive</font></u></a>’s archival-quality crawler, designed for archiving periodic snapshots of a large portion of the Web. It was written in <a title="Java (programming language)" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Java_28programming_language_29?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>Java</font></u></a>. <li><strong><a class=mw-redirect title=Ht-//dig onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Ht-//dig?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>ht://Dig</font></u></a></strong> includes a Web crawler in its indexing engine. <li><strong><a title=HTTrack onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/HTTrack?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>HTTrack</font></u></a></strong> uses a Web crawler to create a mirror of a web site for off-line viewing. It is written in <a title="C (programming language)" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/C_28programming_language_29?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>C</font></u></a> and released under the <a title="GNU General Public License" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/GNU_General_Public_License?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>GPL</font></u></a>. <li><strong><a title="ICDL crawling" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/ICDL_crawling?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>ICDL Crawler</font></u></a></strong> is a <a title=Cross-platform onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Cross-platform?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>cross-platform</font></u></a> web crawler written in <a title=C++ onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/C_2B_2B?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><u><font color=#0000ff>C++</font></u></a> and intended to crawl Web sites based on <a title="Website Parse Template" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Website_Parse_Template?referer=http%3A%2F%2Fcoolshell.cn%2F%3Fp%3D1695');" target=_blank><br></a></li> </ul> <p>from:<br><a >http://coolshell.cn/?p=27</a></p> <img src ="http://www.shnenglu.com/beautykingdom/aggbug/108046.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.shnenglu.com/beautykingdom/" target="_blank">chatler</a> 2010-02-18 21:54 <a href="http://www.shnenglu.com/beautykingdom/archive/2010/02/18/108046.html#Feedback" target="_blank" style="text-decoration:none;">鍙戣〃璇勮</a></div>]]></description></item></channel></rss> <footer> <div class="friendship-link"> <p>感谢您访问我们的网站,您可能还对以下资源感兴趣:</p> <a href="http://www.shnenglu.com/" title="精品视频久久久久">精品视频久久久久</a> <div class="friend-links"> </div> </div> </footer> <a href="http://www.schzjy.cn" target="_blank">久久久久99精品成人片三人毛片</a>| <a href="http://www.f259.cn" target="_blank">久久久久久亚洲精品不卡</a>| <a href="http://www.elecline.com.cn" target="_blank">久久精品免费网站网</a>| <a href="http://www.122797929.cn" target="_blank">亚洲伊人久久成综合人影院</a>| <a href="http://www.ai385.cn" target="_blank">久久久女人与动物群交毛片</a>| <a href="http://www.fj023.cn" target="_blank">国产成人久久精品一区二区三区</a>| <a href="http://www.hotdee.com.cn" target="_blank">欧美日韩中文字幕久久伊人</a>| <a href="http://www.021cp.cn" target="_blank">少妇被又大又粗又爽毛片久久黑人 </a>| <a href="http://www.peopleim.cn" target="_blank">99久久免费只有精品国产</a>| <a href="http://www.banburi.cn" target="_blank">亚洲国产精品狼友中文久久久</a>| <a href="http://www.jsxtcmss.cn" target="_blank">国产成人久久精品一区二区三区 </a>| <a href="http://www.gven.cn" target="_blank">久久精品中文无码资源站</a>| <a href="http://www.840ww.cn" target="_blank">亚洲国产精品一区二区久久</a>| <a href="http://www.0553fc.cn" target="_blank">性做久久久久久免费观看</a>| <a href="http://www.huameizc.cn" target="_blank">久久国产亚洲高清观看</a>| <a href="http://www.gongcheng100.cn" target="_blank">久久久久亚洲爆乳少妇无</a>| <a href="http://www.qcbijj.cn" target="_blank">国产精品欧美久久久久天天影视 </a>| <a href="http://www.hyzjlib.cn" target="_blank">久久亚洲AV成人无码电影</a>| <a href="http://www.yhkim.cn" target="_blank">亚洲国产成人精品久久久国产成人一区二区三区综 </a>| <a href="http://www.lvyoubuy.cn" target="_blank">久久99免费视频</a>| <a href="http://www.kybdt.cn" target="_blank">丰满少妇高潮惨叫久久久</a>| <a href="http://www.vod1314.cn" target="_blank">久久久久国产视频电影</a>| <a href="http://www.by8d5c.cn" target="_blank">国产亚洲美女精品久久久久狼</a>| <a href="http://www.kuhaoma.cn" target="_blank">久久综合亚洲色一区二区三区</a>| <a href="http://www.kongqueyuhn.cn" target="_blank">久久久久九九精品影院</a>| <a href="http://www.dykh-tech.cn" target="_blank">国产∨亚洲V天堂无码久久久</a>| <a href="http://www.gotovision.com.cn" target="_blank">亚洲国产成人精品91久久久</a>| <a href="http://www.ningxue520.cn" target="_blank">国内精品久久久久久久亚洲 </a>| <a href="http://www.67yule.cn" target="_blank">久久无码中文字幕东京热</a>| <a href="http://www.68002.com.cn" target="_blank">品成人欧美大片久久国产欧美... 品成人欧美大片久久国产欧美 </a>| <a href="http://www.ddmir.cn" target="_blank">狠狠色婷婷久久一区二区 </a>| <a href="http://www.eastmark.cn" target="_blank">99久久精品日本一区二区免费</a>| <a href="http://www.uucity.com.cn" target="_blank">中文字幕久久亚洲一区</a>| <a href="http://www.rojie.cn" target="_blank">久久久久亚洲爆乳少妇无</a>| <a href="http://www.it0557.cn" target="_blank">狠狠精品干练久久久无码中文字幕</a>| <a href="http://www.efd-inc.com.cn" target="_blank">国产精品久久国产精品99盘 </a>| <a href="http://www.hgrnoko.cn" target="_blank">久久无码精品一区二区三区</a>| <a href="http://www.spiralstar.com.cn" target="_blank">久久精品国产99国产精品澳门</a>| <a href="http://www.b2721.cn" target="_blank">久久狠狠高潮亚洲精品</a>| <a href="http://www.egpk.cn" target="_blank">国产精品久久久久9999</a>| <a href="http://www.591happy.cn" target="_blank">国产精品久久精品</a>| <script> (function(){ var bp = document.createElement('script'); var curProtocol = window.location.protocol.split(':')[0]; if (curProtocol === 'https') { bp.src = 'https://zz.bdstatic.com/linksubmit/push.js'; } else { bp.src = 'http://push.zhanzhang.baidu.com/push.js'; } var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(bp, s); })(); </script> </body>