國外php程序_php搜索引擎蜘蛛程序

⑴ php搜索引擎蜘蛛程序

推薦一個國外知名度頗高的搜索引擎，含有網頁蜘蛛程序，以前好象有人想要這方面的資料，現在有了，大家可以研究下源碼。

官方網站：
http://phpdig.toiletoine.net/

演示：
http://phpdig.toiletoine.net/sea ... te=100&option=start

中文版本和演示，我以前提供過(1.62版本的漢化)，2003年11月換空間的時候沒備份，沒了。找下載了的人看看有沒有。

下載：
這是最近(2003年12月)更新的版本的下載(1.65 En)：
http://www.phpdig.net/navigation.php?action=download

演示：
http://www.phpdig.net/navigation.php?action=demo

主要功能：
類似google、網路的搜索引擎，php+mysql。

PhpDig is a http spider/search engine written in Php with a MySql database in backend.

HTTP Spidering : PhpDig follows links as it was any web browser within a web server, to build the pages list to index. Links can be in AreaMap, or frames. PhpDig supports relocations. Any syntax of HREF attribute is followed by Phpdig.
PhpDig don't go out the root site you define for the indexing. Spidering depth is choosen by user.
All html content is listed, both static and dynamic pages. PhpDig searches the Mime-Type of the document, or tests existence of an tag at the beginning of it.

支持全文搜索
Full Text indexing : PhpDig indexes all words of a document, excepting small words (less than 3 letters) an common words, those are definded in a text file.
Lone numbers are not inded, but those included in words. Underscores make part of a word.
Occurences of a word in a document is saved. Words in the title can have a more important weight in ranking results.

支持多種格式文件的索引，如pdf
File types wich can be indexed : PhpDig indexes HTML and text files by itself.
PhpDig could index PDF, MS-Word and MS-Excel files if you install external binaries on the spidering machines to this purpose.
To demonstrate the feature, you can search into Hamlet (tragedy, William Shakespeare) in MS-Word format, and L'Avare (comedy, Molière) in Pdf format.

支持robots
Other features : PhpDig Tries to read a robots.txt file at the server root. It searches meta robots tags too.
The Last-Modified header value is stored in the database to avoid rendant indexing. Also the meta revisit-after tag.

可針對特定網站進行全文索引，蜘蛛可1-9個層自動獲取全部url

其中的蜘蛛程序寫得十分好，有興趣的朋友推薦研究下。

希望對你有用！

熱點內容

pr編譯出錯渲染存在偏移發布：2025-04-29 20:01:56 瀏覽：260

如何製作自家的app 發布：2025-04-29 20:01:49 瀏覽：197

推薦一個解壓軟體rar解壓幫手發布：2025-04-29 20:01:48 瀏覽：207

wd文檔加密器發布：2025-04-29 20:01:10 瀏覽：745

伺服器上傳壓縮包一般是什麼格式發布：2025-04-29 20:00:59 瀏覽：331

發送加密文件密碼幾位數發布：2025-04-29 20:00:58 瀏覽：158

樹洞app怎麼樣發布：2025-04-29 20:00:57 瀏覽：173

vivo編譯時間可以改么發布：2025-04-29 19:54:01 瀏覽：147

編譯和編輯怎麼區分發布：2025-04-29 19:52:15 瀏覽：979

iar編譯文件順序發布：2025-04-29 19:40:35 瀏覽：898

java二叉搜索樹發布：2025-04-29 18:59:46 瀏覽：633

王者怎麼看好友的伺服器發布：2025-04-29 18:59:45 瀏覽：733

無線編碼單片機發布：2025-04-29 18:50:26 瀏覽：464

天聯高級版域名伺服器地址發布：2025-04-29 18:44:37 瀏覽：206

鴻蒙用什麼編譯發布：2025-04-29 18:43:59 瀏覽：730

伺服器如何迅速擴容發布：2025-04-29 18:33:05 瀏覽：792

伺服器無固定ip地址不發布：2025-04-29 18:32:05 瀏覽：643

安卓手機如何折扣充值發布：2025-04-29 18:31:20 瀏覽：996

編譯器詞法分析演算法發布：2025-04-29 18:19:28 瀏覽：325

加密狗行業版怎麼樣發布：2025-04-29 18:12:52 瀏覽：331

導航:首頁 > 編程語言 > 國外php程序

國外php程序

與國外php程序相關的資料