一種可推測網路行為者路徑之模型-以教育部技職傳播網為例
No Thumbnail Available
Date
2007
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
預測網站使用者對網頁知識的蒐尋路徑行為,是網站管理與設計的重要方向。有鑑於此,本研究首先從教育部技職傳播網入口網站後端資料倉儲(Data Warehouse)中,擷取出使用者路徑之歷史資料作為本研究模式之系統訓練資料集,再以派翠西網路(Petri Nets)建立其動態行動描述模型並分析使用者之可能行為路徑,最後以資料探勘(Data Mining)貝氏機率法則結合關連分析法則(Association Rule Analysis)之模型,推算在訓練資料集中網路使用者動作行為路徑及行為屬性之機率值,並建構行為屬性之文件機率表,作為預測下一位使用者是否找尋到文件之依據。藉由文件之發現率及使用者行為路徑之關聯性,可成功的對技職教育網頁系統或架構提出相關修正案及問題解決機制。研究中發現以下結果:
一. 若資料維度過少而使用貝氏分類法來做預測分析,將使誤差變大,因為各屬性間為互相獨立的,會因各屬性連乘積的影響,不管其他條件機率為何,其結果值都為 0。為了克服此一問題,本研究提出採用M-estimate 修正機制,經驗證確實可改善此問題。
二. 本研究隨機選取技職傳播網 150 使用者行為資料筆資料,應用貝氏機
率分類法把各屬性機率求出,並建置 Data Mining 引擎,再隨機選取100 筆使用者行為資料做為測試資料,經驗證發現,使用貝氏機率分類法,找到文件預測成功率都達 91.2%以上。
三. 結合貝氏網路及關聯規則模型,可比單一貝氏網路所建構出之預測模
型更精確,並能圖型化的看出網站路徑之效益。
To predict user’s behavior route in website is very important for Website's administrator. The first, the research extract out historical data from data warehouse in the website of the technological and vocational education. Then we use behavior route data as training data. And use Petri nets to build motion model and analyze possible of behavior route. The last we combine naive bayesian classification and association rule analysis of model in data mining to inference behavior route attribute. Then we can build probability table and we can use this table to predict user’s behavior route. We can follow this model propose some comment to the website's administrator. The research finds some result as the following shows. 1. If there are too few attribute then it can cause value equal zero. And the research proposes M-estimate to improve this problem. 2. We use 150 behavior routes as the training data. Then we verify and find our success rate up to 91.2%. 3. Combination bayesian classification and association rule analysis of model can more accurate than bayesian classification.
To predict user’s behavior route in website is very important for Website's administrator. The first, the research extract out historical data from data warehouse in the website of the technological and vocational education. Then we use behavior route data as training data. And use Petri nets to build motion model and analyze possible of behavior route. The last we combine naive bayesian classification and association rule analysis of model in data mining to inference behavior route attribute. Then we can build probability table and we can use this table to predict user’s behavior route. We can follow this model propose some comment to the website's administrator. The research finds some result as the following shows. 1. If there are too few attribute then it can cause value equal zero. And the research proposes M-estimate to improve this problem. 2. We use 150 behavior routes as the training data. Then we verify and find our success rate up to 91.2%. 3. Combination bayesian classification and association rule analysis of model can more accurate than bayesian classification.
Description
Keywords
資料倉儲, 關聯分析法則, 派翠西網路, 資料探勘