Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

A COLLABORATIVE FILTERING APPROACH TO PREDICT WEB PAGES OF INTEREST FROMNAVIGATION PATTERNS OF PAST USERS WITHIN AN ACADEMIC WEBSITE

Nkweteyim, Denis Lemongew (2005) A COLLABORATIVE FILTERING APPROACH TO PREDICT WEB PAGES OF INTEREST FROMNAVIGATION PATTERNS OF PAST USERS WITHIN AN ACADEMIC WEBSITE. Doctoral Dissertation, University of Pittsburgh. (Unpublished)

[img]
Preview
PDF
Primary Text

Download (1MB) | Preview

Abstract

This dissertation is a simulation study of factors and techniques involved in designing hyperlink recommender systems that recommend to users, web pages that past users with similar navigation behaviors found interesting. The methodology involves identification of pertinent factors or techniques, and for each one, addresses the following questions: (a) room for improvement; (b) better approach, if any; and (c) performance characteristics of the technique in environments that hyperlink recommender systems operate in. The following four problems are addressed:Web Page Classification. A new metric (PageRank × Inverse Links-to-Word count ratio) is proposed for classifying web pages as content or navigation, to help in the discovery of user navigation behaviors from web user access logs. Results of a small user study suggest that this metric leads to desirable results.Data Mining. A new apriori algorithm for mining association rules from large databases is proposed. The new algorithm addresses the problem of scaling of the classical apriori algorithm by eliminating an expensive joinstep, and applying the apriori property to every row of the database. In this study, association rules show the correlation relationships between user navigation behaviors and web pages they find interesting. The new algorithm has better space complexity than the classical one, and better time efficiency under some conditionsand comparable time efficiency under other conditions.Prediction Models for User Interests. We demonstrate that association rules that show the correlation relationships between user navigation patterns and web pages they find interesting can be transformed intocollaborative filtering data. We investigate collaborative filtering prediction models based on two approaches for computing prediction scores: using simple averages and weighted averages. Our findings suggest that theweighted averages scheme more accurately computes predictions of user interests than the simple averages scheme does.Clustering. Clustering techniques are frequently applied in the design of personalization systems. We studied the performance of the CLARANS clustering algorithm in high dimensional space in relation to the PAM and CLARA clustering algorithms. While CLARA had the best time performance, CLARANS resulted in clusterswith the lowest intra-cluster dissimilarities, and so was most effective in this regard.


Share

Citation/Export:
Social Networking:
Share |

Details

Item Type: University of Pittsburgh ETD
Status: Unpublished
Creators/Authors:
CreatorsEmailPitt UsernameORCID
Nkweteyim, Denis Lemongewnkweteyim@gmail.com
ETD Committee:
TitleMemberEmail AddressPitt UsernameORCID
Committee ChairHirtle, Stephen Cshirtle@mail.sis.pitt.eduHIRTLE
Committee MemberMay, Jerrold Hjerrymay@katz.pitt.eduJERRYMAY
Committee MemberSpring, Michael Bspring@sis.pitt.eduSPRING
Committee MemberMunro, Paulpmunro@mail.sis.pitt.eduPWM
Committee MemberBrusilovsky, Peterpeterb@mail.sis.pitt.eduPETERB
Date: 30 September 2005
Date Type: Completion
Defense Date: 16 May 2005
Approval Date: 30 September 2005
Submission Date: 21 July 2005
Access Restriction: No restriction; Release the ETD for access worldwide immediately.
Institution: University of Pittsburgh
Schools and Programs: School of Information Sciences > Information Science
Degree: PhD - Doctor of Philosophy
Thesis Type: Doctoral Dissertation
Refereed: Yes
Uncontrolled Keywords: association rule mining; classification; clustering; collaborative filtering; data mining; prediction; Recommender system
Other ID: http://etd.library.pitt.edu/ETD/available/etd-07212005-054925/, etd-07212005-054925
Date Deposited: 10 Nov 2011 19:52
Last Modified: 15 Nov 2016 13:46
URI: http://d-scholarship.pitt.edu/id/eprint/8479

Metrics

Monthly Views for the past 3 years

Plum Analytics


Actions (login required)

View Item View Item