Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

A CRITICAL EXPLORATION OF THE POTENTIAL UTILITY OF RULE INDUCTION DATA MINING METHODS TO “ORTHODOX” EDUCATION RESEARCH

Iwatani, Emi (2018) A CRITICAL EXPLORATION OF THE POTENTIAL UTILITY OF RULE INDUCTION DATA MINING METHODS TO “ORTHODOX” EDUCATION RESEARCH. Doctoral Dissertation, University of Pittsburgh. (Unpublished)

[img]
Preview
PDF
Accepted Version

Download (3MB) | Preview

Abstract

Despite some theoretical promise, it is unclear whether rule induction data mining approaches (e.g., classification trees and association rules) add methodological value to "orthodox" education research, i.e., research unrelated to computer-based education. To better understand whether and how rule induction methods could be useful to education researchers, I explored whether they, relative to regression approaches, (1) improve classification accuracy, and/or (2) offer new avenues of explanation. Additionally, I aimed to illustrate a practical and principled way to use the various rule induction approaches so researchers can more easily choose to use it. To these ends, I conducted an extended literature review on rule induction methods, and re-analyzed two regression studies (Byrnes & Miller, 2007; Thomas, 2006) on the National Educational Longitudinal Study of 1988 using ten rule induction approaches. Data mining happened in two rounds for each study: first, by using only the predictors used in the original study, and second by using all reasonable and available predictors. I compared results across methods and rounds to better understand whether, how, and why the rule induction may provide additional insights.
I found that while rule induction approaches can be labor intensive and not necessarily more predictive than regression, they can provide unique descriptions of the sample that shows at-a-glance, how key predictors relate to each other and to the outcome. They can also help identify relationships between variables that held for some subgroups but not others. For example: (i) rulesets induced from Byrnes and Miller's dataset suggested that Algebra 2 and math self-concept were positively related to 12th grade math scores, but only for those who were higher achieving in 8th grade math; (ii) association rules mined from Thomas' dataset suggested that factors such as school safety and honors program participation were more strongly associated with 12th grade achievement for lower income and students with lower parental education. Thus, when relationships between the predictors and outcome may not be uniform across the population, rule induction can provide more information than regression in exploring those relationships. Lessons learned and recommendations on how to apply rule induction approaches are also discussed.


Share

Citation/Export:
Social Networking:
Share |

Details

Item Type: University of Pittsburgh ETD
Status: Unpublished
Creators/Authors:
CreatorsEmailPitt UsernameORCID
Iwatani, Emiemi8@pitt.eduemi80000-0001-5948-858X
ETD Committee:
TitleMemberEmail AddressPitt UsernameORCID
Committee ChairStone, Clementcas@pitt.educas
Committee MemberLane, Suzannesl@pitt.edusl
Committee MemberIriti, Jennifeririti@pitt.eduiriti
Committee MemberMay, Jerroldjerrymay@katz.pitt.edujerrymay
Date: 30 January 2018
Date Type: Publication
Defense Date: 30 November 2017
Approval Date: 30 January 2018
Submission Date: 30 January 2018
Access Restriction: No restriction; Release the ETD for access worldwide immediately.
Number of Pages: 393
Institution: University of Pittsburgh
Schools and Programs: School of Education > Psychology in Education
Degree: PhD - Doctor of Philosophy
Thesis Type: Doctoral Dissertation
Refereed: Yes
Uncontrolled Keywords: Data mining Quantitative research methodology Student achievement Decision tree Sequential covering Association rules
Date Deposited: 30 Jan 2018 17:32
Last Modified: 30 Jan 2018 17:32
URI: http://d-scholarship.pitt.edu/id/eprint/33748

Metrics

Monthly Views for the past 3 years

Plum Analytics


Actions (login required)

View Item View Item