Iwatani, Emi
(2018)
A CRITICAL EXPLORATION OF THE POTENTIAL UTILITY OF RULE INDUCTION DATA MINING METHODS TO “ORTHODOX” EDUCATION RESEARCH.
Doctoral Dissertation, University of Pittsburgh.
(Unpublished)
Abstract
Despite some theoretical promise, it is unclear whether rule induction data mining approaches (e.g., classification trees and association rules) add methodological value to "orthodox" education research, i.e., research unrelated to computer-based education. To better understand whether and how rule induction methods could be useful to education researchers, I explored whether they, relative to regression approaches, (1) improve classification accuracy, and/or (2) offer new avenues of explanation. Additionally, I aimed to illustrate a practical and principled way to use the various rule induction approaches so researchers can more easily choose to use it. To these ends, I conducted an extended literature review on rule induction methods, and re-analyzed two regression studies (Byrnes & Miller, 2007; Thomas, 2006) on the National Educational Longitudinal Study of 1988 using ten rule induction approaches. Data mining happened in two rounds for each study: first, by using only the predictors used in the original study, and second by using all reasonable and available predictors. I compared results across methods and rounds to better understand whether, how, and why the rule induction may provide additional insights.
I found that while rule induction approaches can be labor intensive and not necessarily more predictive than regression, they can provide unique descriptions of the sample that shows at-a-glance, how key predictors relate to each other and to the outcome. They can also help identify relationships between variables that held for some subgroups but not others. For example: (i) rulesets induced from Byrnes and Miller's dataset suggested that Algebra 2 and math self-concept were positively related to 12th grade math scores, but only for those who were higher achieving in 8th grade math; (ii) association rules mined from Thomas' dataset suggested that factors such as school safety and honors program participation were more strongly associated with 12th grade achievement for lower income and students with lower parental education. Thus, when relationships between the predictors and outcome may not be uniform across the population, rule induction can provide more information than regression in exploring those relationships. Lessons learned and recommendations on how to apply rule induction approaches are also discussed.
Share
Citation/Export: |
|
Social Networking: |
|
Details
Item Type: |
University of Pittsburgh ETD
|
Status: |
Unpublished |
Creators/Authors: |
|
ETD Committee: |
|
Date: |
30 January 2018 |
Date Type: |
Publication |
Defense Date: |
30 November 2017 |
Approval Date: |
30 January 2018 |
Submission Date: |
30 January 2018 |
Access Restriction: |
No restriction; Release the ETD for access worldwide immediately. |
Number of Pages: |
393 |
Institution: |
University of Pittsburgh |
Schools and Programs: |
School of Education > Psychology in Education |
Degree: |
PhD - Doctor of Philosophy |
Thesis Type: |
Doctoral Dissertation |
Refereed: |
Yes |
Uncontrolled Keywords: |
Data mining
Quantitative research methodology
Student achievement
Decision tree
Sequential covering
Association rules |
Date Deposited: |
30 Jan 2018 17:32 |
Last Modified: |
30 Jan 2018 17:32 |
URI: |
http://d-scholarship.pitt.edu/id/eprint/33748 |
Metrics
Monthly Views for the past 3 years
Plum Analytics
Actions (login required)
|
View Item |