2017

Improving Computational Efficiency in Identifying Parsimonious Statistical Models

Joseph L. Valentin, Idaho State UniversityFollow
Ken Aho (Mentor), Idaho State UniversityFollow
John Edwards (Mentor), Idaho State UniversityFollow
Dewayne Derryberry, Idaho State UniversityFollow
Teri Peterson, Idaho State UniversityFollow

Faculty Mentor Information

John Edwards Ken Aho

Presentation Date

7-2017

Abstract

Many authors have argued that identifying parsimonious statistical models (those that are neither overfit nor underfit) while considering curvature and/or interaction terms among predictors is inadvisable because of the huge number of potential models. For example, the complete second order model set will contain models for consideration where k is the number of predictors in the model. To address this difficulty, we present a stepwise algorithm, developed for the R statistical environment, in which the number of considered models is quadratic in k. This is in contrast with conventional stepwise model selection functions (e.g., StepAIC and step) which consider a model set cubic in k. Our new approach, termed Greedy, uses one of 3 measures of statistical parsimony for its model set, the Akaike information criterion (AIC), the Bayesian information criterion (BIC), and its predicted residual error sum of squares (PRESS) statistic. We found that, when considering large and/or high dimensional datasets, the Greedy algorithm identified the same optimal (minimum AIC) model as conventional stepwise approaches, or one with essentially equal parsimony, while having dramatically smaller computational run times.

Download

COinS

Improving Computational Efficiency in Identifying Parsimonious Statistical Models

Idaho Conference on Undergraduate Research

2017

Improving Computational Efficiency in Identifying Parsimonious Statistical Models

Faculty Mentor Information

Presentation Date

Abstract

Browse

Links

Search

Author Corner

LINKS

Idaho Conference on Undergraduate Research

2017

Improving Computational Efficiency in Identifying Parsimonious Statistical Models

Presenter/Author/Student Information

Faculty Mentor Information

Presentation Date

Abstract

Share

Browse

Links

Search

Author Corner

LINKS