Remote Sensing Time-Series Analysis, Machine Learning, and K-Means Clustering Improves Dryland Vegetation and Biological Soil Crust Classification
Date of Final Oral Examination (Defense)
Type of Culminating Activity
Master of Science in Geoscience
Nancy Glenn, Ph.D.
Jodi Brandt, Ph.D.
Jennifer Forbey, Ph.D.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Dryland and semi-arid vegetation communities, although appearing to the casual observer as relatively simplistic and homogeneous, are in fact the opposite. Upon further inspection, semi-arid vegetation is highly complex and heterogeneous at almost any scale. The same holds true for biological soil crust. Growing concern about global changes in climate, nutrient cycles, and land use have required increasing scrutiny of our understanding of these communities and all of their constituents, as we seek to improve forecasting models and inform land management decisions. This thesis aims to provide insight to the paradigm of how we create and interpret vegetation classifications in a semi-arid ecosystem.
In the first chapter, I examine the potential of new remote sensing imaging platforms in combination with machine learning algorithms and cloud computing as they apply to time-series analyses for vegetation classification. The results of this indicate that sinusoidal approximations (“Harmonic Models”) of vegetation indices are able to predict vegetation cover with nearly the same accuracy as monthly composites, and that a combination of both perform no better than either. Additionally, I examine how assigning classes to training data (e.g. species-level, plant functional type) influence the classification accuracy, interpretability, and potential uses. Stricter class membership requirements at increasingly aggregated scales (e.g. PFT) lead to greater accuracies.
Finding the implication of this conclusion unsatisfactory – at the extreme, everything was either cheatgrass or bare, the shrub class succumbing almost entirely to errors of omission – I investigated approaching other methods to assign classes to field data that captured more of the realities of semi-arid vegetation within our study area. To this extent, k-means clustering was used to determine what community classes were present in the field data. The outcome of this approach is a class where each potential constituent cover has a known distribution. Overall accuracies were found to be lower for this approach. However, the classification outcomes quantify overlapping distributions of cover types (e.g. ‘sagebrush’ or ‘shrub’) between classes. These accuracies are assessed using ‘fuzzy’ confusion matrices. This enables more information to be preserved through the remote sensing classification process, and reserves more interpretation for the map user than a typical ‘hard’ classification. Importantly the distributions of cover types are likely most representative of field conditions and thus more useful to land managers making holistic decisions about restoration or fuel management.
For the second chapter, I delved deeper into the potentials of new remote sensing and computing platforms to predict biological soil crust cover. The growing field of research on biological soil crust points to potentially significant implications for nutrient and water cycling, in addition to positive effects on native vascular vegetation. However, spatial data are lacking due to remote sensing limitations. Using time-series of multispectral imagery (from Chapter 1) and data fusion of radar and geophysical parameters, I developed a map of biocrust cover for the study area with high accuracy. This outcome allows us to examine important predictor variables (e.g. particular vegetation indices, soil type) and their relationship to plot-scale processes related to biological soil crust while also providing the spatial data needed for biological soil crust to be included in studies at the landscape scale.
Enterkine, Joshua, "Remote Sensing Time-Series Analysis, Machine Learning, and K-Means Clustering Improves Dryland Vegetation and Biological Soil Crust Classification" (2019). Boise State University Theses and Dissertations. 1523.