Boise State University Theses and Dissertations

Remote Sensing Time-Series Analysis, Machine Learning, and K-Means Clustering Improves Dryland Vegetation and Biological Soil Crust Classification

Joshua Enterkine, Boise State UniversityFollow

Publication Date

5-2019

Date of Final Oral Examination (Defense)

11-9-2018

Type of Culminating Activity

Thesis

Degree Title

Master of Science in Geoscience

Department

Geosciences

Major Advisor

Nancy Glenn, Ph.D.

Advisor

Jodi Brandt, Ph.D.

Advisor

Jennifer Forbey, Ph.D.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Dryland and semi-arid vegetation communities, although appearing to the casual observer as relatively simplistic and homogeneous, are in fact the opposite. Upon further inspection, semi-arid vegetation is highly complex and heterogeneous at almost any scale. The same holds true for biological soil crust. Growing concern about global changes in climate, nutrient cycles, and land use have required increasing scrutiny of our understanding of these communities and all of their constituents, as we seek to improve forecasting models and inform land management decisions. This thesis aims to provide insight to the paradigm of how we create and interpret vegetation classifications in a semi-arid ecosystem.

In the first chapter, I examine the potential of new remote sensing imaging platforms in combination with machine learning algorithms and cloud computing as they apply to time-series analyses for vegetation classification. The results of this indicate that sinusoidal approximations (“Harmonic Models”) of vegetation indices are able to predict vegetation cover with nearly the same accuracy as monthly composites, and that a combination of both perform no better than either. Additionally, I examine how assigning classes to training data (e.g. species-level, plant functional type) influence the classification accuracy, interpretability, and potential uses. Stricter class membership requirements at increasingly aggregated scales (e.g. PFT) lead to greater accuracies.

Finding the implication of this conclusion unsatisfactory – at the extreme, everything was either cheatgrass or bare, the shrub class succumbing almost entirely to errors of omission – I investigated approaching other methods to assign classes to field data that captured more of the realities of semi-arid vegetation within our study area. To this extent, k-means clustering was used to determine what community classes were present in the field data. The outcome of this approach is a class where each potential constituent cover has a known distribution. Overall accuracies were found to be lower for this approach. However, the classification outcomes quantify overlapping distributions of cover types (e.g. ‘sagebrush’ or ‘shrub’) between classes. These accuracies are assessed using ‘fuzzy’ confusion matrices. This enables more information to be preserved through the remote sensing classification process, and reserves more interpretation for the map user than a typical ‘hard’ classification. Importantly the distributions of cover types are likely most representative of field conditions and thus more useful to land managers making holistic decisions about restoration or fuel management.

For the second chapter, I delved deeper into the potentials of new remote sensing and computing platforms to predict biological soil crust cover. The growing field of research on biological soil crust points to potentially significant implications for nutrient and water cycling, in addition to positive effects on native vascular vegetation. However, spatial data are lacking due to remote sensing limitations. Using time-series of multispectral imagery (from Chapter 1) and data fusion of radar and geophysical parameters, I developed a map of biocrust cover for the study area with high accuracy. This outcome allows us to examine important predictor variables (e.g. particular vegetation indices, soil type) and their relationship to plot-scale processes related to biological soil crust while also providing the spatial data needed for biological soil crust to be included in studies at the landscape scale.

Comments

ORCID: https://orcid.org/0000-0001-6956-3619

DOI

10.18122/td/1523/boisestate

Recommended Citation

Enterkine, Joshua, "Remote Sensing Time-Series Analysis, Machine Learning, and K-Means Clustering Improves Dryland Vegetation and Biological Soil Crust Classification" (2019). Boise State University Theses and Dissertations. 1523.
10.18122/td/1523/boisestate

Download

Included in

Other Earth Sciences Commons

COinS

ScholarWorks

Boise State University Theses and Dissertations

Remote Sensing Time-Series Analysis, Machine Learning, and K-Means Clustering Improves Dryland Vegetation and Biological Soil Crust Classification

Publication Date

Date of Final Oral Examination (Defense)

Type of Culminating Activity

Degree Title

Department

Major Advisor

Advisor

Advisor

Creative Commons License

Abstract

Comments

DOI

Recommended Citation

Included in

Browse

Links

Search

Author Corner

Links

ScholarWorks

Boise State University Theses and Dissertations

Remote Sensing Time-Series Analysis, Machine Learning, and K-Means Clustering Improves Dryland Vegetation and Biological Soil Crust Classification

Author

Publication Date

Date of Final Oral Examination (Defense)

Type of Culminating Activity

Degree Title

Department

Major Advisor

Advisor

Advisor

Creative Commons License

Abstract

Comments

DOI

Recommended Citation

Included in

Share

Browse

Links

Search

Author Corner

Links