Modeling Shallow Landslide Potential for Watershed Management
By Laura M. Vaugeois and Susan C. Shaw
The Forest Practices Division of the Department of Natural Resources is mandated to
regulate timber harvest operations in Washington State. It was recognized that the
existing screen for shallow landslides was not sufficient and that several viable
GIS-based models for slope stability were currently in use by timberland managers and
scientists. However, no information existed that compared the effectiveness of the
various models. Prior to creating a new screen of modeled slope stability, a
comparative analysis of several quantitative GIS models was undertaken to determine
which model was most effective in predicting shallow landslide potential.
Introduction
Over the past several years, a variety of entities have developed GIS-based
models for shallow-rapid slope stability. These models, however, have not been
rigorously compared or adapted for statewide application to management and
regulation of forest lands. This report briefly describes the methods, results, and
conclusions of our comparative analysis. This test was conducted under contract to the
Washington Timber/Fish/Wildlife (T/F/W) Program (i.e., a cooperative group of
regulatory, tribal, environmental, and industrial sponsors who collectively makes
recommendations to the Washington Forest Practices Board (WFPB) on matters
related to forest management; T/F/W, 1992) and Washington Forest Protection
Association (WFPA), as a precursor to developing the statewide slope-stability screen
required by the WFPB.
During the course of this study, our focus expanded from evaluating models for
use in regulatory watershed analyses and routine forest management, to include an
assessment of their potential as statewide landslide-screening tools. This shift was
driven primarily by the T/F/W negotiations and the resulting commitments of the
legislature to promote the development of a statewide screen. We are developing a
similar test for watersheds in each of the distinct geomorphic provinces in eastern
Washington, as groundwork for creating a statewide screen of shallow landsliding.
This test should help determine whether any of these GIS-based models can
accommodate the geology and climatic regimes east of the Cascades Range.
People interested in land management in the Pacific Northwest historically have possessed limited means for evaluating landslide
potential where activities are proposed. Existing information on site characteristics and failure potential typically has been confined to small
geographic areas (e.g., 20 km2 or less) in which landslide inventories, geomorphic research, or semi-empirical stability analyses have been
conducted. More recently, private landowners and natural-resource agencies in Washington State have initiated a regulatory form of watershed analysis
(WFPB, 1995) for specific landscape units (i.e., Watershed Administrative Units (WAUs), usually less than 200 km2 or 78 mi2 in
size), in which landslide inventories are developed largely with the aid of aerial photographs and limited field reconnaissance. Landslide assessments
in only about 60 of the 764 Watershed Administrative Units, however, have been finalized and approved by the state during the last seven years
(Washington Department of Natural Resources (WDNR), 1999). Furthermore, incomplete and often imprecisely mapped state soil surveys and their
slope-failure ratings still constitute the main source of information used by state regulatory foresters to evaluate management proposals in areas
outside of those where reliable landslide assessments have been performed.
GIS-based slope stability models can be useful to managers for screening
potential landslide areas and determining where land-use or habitat-restoration
activities should be concentrated, to regulators for determining whether environmental
checklists or impact statements are required, and to analysts for developing preliminary
hazard-zonation maps. Isolated tests of GIS-based models in the Pacific Northwest
have suggested that preliminary landslide-failure or hazard-zonations maps can
provide more accurate slope-stability information than customarily can be interpreted
from topographic, geologic, or soil maps alone (e.g., Shaw and Johnson, 1995;
Montgomery et al., 1998).
Description of Test Models
Three GIS-driven models have been selected for this evaluation, based on their
current availability, potential for adaptation to management decision-making, and/or
use by T/F/W cooperators in field applications or previous tests of model performance.
They are the current statewide soil-stability screen, maintained by the WDNR and
herein labeled SOILS; the shallow landslide model of Montgomery and Dietrich (1994),
nicknamed SHALSTAB by its authors; and the shallow landslide model of Shaw and
Johnson (1995), herein referred to as SMORPH.
The three selected models have a number of elements in common. They use
geographic information systems (GIS) to couple DEM data with assumptions regarding
topographic attributes that influence slope destabilization and with algorithms for
calculating slope stability. Whereas the SHALSTAB and SMORPH models assume
that topographic relief (i.e., hillslope gradient) and form (i.e., slope curvature) are the
principal driving factors in promoting shallow landslides, the SOILS screen assumes
that only gradient is a critical variable. These assumptions derive from previous
studies suggesting that shallow landslides occur most often above a threshold gradient
and in topographic convergences where shallow subsurface flow concentrates, such as
hollows and channelized depressions, with consequent effects on soil moisture and
strength (e.g., Dietrich and Dunne, 1978; Swanson et al., 1981; Swanson and
Fredriksen, 1982; Sidle et al., 1985; Montgomery and Dietrich, 1994). This simplifying
assumption permits a number of key slope-stability factors to be treated implicitly,
including substrate type, bedrock structure, rainfall duration and intensity, soil depth,
soil conductivity and strength, plant transpiration, root strength, and subsurface
drainage properties.
In addition, each model is limited similarly by the accuracy of the DEM data; that
is, these models are only as good as the DEMs on which they are based. Much of
western Washington is mapped with DEMs at a 10-meter resolution. For regions in
which DEMs are available only on a 30-meter grid, however, all models suffer
correspondingly in their precision and accuracy.
The three model differ primarily in the sophistication with which independent
physical parameters affecting slope stability are addressed. The SOILS screen relies
on hillslope gradient and soil type to rate slope-stability potential (Table 1)( WDNR,
1988). The SMORPH model explicitly treats gradient and slope curvature, while the
SHALSTAB model treats these topographic attributes as well as several key soil
physical and hydrological properties. From the standpoint of practical application,
there are advantages and disadvantages to each approach. Simpler models in which
key influencing factors are treated implicitly can be employed readily (i.e., with little to
no data collection) and for larger geographic areas. The level of site-specific accuracy,
however, might be reduced by assuming static or invariant hydrologic and geomorphic
conditions, and by extrapolating local data on soil and hydrologic properties to the
basin or regional scale. The advantage of explicitly treating parameters such as
rainfall, subsurface hydrology, and soil properties is that the model might identify
patterns of potentially unstable ground at a higher resolution. Consequently, such
models are useful for predicting site conditions in the local area for which the input data
apply. Conversely, employing local data might limit the ability of the model to predict
accurately the spatial distribution of unstable slopes at a landscape scale. This
approach also requires considerably more data collection in the field. Some factors, for
example subsurface hydrologic and soil strength properties, might be very difficult to
analyze and measure due to their spatial and temporal variations and their complex
physical interactions.
A number of other models were considered but not chosen for this comparative
test because of availability and software-development issues. (Wu and Sidle, 1995, Wu
and Abdel-Latif (1995, 1997), Pack et al. (1998), and Earth Systems Institute, (pers.
comm.). Other methods were too site-specific to be applied over large geographic
areas, as required of a watershed analysis or statewide landslide screen (e.g., LISA
and DLISA; Hammond et al., 1992). For a general review of analytical methods other
than GIS-based modeling, see literature reviews in papers by Montgomery and Dietrich
(1994) and Wu and Sidle (1995).
Methods
Study areas and landslide data
We chose eight areas in western Washington (Figure 1) for this comparative
test. The test basins range in size from 81 km2 to 331 km2 (Table 4). Existing
Watershed Administrative Units (WAUs) were used as the test-basin boundaries,
wherever possible. WAUs, defined for the purposes of regulatory watershed analysis,
typically follow major drainage divides; the larger-order river systems, however, may be
divided into several WAUs to limit the watershed analyses to a maximum acreage that
reasonably could be assessed in the limited time period permitted by law (WFPB,
1995). Hence, some of our test basins comprise only the upper or mid- sections of a
major river system (e.g., Chehalis Headwaters WAU, Middle Hoh WAU). Preference
was given to those WAUs with recently completed watershed analyses, to utilize
existing databases and to take advantage of the standardized format of data collecting
used in this regulatory process.
We attempted to include at least one test watershed in each of the major
geologic provinces in western Washington (Table 4; Thorsen, 1978). Parent materials
range from glacial till/outwash and lightly metamorphosed sediments to volcanics and
igneous intrusives. Test basins also vary in topographic relief (i.e., lowest to highest
elevation points) from 818m., in the Chehalis Headwaters basin, to 1941m. in the
Jordan-Boulder basin. Our intent was to examine model performance in areas with
different combinations of relief and parent materials, as a means for exploring model
versatility and the feasibility of using each model as a management tool in diverse
topographic and geologic settings. The eight test basins contain a total of 2524 known
landslides (Table 4), including shallow and deep-seated landslides (i.e., earthflows).
We retained data on deep-seated landslides (e.g., earthflows) in the test database to
evaluate the ability of each model to predict shallow landslide features that often are
superimposed on more areally extensive earthflows.
Existing digital landslide inventories were acquired from the appropriate
landowners in the test basins where watershed analyses had been performed (Table
5). Where inventories were not current or were spatially incomplete (i.e., original
inventories covered only portions of the test area), we conducted aerial-photograph
and field surveys to fill in data gaps. Aerial-photo series extended from the mid-1940's
through 1996, in most instances. All inventories were updated chronologically to
include, at a minimum, the most recent storm event known to have triggered
widespread landsliding throughout Washington State (i.e., the high-intensity, long-duration storm of February, 1996; Gerstel, 1996). In addition, most inventories were
checked in the field to verify database accuracy (e.g., landslide type, location, size).
Road-related failures were retained in the test database, to evaluate the theory (e.g.,
Montgomery et al., 1998) that their locations are governed largely by hillslope gradient
and topographic convergence. Standardized field data-forms were designed similar to
the those used in the mass wasting assessment of the regulatory watershed analysis
(WFPB, 1997, Appendix A). Newly identified landslides were mapped on to 1:24,000
scale topographic maps and then digitized into the GIS (Arc/Infotm, version 7.2, for
UNIX on a Solaris platform), coded, and edit-checked for positional and tabular
accuracy.
In some cases, we updated the landslide inventories to include small landslides
(i.e., less than 100m2) that might have been omitted due to time and mapping-resolution limitations that customarily constrain the regulatory watershed-analysis
process. We increased the number of recorded landslides on these inventories by an
average 12%, during our field and aerial-photo verifications of the databases. In the
Upper East Fork Lewis River watershed, for example, our reanalysis of the GIS
landslide-inventory cover maintained by the USFS resulted in a 70% increase in the
number of recorded landslides. Hence, the watershed-analysis-derived landslide
inventories really only provide a lower limit on the number of landslides present during
the time period evaluated by the analyst (i.e., typically coinciding with the aerial-photo
record). Consequently, landslide inventories were used here only as a common basis
for comparing model abilities to predict known contemporary landslides, recognizing
that other shallow landslides have been overlooked or perhaps no longer can be
discerned in the field and photo records due to such obscuring factors as vegetation
regrowth.
All inventory data were projected into Washington State Plane, south zone,
North American Datum 1927. Having all data in the same projection allowed us to
easily incorporate other existing data (e.g. hydrography, transportation), as well as
provide a uniform projection from which to work.
We encountered a number of problems with existing landslide data while
updating and verifying mass-wasting inventories from the completed, regulatory
watershed analyses. These included incorrect basemaps on which landslides were
recorded, as well as incorrectly mapped landslides. Discrepancies between USDI
Geological Survey (USGS) topographic maps and basemaps created from GIS for use
in watershed analysis typically included differences in topographic-contour delineations
and stream-channel positions. Keying landslide locations to these features on USGS
topographic maps, for example, apparently cause a positional offset when data are
transferred to GIS DEM-based topography. A number of mapping errors also appeared
to be related to inaccurate transfer of field data onto basemaps or incorrect digitizing
from basemaps. In the Sol Duc watershed, for example, we determined from a
reassessment of aerial photographs that several landslides were mapped in tributaries
adjacent to the ones in which they actually exist. Hence, we remapped and redigitized
landslides wherever we encountered such discrepancies during field or aerial-photo
verification.
Another common mapping problem is related to landslide size. Mapping
techniques used by analysts ranged from representing landslides as a point or symbol
(e.g., circle) to delineating slides as polygons of finite area. The latter technique also
included a range of mapping styles, from mapping the failure scarp separately to
delineating the entire portion of slope involved in landsliding (e.g., some combination of
the contributing area, initiation point, transport zone, debris-flow runout track, and
depositional area), generally accompanied by little or no explanation of mapping style.
In addition, landslide mapping is prone to some amount of inaccuracy, given that data
are transferred between a number of different media (e.g., photos, maps, digital
databases) with varying levels of resolution and precision, and often between different
workers (e.g., field technicians, analysts, cartographers).
To address problems of mapped landslide location and size, we created a buffer
around landslides mapped as points or symbols, or polygons smaller than 100m2. The
buffer, mapped as a polygon of radius 15m. (50 ft.) around the presumed center of the
landslide feature, assured that landslides registered in a 100m2 DEM grid cell when
inventory data were compared with GIS model output. In many cases, landslide scarps
and bodies were remapped, during aerial-photo and field verification of the existing
databases, to exclude associated features (e.g., contributing areas and debris-flow
runout tracks). The landslide polygons then were joined with the buffered landslide
points to create a single coverage of mapped landslides. The polygon and buffer
method also served to extend the mapped landslide area by an amount slightly larger
than a DEM 10-m. grid cell, to account in part for imperfectly aligned digital landslide-inventory data and DEM topography.
Landslide hazard-zonation maps, created as a product of regulatory watershed
analyses, were employed in this study to evaluate the ability of GIS models to predict
areas considered by field analysts to have a potential for instability. Hazard-zonation
maps produced via the regulatory watershed-analysis process (i.e., Mass-Wasting Map
Unit maps; WFPB, 1997, Appendix A) typically delineate areas of presumed low,
moderate, and high potential for landsliding and delivery of debris to downstream (or
downslope) areas with sensitive public resources. Digital hazard-zonation maps were
available in only four of the eight test basins (Table 5).
The principal dilemma faced with hazard-zonation maps is mapping resolution.
Watershed analysts appear to use two styles of mapping: fine-scale and broad-brush.
Fine-scale mappers delineate map units in detail, attempting to include in a high-hazard polygon only those slopes a high probability of shallow landsliding and to
exclude any stable ground (e.g., the ridge lines between hollows in steep, dissected
terrain). Given that such resolution can be intractable on 1:24,000 scale maps, another
mapping option is to include the entire area in a generic mapping unit and explain in
the report text how to differentiate high and low hazard zones on the ground. These
broad-brush techniques promote Type II mapping errors, in which more area is
included in a high-hazard unit than likely would fail.
GIS model calibration and database development
The SOILS screen required no adjustments to be employed in this study, and in
fact cannot be adjusted to accommodate any new information, including altered soil
classifications or gradient classes, without significant revamping of the GIS cover. The
digital soils database for federal lands, maintained by the USDA Forest Service on the
Internet, was merged with that maintained for state and private lands by the WDNR
(1988). Nonetheless, six of eight test basins had incomplete digital soil covers (Table
5), due largely to gaps in soils-layer coverage on federal property (Figure 2). For
statistical analysis of comparisons between the digital landslide inventories and soils
slope-stability cover in these test basins, an existing landslide was given a "no data"
value where the soils cover was lacking.
The SMORPH model was calibrated in each test basin with its respective
landslide-inventory data to adjust the critical slope classes and their hazard-rating
designations in the gradient-curvature matrix (Table 2). A slope map derived from the
DEMs was intersected with the landslide inventory to determine the maximum gradient
found in each landslide polygon. A curve of maximum gradient versus cumulative
frequency percent was created with the lowest gradient at which a landslide occurred
being used to determine the lower class limit of the moderate hazard rating. The lower
class limit of the high hazard rating was established at a value for which 15% of the
landslides occurred (Table 6), to guarantee a model-prediction rate of at least 85% of
observed landslides.
For consistency with other published tests of the SHALSTAB model (e.g.,
Montgomery et al., 1998), we used the following soil-property values: soil depth (z) =
1.0m; soil bulk density (s) = 2000 kg/m3; internal friction angle () = 33; effective
cohesion (C') = 2 kN/m2; and transmissivity (T) = 65 m2/day. These values were
selected by Montgomery et al. (1998) based on extensive field measurements in a
small catchment in coastal Oregon (Montgomery et al., 1997), and the authors felt that
they gave reasonable results for their test watersheds in western Washington,
including the Chehalis Headwaters WAU that we also use as a test basin. We then
compared predictions of unstable-slope potential for the range of angles and
effective cohesions set internally in the model to yield a standard range of outputs (i.e.,
default parameters; = 33and 45, and c'= 0, 2, 5, 8, 15 kN/m2), to evaluate the effect
of modifying these parameters. In section 4.2 of this paper, we discuss the sensitivity
of model output to variations in input values.
Comparing SHALSTAB with the other GIS models required that we reduce all
model outputs to a common denominator. SMORPH and the SOILS screen yield output
in terms of management hazard ratings (e.g., low, moderate, high), in which the more
subjective determination of what constitutes "hazard" and "risk" previously has been
made in the policy arena. For example, the SMORPH slope matrix is calibrated with
landslide-inventory and hazard-zonation databases created during regulatory
watershed analyses for which definitions of hazard and risk have been set by T/F/W
policy and WFPB regulations (WFPB, 1995, Chapter 222-22 WAC). Likewise, the
SOILS screen hazard designations are derived from unstable-slope ratings in the state
soil surveys. In the absence of another mechanism for converting all model outputs to
the same units of measure, we therefore elected to assign hazard ratings to the
SHALSTAB model output values of predicted critical rainfall, by using rainfall intensity
and duration as the diagnostic criteria.
Given that SHALSTAB model output is expressed as rainfall in mm/day, we
created "precipitation rules" for each test basin by clipping the two-year, 24-hour storm
isohyte data (WDNR-GIS; Miller et al., 1973) and computing the minimum, maximum,
and mean precipitation values for each basin. A high hazard rating was given to each
DEM grid cell in which the predicted critical-rainfall value fell in the model-defined Qc -stability class occupied by the mean precipitation value calculated for that basin (Table
7). A high rating was also given to any predicted Qc less than the minimum two-year,
24-hour calculated precipitation. A moderate hazard rating was assigned to a DEM cell
in which the critical rainfall value occupied the Qc -stability class corresponding to the
maximum calculated precipitation. A low hazard rating was assigned to all other Qc
stability classes. See Table 7 for the precipitation rules and slope-stability hazards
created for each test basin.
The two-year, 24-hour recurrence interval was chosen as the precipitation
regime for which data were readily available and which yielded the most conservative
estimate of failure potential. The SHALSTAB model is configured such that the less
frequent rainfall event yields a greater percentage of the basin area predicted to fail
(Montgomery and Dietrich, 1994). Theoretically, then, a higher-intensity storm event
characteristic of a longer recurrence interval, and/or a longer-duration rainfall, would
result in greater spatial distribution of potential shallow landslides.
This method of assigning management criteria to SHALSTAB output was chosen
in the absence of established techniques or direction provided by the authors (e.g., see
discussion of management applications in Montgomery et al., 1998). A preferred
approach might be to adjust the model in each test basin by using measured values of
input parameters (e.g., soil transmissivity, bulk density, cohesion, internal friction
angle), and calibrating predicted distributions of slope stability with observed landslide
inventories and/or associated hazard-zonation maps in which management criteria
have been assigned (i.e., similar to the approach used by SMORPH). Adjusting input
parameters in the current version of the SHALSTAB model is problematic, given the
relative paucity of soil-property data and the current lack of published algorithms for
modelling stochastic elements or calibrating them from landslide inventories. Obtaining
sufficient soil-parameter samples to adequately describe their spatial variability also
could be intractable or prohibitively expensive for creating a landscape or regional GIS
cover of predicted slope stability.
Calibrating model output with landslide-potential ratings from hazard-zonation
maps is problematic. We found, for example, that hazard map units with different
management designations (e.g., low and high) might contain DEM grid cells with the
same range of Qc - slope stability class values (e.g., 2 through 7; see Table 3), making
it difficult to segregate the eight model-output classes into discrete management
categories of low, moderate, and high. Calibrating model outputs solely on the basis of
landslide inventories also can be misleading because, as discussed previously, they
typically represent only contemporary rates of shallow landsliding, thus conceivably
underestimating the density of potential landslide sites. Landslide density commonly
has been a key factor in assigning management criteria to hazard-potential map
polygons created from inventories (e.g., WFPB, 1997).
The precipitation rules imposed by this study make a number of assumptions,
not the least of which is steady-state throughflow of subsurface water. The SHALSTAB
model, however, is founded on the assumption of steady-state rainfall, constant
transmissivity, and spatially uniform soil saturation (Montgomery and Dietrich, 1994).
Hence, the steady-state precipitation rules are consistent with these assumptions.
Results and Discussion
We evaluated the performance of each model by using the GIS to intersect the
updated, digital landslide inventories and hazard-zonation maps with model predictions
of slope stability. For each model, output was expressed in terms of management
criteria (i.e., low, moderate, high "hazard"), as described in the report section 3.0, so
that model performances could be compared directly. We statistically analyzed the
following, as a measure of the performance of each model:
(1) intersection of the digital landslide inventory with model predictions of hazard
potential, expressed as the number of incorrectly identified landslides per total number
of landslides in each test basin (i.e., Type I model errors);
(2) intersection of the hazard-zonation maps with model predictions of hazard
potential, given as the percent probability that the model predicts a low landslide
potential where it is likely that landslides have occurred or will occur (i.e., Type I model
errors); and,
(3) intersection as in (2) but expressed as the percent probability that the model
predicts the potential for landslides where they are not likely to occur (i.e., Type II
model errors).
Inventories of known existing landslides and maps of hazard potential often are
used in different management contexts. For that reason, we calculated Type I errors
first by intersecting model outputs with the landslide inventories, to evaluate the ability
of each model to predict the spatial distribution of existing landslides. We then
computed Type I errors associated with comparing model outputs and hazard-zonation
maps, to assess model abilities to predict the spatial distribution of existing and
potential slope instability. Given that landslide inventories typically provide only a
minimum estimate of contemporary landslide rates, the hazard-zonation maps
theoretically yield a more complete view of the spatial distribution of past, present, and
potential future landslide occurrences.
Table 8 lists, for each model, the number of incorrectly identified landslides per
total number of landslides in each test basin (i.e., Type I errors). We assumed that an
existing landslide was identified incorrectly if all DEM grid cells overlapping the
landslide polygon or its 15m. (50 ft.) buffer (e.g., see report section 3.1) were coded by
the model as having a low potential (hazard) for shallow landsliding. Conversely, an
existing landslide was assumed to be identified correctly if any overlapping DEM grid
cell was predicted to have a moderate or high potential (hazard) for landsliding. DEM
cells with no data entry in the SOILS screen (i.e., missing soil-survey data) were coded
as an incorrect identification, to account statistically for the incomplete nature of the
data coverage. For this test, the SHALSTAB model was run using default parameters
= 33 and C = 2 kN/m2 and assuming that the two-year 24-hour storm recurrence
interval is a reasonable criterion for assigning hazard-potential ratings to the model
output (i.e., see report section 3.3).
A principal assumption of the model comparative tests is that predictions of
landslide probability densities can be compared even though the GIS covers contain
known mapping artifacts (e.g., elevation banding), as described in section 3.2. Given
that model predictions of slope stability are evaluated using the same DEMs and
landslide databases, the model outputs could be evaluated relative to one another.
However, computed statistics (e.g., average number of landslides incorrectly identified
by each model) should be viewed as estimates rather than absolute values, because
the errors in model predictions associated with database noise (e.g., DEM elevation
banding, field-mapping accuracy and resolution).
Table 8 indicates that the SOILS screen did not identify 32% of the total known
landslides in all eight test basins, whereas the SMORPH and SHALSTAB models
misidentifed 3% and 8%, respectively. The significantly higher percentage of
landslides missed by the SOILS screen can be attributed to the lack or near lack of
soil-survey data for two of the test basins (i.e., the North Fork Stilliguamish and Upper
East Fork Lewis watersheds; see Table 5), given that missing data were coded as
undetected landslides for the purposes of comparing model performances (see report
section 3.1). Where the SOILS screen was complete (e.g., Morton and Chehalis
Headwaters watersheds), however, it misidentified a significantly higher percentage of
landslides than the other two models (e.g., for the Chehalis Headwaters watershed,
32% versus 2% each for the SMORPH and SHALSTAB models).
In the Olympic Peninsula test basins, the SOILS screen misidentified more
landslides than SMORPH but fewer than SHALSTAB (e.g., in the Hoh watershed, 67
versus 53 and 84, respectively). The fact that these were the only basins for which 30-m. DEMs were used was ruled out as a likely cause. In other test basins for which
model results were compared using both 10-m. and 30-m. DEMs, there was no change
in the ordering of models based on their predictive accuracy, although the relative
magnitudes of predicted landslide occurrence (i.e., number of correctly identified
existing landslides) differed between 10-m. and 30-m. DEM test results for each model.
Hence, the seemingly better performance of the SOILS screen might be explained by at
least two compounding factors. One is that, for the portions of the test basins in which
soils data exist, the SOILS screen classes 68% of the Sol Duc and 84% of the Hoh
basin terrain as potentially unstable or very unstable, so that the majority of the
landscape and its associated landslides fall within the high-hazard-potential category.
Although this result lends the appearance that the SOILS screen more closely reflects
the spatial distribution of known landslides than does SHALSTAB, it also tends to over-predict significantly the percent of watershed area predicted by field-derived, hazard-zonation maps to be potentially unstable (see further discussion of the SOILS screen in
this paper section).
Another compounding factor is that the SOILS screen and SMORPH model
consider hillslopes as being potentially unstable at gradients somewhat lower than the
threshold gradient defined in the SHALSTAB model. In the latter model, slopes are
considered unconditionally stable when tan tan [1 - (w/s)] which, for = 33 and
s = 2000 kg/m3, means any slopes less than 18 (32.5%). Field evidence suggests
that non-road-related shallow landslides have occurred in this region on slopes closer
to 25% (e.g., Shaw and Johnson, 1995; D. Parks, WDNR, pers. comm.), particularly in
gently sloped, groundwater-seepage areas whose downslope margins coincide with the
top of steep, inner-gorge slopes, which are quite common in this terrain. Hence, the
SHALSTAB model has the potential for under-predicting the spatial distribution of
unstable ground on hillslopes with gradients less than the threshold value set internally
by the model.
The SMORPH model predicted an average of 22 times fewer Type I errors than
the SOILS screen and five times fewer than the SHALSTAB model. The greatest
discrepancy in SMORPH and SHALSTAB model predictions occurred in the Hazel
watershed (1% versus 32% incorrectly identified; Table 8). Given that the Hazel
watershed is dominated by deep-seated landslides in thick glacial deposits (Table 4),
we expected the predictive capability of both models to diminish correspondingly, with
respect to locating earthflow-influenced topography. It appeared, however, that
SMORPH was better able to distinguish the local slope and curvature of numerous
shallow-landslide headscarps superimposed on the larger earthflows. Hence, the
polygons representing deep-seated failures effectively were identified by SMORPH
predictions of high hazard potential on the basis of these smaller secondary features.
This variation in results might be explained by the manner in which the two
models identify "hazard" potential in adjoining DEM grid cells. The SMORPH model
analyzes variations in topographic relief between adjacent cells based on their relative
steepness and curvature, then assigns a value according to the slope matrix (Table 2);
hence, the model can discern topographic changes between a flatter upslope cell and a
steeper downslope cell (i.e., a landslide headwall). On the other hand, the SHALSTAB
model can smooth (i.e., not detect) subtle variations in topographic relief at the DEM-cell scale, by assigning a given flow tube a Qc value depending on the flow across its
upper boundary (i.e., variable "a" in Equation 2) from upslope contributing areas,
which, in turn, is governed by the way in which flow is dispersed from that contributing
area to any one of a number of downslope grid cells. Hence, if the upslope contributing
area has a lower gradient and requires a relatively higher water flux to create "wet"
soils, then a relatively steeper cell downslope (e.g., a landslide headwall) might not be
predicted to fail until the same "wetness" is achieved. Hence, the grid cell downslope
of the contributing area is given a lower slope-stability rating, whereas SMORPH
assigns a higher value based solely on topographic factors.
Although Table 8 indicates that SMORPH yielded 43% fewer Type I errors in
predicting known landslide occurrences than SHALSTAB (Table 8), we wanted to
evaluate whether these differences in model performance, based on a comparison in
eight watersheds, were significant statistically. We used a non-parametric test for non-normally distributed, small, independent samples to evaluate the hypothesis that there
is no difference in the average performance of the SMORPH (SM) and SHALSTAB
(SH) models, in terms of their ability to predict the spatial distribution of known
landslides. The null hypothesis is that the means (µ) of the population of Type I errors
for each model are equal when only eight independent samples (i.e., test basins) exist;
H0: µSM = µSH. Equality of means was tested with the Wilcoxon rank-sum statistic for
two populations (Walpole, 1974; MathSoft 1998), in which the null hypothesis was true
if:
Pr [W w = (a - n(n+1)/2)] > ,
where Pr is the probability distribution, W is the test statistic, a is the smaller of the
summed ranks for each population, n is the number of observations corresponding to a,
and = 0.01, 0.05 is the level of significance. Table 9 indicates that the test statistic is
significant at a confidence level of 95%, permitting rejection of the null hypothesis,
which suggests that the models differ somewhat in their ability to predict known
landslide distributions; that is, µSM < µSH. However, the test statistic proved insignificant
at the 99% confidence level (Table 9), allowing acceptance of the null hypothesis and
implying that the difference in model predictive capability is relatively small. A similar
statistical comparison of SHALSTAB and the SOILS screen indicated that the test
statistic was significant at the 99% confidence level, implying that the screen and model
are considerably different in their ability to predict existing landslide distributions.
Tables 10 and 11, respectively, give the estimated Type I and Type II model
errors for the SMORPH and SHALSTAB model based on a comparison of model output
with hazard-zonation maps. Error distributions were not computed for the SOILS
screen, given that soils-survey data were complete in only two of the test basins,
neither of which had usable hazard-zonation maps. Type I errors were calculated, for
each model in each test basin, by intersecting the low-hazard DEM cells predicted by
the model with the moderate- and/or high- hazard map units produced via watershed
analysis (i.e., incorporating all map units intersecting with known landslides in the GIS
inventory layer). This database intersection was expressed numerically as a
percentage of model-predicted, low-hazard areas (in km2) overlapping field-mapped
hazard areas. Type II errors similarly were analyzed by intersecting the high-hazard
cells predicted by the model with the low-hazard map units and computing respective
areas. These estimates were made for the four basins in which we had access to
complete, digitized, hazard-zonation maps. To facilitate comparison (see Table 10 and
11), the percent error for each model (A/M) in each basin was normalized by the basin
area in a given hazard class (A) divided by the total A for all four basins (T), that is: E =
(A/M)(A/T).
Analysis of Type I error estimates with respect to hazard-zonation maps
indicates that the SMORPH and SHALSTAB models similarly under-predict the percent
area of hazard map units determined to be of moderate and/or high failure potential, by
an average 6% and 5%, respectively. Using the Wilcoxon rank-sum statistic for two
populations, as described previously, the computed test statistic proved insignificant at
the 95% confidence level (Table 9), implying that the models perform similarly in
predicting areas of relatively low hazard potential inside mapped landslide-hazard
areas.
Whether the observed discrepancies between model predictions and hazard-zonation map units represent true " Type 1 errors" in the statistical sense is debatable,
given that three of the four hazard-zonation maps (i.e., Jordan-Boulder, Hazel, and Sol
Duc River) were drawn using broad map polygons that incorporated both unstable
slopes and intervening stable ground. In the Jordan-Boulder basin, for example,
hazard-zonation units intentionally were drawn to include potential landslide sites (e.g.,
hollows, groundwater seeps, inner gorges) and intervening divergent topography (e.g.,
ridge lines) because it was not possible to delineate them on 1:24,000 scale maps
(Coho, 1997). Hence, the GIS-based models might discriminate, more accurately than
the hazard-zonation maps, the topographic features potentially influencing shallow
landslide initiation in finely dissected terrain.
As a test of the influence of mapping resolution on hazard zonation maps, we
intentionally created the hazard-zonation map units in the East Fork Lewis test basin
with as fine a resolution as possible on 1:24,000 scale maps. This allowed us to
compare model predictions with two different scales of hazard-map resolution. Type I
"errors" generated by SMORPH and SHALSTAB decreased substantially, from 14%
and 9% for the Jordan-Boulder basin, respectively, to 1% and 2% for the East Fork
Lewis basin (Table 10; values normalized as described previously). One implication of
this result is that GIS-based model predictions of slope-stability potential could be used
advantageously by analysts in drawing hazard-zonation maps with higher resolution
than demonstrated.
Table 11 shows the distribution of Type II errors generated by the SMORPH and
SHALSTAB models, based on comparisons with hazard-zonation maps. As in Table
10, error values are given as normalized relative percent areas. Calculated error
estimates for each of the test basins suggest that SMORPH over-predicts the percent
area of hazard-zonation map units designated as high landslide potential, by an
average amount slightly less than predicted by SHALSTAB (i.e., 3% versus 7%,
respectively). In all four test basins, SHALSTAB tended to over-predict, by a factor of
two greater than SMORPH, the spatial distribution of high-hazard areas observed on
hazard-zonation maps. With respect to the East Fork Lewis basin, which we believe
was mapped fairly carefully for the purposes of this study, some amount of model over-prediction (i.e., 16% for SMORPH and 43% for SHALSTAB) might be true Type II
errors. That is, the models likely do over-predict observed spatial patterns of slope-stability potential, as can be discerned from observed spatial patterns of existing and
potential landslides. Particularly in the case of SHALSTAB, however, some portion of
this over-prediction might be an artifact of the manner in which hazard-potential criteria
were derived (i.e., the Qc - slope stability classes assigned by precipitation rules to be
included in the high-hazard management designation), as discussed previously.
To evaluate the potential for model use in a management context, we developed
a ranking scheme to quantify model performance and a number of other comparative
criteria. We employed a statistical method for ranking models in terms of their ability to
correctly and incorrectly identify known, existing shallow landslides. A numeric value
was assigned to each of the possible database-intersect outcomes:
|
Type of database intersection |
Assigned value
(p) |
| Landslide overlaps with DEM cell coded by model as high
hazard |
0 |
| Landslide overlaps with DEM cell coded by model as moderate
hazard |
1 |
| Landslide overlaps with DEM cell coded by model as low hazard |
2 |
For example, an existing landslide was considered to be identified by a particular
model if any superimposed DEM grid cell was coded "high hazard" (p = 0) or "moderate
hazard" (p = 1). The assigned values for all correctly and incorrectly identified
landslides in each of the eight test basins were added to yield a cumulative score for
each model, which then was normalized by the total number of landslides in each
basin. Where landslides occurred in areas for which the soils survey data were
missing, the SOILS screen grid cells were given a score of p = 2. These normalized
scores then were added to a score sheet including results of other tested criteria, as
will be described in report section 5.0.
Table 12 shows the results of this ranked test. SHALSTAB gained
approximately twice as many points as SMORPH, reflected in the normalized
cumulative scores (i.e., 1.9 versus 0.8, respectively). The SOILS screen received a
significantly higher score (i.e., 6.7) than the other two models, due in part to the partial
or total absence of soils-survey data in most test basins. SHALSTAB received a
greater cumulative score than SMORPH, largely due to more frequent intersections of
identified landslide polygons with model-predicted low and moderate hazards. Some of
the discrepancy theoretically could be attributed to our assignment of management
criteria via the precipitation rules.
At the outset of this study, we posed the following questions with regard to
model performance: (1) How do model predictions of shallow landsliding compare with
existing landslide inventories and hazard-zonation maps?; and, (2) How do model
predictions compare with respect to each other? In summary, test statistics imply that
the SMORPH and SHALSTAB models predict fairly well the spatial distribution of
known existing landslides in the eight test basins (i.e., error frequency of 3% and 8%,
respectively). These models, in general, also compare favorably with maps of shallow-landslide potential produced via watershed analyses (i.e., 6% and 5% Type I errors,
respectively; and 3% and 7% Type II errors, respectively). The SOILS screen
performed least well, missing 32% of the known existing landslides (i.e., Type I errors)
and providing an incomplete cover of a substantial percentage of western Washington
terrain (e.g., full data coverage existed in only two of the eight test basins). Test
statistics also indicated that the mean differences in predictive model capability
between the SOILS screen and either model were statistically significant, whereas the
mean differences between SMORPH and SHALSTAB were marginally significant
statistically. Hence, we conclude that the SOILS screen is comparatively less accurate
and certainly less complete than the two tested models. While the average differences
in predictive capability of SMORPH and SHALSTAB were not great, the former model
tended to produce slightly fewer Type I and II errors. Contingent on the
appropriateness of the precipitation-rule algorithm used to calibrate the SHALSTAB
model, we conclude that SMORPH is slightly more accurate than SHALSTAB in
predicting existing and potential landslides as represented in our updated landslide-inventory and hazard-zonation-map databases (Figure 3).
Acknowledgments
This study was funded by the Washington Department of Natural Resources, the
Washington Forest Protection Association, and the Cooperative Research and
Effectiveness Monitoring Committee (CMER) of the Washington Timber/Fish/Wildlife
program. We thank Weyerhaeuser Company, the Tulalip Tribe, Murray-Pacific
Corporation, and WDNR for providing data and local knowledge of test watersheds.
Landslide and soils data for the Lewis watershed were obtained, in part, from the Web
site maintained by the U.S.D.A. Forest Service, Gifford Pinchot National Forest. We
thank David Montgomery, Tien Wu, and David Johnson for valuable input and
assistance in compiling and running their models. We also thank Matthew Brunengo,
Kate Sullivan, Daniel Miller, Tien Wu, George Pess, and Paul Kennard for valuable
input in the early phases of this project. In addition, we thank Harvey Greenberg for
assistance in making the SHALSTAB model operable on the WDNR GIS system;
Daniel Miller for sharing his model and many critical discussions; Andria Villines and
Elizabeth Freeman for assistance with statistical analyses; and Venice Goetz for her
field assistance in verifying landslide data.
Table 1. Criteria for determining slope stability from the SOILS data.
Soils criteria for slope stability ratings |
Mass Wasting
Potential |
| Very Unstable |
map units with slopes greater than 30% |
very high |
| map units with slopes up to 30% |
high |
| Unstable |
map units with slopes greater than 65% |
very high |
| map units with slopes up to 30% |
medium |
| map units with slopes from 30-65% |
high |
| Stable |
map units with slopes up to 30%, where the soil phase is rated as unstable |
medium |
| map units with slopes up to 30% |
medium |
| map units with slopes up to 30%, where the soil phase at 30-65% is also rated stable |
low |
Table 2. Matrix relating slope curvature and gradient to shallow landslide potential, as used in the
SMORPH model. The number and distribution of slope gradient classes (i.e., A - E) are
set for a specific geomorphic unit with the aid of landslide inventories or slope stability
analyses.
| Slope
curvature |
Slope gradient (percent) |
| A |
B |
C |
D |
E |
| Convex |
low |
low |
low |
low |
moderate |
| Planar |
low |
low |
low |
moderate |
high |
| Concave |
low |
moderate |
high |
high |
high |
Table 3. Critical rainfall classes (Qc) designated by the SHALSTAB model.
| Qc class |
Rainfall amount needed
to induce failure |
Qc class |
Rainfall amount needed
to induce failure |
| 1 |
Unconditionally unstable at this
cohesion |
5 |
200-400 millimeters per day |
| 6 |
greater than 400 millimeters per day |
| 2 |
0-50 millimeters per day |
7 |
Unconditionally stable |
| 3 |
50-100 millimeters per day |
8 |
Stable at this cohesion |
| 4 |
100-200 millimeters per day |
|
|
Table 4. Physical and geologic characteristics of test basins.
| Test Basin |
Physiographic
Area |
Geologic Province |
Area
(km2 and
acres) |
Topographic Relief
(m) |
Number of
Known
Landslides1 |
| Jordan-Boulder |
North Cascades
Range |
Northwest Cascades
Metamorphic Suite; includes
meta-quartz diorite, low-grade
schists and phyllites, and
plutonics |
133 km2
32,987 ac. |
1941 |
155 |
| North Fork
Stillaguamish
River |
North Cascades
Range |
Low-grade metamorphosed
sediments, including phyllite and
greenschist |
130 km2
32,144 ac. |
1504 |
215 |
| Hazel |
western flank of
Cascades Range -
Puget Lowlands |
Continental glacial deposits
overlying low-grade
metamorphosed sediments |
98 km2
24,209 ac. |
1528 |
117 |
| Sol Duc River |
northern Olympic
Peninsula |
Crescent Basalt and Olympic
Lithic Assemblage
(metamorphosed marine
sediments) |
185 km2
45,674 ac. |
915 |
101 |
| Middle Hoh
River |
western Olympic
Peninsula |
Western Olympic Assemblage;
extensively sheared and
metamorphosed marine
sediments |
331 km2
81,879 ac. |
1575 |
733 |
| Morton |
Central Cascades
Range |
Eocene to Recent andesitic
volcanics |
88 km2
21,686 ac. |
1127 |
980 |
| Chehalis
Headwaters |
Coast Range
(Willapa Hills) |
Eocene to Miocene mafic
volcanic assemblage |
182 km2
45,000 ac. |
818 |
134 |
| Upper East
Fork Lewis
River |
Central Cascades
Range |
Eocene to Recent andesitic
volcanics with igneous intrusions |
81 km2
20,016 ac. |
1022 |
89 |
1 Includes identified shallow and deep-seated landslides.
Table 5. DEM resolution and sources of data for the eight test basins.
| Test Basin |
Source of
Landslide
Inventory
Data |
Hazard-Zonation Map
Available |
DEM
Resolution |
Percent Basin
with Soils
Layer |
| Jordan-Boulder |
WDNR, 1997 |
Yes |
10m |
63% |
| North Fork
Stillaguamish
River |
Perkins and
Collins (1997);
inventories
created for this
study |
No |
10m |
22% |
| Hazel |
WDNR, 1998 |
Yes |
10m |
65% |
Sol Duc River
(4 WAUs) |
WDNR and
USDA Forest
Service (1996) |
Yes |
30m |
95% |
| Middle Hoh
River |
WDNR (in
preparation) |
No
(not yet
digitized) |
30m |
64% |
Morton
(Portions of 2
WAUs) |
Murray Pacific
Timber Corp.
(1998) |
No
(not available
in digital
format) |
10m |
100% |
| Chehalis
Headwaters |
Weyerhaeuser
Co. (1994);
updated for
this study |
No
(errors in
digital
database) |
10m |
100% |
| Upper East
Fork Lewis
River |
USDA Forest
Service (1997)
and
inventories
created for this
study |
Yes |
10m |
2% |
Table 6. Gradient threshold values (in percent) calculated from landslide databases for input to the
SMORPH slope matrix (Table 2) for each test basin. See text for discussion.
Test Basins |
Gradient threshold corresponding to "hazard" designations for each curvature
class |
| Low for
convex and
planar,
moderate
for concave |
Low for
convex and
planar, high
for concave |
Low for
convex,
moderate for
planar, high
for concave |
Moderate for
convex, high
for planar,
concave |
High for all
slope forms |
| Jordan-Boulder |
15 |
45 |
50 |
70 |
|
| N.F.
Stillaguamish
River |
15 |
40 |
47 |
70 |
|
| Hazel |
15 |
24 |
47 |
70 |
|
| Sol Duc River |
15 |
24 |
47 |
70 |
|
| Middle Hoh
River |
15 |
24 |
47 |
70 |
|
| Morton |
25 |
55 |
65 |
70 |
|
| Chehalis
Headwaters |
15 |
65 |
70 |
80 |
|
| Upper E.F.
Lewis River |
40 |
50 |
60 |
70 |
|
Table 7. Precipitation "rules" used to create management criteria for the SHALSTAB model. See
text for discussion.
| Test Basin |
Management Criteria |
Area-Weighted
Mean
Precipitation |
Area-Weighted
Maximum
Precipitation |
| Low
"Hazard" |
Moderate
"Hazard" |
High
"Hazard" |
| Jordan-Boulder |
6,7,8 |
5 |
1,2,3,4 |
108 |
127 |
| Upper North Fork
Stillaguamish |
5,6,7,8 |
4 |
1,2,3 |
83 |
102 |
| Hazel |
5,6,7,8 |
4 |
1,2,3 |
80 |
102 |
| Sol Duc |
6,7,8 |
5 |
1,2,3,4 |
129 |
152 |
| Middle Hoh |
6,7,8 |
5 |
1,2,3,4 |
185 |
229 |
| Morton |
5,6,7,8 |
4 |
1,2,3 |
100 |
114 |
| Chehalis Headwaters |
6,7,8 |
5 |
1,2,3,4 |
116 |
140 |
| E.F. Lewis |
6,7,8 |
5 |
1,2,3,4 |
123 |
140 |
Table 8. Predictions of known, existing shallow landslides using the three models
(SOILS screen, SMORPH, and SHALSTAB), given as the number of
incorrectly identified landslides (no. missed) per total number of
landslides in each basin (see text).
| Test Basin |
Number
of
Identified Land-slides
(T) |
SOILS |
SMORPH |
SHALSTAB
( = 33, C = 2kN/m2) |
| no.
missed
(N) |
N/T |
no.
missed
(N) |
N/T |
no.
missed
(N) |
(N/T) |
| Jordan-Boulder |
155 |
40 |
0.26 |
0 |
0.00 |
5 |
0.03 |
| North Fork
Stillaguamish River |
215 |
202 |
0.94 |
1 |
0.00 |
20 |
0.09 |
| Hazel |
117 |
34 |
0.29 |
1 |
0.01 |
37 |
0.32 |
| Sol Duc
River |
101 |
6 |
0.06 |
1 |
0.01 |
12 |
0.12 |
| Middle Hoh
River |
733 |
67 |
0.09 |
53 |
0.07 |
84 |
0.11 |
| Morton |
134 |
64 |
0.48 |
5 |
0.04 |
14 |
0.10 |
| Chehalis
Headwaters |
980 |
309 |
0.32 |
20 |
0.02 |
18 |
0.02 |
| Upper East
Fork Lewis
River |
89 |
89 |
1.00 |
2 |
0.02 |
1 |
0.01 |
| Mean
(Std. Dev.): |
315.5 |
101.4 |
0.43
(± 0.36) |
10.4 |
0.02
(± 0.02) |
23.9 |
0.10
(± 0.10) |
| Total: |
2524 |
811 |
0.32 |
83 |
0.03 |
191 |
0.08 |
Table 9. Wilcoxon rank-sum test for two populations, comparing means (µ) of error
distributions generated by the SMORPH and SHALSTAB models (see
Type I error estimates in Table 8 and 10).
| Test Criterion |
Test
Variable |
Comparison of
SMORPH (1)
and SHALSTAB
(2) |
Comparison
of SOILS (1)
and
SHALSTAB (2) |
| Type I errors:
Existing
landslides |
n1, n2 |
8, 8 |
8, 8 |
|
a1, a2 |
15.5, 48.5 |
53.0, 11.0 |
|
W test
statistic |
0.04 |
0.01 |
|
significant
at =
0.05? |
Yes;
µ1 < µ2 |
Yes;
µ1 > µ2 |
|
significant
at =
0.01? |
No;
µ1 = µ2 |
Yes;
µ1 > µ2 |
| Type I errors:
Hazard-zonation
map units |
n1, n2 |
4, 4 |
N/A
(see text) |
|
a1, a2 |
7.0, 9.0 |
|
|
W test
statistic |
0.44 |
|
|
significant
at =
0.05? |
No;
µ1 = µ2 |
|
|
significant
at =
0.01? |
No;
µ1 = µ2 |
|
Table 10. Type I model errors, in which each model predicts that shallow landslides likely do not occur, whereas field-derived maps of hazard zonation indicate that there is a moderate to high likelihood of landsliding.
| Test Basin |
Mass-Wasting Map
Unit Data |
SMORPH Model |
SHALSTAB Model |
| Basin Area
with
Moderate to
High Hazard
Rating (km2)
(A) |
Total
Basin
Acres
(%) |
Map
Unit
No.1 |
Basin Area
Predicted
with Low
Hazard
Rating (km2)
(M) |
(A/M)
= P |
E =
P(A/T)
|
Map
Unit
No.2 |
Basin Area
Predicted
with Low
Hazard
Rating (km2)
(M) |
(A/M)
= P |
E =
P(A/T) |
| Jordan-Boulder |
73.9 |
0.55 |
1 |
22.8 |
0.31 |
0.14 |
6, 7,
8 |
14.3 |
0.19 |
0.09 |
| Hazel |
78.9 |
0.81 |
1 |
12.3 |
0.16 |
0.08 |
5, 6,
7, 8 |
12.5 |
0.16 |
0.08 |
| Sol Duc River |
2.7 |
0.01 |
1 |
1.0 |
0.39 |
0.01 |
6, 7,
8 |
1.1 |
0.43 |
0.01 |
| Upper East
Fork Lewis
River |
9.0 |
0.11 |
1 |
1.7 |
0.19 |
0.01 |
6, 7,
8 |
2.3 |
0.26 |
0.02 |
| Total: |
164.5 (T) |
|
|
37.8 |
|
|
|
30.2 |
|
|
| Mean: |
41.1 |
0.37 |
|
9.5 |
0.26 |
0.06 |
|
7.6 |
0.26 |
0.05 |
1 Map unit corresponds to "high" hazard potential as defined by gradient-curvature class (see Table 2).
2 Map unit corresponds to "high" hazard potential as defined by precipitation rules (see Table 7).
Table 11. Type II model errors, in which each model predicts that shallow landslides likely have a high probability of
occurring, whereas field-derived maps of hazard zonation indicate that there is a low likelihood of
landsliding.
| Test Basin |
Mass-Wasting Map Unit
Data |
SMORPH Model |
|
SHALSTAB Model |
|
| Basin Area
with Low
Hazard Rating
(km2)
(A) |
Total
Basin
Acres
(%) |
Map
Unit
No.1 |
Basin Area
Predicted
with High
Hazard
Rating (km2)
(M) |
(A/M)
= P |
E =
P(A/T) |
Map
Unit
No.2 |
Basin Area
Predicted
with High
Hazard
Rating (km2)
(M) |
(A/M)
= P |
E =
P(A/T) |
| Jordan-Boulder |
59.6 |
0.45 |
3 |
9.1 |
0.15 |
0.03 |
1, 2,
3, 4 |
17.8 |
0.30 |
0.05 |
| Hazel |
18.4 |
0.19 |
3 |
6.3 |
0.34 |
0.02 |
1, 2,
3 |
14.3 |
0.78 |
0.04 |
| Sol Duc River |
182.1 |
0.99 |
3 |
19.0 |
0.10 |
0.05 |
1, 2,
3, 4 |
26.3 |
0.14 |
0.08 |
| Upper East
Fork Lewis
River |
72.0 |
0.89 |
3 |
11.7 |
0.16 |
0.03 |
1, 2,
3, 4 |
30.8 |
0.43 |
0.09 |
| Total: |
332.1 (T) |
|
|
46.1 |
|
|
|
89.2 |
|
|
| Mean: |
83.0 |
0.63 |
|
11.5 |
0.19 |
0.03 |
|
22.3 |
0.41 |
0.07 |
1 Map unit corresponds to "low" hazard potential as defined by gradient-curvature class (see Table 2).
2 Map unit corresponds to "low" hazard potential as defined by precipitation rules (see Table 7).
Table 12. Comparison of model performance in correctly and incorrectly predicting
landslide potential. For each model, slope-stability ratings of each DEM
grid cell were compared with the landslide-inventory database. A
numerical value was assigned to each of three possible database-intersection outcomes, as described in the text.
SMORPH Model |
SHALSTAB Model |
SOILS Screen |
| Test Basins |
Number
of slides
|
Calibrated
model
value |
Normalized
calibrated
value |
c=2 kN/m2
= 33
|
Normalized
value
|
Modeled
value
|
Normalized
Value |
| Jordan-Boulder |
155 |
5 |
0.03 |
11 |
0.07 |
80 |
0.52 |
| Upper N. F.
Stillaguamish |
215 |
15 |
0.07 |
50 |
0.23 |
404 |
1.88 |
| Hazel |
117 |
3 |
0.03 |
84 |
0.72 |
68 |
0.58 |
| Sol Duc |
101 |
11 |
0.11 |
26 |
0.26 |
12 |
0.12 |
| Middle Hoh |
733 |
155 |
0.21 |
177 |
0.24 |
134 |
0.18 |
| Morton |
134 |
28 |
0.21 |
44 |
0.33 |
128 |
0.96 |
| Chehalis
Headwaters |
980 |
49 |
0.05 |
40 |
0.04 |
618 |
0.63 |
| Lewis |
89 |
9 |
0.10 |
2 |
0.02 |
178 |
2.00 |
| Total:
|
2524
|
275
|
0.81
|
434
|
1.91
|
1622
|
6.87
|
References
Benda, L.E. and T.W. Cundy, 1990, Predicting deposition of debris flows in mountain channels. Canadian Geotechnical
Journal 23: 409-417.
Coho, C., 1997, Mass-Wasting Assessment. IN Jordan-Boulder Watershed Analysis, Appendix A, Wash. Dept. Natural
Resources, Olympia, WA. Appendix with maps.
Dietrich, W.E. and T. Dunne, 1978, Sediment budget for a small catchment in mountainous terrain. Zietschrift fur
geomorphologie, suppl. bd. 29: 191-206.
Dragovich, J.D. and M.J. Brunengo, 1995, Landslide map and inventory, Tilton River - Mineral Creek area, Lewis County,
Washington. Wash. Dept. Natural Resources, Div. of Geology and Earth Resources, Open-File Report 95-1, 165 pp.
Dietrich, W.E., Wilson, C.J., and S.L. Reneau, 1986, Hollows, colluvium, and landslides in soil-mantled landscapes. IN
Hillslope Processes, A.D. Abrahams (ed.), Allen and Unwin, Winchester, MA., p. 361-388.
Dietrich, W.E., Wilson, C.J., Montgomery, D.R., and J. McKean, 1993, Analysis of erosion thresholds, channel networks,
and landscape morphology using a digital terrain model. Journal of Geology 101: 259-278.
Environmental Systems Research Institute, 1992, Arc/Info Users Guide. Chap. 6.1, Grid command references, 1 vol.
Gerstel, W.J., 1996, The upside of the landslides of February 1996 - Validating a stability analysis of the Capitol Bluffs,
Olympia, Washington. Washington Geology 24(3): 3-16.
Hammond, C., Hall, D., Miller, S., and Swetik, P. 1992, Level 1 stability analysis (LISA). Documentation for version 2.0.
USDA Forest Service, Gen. Techn. Report INT-285, 190 pp.
Hoh Tribe and Washington Department of Natural Resources (WDNR), 1993, Forest agreement related to the Hoh River,
Kalaloch Creek, and Nolan Creek drainages: Memorandum of understanding, and supporting documents. Wash. Dept.
Natural Resources, Forest Resources Div., Olympia, WA.
MathSoft, Inc., 1998, S-Plus 5: Guide to Statistics. Chap. 3, Statistical inference for one and two sample problems,
MathSoft Inc., Data Analysis Products Div., 702 pp.
Miller, J.F. et al., 1973. Precipitation-Frequency Atlas of the Western United States. U.S. Department of Commerce,
National Oceanic and Atmospheric Admin. (NOAA), vol. IX.
Montgomery, D.R. and W.E. Dietrich, 1994, A physically based model for the topographic control on shallow landsliding.
Water Resources Research 30(4): 1153-1171.
Montgomery, D.R, Dietrich, W.E., Torres, R., Anderson, S.P., Heffner, J.T., and Loague, K., 1997, Hydrologic response
of a steep unchanneled valley to natural and applied rainfall. Water Resources Research 33, 215-230.
Montgomery, D.R., Sullivan, K., and H.M. Greenberg, 1998, Regional test of a model for shallow landsliding.
Hydrological Processes 12: 943-955.
Murray Pacific Corporation, 1998, West Fork Tilton Watershed Analysis. Murray Pacific Corporation, Tacoma, WA, 1
volume with appendices.
Murray Pacific Corporation, 1998, Nineteen Creek Watershed Analysis. Murray Pacific Corporation, Tacoma, WA, 1
volume with appendices.
O'Connor, M. and T.W. Cundy, 1993, North Fork Calawah River watershed condition survey: Landslide inventory and
geomorphic analysis of mainstem alluvial system. Part I: Landslide inventory and geomorphic analysis of mass erosion.
Report prepared for USDA Forest Service, Olympic National Forest, Olympia, WA., 17 pp.
O'Loughlin, E.M., 1986, Prediction of surface saturation zones in natural catchments by topographic analysis. Water
Resources Research 22(5): 794-804.
Pack, R.T., Tarboton, D.G., and C.N. Goodwin, 1998, Terrain stability mapping with SINMAP. Technical description and
users guide for version 1.00. Terratech Consulting Ltd., Report No. 4114-0, Salmon Arm, B.C., Canada.
Perkins, S. And B.D. Collins, 1997, Landslide and channel response inventory for the Stilliguamish watershed,
Snohomish and Skagit Counties, Washington. Report prepared for the Stilliguamish Tribe of Indians, Washington Dept.
of Ecology, Snohomish County Dept. of Public Works, and the Tulalip Tribes Nat. Res. Dept. Tulalip Tribes, Natural
Resources Dept., Marysville, WA., August 1997, 23 pp. with appendices.
Reddi, L.N. and T.H. Wu, 1991, Probabilistic analysis of ground-water levels in hillside slopes. Journal of Geotechnical
Engineering 117(6): 872-890.
Schuster, J.E., compiler, 1992, Geologic Map of Washington. Wash. Dept. Natural Resources, Div. Geology and Earth
Resources, Olympia, WA.
Shaw, S.C. and D.H. Johnson, 1995, Slope morphology model derived from digital
elevation data. Proceedings, Northwest Arc/Info Users Conference, Coeur d'Alene, ID.,
Oct. 23-25, 1995, 12 pp.
Sidle, R.C., Pearce, A.J., and C.L. O'Loughlin, 1985, Hillslope Stability and Land Use.
Chapter 4, Natural factors affecting slope stability, pp. 31-72. Water Resources
Monograph Series 11, American Geophysical Union, Washington, D.C., 140 pp.
Swanson, F.J. and R.L. Fredricksen, 1982, Sediment routing and budgets: Implications
for judging impacts of forest practices. IN Sediment Budgets and Routing in Forested
Drainage Basins, USDA Forest Service, Gen. Techn. Report PNW-141, Portland, OR.,
p. 129-137.
Swanson, F.J., Swanson, M.M., and C. |