Automated Water Extraction Index: A new technique for surface water
mapping using Landsat imagery
Gudina L. Feyisa a,
⁎, Henrik Meilby a
, Rasmus Fensholt b
, Simon R. Proud b
a
Department of Food and Resource Economics, University of Copenhagen, Rolighedsvej 23, DK-1958 Frederiksberg C, Denmark
b
Department of Geosciences and Natural Resource Management, University of Copenhagen, Oester Voldgade 10, DK-1350 Copenhagen K, Denmark
a b s t r a c ta r t i c l e i n f o
Article history:
Received 16 April 2013
Received in revised form 3 August 2013
Accepted 21 August 2013
Available online 17 September 2013
Keywords:
Classiﬁcation accuracy
Threshold stability
Subpixel
Mixed pixel
Classifying surface cover types and analyzing changes are among the most common applications of remote
sensing. One of the most basic classiﬁcation tasks is to distinguish water bodies from dry land surfaces. Landsat
imagery is among the most widely used sources of data in remote sensing of water resources; and although
several techniques of surface water extraction using Landsat data are described in the literature, their application
is constrained by low accuracy in various situations. Besides, with the use of techniques such as single band
thresholding and two-band indices, identifying an appropriate threshold yielding the highest possible accuracy
is a challenging and time consuming task, as threshold values vary with location and time of image acquisition.
The purpose of this study was therefore to devise an index that consistently improves water extraction accuracy
in the presence of various sorts of environmental noise and at the same time offers a stable threshold value. Thus
we introduced a new Automated Water Extraction Index (AWEI) improving classiﬁcation accuracy in areas that
include shadow and dark surfaces that other classiﬁcation methods often fail to classify correctly. We tested the
accuracy and robustness of the new method using Landsat 5 TM images of several water bodies in Denmark,
Switzerland, Ethiopia, South Africa and New Zealand. Kappa coefﬁcient, omission and commission errors were
calculated to evaluate accuracies. The performance of the classiﬁer was compared with that of the Modiﬁed Normalized
Difference Water Index (MNDWI) and Maximum Likelihood (ML) classiﬁers. In four out of ﬁve test sites,
classiﬁcation accuracy of AWEI was signiﬁcantly higher than that of MNDWI and ML (P-value b 0.01). AWEI improved
accuracy by lessening commission and omission errors by 50% compared to those resulting from MNDWI
and about 25% compared to ML classiﬁers. Besides, the new method was shown to have a fairly stable optimal
threshold value. Therefore, AWEI can be used for extracting water with high accuracy, especially in mountainous
areas where deep shadow caused by the terrain is an important source of classiﬁcation error.
© 2013 Elsevier Inc. All rights reserved.
1. Introduction
Environmental changes and their impacts on natural systems and
human societies are topics of research in a wide range of scientiﬁc ﬁelds.
Surface water is among the most vital earth resources undergoing
changes in time and space as a consequence of land use/cover (LULC)
changes, climate change and other forms of environmental changes in
many parts of the world. The ecological, social, health and economic
effects of surface water changes have been the subject of academic
study for many years (Alderman, Turner, & Tong, 2012; Bond, Lake, &
Arthington, 2008; Charron et al., 2004; Kondo et al., 2002; Lake, 2003;
Li, Wu, Dai, & Xu, 2012); Sun, Sun, Chen, and Gong (2012). Changes in
surface water may result in disasters such as ﬂooding, outbreaks of
waterborne disease and water shortage in dry tropical areas, which
may involve loss of lives. Timely monitoring and delivery of data on
the dynamics of surface water are, therefore, essential for policy and
decision making processes (Giardino, Bresciani, Villa, & Martinelli,
2010; Morss, Wilhelmi, Downton, & Gruntfest, 2005).
Remote sensing has become an important source of information in
analyzing and delivering data on changes in different earth resources,
and surface water in particular. Examples of studies applying remote
sensing and GIS techniques for various applications in relation to water
resources include ﬂood hazard/damage assessment and management
(Dewan, Islam, Kumamoto, & Nishigaki, 2007; Ji, Zhang, & Wylie, 2009;
Proud, Fensholt, Rasmussen, & Sandholt, 2011), change in surface
water resources (Gardelle, Hiernaux, Kergoat, & Grippa, 2009; Haas,
Bartholomé, & Combal, 2009; Prigent et al., 2012), water quality assessment
and monitoring (Guttler, Niculescu, & Gohin, 2013; He et al.,
2012; Novoa et al., 2012), and water-related disease epidemiology
(Charoenpanyanet & Chen, 2008; Dambach et al., 2012; Lacaux, Tourre,
Vignolles, Ndione, & Lafaye, 2007).
Satellite sensors of varying spatial, temporal and spectral resolution
have been used to extract and analyze information regarding surface
water. Landsat satellites are among the most widely used optical
Remote Sensing of Environment 140 (2014) 23–35
⁎ Corresponding author. Tel.: +45 91414185; fax: +45 353 31508.
E-mail addresses: fgudina@gmail.com, fgudina@ifro.ku.dk (G.L. Feyisa),
heme@ifro.ku.dk (H. Meilby), rf@geo.ku.dk (R. Fensholt), srp@geo.ku.dk (S.R. Proud).
0034-4257/$ – see front matter © 2013 Elsevier Inc. All rights reserved.
http://dx.doi.org/10.1016/j.rse.2013.08.029
Contents lists available at ScienceDirect
Remote Sensing of Environment
journal homepage: www.elsevier.com/locate/rse
sensors in surface water and other environmental research. The use of
these remotely sensed data commonly starts with classiﬁcation of
land use/cover types. Common water classiﬁcation methods for optical
imagery could be categorized into four basic types (Ji et al., 2009):
(a) thematic classiﬁcation (Lira, 2006), (b) linear unmixing (Sethre,
Rundquist, & Todhunter, 2005), (c) single-band thresholding (Jain,
Singh, Jain, & Lohani, 2005) and (d) two-band spectral water indices
(Jain, Saraf, Goswami, & Ahmad, 2006; McFeeters, 1996; Rogers &
Kearney, 2004; Xu, 2006). Combinations of various methods are also
proposed to improve water extraction accuracies. Examples are, Jiang,
Qi, Su, Zhang, and Wu (2012), Sheng, Shah, and Smith (2008), Sun
et al. (2012) and Verpoorter, Kutser, and Tranvik (2012). Single band
thresholding and two-band indices are commonly used water extraction
methods because of ease of use and the fact that these methods
are computationally less time-consuming than alternative approaches
(Ryu, Won, & Min, 2002).
McFeeters (1996) introduced the Normalized Difference Water
Index (NDWI) to delineate open water features using the green (band
2) and near-infrared (band 4) of Landsat TM. Rogers and Kearney
(2004) used another NDWI for water extraction where they applied
bands 3 and 5 of Landsat TM. McFeeters (1996) proposed a threshold
of 0 for extracting surface water using the raw digital number of
Landsat, where all positive NDWI values would be classiﬁed as water
and negative values as nonwater. However, Xu (2006) found that the
NDWI cannot efﬁciently suppress the signal from built-up surfaces
and using an NDWI threshold of 0 does not accurately enable discriminating
built-up surfaces from water pixels. Xu (2006) therefore proposed
another index, called Modiﬁed Normalized Difference Water
Index (MNDWI), where McFeeters (1996) NDWI was modiﬁed by
replacing band 4 by band 5 of Landsat 5 TM. The MNDWI of Xu (2006)
is one of the most widely used water indices for various applications,
including surface water mapping, land use/cover change analyses
and ecological research (Davranche, Lefebvre, & Poulin, 2010; Duan
& Bastiaanssen, 2013; Hui, Xu, Huang, Yu, & Gong, 2008; Poulin,
Davranche, & Lefebvre, 2010).
Even though a number of water extraction techniques are described
in the literature, the choice between them is constrained by accuracy
problems. Environmental monitoring and change detection techniques
such as post-classiﬁcation comparison are likely to be less reliable
when classiﬁers of low accuracy are used (Congalton & Green, 2009;
Mucher, Steinnocher, Kressler, & Heunks, 2000). For instance, in a
study focusing on water dynamics monitoring, Ji et al. (2009) faced
two major problems in appropriately using water indices: ﬁrst, the results
obtained using different indices were inconsistent and unreliable;
second, the threshold values applied to distinguish water from nonwater
were unstable, varying with scene and locations. These authors
compared four different water indices using simulated datasets of
four satellite sensors: Landsat ETM+, Système Pour l'Observation de
la Terre (SPOT), the Advanced Space-borne Thermal Emission and
Reﬂection radiometer (ASTER), and the Moderate Resolution Imaging
Spectroradiometer (MODIS), aiming to identify the best method for
delineating water features. Among the four alternatives, they found
that the MNDWI performed best in delineating water, and featured
the most stable threshold.
Water classiﬁcation accuracy problems may be especially pronounced
in areas where the background land cover includes low albedo
surfaces such as asphalt roads in urban areas, and shadows from
mountains, buildings and clouds. The presence of shadows may cause
misclassiﬁcation due to the similarity in reﬂectance patterns, and this
may lessen the accuracy of surface water mapping and change analysis
(Frey, Huggel, Paul, & Haeberli, 2010; Verpoorter et al., 2012; Xu, 2006).
In environments where nonwater dark surfaces are found, simple classiﬁcation
methods such as two-band water indices and single-band
thresholding may not sufﬁciently and accurately distinguish between
water pixels and nonwater dark surfaces, particularly shadows
(Verpoorter et al., 2012). In a study of land cover dynamics using
Landsat TM data, we noted accuracy problems due to failure of existing
water extraction methods in accurately distinguishing water from
shadows and low albedo urban surfaces. Particularly, no existing
water index was able to automatically separate water and shadowed
surfaces. In this paper, therefore, we introduce a multiple-band index
called Automated Water Extraction Index (AWEI), with the objectives
to: (a) improve accuracy of surface water mapping by automatically
suppressing classiﬁcation noise from shadow and other nonwater
dark surfaces, and (b) test the robustness of the new method under
different environmental conditions and evaluate its relative accuracy
in comparison with existing classiﬁcation techniques.
2. Study areas and data sources
2.1. Test sites
The accuracy and robustness of the Automated Water Extraction
Index (AWEI) were tested considering several lakes and other water
bodies in different environmental conditions ranging from humid temperate
through sub-tropical to tropical dry regions. The test water bodies
were obtained from ﬁve different countries: Denmark, Switzerland,
Ethiopia, South Africa and New Zealand. The water bodies that include
small freshwater reservoirs, large lakes, harbors and the sea differ
with regard to depth, turbidity, chemical composition and surface appearance.
A summary of the basic characteristics of the test sites is
shown in Table 1.
The test sites were deliberately selected so that the sub-scenes
consist of complex surface features, such as hill shade, built-up areas
and other dark surfaces as background to the water bodies. The test
sites in Switzerland, Ethiopia and South Africa are characterized by the
presence of built-up surfaces and shadows of mountains. The site in
Denmark also consists primarily of urban background but with no
major shadow problems since the terrain is predominantly ﬂat and
tall buildings in the urban area are rare. The test site in New Zealand
consists of mountain slopes with deep shadows, but no major urban
surfaces are included.
In addition to the ﬁve test sites for which detailed accuracy analyses
and comparisons were carried out, further validation of the robustness
of the new index was undertaken considering shadow-dominated
water bodies in Norway, rivers with urban surfaces and shadows from
tall buildings in Shanghai, China, and several crater lakes with built-up
background surfaces in Bishoftu, Ethiopia. However, these additional
test sites were not analyzed in detail and classiﬁcation output from
these sites is not included in the Results section; instead, the classiﬁcation
maps are included in Appendix A for visual inspection of classiﬁcation
accuracy.
2.2. Landsat images
Landsat 5 TM images were acquired from USGS GLOVIS portal
(United States Geological Survey (USGS), 2012). All Landsat images
used are of product type L1T and with a scene quality score of 9,
which means perfect scenes with no errors detected. The images were
also georeferenced with precision better than 0.4 pixels (NASA, 2012).
The sub-scenes were all free of clouds. Descriptions of the Landsat
images are presented in Table 2.
2.3. Reference data
Reference data used in accuracy assessment are described in Table 2.
For the test site in Denmark, colored Digital Orthophoto Quadrangles
(DOQs) from year 2010 were used as reference. These aerial photos
have a spatial resolution of 12.5 cm and location accuracy better than
0.5 m (COWI, 2010). For the four other test sites, high spatial resolution
images provided by Google Earth™ were used for reference. The acquisition
dates of the reference data and the Landsat 5 TM images were
24 G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
closely matched to minimize bias in the surface water boundaries that
could arise because of large differences in time. The dates of acquisition
of the Landsat images and reference data are shown in Table 2.
The “true” boundaries of all the test water bodies were digitized
manually on-screen from the reference data. In the analysis, the manually
digitized water map was used to assess the accuracy of the different
water extraction methods when applied to the Landsat images.
3. Methods
3.1. Image preprocessing
Landsat 5 TM images acquired in the form of raw digital number
were calibrated to surface reﬂectance values. Atmospheric correction
was applied to all images using the Fast Line-of-Sight Atmospheric Analysis
of Spectral Hypercubes (FLAASH) module in ENVI v. 4.8 (Exelis
Visual Information Solutions, 2010). Aerosol Optical Depth (AOD) and
total Column Water Vapor data to be used in the FLAASH atmospheric
correction module were retrieved from MODIS Terra atmospheric products
(Jimenez-Munoz, Sobrino, Mattar, & Franch, 2010) for each of the
ﬁve test sites. The MODIS Terra products that had been acquired on
the same date as the Landsat images were used to retrieve the calibration
constants. Initial visibility was estimated using the aerosol optical
thickness obtained from MODIS data using Eq. (1), where, VIS = initial
visibility, Z = mixing layer height and AOD = Aerosol Optical Depth.
The mixing layer height values for the test sites were not available.
Therefore, a daytime mixing layer height of 1.5 km was used for all
test sites (typical values range from 1 to 2 km (Butcher, Charlson,
Orians, & Wolfe, 1992)). Water vapor calibration constants for each of
the Landsat images are summarized in Table 3. The overpass time of
Landsat TM and MODIS Terra at each test site was closely matched
(less than 2 h difference).
VIS ¼
3:912
AOD
Ã Z: ð1Þ
Image-to-image co-registration between the reference data and
Landsat images was undertaken for the test sites in Denmark and
Ethiopia. Manual co-registration was performed with a Root Mean
Square Error (RMSE) of less than 0.4 pixels. At least 25 control points
were used for co-registration of each image with the reference data.
The co-registration between Landsat and Google Earth™ images at the
rest of the test sites was already highly accurate and hence there
was no need for manual co-registration. Since the L1T Landsat TM and
ETM+ products are geometrically corrected (NASA, 2012), no such
corrections were applied in the pre-processing.
3.2. Pure-pixel selection
An independent set of “pure” pixel reﬂectance values of nine major
land cover types was sampled from the six reﬂective bands of a Landsat
5 TM image of Addis Ababa, acquired on Dec. 9, 2010. The land cover
types are: water, vegetation (forest and non-forest), bright soil, dark
soil, brown soil, bright built, asphalt, other dark built and shadow.
Spectral data from these pure pixels were used to examine reﬂectance
patterns and identify land cover types that affect water extraction accuracy,
aiming to design a method that accurately discriminates between
Table 1
Characteristics of the study sites.
The source of climate information is: (http://www.climatedata.eu/).
Country and name of water bodies Center point coordinate
(UTM)
Area
(ha)
General characteristics
of water bodies
Mean alt. (m) Topography Climate
Denmark
Several artiﬁcial lakes, a harbor and
the sea (Øresund and Køge Bugt)
6,172,085 m N, 12°34′57.42″E 2085 Shallow clear artiﬁcial
lakes, clear seas
9 Predominantly ﬂat Temperate
Switzerland
Lake Lauerz 5,209,030 m N, 469,608 m E 289 Clear lake 1100 Mountainous Temperate
Ågeri lake 5,218,774 m N, 471,530 m E 719 Clear lake
Sihl lake 5,218,191 m N, 484,028 m E 1034 Clear lake
Wägitaler lake 5,214,616 m N, 494,092 m E 402 Clear lake
Klöntaler lake 5,207,839 m N, 498,040 m E 309 Clear lake
Ethiopia
Gefersa 1,002,432 m N, 459,709 m E 144 Clear reservoir 2377 Mountainous Tropical dry
Dire 1,011,794 m N, 493,000 m E 106 Turbid reservoirs
Legedadi 1,002,374 m N, 497,446 m E 423 Turbid reservoirs
South Africa
Berg river 6,244,161 m S, 320,605 m E 426 Clear reservoir 600 Rugged hilly Subtropical semi-arid
Wemmershoek 6,255,473 m S, 323,355 m E 195 Clear reservoir
Brandvlei 6,265,857 m S, 354,413 m E 3097 Clear reservoir
New Zealand
Lake Te Anau 5,004,239 m S, 723,800 m E 6495 Large clear lake 800 Rugged hilly Humid temperate
Table 2
Description of Landsat TM scenes and corresponding reference data.
Test site Landsat scene Reference data and sources
Acquisition date Path Row
Water bodies in Copenhagen, Denmark June 29, 2010 195 21 Colored Digital Orthophoto quadrangles acquired between May and July 2010, ©COWI
Lakes in Switzerland Sept 30, 2009 195 27 Google Earth™ image acquired on Jul 1 2009
Reservoirs in Addis Ababa, Ethiopia Dec 9, 2010 168 54 Google Earth™ image acquired on Oct 13 and Dec 20, 2010, ©Digital globe, CNES/SPOT Image
Reservoirs in South Africa Mar 29, 2010 175 83 Google Earth™ image acquired on Jan 4, Jan 7 and Jan 18, 2010, ©GeoEye and AfriGIS
A lake in New Zealand Feb 3, 2010 76 91 Google Earth™ image acquired on Apr 09, 2010 and Feb 16, 2011, ©GeoEye
25G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
such surfaces and water. The pure pixel data were intended neither for
classiﬁcation nor accuracy assessment and therefore, only the image of
Addis Ababa and its surroundings was used for pure pixel extraction.
The reason for choosing Addis Ababa for pure pixel extraction was
that this area includes all the major challenging features inﬂuencing
water extraction accuracy: shadow, dark built-up surfaces and other
low albedo surfaces such as black soil.
The methods used to extract pure pixels of the selected land cover
types include spectral feature space scatter plot from Minimum Noise
Fraction Transform (MNFT) images, Pixel Purity Index (PPI), manual
digitization from images accessed through Google Earth™, groundbased
land cover assessment and the familiarity of the ﬁrst author
with the local area. Pure pixel samples for water were taken from the
middle of lakes to avoid mixed edge pixels. Similarly, high forest
with closed canopy from Menagesha national forest was applied for
sampling vegetation pure pixels. Pure pixels of built-up land cover
were sampled from homogenous surfaces such as airport runways and
large warehouse roofs in Addis Ababa city. Since land cover types in
Addis Ababa are highly heterogeneous, detection of pure built pixels
was assisted by PPI and spectral feature space in ENVI v.4.8 (Exelis
Visual Information Solutions, 2010). Shadow pixels were extracted by
thresholding hill-shade images derived from elevation data using the
ASTER Digital Elevation Model (DEM) in mountainous parts of the test
site. Homogenous agricultural ﬁelds with exposed black, brown and
bright soils were also sampled from the outskirts of the city.
For each land cover type, 312 pure pixels were extracted from the six
reﬂective bands of the Landsat 5 TM images. Average reﬂectance values
of the pure pixels are shown in Fig. 1. Separability of the spectral
signatures of the selected nine major land cover types was tested by
Jeffries–Matusita's pairwise separability measure (Richards, 1993) in
ENVI v. 4.8. All pairs of land cover types were found to be separable
with values ranging from 1.89 to 2.0.
3.3. Formulation of the Automated Water Extraction Index (AWEI)
Five spectral bands of Landsat 5 TM were used in developing the
new index (AWEI) to increase the contrast between water and other
dark surfaces. The primary aim of the formulation of AWEI was to
maximize separability of water and nonwater pixels through band
differencing, addition and applying different coefﬁcients. Accordingly,
two separate equations are proposed to effectively suppress nonwater
pixels and extract surface water with improved accuracy (Eqs. (2) and
(3)). The coefﬁcients used in Eqs. (2) and (3) and the arithmetic combinations
of the chosen spectral bands were determined based on critical
examination of the reﬂectance properties of various land cover types.
The coefﬁcients of these equations are empirical results determined
based on reﬂectance patterns observed across the dataset of pure pixels
of various land cover types. An iterative process was applied to identify
parameters that maximize the separability of water and nonwater surfaces
characterized by low reﬂectance. In the ﬁnal index, the coefﬁcients
were rounded for ease of use. Particular emphasis was given to the enhancement
of the separability of water and dark surfaces such as shadow
and built-up structures that are often difﬁcult to distinguish due to
similarities in reﬂectance patterns. In addition to enhancing separability
of water and nonwater pixels, the choice of the coefﬁcients also aimed
to stabilize the threshold needed to distinguish water from nonwater
pixels by forcing nonwater pixels below 0 and water pixels above 0,
implying that 0 could be used as a reasonable starting threshold for
classifying land cover into binary classes of water and nonwater under
a wide range of environmental conditions.
AWEInsh ¼ 4 Â ρband2−ρband5ð Þ− 0:25 Â ρband4 þ 2:75 Â ρband7ð Þ ð2Þ
AWEIsh ¼ ρband1 þ 2:5 Â ρband2−1:5 Â ρband4 þ ρband5ð Þ−0:25 Â ρband7 ð3Þ
where ρ is the reﬂectance value of spectral bands of Landsat 5 TM:
band 1 (blue), band 2 (green), band 4 (NIR), band 5 (SWIR) and band
7 (SWIR).
AWEInsh is an index formulated to effectively eliminate nonwater
pixels, including dark built surfaces in areas with urban background
and AWEIsh is primarily formulated for further improvement of accuracy
by removing shadow pixels that AWEInsh may not effectively eliminate.
The subscript “nsh” in Eq. (2) is included to specify that the index is
suited for situations where shadows are not a major problem. The subscript
“sh” in Eq. (3) indicates that the equation is intended to effectively
eliminate shadow pixels and improve water extraction accuracy in
areas with shadow and/or other dark surfaces. But in areas with highly
reﬂective surfaces such as ice, snow and reﬂective roofs in urban areas,
Eq. (3) may misclassify such surfaces as water.
In Eq. (2), quadrupling the difference between the band 2 and band 5
results in large positive values for water pixels and negative values for
most nonwater pixels. To help in discriminating water from other
surfaces that have similar spectral patterns, band 4 and band 7 are
subtracted from the result and different weights are assigned to these
bands to force nonwater pixels to have even larger negative values;
this subtraction will not to any greater extent lead to negative values
for water pixels because water has very low reﬂectance in the spectral
ranges of band 4 and band 7. The equation results in large negative
values for pixels covered by vegetation, soil, bright built and other
surfaces that have large reﬂectance for band 4 or 7. The equation is
also intended to enhance separability between water, dark surfaces
and other nonwater surfaces. In many cases, water absorbs almost all
of the incoming radiation in bands 4, 5 and 7 and achieves relatively
highest reﬂectance between bands 1 and 2 of Landsat 5 TM (Lillesand,
Kiefer, & Chipman, 2004). Shadowed surfaces also have low reﬂectance
in all spectral bands, but the magnitude of reﬂectance varies due to variation
in surface characteristics and the depth of shadow. Hence, Eq. (2)
alone may not completely eliminate all types of shadows and other low
albedo surfaces. As shown in Fig. 1, for instance, subtracting band 5 from
band 2 could yield positive values for both water and shadows. Quadrupling
the difference and subtracting bands 4 and 7 may result in some
shadow pixels obtaining similar values as that of water due to the
similarity in reﬂectance patterns within these bands, hence making it
difﬁcult to exclude shadow pixels from water class.
Due to these limitations of Eq. (2), Eq. (3) was formulated to achieve
enhanced separability of water and shadows and/or dark surfaces. It can
Table 3
Calibration values used in atmospheric correction using FLAASH.
Test Site Date of Landsat/MODIS
Terra overpass
Average water vapor column
from MODIS (g/cm2
)
Water vapor multiplier Average aerosol optical
thick-from MODIS
Visibility (km)
Denmark Jun 29, 2010 3.00 1.03 0.33 17.8
Switzerland Sept 30, 2009 1.43 0.49 0.08 73.4
Ethiopia Dec 9, 2010 2.30 0.56 0.10 58.7
South Africa Mar 29, 2010 1.90 0.46 0.033 177.8
New Zealand Feb 3, 2010 2.67 0.91 0.12 48.9
26 G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
be noted from Fig. 1 that the largest difference between the reﬂectance
of water and shadow is found in bands 1 and 2. Therefore, adding these
two bands, while at the same time multiplying band 2 by the speciﬁed
coefﬁcient, enhances the separability between water and shadow
pixels, yielding relatively large positive values for water pixels compared
to shadow pixels. Subtracting bands 4, 5 and 7 forces nonwater
pixels in the negative direction, and the net effect of this subtraction
on water pixels is minimal compared to nonwater surfaces including
shadows, which are forced considerably below zero. Band 3 was not
used in Eqs. (2) and (3) because during the preliminary tests, including
this band did not improve separability and accuracy. From the arithmetic
formulation of Eq. (3), it may be noted that the addition of the short
wave bands (bands 1 and 2) may result in large positive values for
high albedo surfaces such as ice, cloud, and highly reﬂective building
roofs. Eq. (3) may therefore not be able to distinguish these highalbedo
surfaces from water.
Therefore, the intended use of the two AWEI equations is as follows:
1) in situations where shadows are major sources of accuracy loss but
surfaces such as snow, ice and high albedo built surfaces are not present,
AWEIsh alone is proposed to automatically enhance the separability of
pixels of water from nonwater (more importantly from shadow pixels)
so that application of a threshold close to 0 is suitable for the extraction
of surface water; 2) in areas where shadows are not a major problem,
AWEInsh alone is proposed; 3) in conditions where both high albedo
surfaces and shadow/dark surfaces are found, we propose using
Eqs. (2) and (3) sequentially in a classiﬁcation tree; 4) in areas with
no shadowed areas, no dark urban backgrounds and no high-albedo
surfaces, either of the two can be used alone.
3.4. Classiﬁcation, threshold optimization and per-pixel accuracy assessment
At the test sites in Denmark and Ethiopia, urban background
dominates the sub-scenes. Therefore, both equations of AWEI (AWEInsh
and AWEIsh) were applied sequentially: ﬁrst, AWEInsh was applied to the
image; next, AWEIsh was used to eliminate misclassiﬁed pixels with
shadows and other dark surfaces. At the test sites in Switzerland, New
Zealand and South Africa, only AWEIsh was applied because urban
surfaces are rare in these sites.
To compare accuracy of the proposed water extraction technique
with other methods, we made preliminary tests of various water indices
including the Water Index (WI) of Ouma and Tateishi (2006), the
Normalized Difference Water Index (NDWI) of McFeeters (1996) and
other indices that Ji et al. (2009) used in their studies. Based on this preliminary
evaluation, it appeared that all indices, except the MNDWI,
performed poorly at our test sites. We therefore only considered
MNDWI for comparison with the new index proposed in this paper. A supervised
maximum likelihood (ML) classiﬁer was also included in our
comparison as this classiﬁer is one of the most widely used methods in
land cover classiﬁcation. For the ML classiﬁer, water and nonwater training
data were produced for each test site. The minimum size of reference
datasets for training was determined using the multinomial conservative
sample size equation described in Congalton and Green (2009). The
Blue band (b1)
Reflectance
0.00
0.05
0.10
0.15
0.20
0.25
0.30
Green band (b2)
NIR (b4)
Reflectance
0.0
0.1
0.2
0.3
0.4 SWIR (b5)
Red band (b3)
SWIR (b7)
Eq. 2, AWEInsh
Forest
Water
Shadow
Non-for. green
Asphalt
Dark built
Bright built
Bright soil
Black soil
Brown soil
Indexvalue
-1.5
-1.0
-0.5
0.0
0.5
1.0
Eq. 3, AWEIsh
Surface category
Forest
Water
Shadow
Non-for. green
Asphalt
Dark built
Bright built
Bright soil
Black soil
Brown soil
MNDWI
Forest
Water
Shadow
Non-for. green
Asphalt
Dark built
Bright built
Bright soil
Black soil
Brown soil
Fig. 1. Reﬂectance distributions of pure pixels of major land cover types. Each box plots shows the location of the 10th, 25th, 50th, 75th, and 90th percentiles using horizontal lines (boxes
and whiskers) and the circles are 5th and 95th percentiles.
27G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
reference data were generated by digitizing multiple polygons on the
true-color composites of Landsat bands and evenly distributing the samples
across all parts of the sub-scenes. It was easy to generate large reference
data units since the classes considered are only water and nonwater
and it is relatively easy to visually distinguish between water and
nonwater surfaces from high spatial resolution images retrieved through
Google Earth™. These images were used to differentiate nonwater dark
areas from water surfaces. No separate validation data were necessary
for accuracy assessment of the ML classiﬁer since the classiﬁcation result
was compared against the true map of water.
Since the AWEI equations are formulated to enhance separability of
water and nonwater pixels by applying coefﬁcients that force nonwater
pixels below 0 and water pixels above 0, a threshold of 0 can be used as
a default starting point. But due to variation in scene brightness and
contrast with time and space, the default threshold may not always
result in the highest possible water extraction accuracy that can be
achieved by application of the index. In order to determine the optimal
threshold, multiple thresholds were considered, and for each threshold
value corresponding commission errors (over-estimation) and omission
errors (under-estimation) were calculated and the percentage
errors were plotted against threshold values. The intersection point of
commission and omission error graphs was then considered as the
optimal threshold since it approximates the minimum possible sum of
the two error types. We evaluated the stability of optimal thresholds
of the new method and of MNDWI by examining the variation of the
optimal threshold values for the two indices across the ﬁve test sites.
Classiﬁcation accuracy of the three methods, i.e. AWEI, MNDWI and
ML, was assessed by calculating kappa coefﬁcients and error matrices.
The accuracy comparison between AWEI and MNDWI was made at
their optimal thresholds. McNemar's statistical test was applied to
examine whether the new water extraction method signiﬁcantly
improves accuracy compared to MNDWI and ML in the test sites.
McNemar's continuity corrected chi-square statistic was computed as
shown in Eq. (4) (De Leeuw et al., 2006):
X
2
¼
f 12−f 21j j−1ð Þ
2
f 12 þ f 21
ð4Þ
where, f12 and f21 denote the frequencies of cases that are correctly classiﬁed
by one classiﬁcation method but wrongly classiﬁed by the other.
3.5. Sub-pixel accuracy assessment
The sensitivity of different classiﬁers to various mixtures of water
and nonwater was evaluated using sub-pixel commission–omission
errors and by plotting graphs showing the cumulative percentage of
edge pixels classiﬁed as water against the proportion of each individual
pixel covered by water for mixed edge pixels at test sites in Denmark,
Switzerland and Ethiopia. Detailed sub-pixel accuracy analysis and
comparisons were undertaken using the three reservoirs in Ethiopia
(Gefersa, Dire and Legedadi). The total number of mixed edge pixels
in the three reservoirs was 1819 (164 ha). In the sub-pixel accuracy assessment,
commission and omission errors brought about by edge
pixels were quantiﬁed by the use of an overlay analysis in ArcGIS. Any
pixels that included water and nonwater surfaces were considered to
be mixed edge pixels (Fig. 2). If a mixed edge pixel was classiﬁed as
water, the fraction of it that fell outside the “true” boundary was considered
to be sub-pixel commission error. Similarly, in cases where mixed
pixels are classiﬁed as nonwater, the fraction of these pixels that fell
inside the “true” water body was considered to be an omission error
at the sub-pixel level. Mixed pixels consisting predominantly of water
(N50% water) should ideally be classiﬁed as water and vice versa. In
the sub-pixel accuracy assessment, inﬂuences of misregistration artifacts
and manual digitization of true water boundaries were assumed
to be insigniﬁcant.
4. Results
4.1. Water extraction maps
The outputs of water extraction using the three classiﬁers at the ﬁve
test sites are presented in Fig. 3. Visual inspection of Fig. 3 indicated that
AWEI resulted in better accuracy of surface water mapping compared to
MNDWI and ML. Particularly at test sites in Switzerland, South Africa
and New Zealand, the new index (AWEI) was consistently better in suppressing
shadow and other nonwater surfaces. In most cases, MNDWI
Fig. 2. Edge pixels around Gefersa reservoir (Ethiopia) showing mixed pixels with different proportions of water (shown on high spatial resolution image accessed through Google Earth™).
28 G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
and especially ML produced noisy results. However, at test sites in
Denmark and Ethiopia, visual inspection of Fig. 3 indicated smallest
difference among the three classiﬁcation methods.
Visual inspection of classiﬁcation outputs at the three additional test
sites shown in Appendix A (Figs. A1–A3) also indicates that AWEI is
effective in extracting surface water in the presence of shadow and
urban surfaces. At Bishoftu lakes in Ethiopia, where no major shadow
surfaces were present, both AWEIsh and MNDWI resulted in (visually)
similar classiﬁcation outputs. By contrast, at the test sites in Norway
and Shanghai where dark shadows were abundant, visual inspection
clearly shows that AWEIsh suppressed shadowed surfaces more effectively
than MNDWI (shown in Appendix A).
4.2. Classiﬁcation accuracy and edge pixel effects
The results of mapping accuracy at each of the ﬁve main test sites are
summarized in Table 4. At all test sites the accuracy achieved by AWEI
was higher than that of the MNDWI and ML classiﬁers. Averaged over
LMIWDNMIEWATest site
Denmark
Switzerland
Ethiopia
LMIWDNMIEWASite
S. Africa
New
Zealand
Fig. 3. Comparison of water extraction results using three classiﬁers at the ﬁve test sites.
29G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
the ﬁve test sites the total omission and commission error of AWEI was
only about 50% of that of the MNDWI and 25% of that of the ML classiﬁers
(Fig. 4). Details of accuracy assessment including users' and producers'
accuracy are shown in Appendix A (Table A1). Since visual
inspection from Fig. 3 indicates small variation in accuracy at test sites
in Denmark and Ethiopia, McNemar's chi-square test of signiﬁcance of
accuracy difference at the test sites in Denmark, Switzerland and
Ethiopia are included in Table 5. At these three test sites, signiﬁcant accuracy
improvement was achieved by AWEI (P-value b 0.01) compared
to ML. At the test site in Denmark, accuracy difference between AWEI
and MNDWI was insigniﬁcant (Table 5). ML performed worst at test
site in South Africa (kappa coefﬁcient 0.62) and at this test site, the
highest accuracy was achieved by AWEI, with a kappa coefﬁcient of
0.98 (Table A1 in Appendix A).
The sub-pixel accuracy analysis is presented in Fig. 5. The comparison
shows the ability of the three classiﬁers in correctly classifying
edge pixels with various mixtures of water and nonwater components.
The vertical line in Fig. 5 indicates the 50% water–nonwater mixture and
the ﬁgure shows that among the edge pixels that AWEI classiﬁed as
water, only 13% were predominantly nonwater. Conversely, 87% of
mixed edge pixels that were classiﬁed as water were correctly classiﬁed
by AWEI. Using MNDWI, 81% of the mixed edge pixels were correctly
classiﬁed. Further analysis of mixed edge pixels at test sites in Addis
Ababa showed that sub-pixel commission error of AWEI corresponded
to an overestimation of 16.6 ha (total area of predominantly nonwater
edge pixels classiﬁed as water), and omission error corresponded to
4.3 ha (total area of predominantly water edge pixels classiﬁed as
nonwater). For comparison, edge pixel commission and omission errors
of MNDWI corresponded to overestimation and underestimation of
18.3 ha and 4 ha, respectively. At this site edge pixel omission and commission
error of ML corresponded to 49.3 ha and 0.6 ha, respectively.
Based on the sum of total overestimation and underestimation of edge
pixels, AWEI performed slightly better than MNDWI, and ML achieved
the lowest accuracy in classifying mixed edge pixels.
4.3. Optimal threshold and its variability
A comparison of the stability of the optimum thresholds of AWEI and
MNDWI is shown in Fig. 6. It clearly appears that the optimal threshold
of MNDWI at different test sites exhibited large variation compared
to AWEI. The optimal threshold of MNDWI ranged from 0.005 in
Denmark to 0.6 in South Africa, whereas for AWEI the optimal threshold
only varied from −0.15 (AWEInsh in Denmark) to 0.045 (AWEIsh in
South Africa), and in the three other sites the optimal threshold of
AWEIsh was 0 (Fig. 6).
5. Discussion and perspectives
The new water extraction index introduced in this paper contributes
to the efforts being made to improve the accuracy of surface water
mapping and change analysis for various environmental studies and
applications. This method uses a simple and systematic technique of
enhancing class separability without a need for additional data to
remove shadow and dark surface noises, which are often major causes
of misclassiﬁcation in surface water mapping. Using a simple classiﬁcation
tree approach, the AWEI was shown to extract surface water with
high accuracy, particularly in mountainous areas where hills cast
shadows on background surfaces and in urban areas with complex
land cover. AWEI is not only a simple technique but was also shown
to be robust under various environmental conditions and for different
types of water bodies.
Totalerror(%omissionandcommission)
0
10
20
30
40
50
60
South Africa
Switzerland
Ethiopia
Denmark
New Zealand
South Africa
Switzerland
Ethiopia
Denmark
New Zealand
South Africa
Switzerland
Ethiopia
Denmark
New Zealand
Classifier
AWEI MNDWI ML
Fig. 4. Total classiﬁcation error (combined commission and omission error). The box plots
show the variability of classiﬁcation errors among test sites. Each box plots shows the
location of the 10th, 25th, 50th, 75th, and 90th percentiles using horizontal lines
(boxes and whiskers) and the circles are 5th and 95th percentiles.
Table 5
Summary of McNemar's continuity corrected χ2 test for differences in classiﬁcation
accuracy.
Test sites Classiﬁer χ2
P-value
MNDWI ML MNDWI ML
Denmark AWEI 0.8 114 0.30 0.00
Switzerland AWEI 408.0 619 0.00 0.00
Ethiopia AWEI 19.0 30 0.00 0.00
AWEI
MNDWI
Max. Like.
Percentage of water in edge pixels
0 20 40 60 80 100
Cum.%ofedgepixelsclassifiedaswater
0
20
40
60
80
100
A A
A
A
A
A
A
A
A
A
A
M
M
M
M
M
M
M
M
M
M
M
L
L
L
L
L
L
L
L
L
L
L
A
M
L
Fig. 5. Cumulative frequency of mixed edge pixels classiﬁed as water (average of test sites
in Denmark, Switzerland and Ethiopia).
Table 4
Summary of classiﬁcation accuracy of the three classiﬁers by test site.
Classiﬁer Denmark Switzerland Ethiopia S. Africa New Zealand
Kappa coeff. Kappa coeff. Kappa coeff. Kappa coeff. Kappa coeff.
AWEI 0.93 0.95 0.97 0.98 0.98
MNDWI 0.92 0.89 0.95 0.94 0.90
ML 0.89 0.81 0.93 0.62 0.97
30 G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
In many water indices, the lack of stability of the threshold is a problem
(Ji et al., 2009), making it difﬁcult to decide which value should be
used in classiﬁcation trees. The lack of a reasonably stable threshold
may make the classiﬁcation more time-consuming and lead to a subjective
choice of threshold which may also affect accuracy. In addition to
accuracy improvement, our new index was also shown to have a relatively
stable optimal threshold which makes the use of the method
even simpler. It should be noted that in our study, images from all test
sites were atmospherically corrected applying the FLAASH module in
ENVI v.4.8. In classifying images that are calibrated to TOA reﬂectance,
but with no atmospheric corrections, the optimal thresholds may differ
slightly from what is observed in this study.
A number of authors contributed to previous research on the improvement
of surface water mapping accuracy using remotely sensed
data, including those that have emphasized the improvement of general
land cover classiﬁcation accuracies (Aguirre-Gutierrez, Seijmonsbergen,
& Duivenvoorden, 2012; Rozenstein & Karnieli, 2011) and surface water
mapping in particular (Ji et al., 2009; Sun et al., 2012; Verpoorter et al.,
2012; Xu, 2006). In a recent work of Verpoorter et al. (2012), a six-step
water extraction method called GeoCover Water Bodies Extraction
Method (GWEM) was introduced. In GWEM, the authors proposed a
combination of various classiﬁcation techniques for improvement of
accuracy. The same authors identiﬁed shadows of cloud and mountains
as major sources of accuracy problems and proposed a method where
elevation data were used to detect shadows and water surfaces that
overlap with shadow were removed from the classiﬁcation dataset.
Our new method which automatically suppresses shadow pixels without
the need for other data input or separate shadow detection procedures,
may ease surface water mapping, particularly in situations
where mapping, monitoring and change detection of surface water
resources across multiple scenes or over regional and global scales are
required.
Despite a number of surface water mapping and accuracy improvement
methods reported in the literature, limited research has been undertaken
on accuracy assessment at sub-pixel level. This is particularly
important when satellite images such as Landsat are used. Because of
the limited spatial resolution of reﬂective bands of Landsat TM, edge
pixels cover relatively large areas which likely consist of a mixture of
water and nonwater components. In the use of Landsat TM data for
environmental studies where monitoring and detecting changes in
waterline are of interest, the accuracy of classifying mixed edge pixels
may become an important issue.
As mentioned in the Results section, when applying the ML classiﬁer
to the reservoirs around Addis Ababa, a substantial number of edge
pixels that predominantly consist of water were classiﬁed as nonwater,
thus obviously leading to underestimation of surface water extents. The
implication of this could be that even if the water boundary increases by
certain distance, thus changing the proportion of water in mixed edge
pixels, say from 40% to 60%, the ML method could still classify the pixels
as nonwater since it seems that the classiﬁer is sensitive to nonwater
components of the mixture. The ability of different classiﬁers to classify
such mixed pixels correctly into water and nonwater classes may
vary depending on the spectral bands and algorithms used. The
nonwater components of the mixture could be composed of many
combinations of different land cover types. So, the reﬂectance values
of mixed pixels can vary considerably, even for pixels where the proportion
of water is similar (Ji et al., 2009). The relative improvement
in sub-pixel accuracy achieved by AWEI may make it suitable for
consistent and reliable estimation of surface water dynamics using
Landsat data.
Though the new water extraction index was tested under wide
range of environmental conditions and water body types, several variables
that were not considered at our test sites are likely to affect the accuracy
of water extraction methods. Seasonal and daily variation in the
angle of the sun, atmospheric composition, and changes in biophysical
and chemical properties of water bodies, such as changes in phytoplankton
(Zhang et al., 2010) may inﬂuence the reﬂectance patterns of
water bodies. The use of different atmospheric correction methods
may also inﬂuence thresholds and accuracies. Therefore one may need
to consider the importance and type of atmospheric correction applied
in the image preprocessing stage in evaluating accuracies of different
water extraction methods. AWEI was tested using Landsat TM data
only and its use may therefore need to be evaluated on data from
other sensors.
In our test cases, we did not consider the inﬂuence of seasonal variation
in appearances of water bodies. Therefore, the robustness of the
new method also needs to be tested in different seasons. In addition
to the ﬁve test sites examined in details, we included three additional
sites in Appendix A (Figs. A1–A3). Nevertheless, more sites may need
to be included for a thorough evaluation of the performance of the
index.
6. Conclusion
The main purpose of this study was to devise a method that improves
water extraction accuracy by increasing spectral separability between
water and nonwater surfaces, particularly in areas with shadows
and urban backgrounds that are often major causes of low classiﬁcation
accuracy. Using Landsat 5 TM data, we introduced a new automated
water extraction method (AWEI) and compared its per-pixel and subpixel
accuracy and threshold stability with that of the MNDWI and
ML classiﬁers. AWEI signiﬁcantly improved accuracy in areas where
shadow and other dark surfaces were the main sources of classiﬁcation
errors.
A sub-pixel analysis of errors at the edges of water bodies revealed
that the AWEI classiﬁer was relatively more accurate in classifying
edge pixels compared to the MNDWI and ML classiﬁcation methods. Besides,
the optimal threshold of AWEI was shown to be less variable with
images of different locations and times compared to that of MNDWI.
Therefore, AWEI is proposed as an alternative and improved water
index, especially in extracting water information from areas where
noisy results are expected because of the presence of shadows and
built-up surfaces. This new method would also be suitable for surface
water change detection studies since it classiﬁes edge pixels with high
accuracy and with a stable threshold.
AWEI MNDWI
Denmark
Switzerland
Ethiopia
South Africa
New Zealand
Denmark
Switzerland
Ethiopia
South Africa
New Zealand
Indexvalue
-1.0
-0.5
0.0
0.5
1.0
x
x x x x x
x
x
x
x
nsh
sh sh
sh
sh
Fig. 6. Threshold variability and distribution of index values for AWEI and MNDWI. Dashed
lines show mean optimal threshold of the ﬁve test sites, and symbol “x” shows optimal
threshold for each site.
31G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
Appendix A
AWEIsh at 0 threshold
MNDWI at 0.0 threshold
Landsat true color composite
AWEIsh at 0.17 threshold
Google 3D viewshowing water and mountain shadow
MNDWI at 0.7 threshold
Fig. A1. Water extraction images applying AWEIsh (top row) and MNDWI (middle row) at test site in Norway using Landsat 5 TM acquired on Aug 13, 2011 (2382 by 2382 pixels, top-left
corner coordinate in UTM: 6,989,655 N, 371,295 E).
32 G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
AWEIsh at 0.0 threhold
X
X
X
X
X
X
X
X
X
X
AWEIsh at 0.1 threhold MNDWI at 0.15 thresholdMNDWI at 0.0 threshold
Fig. A3. Water extraction images applying AWEIsh (top row) and MNDWI (middle row) at test site in Bishftu Ethiopia using Landsat ETM+ acquired on Nov 27, 2002 (400 by 400 pixels,
top-left corner coordinate UTM: 3,461,925 N, 350,415 E). Location of actual water bodies is shown by “X” mark on true color composite of the Landsat ETM+ image (bottom image).
AWEIsh at 0.15 threshold
MNDWI at 0.2 threshold
Landsat true color composite Google 3D view showing building shadow and river
MNDWI at 0.5 t hresholdMNDWI at 0.4 threshold
AWEIsh at 0.25 thresholdAWEIsh at 0.2 threshold
Fig. A2. Water extraction images applying AWEIsh (top row) and MNDWI (middle row) at test site in China Shanghai using Landsat ETM+ acquired on Nov 27, 2002 (400 by 400 pixels,
top-left corner coordinate UTM: 3,461,925 N, 350,415 E).
33G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
Table A1
Summary of accuracy assessments at the ﬁve main test sites showing various accuracy measures.
Test site Classiﬁcation method Threshold Land cover class User accu. Produc accu. Kappa Comm. error % Omi. error % Total error %
Denmark a
AWEInsh 0.000 Water 97.08 91.43 0.93 2.92 8.57 11.49
Nonwater 98.57 99.54 1.43 0.46 1.89
−0.2 Water 96.35 92.89 0.94 3.65 7.11 10.77
Nonwater 98.81 99.41 1.19 0.59 1.79
−0.15 Water 98.30 92.58 0.95 1.70 7.42 9.12
Nonwater 98.75 99.72 1.25 0.28 1.53
MNDWI 0.00 Water 95.13 91.04 0.92 4.87 8.96 13.82
Nonwater 98.50 99.22 1.50 0.78 2.28
0.05 Water 97.10 89.89 0.92 2.90 10.11 13.02
Nonwater 98.32 99.55 1.68 0.45 2.14
0.1 Water 98.05 88.65 0.92 1.95 11.35 13.31
Nonwater 98.12 99.70 1.88 0.30 2.18
MaxLike Water 96.85 84.61 0.89 3.15 15.39 18.53
Nonwater 97.46 99.54 2.54 0.46 3.00
Switzerland AWEIsh −0.050 Water 52.89 96.77 0.66 47.1 3.2 50.3
Nonwater 99.84 95.81 0.2 4.2 4.4
0.100 Water 99.34 76.61 0.86 0.7 23.4 24.1
Nonwater 98.88 99.98 1.1 0.0 1.1
0.000 Water 99.01 90.96 0.95 1.0 9.0 10.0
Nonwater 99.56 99.96 0.4 0.0 0.5
MNDWI −0.005 Water 99.34 76.61 0.94 0.7 23.4 24.1
Nonwater 99.84 95.81 0.2 4.2 4.4
0.100 Water 73.92 95.64 0.82 26.1 4.4 30.4
Nonwater 99.79 98.36 0.2 1.6 1.9
0.300 Water 87.99 91.92 0.89 12.0 8.1 20.1
Nonwater 99.61 99.39 0.4 0.6 1.0
0.500 Water 96.76 78.95 0.86 3.2 21.1 24.3
Nonwater 98.99 99.87 1.0 0.1 1.1
MaxLike – Water 74.46 92.08 0.81 25.5 7.9 33.5
Nonwater 99.61 98.47 0.4 1.5 1.9
Ethiopia AWEIsh −0.050 Water 95.47 98.85 0.95 4.53 1.15 5.68
Nonwater 100.00 99.97 0.00 0.03 0.04
0.000 Water 95.47 98.85 0.97 4.53 1.15 5.68
Nonwater 100.00 99.98 0.00 0.02 0.02
0.100 Water 96.60 91.48 0.94 3.40 8.52 11.92
Nonwater 99.97 99.99 0.03 0.01 0.05
MNDWI 0.000 Water 92.57 96.66 0.95 7.43 3.34 10.77
Nonwater 99.99 99.97 0.01 0.03 0.04
0.100 Water 95.01 95.39 0.95 4.99 4.61 9.60
Nonwater 99.98 99.98 0.02 0.02 0.04
0.150 Water 97.53 92.66 0.95 2.47 7.34 9.81
Nonwater 99.97 99.99 0.03 0.01 0.04
MaxLike – Water 99.71 86.96 0.93 0.29 13.04 13.33
Nonwater 99.95 100.00 0.05 0.00 0.05
S. Africa AWEIsh 0.020 Water 83.23 98.86 0.90 16.77 1.14 17.91
Nonwater 99.97 99.50 0.03 0.50 0.53
0.045 Water 98.32 98.30 0.98 1.68 1.70 3.38
Nonwater 99.96 99.96 0.04 0.04 0.09
0.060 Water 98.43 97.51 0.98 1.57 2.49 4.06
Nonwater 99.94 99.96 0.06 0.04 0.10
MNDWI 0.300 Water 70.54 97.67 0.81 29.46 2.33 31.79
Nonwater 99.94 98.97 0.06 1.03 1.09
0.450 Water 89.07 96.37 0.92 10.93 3.63 14.56
Nonwater 99.91 99.70 0.09 0.30 0.39
0.600 Water 94.46 93.58 0.94 5.54 6.42 11.96
Nonwater 99.84 99.86 0.16 0.14 0.30
MaxLik – Water 46.74 97.12 0.62 53.26 2.88 56.14
Nonwater 99.93 97.21 0.07 2.79 2.87
N. Zealand AWEIsh −0.100 Water 98.74 99.87 0.96 1.26 0.13 1.39
Nonwater 99.29 93.65 0.71 6.35 7.06
0.000 Water 99.82 99.56 0.98 0.18 0.44 0.61
Nonwater 97.85 99.13 2.15 0.87 3.02
0.100 Water 99.90 99.45 0.98 0.10 0.55 0.65
Nonwater 97.33 99.51 2.67 0.49 3.17
MNDWI 0.000 Water 96.93 99.79 0.89 3.07 0.21 3.28
Nonwater 98.79 84.23 1.21 15.77 16.98
0.150 Water 97.39 99.44 0.90 2.61 0.56 3.16
Nonwater 96.91 86.72 3.09 13.28 16.37
0.200 Water 98.14 98.50 0.90 1.86 1.50 3.35
Nonwater 92.39 90.71 7.61 9.29 16.89
MaxLike – Water 99.84 99.18 0.97 0.16 0.82 0.98
Nonwater 96.05 99.22 3.95 0.78 4.73
a
At this test site, shadow is not a major source of classiﬁcation noise but built-up surfaces are predominant land cover type. Therefore, the use of AWEInsh resulted in high accuracy of
water extraction (a combined use of both AWEIsh and AWEInsh did not improve accuracy).
34 G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35
References
Aguirre-Gutierrez, J., Seijmonsbergen, A.C., & Duivenvoorden, J. F. (2012). Optimizing land
cover classiﬁcation accuracy for change detection, a combined pixel-based and
object-based approach in a mountainous area in Mexico. Applied Geography, 34,
29–37.
Alderman, K., Turner, L. R., & Tong, S. L. (2012). Floods and human health: A systematic
review. Environment International, 47, 37–47.
Bond, N. R., Lake, P.S., & Arthington, A. H. (2008). The impacts of drought on freshwater
ecosystems: An Australian perspective. Hydrobiologia, 600, 3–16.
Butcher, S. S., Charlson, R. J., Orians, G. H., & Wolfe, G. V. (Eds.). (1992). Global
biogeochemical cycles. London: Academic Press Limited.
Charoenpanyanet, A., & Chen, X. (2008). Satellite-based modeling of Anopheles mosquito
densities on heterogeneous land cover in Western Thailand. The International Archives
of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 37, 159–164.
Charron, D. F., Thomas, M. K., Waltner-Toews, D., Aramini, J. J., Edge, T., Kent, R. A., et al.
(2004). Vulnerability of waterborne diseases to climate change in Canada: A review.
Journal of Toxicology and Environmental Health-Part a-Current Issues, 67, 1667–1677.
Congalton, R. G., & Green, K. (2009). Assessing the accuracy of remotely sensed data:
Principles and practices (2nd ed.)Boca Raton: Lewis Publishers.
COWI (2010). DDOland2010: Danish National Digital Orthophoto, version 2010. http://
www.geodata-info.dk/Portal/ShowMetadata.aspx?id=d6a57343–c930–4d28–94ce-
85b5ca2847dc
Dambach, P., Machault, V., Lacaux, J. -P., Vignolles, C., Sié, A., & Sauerborn, R. (2012). Utilization
of combined remote sensing techniques to detect environmental variables
inﬂuencing malaria vector densities in rural West Africa. International Journal of
Health Geographics, 11.
Davranche, A., Lefebvre, G., & Poulin, B. (2010). Wetland monitoring using classiﬁcation
trees and SPOT-5 seasonal time series. Remote Sensing of Environment, 114, 552–562.
De Leeuw, J., Jia, H., Yang, L., Liu, X., Schmidt, K., & Skidmore, A. K. (2006). Comparing accuracy
assessments to infer superiority of image classiﬁcation methods. International
Journal of Remote Sensing, 27, 223–232.
Dewan, A.M., Islam, M. M., Kumamoto, T., & Nishigaki, M. (2007). Evaluating ﬂood hazard
for land-use planning in Greater Dhaka of Bangladesh using remote sensing and GIS
techniques. Water Resources Management, 21, 1601–1612.
Duan, Z., & Bastiaanssen, W. G. M. (2013). Estimating water volume variations in lakes
and reservoirs from four operational satellite altimetry databases and satellite imagery
data. Remote Sensing of Environment, 134, 403–416.
Exelis Visual Information Solutions (2010). ENVI v. 4.8. Boulder, Colorado: EXELIS.
Frey, H., Huggel, C., Paul, F., & Haeberli, W. (2010). Automated detection of glacier lakes
based on remote sensing in view of assessing associated hazard potentials. Grazer
Schriften der Geographie und Raumforschung, 45, 261–272.
Gardelle, J., Hiernaux, P., Kergoat, L., & Grippa, M. (2009). Less rain, more water in ponds:
A remote sensing study of the dynamics of surface waters from 1950 to present
in pastoral Sahel (Gourma region, Mali). Hydrology and Earth System Sciences
Discussions, 6, 5047–5083.
Giardino, C., Bresciani, M., Villa, P., & Martinelli, A. (2010). Application of remote sensing
in water resource management: The case study of Lake Trasimeno, Italy. Water
Resources Management, 24, 3885–3899.
Guttler, F. N., Niculescu, S., & Gohin, F. (2013). Turbidity retrieval and monitoring of
Danube Delta waters using multi-sensor optical remote sensing data: An integrated
view from the delta plain lakes to the western-northwestern Black Sea coastal
zone. Remote Sensing of Environment, 132, 86–101.
Haas, E. M., Bartholomé, E., & Combal, B. (2009). Time series analysis of optical remote
sensing data for the mapping of temporary surface water bodies in sub-Saharan
western Africa. Journal of Hydrology, 370, 52–63.
He, B., Oki, K., Wang, Y., Oki, T., Yamashiki, Y., Takara, K., et al. (2012). Analysis of stream
water quality and estimation of nutrient load with the aid of Quick Bird remote sensing
imagery. Hydrological Sciences JournalJournal Des Sciences Hydrologiques, 57,
850–860.
Hui, F., Xu, B., Huang, H., Yu, Q., & Gong, P. (2008). Modelling spatial–temporal change of
Poyang Lake using multitemporal Landsat imagery. International Journal of Remote
Sensing, 29, 5767–5784.
Jain, S. K., Saraf, A. K., Goswami, A., & Ahmad, T. (2006). Flood inundation mapping using
NOAA AVHRR data. Water Resources Management, 20, 949–959.
Jain, S. K., Singh, R. D., Jain, M. K., & Lohani, A. K. (2005). Delineation of ﬂood-prone areas
using remote sensing technique. Water Resources Management, 19, 337–347.
Ji, L., Zhang, L., & Wylie, B. (2009). Analysis of dynamic thresholds for the normalized difference
water index. Photogrammetric Engineering and Remote Sensing, 75, 1307–1317.
Jiang, Z., Qi, J., Su, S., Zhang, Z., & Wu, J. (2012). Water body delineation using index composition
and HIS transformation. International Journal of Remote Sensing, 33, 3402–3421.
Jimenez-Munoz, J. C., Sobrino, J. A., Mattar, C., & Franch, B. (2010). Atmospheric correction
of optical imagery from MODIS and reanalysis atmospheric products. Remote Sensing
of Environment, 114, 2195–2210.
Kondo, H., Seo, N., Yasuda, T., Hasizume, M., Koido, Y., Ninomiya, N., et al. (2002).
Post-ﬂood-infectious diseases in Mozambique. Prehospital and Disaster Medicine, 17,
126–133.
Lacaux, J. P., Tourre, Y. M., Vignolles, C., Ndione, J. A., & Lafaye, M. (2007). Classiﬁcation of
ponds from high-spatial resolution remote sensing: Application to Rift Valley Fever
epidemics in Senegal. Remote Sensing of Environment, 106, 66–74.
Lake, P.S. (2003). Ecological effects of perturbation by drought in ﬂowing waters.
Freshwater Biology, 48, 1161–1172.
Li, K. Z., Wu, S. H., Dai, E. F., & Xu, Z. C. (2012). Flood loss analysis and quantitative risk
assessment in China. Natural Hazards, 63, 737–760.
Lillesand, T. M., Kiefer, R. W., & Chipman, J. W. (2004). Remote sensing and image interpretation
(5th ed.): John Wiley & Sons, Inc.
Lira, J. (2006). Segmentation and morphology of open water bodies from multispectral
images. International Journal of Remote Sensing, 27, 4015–4038.
McFeeters, S. K. (1996). The use of Normalized Difference Water Index (NDWI) in the
delineation of open water features. International Journal of Remote Sensing, 17,
1425–1432.
Morss, R. E., Wilhelmi, O. V., Downton, M. W., & Gruntfest, E. (2005). Flood risk, uncertainty,
and scientiﬁc information for decision making — Lessons from an interdisciplinary
project. Bulletin of the American Meteorological Society, 86 (1593-+).
Mucher, C. A., Steinnocher, K. T., Kressler, F. P., & Heunks, C. (2000). Land cover characterization
and change detection for environmental monitoring of pan-Europe.
International Journal of Remote Sensing, 21, 1159–1181.
NASA (2012). Landsat 7 science data users handbook. Online.
Novoa, S., Chust, G., Sagarminaga, Y., Revilla, M., Borja, A., & Franco, J. (2012). Water quality
assessment using satellite-derived chlorophyll-a within the European directives,
in the southeastern Bay of Biscay. Marine Pollution Bulletin, 64, 739–750.
Ouma, Y. O., & Tateishi, R. (2006). A water index for rapid mapping of shoreline changes
of ﬁve East African Rift Valley lakes: An empirical analysis using Landsat TM and
ETM+ data. International Journal of Remote Sensing, 27, 3153–3181.
Poulin, B., Davranche, A., & Lefebvre, G. (2010). Ecological assessment of Phragmites
australis wetlands using multi-season SPOT-5 scenes. Remote Sensing of Environment,
114, 1602–1609.
Prigent, C., Papa, F., Aires, F., Jimenez, C., Rossow, W. B., & Matthews, E. (2012). Changes in
land surface water dynamics since the 1990s and relation to population pressure.
Geophysical Research Letters, 39, L08403.
Proud, S. R., Fensholt, R., Rasmussen, L. Y., & Sandholt, I. (2011). Rapid response ﬂood detection
using the MSG geostationary satellite. International Journal of Applied Earth
Observation and Geoinformation, 13, 536–544.
Richards, J. A. (1993). Remote sensing digital image analysis: An introduction (2nd ed.)
Berlin: Springer-Verlag.
Rogers, A. S., & Kearney, M. S. (2004). Reducing signature variability in unmixing coastal
marsh Thematic Mapper scenes using spectral indices. International Journal of Remote
Sensing, 25, 2317–2335.
Rozenstein, O., & Karnieli, A. (2011). Comparison of methods for land-use classiﬁcation
incorporating remote sensing and GIS inputs. Applied Geography, 31, 533–544.
Ryu, J. H., Won, J. S., & Min, K. D. (2002). Waterline extraction from Landsat TM data in a
tidal ﬂat — A case study in Gomso Bay, Korea. Remote Sensing of Environment, 83,
442–456.
Sethre, P. R., Rundquist, B. C., & Todhunter, P. E. (2005). Remote detection of prairie pothole
ponds in the Devils Lake Basin, North Dakota. GIScience and Remote Sensing, 42,
277–296.
Sheng, Y. W., Shah, C. A., & Smith, L. C. (2008). Automated image registration for hydrologic
change detection in the lake-rich Arctic. IEEE Geoscience and Remote Sensing
Letters, 5, 414–418.
Sun, F., Sun, W., Chen, J., & Gong, P. (2012). Comparison and improvement of methods for
identifying waterbodies in remotely sensed imagery. International Journal of Remote
Sensing, 33, 6854–6875.
United States Geological Survey (USGS) (2012). Landsat data archive. Global Visualization
Viewer (GLOVIS).
Verpoorter, C., Kutser, T., & Tranvik, L. (2012). Automated mapping of water bodies using
Landsat multispectral data. Limnology and Oceanography-Methods, 10, 1037–1050.
Xu, H. (2006). Modiﬁcation of normalised difference water index (NDWI) to enhance
open water features in remotely sensed imagery. International Journal of Remote
Sensing, 27, 3025–3033.
Zhang, Y. L., Feng, L. Q., Li, J. S., Luo, L. C., Yin, Y., Liu, M. L., et al. (2010). Seasonal–spatial
variation and remote sensing of phytoplankton absorption in Lake Taihu, a large eutrophic
and shallow lake in China. Journal of Plankton Research, 32, 1023–1037.
35G.L. Feyisa et al. / Remote Sensing of Environment 140 (2014) 23–35