4.1. OGA19’s Performance on Proximal Remote Sensing Datasets
The performance comparison between OGA19 and other semi-empirical algorithms (HUN08, MIS14 and LIU17) indicates that OGA19 resulted in the highest correlation when being tuned using all eight different datasets (collected in three different reservoirs in three different years) (
Table 3), implying its higher geographic transferability of OGA19 than other remote sensing algorithms (
Table 1) in terms of different environmental conditions of the three study sites (
Table 2). For the validation dataset (
Table 5), OGA19 had the lowest RMSE and MAE for ECR and GR datasets (
Figure 4A,B), but the second lowest RMSE for MR with the scatter plot being closer to the 1:1 line (
Figure 4C).
Although the MIS14 algorithm resulted in a lower RMSE and MAE than OGA19 when applied to the 2010 MR validation dataset, a sensitivity analysis of OGA19 and MIS14 confirmed the advantage of the former than the latter when applied to all MR datasets. Shown in
Figure 7 are three-dimensional plots with chl-
a and PC concentrations being axes Y and X and algorithm values being axis Z. It is obvious that the three-dimensional plot for MIS14 does not show a linear relationship between the algorithm value and PC concentrations as chl-
a concentration varies (
Figure 7A). In contrast, a strong linear relationship between the OGA19 value and PC concentration is evident with increasing chl-
a concentration (
Figure 7B). This sensitivity analysis suggests that the proposed OGA 19 is less sensitive to chl-
a than the MIS14 algorithm and should be favorable for retrieval of PC concentrations from remote sensing data.
OGA19 was also compared to the semi-analytical algorithm: SIM05 (
Figure 5 and
Figure 6). The tuning results suggest similar performances between SIM05 and OGA19 (
Figure 5). For each yearly dataset of a given study site, OGA19 performed slightly better than SIM05, but the biggest difference was for the ECR 2007 dataset with the R
2 value for OGA19 being 0.85 as opposed to 0.16 for SIM05. For the validation result (
Figure 6) the performances from both algorithms were similar though OGA19 resulted in slightly lower RMSE and MAE values than SIM05 did with the exception of GR for which SIM05 got a lower MAE. In spite of their similar performance, OGA19 has the advantages over SIM05 because OGA19 is simple Equation (14) as compared with SIM05 which needs to calculate parameters such as
aw(709),
aw(665),
bb, ϒ, δ and ε (Equations (16) and (17)).
OGA19 has been shown to outperform several semi-empirical and semi-analytical algorithms. Recent reviews on this topic have shown that a simple algorithm like band ratios generated accurate estimates [
20,
29,
33,
34]. In this study, the ratio between the
Rrs at 709 nm and 620 nm was used to approximate the
aphy at 620 nm (
Figure 1). Based on the high R
2 value 0.9, it is worth comparing the use of OGA19 with the band ratio between 709 and 620 nm for the estimation of PC.
Table 8 shows that the OGA19 performed better when tuned against all year datasets collected from each of the three central Indiana Reservoirs. For ECR, OGA19 got an R
2 of 0.67 while the band ratio got an R
2 of 0.50; for MR the proposed algorithm got an R
2 of 0.65 which is higher than 0.52 obtained by the band ratio; for GR, both algorithms showed a poor performance with R
2 values of 0.20 and 0.18 for OGA19 and the band ratio, respectively.
Poor performance of both algorithms for GR was also observed for SIM05 (
Figure 5), HUN08, MIS14, and LIU17(
Table 3), meaning that none of these remote sensing algorithms provided accurate estimates of PC for GR and this needs further explanation.
Hunter et al. [
41] showed that the increase of sediments in the water column can enhance the prominence of the chl-
a and PC absorption features and result in a shift in the
Rrs spectra. The
Rrs spectra of the GR 2006 dataset were observed to shift to approximately 680 nm for the chl-
a absorption feature and to approximately 635 nm for the PC absorption feature. Therefore, it is believed that high turbidity caused by dredging GR affected spectral estimation of PC. In the 2005 dataset for GR, the average TSS concentration was 15.79 mg/L. However, with the dredging started in 2006, the average TSS was 17.40 mg/L in 2006 and 23.55 mg/L in 2007, indicating an increased TSS concentration from 2005 to 2007. Given that phytoplankton concentration was part of TSS in this study, the ratio of chl-
a concentration (µg/L) and TSS concentration (mg/L) was computed for these datasets to determine the relative dominance of phytoplankton and TSS. The ratio was 0.89, 4.25, and 6.32 µg/mg for years 2005, 2006, and 2007, respectively, suggesting that dredging the reservoirs led to an increase in chl-
a concentration as a result of the resuspension of nutrients trapped in the sediment. Based on the observation that changing the spectral bands used in the formulation of OGA19 in the 2006 dataset resulted in improved PC estimates with R
2 being 0.43, we conclude that a high sediment load in the water column led to shifted absorption features and resulted in the poor performance of all algorithms when applied to the 2006 GR dataset. However, errors in the sample collection and the process of analyzing PC concentration cannot be ruled out.
Both the band ratio algorithm and OGA19 were validated on the 2010 datasets for the three central Indiana study sites.
Figure 8 shows the scatterplots for measured and remotely estimated PC for ECR (
Figure 8A), GR (
Figure 8B), and MR (
Figure 8C). OGA19 outperformed the band ratio with a lower RMSE and MAE for ECR and GR (
Table 9); however, for MR, the band ratio algorithm got a lower RMSE and MAE than OGA19. This relative weak performance of OGA19 for MR was similar to the case that MIS14 resulted in a lower RMSE and MAE than OGA19 did.
To get insights into these results, we examined the PC:chl-
a ratio of the samples (
Table 1). It was observed that OGA19 presented a stronger relationship for ECR in which chl-
a dominated over PC (mean PC:chl-
a = 0.95), and for GR where PC concentrations were marginally higher than chl-
a (mean PC:chl-
a = 1.17). On the other hand, the MR dataset presented PC concentration higher than chl-
a concentration (mean PC:chl-
a = 1.31) for which MIS14 achieved the lowest RMSE value. These results indicate that OGA19 performed strongly at a low (e.g., ECR) or intermediate (e.g., GR) PC:chl-
a ratio. However for the case of a high PC:chl-
a ratio (e.g., MR), OGA19 did not perform as well as MIS14 (
Table 3) or the band ratio (
Table 8).
We know that the absorption of phytoplankton is apparently not synonymous with the combination of computed absorption of isolated pigments. A complete compensation for the effect of phytoplankton pigments (other than chl-
a) on the absorption at 620 nm was not possible in this study, but the fact that OGA19 performed well on the datasets collected for the three different study sites in different years indicates that OGA19 was not prone to the effect of other pigments as compared with other remote sensing algorithms. This analysis suggests that OGA19, which was developed to remove the chl-
a interference on remote estimation of PC, overestimates the interference of chl-
a when PC concentrations are high causing an underestimation in the PC concentration (
Figure 4 and
Figure 8). This occurs due to the structure of the algorithm Equation (14) in which the estimation of PC is done by subtraction of the chl-
a influence at 620 nm. When PC is higher than chl-
a, the presence of chl-
a at 620 nm can be negligible and the correction for the chl-
a influence is not needed. Therefore, MIS14 and the band ratio gave rise to a lower RMSE and MAE than OGA19 for the validation dataset as opposed to the tuning results. These results corroborate with the idea that the variation of PC:chl-
a is an important factor influencing the performance of the remote sensing algorithm [
12]. It is important to highlight that OGA19 performed better at low and medium PC:chl-
a levels, and can be more applicable for early warning of cyanobacterial blooms because most of the algorithms do not perform well at low PC concentrations [
26].
4.3. Comparison between OGA19 and SIM05 based on Different achl-a (620) Values
In this study we proposed a new method for calculating the
aPC (620) based on the computation of the
achl-a at 620 nm and
aPC at 665 nm (Equation (8) and (9)). Simis et al. [
9] proposed the calculation of
achl-a at 620 nm and assigned 0.24 as the value of ε. In this study, we used 0.2215 to relate
achl-a at 665 nm to
achl-a at 620 nm (
Figure 2A). To evaluate if this value was appropriate, we compared the sensitivity to both
φ1 = 0.2215 and ε = 0.24 of SIM05 and OGA19, respectively.
Table 11 summarizes the results of R
2 of this comparison.
The results presented in
Table 11 show that when applied to the eight datasets, OGA19 with
φ1 = 0.2215 had the best performance for five of them, and with
φ1 = 0.24 got the best performance for two datasets, but SIM05 got the best performance only for one dataset. It was observed that SIM05 was improved on five datasets when ε was set to be 0.2215. On the contrary, OGA19 had a degraded performance in five datasets when
φ1 was set to be 0.24. The most significant improvement was observed for the 2007 ECR dataset for which the original SIM05 got an R
2 of 0.16 and an RMSE of 20.70 µg/L while the SIM05 with ε being 0.2215 got R
2 and RMSE values of 0.34 and 20.06 µg/L respectively. Except for this dataset, the difference between SIM05 and OGA19 on other datasets was not significant. Therefore, the value that is used to relate
achl-a at 665 nm to
achl-a at 620 nm is not a major contributor to the different performances of SIM05 and OGA19.