1. Introduction
Measurement of surface temperature (ST) is critical for a variety of Earth science applications, e.g., monitoring potential climate change [
1,
2,
3], detecting areas of drought [
4,
5,
6], predicting areas of vector-borne diseases [
7], and measuring evapotranspiration [
8,
9,
10]. Several algorithms necessary to deliver accurate ST products have been developed for existing imaging systems (e.g., MODIS, AVHRR, ABI, VIIRS and most recently, TIRS [
11,
12,
13,
14]). As technology advances and sensor systems are designed with increased sensitivity, there is a fundamental need among the scientific community to drive down errors in satellite-derived surface temperature measurements. Some ST users desire that spaceborne temperature products be retrieved to within an accuracy of 1 K or better [
15]. With an increased demand of ST product accuracy comes the apparent need for validation of these products. In the early 2000s, a comprehensive validation of MODIS ST products was conducted using a worldwide ground-based instrumentation network. The measurements acquired during this campaign were compared to MODIS-derived ST measurements to characterize the fidelity of the product [
16]. The expense involved in such a campaign is not easily repeatable, pointing to the increased need for a simpler validation tool.
In January 2018, the Committee on Earth Observation Satellites (CEOS) Working Group on Calibration and Validation (Land Product Validation Subgroup) identified six existing ground-based networks that could potentially be used to assess the fidelity of ST products [
17]. The outcome documented by this working group of thermal experts led to three findings. Firstly, the identified networks are too sparse (in a spatial sense) to validate a global product. Secondly, the instrumentation used are not ideal for ST validation as these sites were not developed specifically for this application. Thirdly, expanding the number and quality of ground-based reference data is imperative to ensuring high-quality, well-characterized surface temperature products [
17].
Nevertheless, several efforts in recent years have taken advantage of the National Oceanic and Atmospheric Administration (NOAA) Surface Radiation budget (SURFRAD) network for ST product validation [
11,
12,
13,
14]. SURFRAD is a network of seven sites in different climate regions of the United States that were surveyed for spectral uniformity over a 10 km radius [
18]. The surface temperature at each site is calculated from data measured by two Epply pyrgeometers. One pyrgeometer faces toward the Earth and a second points up toward the sky where they record the surface upwelled and sky downwelled thermal irradiance, respectively. The pyrgeometers consist of a temperature-controlled thermopile sensor that records over a broad range of the electromagnetic spectrum (4
m to 50
m). The sampling rate is once per second but the data is smoothed over a three-minute window to dampen high-frequency fluctuations in the measured signal [
18]. A silicon dome is attached to the outside of the pyrgeometer to protect the thermopile from wind, which may significantly impact measured temperature [
18].
Due to the broadband spectral nature of the pyrgeometer, two potential issues may arise. Firstly, solar reflected radiance in the mid-wave infrared may affect the surface temperature estimation. Secondly, the emissivity is not completely defined from 4 to 50
m for most materials, so the impact of emissivity uncertainty on the final recorded temperature is not apparent. Biases and residual errors reported in recent validation efforts may be attributed to these instrument limitations in conjunction with the spatial non-uniformity of several SURFRAD sites as observed by the spaceborne sensor. An inconsistent ST bias ranging from
K to 2 K and a consistent standard deviation of over 2 K are widely reported in the literature [
11,
12,
13,
14]. Given the current ground-based instrumentation, validation efforts will not be able to decouple residual error due to uncertainty in the spaceborne instrument, the algorithm used to derive ST, and the ground-based system used for validation. As such, the need for more advanced ground-based equipment is apparent to properly assess ST product fidelity. The work conducted here by the Rochester Institute of Technology (RIT) focused on the design and implementation of a small, low-cost, field-deployable prototype radiometer. The prototype radiometer was designed to serve as a validation tool for surface temperature products derived from Landsat 8 Thermal Infrared Sensor (TIRS) image data and, as such, contains two spectral channels that mirror the Landsat 8 thermal bands. By windowing the wavelengths of interest, incident solar radiation is filtered and measurements can be acquired over a spectral range where emissivity is well-defined. This paper discusses the radiometer’s lab-based characterization, field-implementation, and validation efforts against Landsat’s ST product.
2. Methodology
The radiance recorded by a sensor viewing the Earth contains contributions from both the target and the atmosphere. Mathematically the sensor-reaching radiance (
) can be expressed as
where
is the emissivity of the target,
is the blackbody radiance,
is the downwelled atmospheric radiance,
is the upwelled atmospheric radiance, and
is the atmospheric transmission (note that all variables are band-effective values). When a sensor is in close proximity to the target (3 m in this study), the upwelled atmospheric radiance (
) is zero and the atmospheric transmission (
) is equal to one. Additionally, the sensor must be calibrated in order to relate the raw detector output to at-aperture spectral radiance.
2.1. Design and Instrumentation
Similar to the pyrgeometers used in the SURFRAD network, the prototype radiometers developed here take advantage of thermopile technology. A thermopile is an electric device that records voltage as a function of the temperature difference between two thermocouples [
19]. One thermocouple remains at a known temperature while the other is sensitive to, and changes temperature with, incoming thermal radiation [
19]. For the prototype described here, ST-60 thermopiles were obtained from Dexter Research Center with spectral band-limiting filters to resemble the Landsat TIRS sensor bands shown in
Figure 1. One band (Channel A) is centered at 10.6
m and the other (Channel B) is centered at 12.3
m, both having a 52-degree field-of-view. Thermopiles were chosen primarily due to their flexibility, i.e., they are customizable to the spectral windows of interest so that band-specific emissivities can be utilized in the measurement process. The specifications of both the SURFRAD pyrgeometer and the prototype radiometer can be seen in
Table 1. The major difference between the two instruments is the spectral range to which the instruments are sensitive; the prototype radiometer is windowed to be more like Landsat.
An environmental sensor was incorporated into the radiometer package to collect temperature, humidity, and atmospheric pressure data. These data are useful for atmospheric characterization and for redundancy. An off-the-shelf Bosch BME 280 sensor was selected as the environmental sensor due to its low cost and ease of integration. The accuracy of the BME sensor is ± 0.5 K for temperature and ± 3% for relative humidity. The completed instrument, protected by a waterproof case, can be seen in
Figure 2.
2.2. Lab Based Characterization
The radiometer records voltage for each channel along with the temperature of the detector (
) and outputs from the BME sensor. For the prototype radiometer to validate the Landsat 8-derived ST product, the output channel voltage must first be related to target temperature. The output voltage to temperature relation is completed in the lab environment by using a known temperature and emissivity source. A Santa Barbara Infrared (SBIR) Infinity differential blackbody was used in the lab effort to characterize the prototype sensor. The blackbody head has an 8.05-inch square aperture surface, a spectral emissivity of greater than 0.995 (8–14
m), and is controllable over a temperature range of 248–348 K with an accuracy of 0.01 K [
20]. Once the radiometer voltage-to-temperature relationship is determined, it is used for all further measurements to convert output signal into the target temperature. Additionally, the instrument performance was characterized in the lab to determine sensitivity to environmental conditions and noise. Further characterization was then performed in real-world conditions in the field.
2.2.1. Instrument Calibration
Each prototype radiometer must be characterized and calibrated before it can be implemented in the field. An empirical relationship, derived by Dexter Research, is used to relate output voltage measurements to the target temperature, as shown in Equation (
2) [
21].
where:
= output voltage from the radiometer (Ch. A or Ch. B)
= 0.995 (SBIR blackbody)
= Temperature of the source (SBIR blackbody)
= Temperature of the optics (Assumed to be equal to )
= Temperature of the detector measured by a thermocouple (same as optics)
n, F, F1 = constants (n is the power law, and F/F1 are dependent on geometry)
Calibration coefficients for Equation (
2) (n, F, and F
) were developed by conducting temperature sweeps of the blackbody from 273 K to 318 K at 1 K increments in which radiometer data was acquired at each increment. Since
is measured by an internal thermocouple, and the emissivity and temperature of the lab blackbody are known, the three coefficients are solved by using a least-squares regression. Once the coefficients are derived, they remain fixed and the temperature of an object can be determined using Equation (
3), as long as the object’s band-effective emissivity is known.
A dwell test was performed to estimate the prototype radiometer’s change in radiance corresponding to one unit of change in temperature, known as Noise-Equivalent Change in Temperature (NE
T), at three blackbody temperatures (283, 293, 303 K). A final measurement at 283 K was repeated at the end of the test to characterize potential drift in the system. Referring to
Figure 3, measurements were obtained for each temperature for approximately one hour and the NE
T was calculated as the standard deviation of the collected data at each temperature.
Table 2 shows the average difference between the predicted versus actual blackbody temperatures (in column 2) and the corresponding NE
T values (in column 3) for the prototype radiometer.
Table 2 illustrates that, in this lab test, the prototype radiometers are able to measure the blackbody temperature to within 1.28 K of the actual temperature and exhibit an NE
T of approximately 0.20 K across a range of temperatures (283, 293, 303 K).
Figure 3 also shows that the system is stable, based on the repeatability of predicting the 283 K temperature at the end of the test.
2.2.2. Environmental Effects
Initial field experiments were conducted with the lab characterized prototype units in the Spring of 2019. As seen in
Figure 4a, a large variation in measured temperature was recorded while viewing a grass field target. This phenomenon was not observed in the lab, indicating that environmental parameters were potentially impacting the measured temperatures. To understand and potentially mitigate these effects, a series of lab tests were conducted where the radiometer was subjected to wind, heat, and vibration while staring at a 303 K blackbody.
Wind was introduced to the setup by using a fan to blow air directly over the front of the sensor. The fan was turned on and off intermittently to simulate a variable breeze and left on to simulate a constant breeze. Both cases introduced significant variations in the estimated temperatures. The noise in
Figure 4a also indicated that solar loading was a potential contributor to temperature measurement uncertainty. To simulate solar loading in the lab setting, an external heat source was introduced in close proximity to the front of the radiometer. The addition of heat to the front of the sensor changed the response initially but quickly settled out. The final test on the radiometer was a vibration test. While staring at the blackbody, the radiometer was introduced to a series of vibrations of varying intensity and duration. No correlation was established between the vibration and variation in temperature calculation.
The SURFRAD pyrgeometers design includes two features to combat the wind and solar effect. Firstly, the pyrgeometer has active heating or cooling to keep the thermopile at a near-constant temperature, eliminating the solar loading effect [
18]. Instead of using active heating and cooling to reduce the solar loading effect, the prototype radiometer was wrapped in insulation to help stabilize its internal temperature. Active heating and cooling of the unit requires extra power, which in turn requires a larger battery or constant power source, both not desired for the initial version of this remotely deployed unit. Secondly, on the exterior of the pyrgeometer, the sensor is covered with a clear silicon dome to protect the sensor from wind effects [
18]. The prototype radiometer was fitted with a plastic cone around the sensor to help block the wind. Different sizes of cones were tested and the final design included a cone around the entire unit, rather than just the thermopile sensor.
As seen in
Figure 4a, without wind protection and solar loading reduction, the temperature prediction varied by as much as 20 K from sample to sample. After the modifications for wind protection and temperature stabilization were included (
Figure 4b), the temperature variation between samples reduced to approximately 5 K.
Lab and field testing (as seen in
Figure 4b) have also shown that it takes approximately 50 min for the electronics of the system to reach thermal equilibrium. The thermistor on the thermopile will not settle out until the electronics have reached thermal equilibrium.
2.2.3. Validation of Temperature Prediction
Once the radiometer was calibrated and characterized, its performance was compared to a commercially available FLIR infrared camera. The FLIR camera (model A6751sc SLS) is documented to be accurate within 2 K [
22]. The blackbody was set to 303 K and both instruments recorded data for six minutes. The FLIR predicted the temperature as 303.54 K while the prototype radiometer predicted the temperature to be 302.50 K. Comparing the prototype radiometer to the cooled FLIR camera validated that the prototype radiometer has accuracy in line with commercially available instruments, and the process to derive the coefficients in Equation (
2) was sufficient for determining the target temperature.
2.3. Target Emissivity
To accurately predict the temperature of the surface using Equations (
1) and (
3), the emissivity of the target is required. The emissivity of the target material, which in this initial study was grass, was calculated using the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) data from EOS-1 (Terra), specifically the ASTER global emissivity dataset (ASTER-GED). The ASTER-GED consists of emissivity maps at 100 m spatial resolution from data acquired between 2000 and 2008. The emissivity of the target was estimated by establishing an empirical relationship between the effective emissivity for each band of the prototype radiometer and ASTER Band 13 (10.25–10.95
m) and Band 14 (10.95–11.65
m) respectively using the natural material emissivity data provided by the ICESS group from the University of California Santa Barbara [
23,
24,
25]. The emissivity of the geographic location of the deployed radiometer was then calculated from the empirical relationship and used in Equation (
1).
2.4. Atmospheric Study
To calculate the sensor reaching radiance from Equation (
1), the downwelled atmospheric radiance (
) must be understood. Unlike the SURFRAD radiometers, the prototype radiometer does not record the downwelled irradiance. To determine the effect of downwelled radiance, an extensive atmospheric modeling study was conducted using the MODTRAN atmospheric radiance modeling tool by varying three independent variables, in turn creating 1020 atmospheric profiles. The first variable was the overall atmospheric profile using the MODTRAN pre-defined atmospheres of mid-latitude summer, mid-latitude winter, and tropical. The second variable altered the amount of column water vapor (CWV) over the target, varying from 0.0 to 3.2 g/cm
in 0.1 g/cm
steps. The final variable tested the effect of the sensor distance to target varying the altitude from 1.21 to 121 m by 12 m increments. The maximum altitude was selected based on Federal Aviation Administration (FAA) drone regulations, in the event that the radiometer was flown on a drone to increase ground spot size.
Since the surface temperature and emissivity are unknown for the target, a range of temperatures from 250 K to 320 K and an emissivity range from 0.905 to 0.984 were used to calculate the surface leaving radiance of the target. The temperature range was selected due to possible target temperatures in the field (spanning from snow-covered to desert targets), and the emissivity range was determined by the band effective emissivity using the 113 natural material emissivity curves provided by the ICESS group from the University of California Santa Barbara [
23].
Placing the sensor at its intended height (3 m), the MODTRAN simulation confirmed that the upwelled radiance was near-zero (
= 0.001
) and transmission values were approximately 1 (
= 0.999). Using the downwelled radiance values from MODTRAN, the sensor-reaching radiance was calculated via Equation
1, and the percentage of downwelled radiance in the total radiance was calculated to determine the overall effect.
Figure 5a,b depict the downwelled radiance contribution as a percent of the total sensor-reaching radiance. From the initial MODTRAN run of 1020 atmospheric profiles, it was apparent that the pre-defined atmospheres (mid-latitude summer, mid-latitude winter, tropical) and height of the sensor had little impact on the overall percent of downwelled radiance (less than 1%). Therefore, only the results for the mid-latitude summer atmosphere (most closely represented the RIT field collect site) and a sensor height of 3 m are displayed in the figures. The column water vapor was found to be the largest influence on the downwelled radiance contribution. The percent of downwelling radiance is shown as a function of column water vapor and as a function of target emissivity and temperature. The downwelled radiance results from the maximum (bottom curve) and minimum (top curve) target emissivities bound the range of possible emissivity values. However, from
Section 2.3 the expected emissivity values for the target field for this study are 0.970 for Channel A and 0.962 for Channel B (closer to the maximum emissivity curve).
To estimate the error in surface temperature prediction by not accounting for downwelling radiance, the surface temperature in Equation (
1) was first calculated from the total sensor reaching radiance, then re-calculated by removing the downwelled contribution. When removing the highest percentage of downwelled radiance (lowest target temperature, highest CWV amount, and minimum emissivity value), the predicted temperature difference was 3.16 K. As mentioned, the intended target for this initial study is a grass field (
= 0.970). Therefore, when downwelled radiance is not accounted for, the predicted surface temperature error is 1.59 K.
Based on the MODTRAN study, the error associated with the ST prediction from the prototype radiometers, even without accounting for the downwelled radiance, are small enough to be a significant improvement over the SURFRAD sites. This omission of the downwelled radiance component removes the need for a dedicated downwelled sensor or for characterizing the atmosphere between the target and the sensor, but this effect will be re-visited for future versions of the radiometer.
2.5. Field-Based Experiments
Once the prototype radiometer was characterized in the lab, field experiments were conducted concurrently with Landsat 8 overpasses. An open grass field at the southern end of the RIT campus in Rochester, NY was selected due to its similarity to the SURFRAD site in Goodwin Creek, MS. The field is assumed to be approximately uniform over a 400-m area.
Due to orbital dynamics, Landsat 8 will pass over the same point on the Earth every 16 days. However, the RIT site is located in an overlap region of two adjacent Landsat paths (Path 17/Row 30 and Path 16/Row 30) enabling twice the measurements within the 16 day period (See
Figure 6). Before each Landsat pass, the instrument is placed in the field looking nadir at the surface of the Earth. The instrument is mounted on a crossbar between two tripods to ensure that the field of view is not obstructed and that no shadows are cast onto the target area. The height of the instrument can vary, but preliminary testing showed no significant difference in temperature prediction by differing the height range from 0.3 m to 3 m. The current setup has a height of 3 m and a second portable field unit is also available with a height of 1 m.
Figure 6 shows the setup of the RIT test rig and the portable test rig, with a radiometer attached.
The surface temperature is calculated using Equation (
3) with the calculated emissivity, derived n, F, and F1 coefficients, and the output voltages from the thermopile (the output voltages are timestamped to easily match up the Landsat overpass). For the initial study of the prototype radiometer, the final target temperature is taken as an average of the measured temperatures from the two bands.
3. Results and Discussion
To date, 21 measurements concurrent with Landsat 8 overpasses have been collected with the prototype unit.
Figure 7 shows the radiometer-measured vs. Landsat-derived surface temperature displayed by triangles. The average difference between the measured temperature and Landsat 8-derived ST is 1.37 K with a standard deviation of 1.34 K. For reference, the SURFRAD-measured temperatures at the Goodwin Creek, MS site (data overlaid in
Figure 7 as blue dots) had an average difference of 3.52 K with a standard deviation of 2.16 K as compared to the Landsat-derived ST. Goodwin Creek was chosen as the comparison site as its grass field is very similar to the RIT measurement location. Note that both data sets include a variety of cloud conditions with the pixel of interest (target) never being obstructed by clouds.
Two outlying points from the prototype radiometer are highlighted in
Figure 7 with an “x”. Both points resulted in a temperature prediction error over 4 K when compared to the Landsat 8-derived ST. On these collects, the winds were particularly high, confirming that wind will affect the accuracy of the radiometer. The high magnitude of error from the high wind suggest the need for a more advanced wind protection solution on future radiometers. The two outlier points are included in the overall average difference between measured and predicted temperature. If the outliers are not included, the average difference between measured temperature and Landsat 8-derived ST for the prototype radiometer is 0.99 K with a standard deviation of 0.87 K.
The point at a Landsat 8-derived surface temperature of 322.7 K, shown as a solid triangle, was collected by the radiometer in the Mojave desert California rather than the RIT site. The sand and sparse vegetation of the Mojave desert landscape is much different than the field at RIT (grass) so new emissivity coefficients were derived using the ASTER-GED database (
Section 2.3). Testing against a different target than the RIT field and getting a temperature error of 1.49 K, displayed the feasibility of using the radiometer in multiple environments over different types of surface targets. The surface temperature of the Mojave site (50
C) was outside the range of the radiometer lab calibration (0–45
C), demonstrating that the radiometer can be accurate over a larger temperature range.
The radiometer measurements can be used to adjust the calibration of the Landsat 8 TIRS bands.
Figure 8 shows the radiometer-measured temperature versus the Landsat 8-derived surface temperature with a 1:1 line drawn for reference. Another aspect of the radiometer data from
Figure 8 is the wide temperature range (273 K to 323 K) of the ground targets. The low cost of the prototype radiometer allows for units to be fielded in multiple climate zones around the United States, which requires that the radiometer accurately predict the ST over a large range of temperature values. As the prototype radiometer’s predictions are closely related to the one-to-one line, there is confidence in the radiometer’s ability to perform over a wider temperature range than initially tested. The error bars shown in
Figure 8 were calculated using the maximum error in temperature prediction due to the uncertainty of the target emissivity. The target emissivity range of 0.936 to 0.981 was found by calculating the band effective emissivity over 67 natural materials as referenced in
Section 2.4, and an emissivity error of 0.05 corresponding to a temperature prediction error of 1.05 K.
This initial data set demonstrates that the prototype radiometer is statistically outperforming the pyrgeometers at the Goodwin Creek SURFRAD site as compared to Landsat 8-derived ST (lower average difference and smaller standard deviation as shown in
Section 3). The studies referenced in
Section 1, in conjunction with these results, indicate that incorporating narrow-band spectral windows enable better estimates of emissivity, which reduces variation in surface temperature measurements. Future iterations of these radiometers, however, can incorporate three improvements to reduce overall temperature prediction errors. The characterization of emissivity using various thermal band combinations has been rigorously studied within the scientific community [
3,
27,
28]. An accurate emissivity measurement, using the Temperature Emissivity Separation (TES) algorithm, is defined to be within 0.015 using a minimum of three to four spectral bands [
28]. To eliminate the reliance on ASTER data and calculate the emissivity of the target with the radiometer, the next radiometer unit is designed to incorporate four additional response bands in the 8 to 9
m range. A four-channel thermopile was constructed by Dexter Laboratories [
19] and will be integrated with the existing two band thermopile creating a six band radiometer. Using the calculated emissivity from the six-band radiometer, an updated MODTRAN simulation would produce a better estimate of the downwelled radiance. Applying this modeled downwelled radiance value to Equation (
1), rather than using a value of zero, could result in lower overall temperature prediction error. Lastly, based on the two outliers from the field-collected data, further research of lowering wind interference across the sensor is required. The current wind protection device offers marginal improvement, whereas a sealed design similar to SURFRAD with high thermal transmittance should be investigated.
4. Conclusions
The number of scientific fields incorporating the application of ST products continues to increase. With this increase of users, Landsat and other Earth-observing platforms are pushed to deliver a more accurate ST product. The current ground-based network used to validate ST products (SURFRAD) collects measurements over a broad spectrum and is accurate to within 4 K of Landsat 8-derived ST [
29]. The need for a ground network that is more accurate than the 2 K accuracy of the Landsat split-window ST algorithm [
24] is apparent and was the focus of this study.
In this paper, the possibility of using low-cost, portable field radiometers to verify the Landsat 8-derived ST product was explored. This was accomplished by fabricating a prototype radiometer using a suite of low power electronics and a thermopile with narrow spectral bands that are similar to the TIRS sensor bands on Landsat 8. Lab characterizations resulted in the radiometer’s ability to predict the temperature within 1.28 K of a known temperature source, with an NET of 0.20 K. Preliminary field testing with concurrent Landsat 8 overpasses produced temperature results within 1.37 K of the Landsat 8-derived ST product. The prototype radiometer’s ability to accurately predict the target temperature, along with the low cost of production, makes it a strong candidate for building a ground-based validation network over a range of surface types and emissivities.
Continued testing of the prototype radiometer is necessary for determining the temperature prediction accuracy in different climate zones and target materials (e.g., sand, rocks, mixed vegetation). To date, the prototype radiometer has predicted the temperature of high emissivity targets. Future planned field campaigns will test the prototype radiometer against low emissivity targets, such as sand, to determine the accuracy of the temperature predictions over a range of emissivity values. This data is crucial for determining where to build a network of radiometers to validate the Landsat-8 ST product.
A future effort includes testing the ability of the six band prototype radiometer to predict the accurate surface temperature and emissivity measurements. Windowing the spectral response coupled with a more accurate emissivity measurement shows promise that the prototype radiometer has the ability to validate Landsat 8-derived ST products lower than the stringent 1 K requirement [
15]. If successful, a network of six band radiometers could measure the temperature and emissivity of a ground-based target, ensuring that users receive a more accurate worldwide Landsat ST product.