Specific Direction-Based Outlier Detection Approach for GNSS Vector Networks
Next Article in Journal
Dynamic Flying Ant Colony Optimization (DFACO) for Solving the Traveling Salesman Problem
Next Article in Special Issue
Experimental Evaluation of UWB Indoor Positioning for Indoor Track Cycling
Previous Article in Journal
A PSO-Based Uneven Dynamic Clustering Multi-Hop Routing Protocol for Wireless Sensor Networks
Previous Article in Special Issue
Background and Recent Advances in the Locata Terrestrial Positioning and Timing Technology
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Specific Direction-Based Outlier Detection Approach for GNSS Vector Networks

College of Surveying and Geo-informatics, Tongji University, Shanghai 200092, China
*
Author to whom correspondence should be addressed.
Sensors 2019, 19(8), 1836; https://doi.org/10.3390/s19081836
Submission received: 20 March 2019 / Revised: 15 April 2019 / Accepted: 16 April 2019 / Published: 17 April 2019
(This article belongs to the Collection Positioning and Navigation)

Abstract

:
In this paper we propose an outlier detection approach for GNSS vector networks based on the specific direction (i.e., SD approach), along which the test statistic constructed reaches the maximum. We derive the unit vector of this specific direction in detail, and prove that the unit vector is the same as that determined by the outlier estimates in three-dimensional (3D) approach, while the distribution of the maximum test statistic in this direction is the square root of Chi-squared distribution. Therefore, eliminating an outlier along this specific direction can get the same result as that of eliminating all three components of outlier vector in 3D approach. The mathematical equivalence of SD approach and 3D approach is further demonstrated by a real GNSS network. Moreover, preliminary application of the SD approach to detect the abnormal antenna height measurement is carried out in terms of numerical simulations of multiple baseline solutions, and it shows that the SD approach can effectively detect baselines that are directly infected by corresponding receiver antenna height errors.

1. Introduction

When a weight matrix is chosen as the inverse of observables’ covariance matrix, the weighted least-squares (WLS) estimation is the best linear unbiased estimator (BLUE), assuming that no outlier exists. However, outliers can inevitably occur in practice and cause the optimal feature of such estimation loss [1,2,3]. Therefore, outliers must be detected and then eliminated as soon as possible. Baarda [4] first introduced ‘data-snooping’ for detecting outliers in geodetic networks, where outliers are identified one by one based on the test statistic of single outlier detection approach. The test statistic can be constructed according to various statistical distributions, e.g., standard normal distribution, τ -distribution, and F-distribution [4,5,6,7], and one can determine the existence of outliers by a comparison with correspondent critical value at a given significance level [8,9]. Three types of errors, i.e., rejecting the right observation (type I error) and accepting the wrong observation (type II error) as well as locating the outlier to right observation (type III error) [10,11,12,13], are inevitably encountered in outlier detection processes. Therefore, the reliability theory is of fundamental importance in outlier detection, the content of which has been extended from the case for single outlier [4] to multiple outliers [7,14,15] and from independent observations to correlated ones [16,17]. According to the reliability theory, once the possibility of type I and type II errors is given, the Minimal Detectable Bias (MDB) and Bias-to-Noise Ratios (BNR), defined respectively as the measures of internal reliability and external reliability of geodetic networks, are uniquely determined [18,19,20]. Both MDB and BNR reflect the characteristics of a geodetic network to resist outliers, and the BNR shows impact of non-detected outlier on the final solution [21], which can be reduced or eliminated by robust methods via iteratively reweighting of observations [2,22,23,24,25]. The outlier detection and reliability theory has already been widely applied for the three-dimensional (3D) networks of Global Navigation Satellite System (GNSS) [26,27,28,29,30,31,32,33].
At present, outlier detection for 3D GNSS vector networks is performed either at one-dimensional (1D) baseline component level or 3D baseline vector level [34], and based on a fundamental set of independent loops [35,36] or by adding a redundancy constraint [37]. Different from many other geodetic measurements, the baseline vector observations of GNSS networks are not directly observed, but derived from the pseudo-range or carrier-phase measurements [38]. Hence, there can be various outliers’ sources in GNSS networks such as satellite orbit error, impractical tropospheric model, wrong measurement of GNSS antenna height, antenna centering and positioning, etc. These factors usually come from a specific direction in the space, and have varying effects on all coordinate components of baseline vectors [39]. Supposing a GNSS baseline vector is spoiled by the outlying antenna height measurement of a station, this outlier would be detected with a higher probability when the least-squares residuals are projected to the vertical direction of the station. Therefore, outlier detection in GNSS networks can be conducted by searching a specific direction, so that the test statistic for outliers along this direction reaches maximum. However, due to the existence of random observation errors, this specific direction is certainly deflected from the true one. How can we determine such a specific direction for outlier detection of a certain baseline vector? Further, once the outlier in this direction is detected, what is its impact on the other two coordinate components orthogonal to this specific direction? These issues motivate the following investigation of the paper.
The rest of this paper is organized as follows. Section 2 briefly reviews the traditional outlier detection approaches in GNSS vector networks and derives in detail the mathematical formulas of the SD approach. Section 3 presents the results of applying ‘data snooping’ to a real GNSS network by 1D, 3D, and the proposed Specific Direction-based (SD) approach, which validates the effectiveness of the SD approach. In Section 4, we apply the SD approach for the detection of wrong GNSS antenna height measurement in terms of numerical simulations of multiple baseline solutions. Section 5 gives the concluding remarks.

2. Methodology

2.1. Traditional Outlier Detection Approach for GNSS Vector Observations

Since it is usually impossible to predetermine the number of outliers, hypothesis testing is practically applied by iteratively removing the wrong observation corresponding to the largest test statistic until no statistic exceeds the critical value. If a single outlier i occurs at the ith observation, the general linear or linearized observation equation is expanded to
y = A x + h i i + ε
where y is a 3m × 1 vector of observations with m the number of baselines, A is a 3m × n design matrix with full column rank, x is an n × 1 vector of unknown parameters to be solved, h i = ( 0 ï 0 1 0 ï 0 ) T is a 3m-dimensional zero vector with only the ith element equaling to one and ε is an 3m × 1 vector of observation error with the covariance Σ . When the weight matrix in WLS is taken as P = σ 0 2 Σ 1 with σ 0 2 the prior variance factor, the least-squares estimate of the outlier ^ i and its variance σ ^ i 2 are as follows
^ i = ( h i T P Q v v P h i ) 1 h i T P Q v v P y = ( h i T P ¯ h i ) 1 h i T P ¯ y ,       σ ^ i 2 = σ 0 2 ( h i T P ¯ h i ) 1
where P ¯ = P Q v v P is called the reliability matrix of the observations [20], and Q v v = P 1 A ( A T P A ) 1 A T is the cofactor matrix of residuals vector v = Q v v P y . Then the w-test statistic for the ith observation is formulated as [4,7]:
w i = ^ i σ ^ i = h i T P ¯ y σ 0 ( h i T P ¯ h i )
If there is no outlier in the ith observation, w i follows the standard normal distribution, i.e., w i N ( 0 , 1 ) . Once the significance level α 0 of the test is given, the critical value of test statistic is uniquely determined by the distribution function. If the absolute value of w i is larger than the critical value, the estimated outlier ^ i by Equation (2) is assumed significant and thereby an outlier is detected in the ith observation. If the variance factor σ 0 2 in (3) is unknown and substituted with its unbiased estimator σ ^ 0 2 , then the test statistic (3) follows the τ -distribution with r − 1 degrees of freedom with r being the redundancy number of observations [5,6]. In particular, for uncorrelated observations case, test statistic (3) can be simplified as
w i = v i σ 0 q v i v i
where v i and q v i v i denote the ith element of the residuals vector v and the ith diagonal element of cofactor matrix Q v v in (2), respectively.
The Minimal Detectable Bias (MDB), as a measure of internal reliability, of the ith observation for single outlier case is expressed as [4,7]:
MDB ( i ) = δ 0 σ 0 h i T P ¯ h i
where δ 0 is the non-centrality parameter, which is uniquely determined by the size of type I error α 0 and type II error β 0 [40].
For the GNSS baseline networks, it is reasonable to treat the baseline vector observations in triples manner because the three components of a baseline vector are computed together by the same GNSS observations and are naturally correlated. Once an outlier occurs, all three components would be impacted. Therefore, the 3D outlier detection approach is intuitively developed specifically for the GNSS baseline vector applications [34]. To describe the 3D approach, the observation vector and design matrix in (1) is partitioned as
y = ( y 1 y 2 y m ) ,      A = ( A 1 A 2 A m ) ,      ε = ( ε 1 ε 2 ε m )
where y j and ε j are the 3 × 1 vectors of the jth baseline observation and observation error, A j is the jth 3 × n design sub-matrix. When the ith observation vector y i contains a 3D outlier vector d i , Equation (1) is rewritten with (6) as
y j = A j x + H j d i + ε j ,     j = 1 , 2 , , m
and H j = { 0 3 ,   for   j i I 3   for   j = i , where 0 3 is a 3 × 3 zero matrix and I 3 is a 3 × 3 identity matrix. The WLS estimation d ^ i of 3D outlier vector is derived from (7) as
d ^ i = P ¯ i i 1 j = 1 m P ¯ i j y j
in which, P ¯ i j denotes the ij-th 3 × 3 sub-matrix of the reliability matrix P ¯ . To determine whether or not the 3D vector of outliers exists, the test statistic T i is constructed by
T i = d ^ i T P ¯ i i d ^ i / 3 σ 0 2
If there are no outliers, T i is central F-distributed with two degrees of freedom as 3 and at given α 0 , i.e., T i F ( α 0 ; 3 , ) . If the variance factor σ 0 2 in (9) is unknown, then T i can be re-constructed following the central F-distribution of F ( α 0 ; 3 , 3 m n 3 ) if no outlier exists according to [6] (p. 302).

2.2. Specific Direction-Based (SD) Approach for GNSS Vector Observations

2.2.1. Outlier Detection in SD Approach

Supposing the outlier’s coefficient matrix of (7) is defined as H j = { 0 3 ,   for   j i u i k   for   j = i , where u i k represents the 3D directional cosines relative to three Cartesian coordinate axes and 0 3 is a 3D zero vector, similar to (3) the test statistic for the ith baseline vector observation at the kth direction of is constructed as
w i k = u i k T j = 1 m P ¯ i j y j σ 0 u i k T P ¯ i i u i k = u i k T g i σ 0 u i k T P ¯ i i u i k = g i T u i k σ 0 u i k T P ¯ i i u i k
where g i = j = 1 m P ¯ i j y j is a 3D vector and P ¯ i j is the ij-th 3 × 3 sub-matrix of the reliability matrix P ¯ . For different directions u i k , the testing values w i k in (10) are also different.
Accordingly, the outlier detection and identification should focus on finding the specific unit direction vector, supposing u i 3 , that enables the largest test statistic for (10). This can be solved by the following target function Φ ( u i 3 )
max : Φ ( u i 3 ) = w i 3 2 = ( g i T u i 3 ) 2 σ 0 2 u i 3 T P ¯ i i u i 3
For a local maximum of the target function above, its first order partial derivative must equal to zero, i.e.,
Φ ( u i 3 ) u i 3 = 2 ( g i T u i 3 ) [ ( u i 3 T P ¯ i i u i 3 ) g i T ( g i T u i 3 ) u i 3 T P ¯ i i ] σ 0 2 ( u i 3 T P ¯ i i u i 3 ) 2 = 0
The solution of (12) is
u i 3 = u i 3 T P ¯ i i u i 3 g i T u i 3 P ¯ i i 1 g i     and     g i T u i 3 = 0
where the matrix P ¯ i i 1 denotes the inverse of P ¯ i i . When g i T u i 3 = 0 , the test statistic of (10) gets the minimum value, which is not the right solution we are looking for. Since the scalar factor u i 3 T P ¯ i i u i 3 / g i T u i 3 in (13) does not impact the direction of the unit direction vector u i 3 , the first equation of (13) is simply equivalent to (14) for determination of a spatial direction
u i 3 = ± P ¯ i i 1 g i P ¯ i i 1 g i
with P ¯ i i 1 g i = g i T P ¯ i i 2 g i being the 2-norm of the vector P ¯ i i 1 g i . Here, by comparing (14) with (8), we can find that u i 3 and d ^ i are along the same direction, indicating the outlier vector derived from 3D approach can intrinsically determine the specific direction with maximum test statistic.
By taking the second order partial derivative to (12) and then substituting (14) into it, one can get
Φ 2 ( u i 3 ) u i 3 2 = P ¯ i i [ ( g i T P ¯ i i 1 g i ) I 3 P ¯ i i 1 g i g i T ] σ 0 2 g i T P ¯ i i 1 g i
where I 3 denotes the 3 × 3 identity matrix. Since the block matrix P ¯ i i is positive definite, if the matrix M i = ( g i T P ¯ i i 1 g i ) I 3 P ¯ i i 1 g i g i T is non-negative definite, the solution of (14) is the unique unit vector to get the local maximum test statistic, and it must be the global maximum one. By substituting (14) into (10), one can derive the maximum value as (16), which is utilized as test statistic in the SD approach
| w i 3 | = 1 σ 0 g i T P ¯ i i 1 g i
Considering g i = j = 1 m P ¯ i j y j , test statistic (16) can be rewritten with (8) as
| w i 3 | = 1 σ 0 ( P ¯ i i 1 g i ) T P ¯ i i P ¯ i i 1 g i = 1 σ 0 d ^ i T P ¯ i i d ^ i
Comparing (17) with (9), we can find that | w i 3 | 2 follows the Chi-squared distribution with 3 degrees of freedom. Thereby, the SD approach is mathematically equivalent to the 3D method and its critical value for | w i 3 | can be directly calculated by 3 F ( α 0 ; 3 , ) .

2.2.2. Outlier Elimination in SD Approach

If the maximum test statistic | w i 3 | by (17) is larger than its critical value, the 3D test statistic (9) will also be larger than its corresponding critical value and the whole ith baseline vector should be eliminated.
Evaluating 3D outlier estimates (8), it can be rewritten as
d ^ i = P ¯ i i 1 j = 1 m P ¯ i j y j = P ¯ i i 1 g i = P ¯ i i 1 g i P ¯ i i 1 g i P ¯ i i 1 g i = u i 3 d i 3
where d i 3 = P ¯ i i 1 g i , the outlier estimates at the other two directions ( u i 1 and u i 2 ) orthogonal to u i 3 must be zero. Therefore, in SD approach, if an outlier occurs at the ith baseline vector, the observational equation for eliminating the outlier along the specific direction u i 3 is expressed as
y i = A i x + u i 3 d i 3 + ε i ,      and y j = A j x + ε j ,                  j = 1 , 2 , , m ; j i
Then the estimates of parameter vector x and its variance can be derived via least squares adjustment. It was found that elimination of the outlier estimated in this direction will lead to the same results of elimination outlier vector d ^ i in the whole baseline vector as done in 3D approach, since the outlier scalar estimated in this specific direction u i 3 contains all the information content of 3D outlier vector d ^ i .

3. Outlier Detection and Elimination for Real GNSS Network

3.1. Data Description

The real GNSS network used in the following is shown in Figure 1 and its observation data set is given in Table A1 and Table A2 in the Appendix [41]. There are 8 sites and 16 baselines in this network, and the site N001 is fixed as known for the free network adjustment.
The square root of prior variance factor σ 0 is taken as 1 cm in the following data analysis, and probabilities of type I and type II errors are chosen as α 0 = 0.1 % and β 0 = 20 % respectively thereafter [7].

3.2. Specific Direction Validation

If the matrix M i = ( g i T P ¯ i i 1 g i ) I 3 P ¯ i i 1 g i g i T in (15) is non-negative definite, previous derivation guarantees the unit direction vector u i 3 determined by (14) is the specific direction for the ith baseline observation to achieve the maximum test statistic as expressed by (16). The unit direction vector u i k of an arbitrary direction in 3D space can be expressed as
u i k = ( cos ϕ cos λ cos ϕ sin λ sin ϕ ) T
where ϕ [ 90 , 90 ] and λ [ 0 , 360 ] are spherical coordinates. When ϕ and λ are fixed, correspondent test statistic of (10) at this direction is uniquely determined. Therefore, the specific direction, which (10) generates as the maximum test statistic, can be obtained by simply traversing the whole range space of ϕ and λ given a certain small step size. To validate the effectiveness of (14) and (16), statistic values (10) of the No. 1 baseline for all ( ϕ , λ ) direction combinations are calculated and plotted in Figure 2 given the 1 step size. It is shown that values in Figure 2 manifest a symmetric pattern with respect to the origin and there exist two maximum points of 1.4973 in two opposite directions, just corresponding to the positive and negative sign in (14). This value is slightly smaller than the maximum 1.4975, which is directly derived from analytical formula (16). The differences of the maximum test statistics by analytical formula (16) proposed in SD method and those by numerical traversal algorithm are shown in Figure 3 for 16 baseline observations, which indicate that the maximum test statistics derived from (16) are all slightly larger than those from the traversal method. It further proves the test statistic derived from (16) is the theoretical global maximum one, since the traversal method can only get an approximate maximum due to the step size limitation. Besides this, the matrices M i = ( g i T P ¯ i i 1 g i ) I 3 P ¯ i i 1 g i g i T corresponding to 16 baseline observations are all non-negative definite.

3.3. Outlier Detection

The test statistics of all baseline observations of the networks by SD, 3D, and 1D approach, calculated via (16), (9), and (3) respectively, are listed in Table 1, and the correspondent spherical coordinates denoting the specific directions in SD approach are also demonstrated. For a given α 0 = 0.1 % , the critical values of three approaches are shown in Table 2, which are respectively computed by the inverse of the standard normal cumulative distribution function N ( α 0 ; 0 , 1 ) for the 1D approach, by inverse of the central cumulative F-distribution function F ( α 0 ; 3 , ) for 3D approach, and by 3 F ( α 0 ; 3 , ) for SD approach. Besides this, test statistics larger than corresponding critical values are marked in bold font.
Since one outlier can pollute its neighboring observations and possibly causes their test statistic values exceed the critical value, the ‘data snooping’ procedure is iteratively conducted, i.e., detecting the outliers one by one. The largest test statistics among all baseline observations in each test step by different approaches are shown in Table 3 and those exceeding critical values are in bold font.
In Step 1, the outlier is detected at the No. 3 baseline by all three approaches, but the 1D method locates the outlier only at the Y-component of the baseline. Therefore, the No. 3 baseline observation should be discarded and the remaining data set is used to continue ‘data snooping’ procedure in Step 2. In Step 2, all test statistics are well lower than the corresponding critical values listed in Table 2; therefore, no outlier is detected, indicating that the current dataset is quite ‘clean’ and the ‘data snooping’ procedure can be terminated. However, three methods show obvious discrepancy at this step. By the SD and 3D method, the largest statistics are reached both at the No. 1 baseline. However, by the 1D method, the largest test statistic is located at the Z-component of the No. 9 baseline, indicating that the baseline-component-based method (1D approach) and the baseline-vector-based method (3D and SD approach) do not always lead to same results as already discussed by [34].

3.4. Outlier Elimination

After identifying the outlier observation, its influence on the final parameter estimation must be eliminated. Since the SD approach is mathematically equivalent to the 3D approach, it can be expected that eliminating the influence of outlier in specific direction (14) has the same effect as that in 3D approach by (8). Figure 4 shows the absolute differences of 21 coordinate parameters (7 unknown sites) estimated by SD and 3D approach after elimination of outliers’ influence respectively. The differences are ignorable and are merely caused by the limits of computer precision. Note that the values of parameters 12 and 13 are zero due to floating point number round-off and therefore not be presented in Figure 4. The final parameters estimation after outlier elimination of the No. 3 baseline by SD and 3D method are listed in Table 4.

4. Simulation Analysis for Detecting Abnormal GNSS Antenna Height Measurements

In this section, we adopt the SD approach to detect baseline vectors which are infected by abnormal antenna height measurement in the GNSS networks in terms of numerical simulations of multiple baseline solutions. The GNSS network used for simulations is based on Figure 1, which also consists of 8 sites and 16 baselines. Simulated baseline vector observations are generated in two steps, where firstly error-free baseline vectors are calculated by ‘true’ coordinates of each site, and secondly baseline vector noises are randomly generated according to corresponding baseline covariance matrix and then added to the error-free baseline vector. The simulation procedure is described in detail by [23], and in following simulations the estimated sites coordinates in Table 4, as well as the known coordinates of site N001 in Appendix A, are treated as the ‘true’ values for error-free baseline vector generation. Besides this, the covariance matrices in Appendix A are used to generate baseline vector noises as that in [23].
Assuming there are four GNSS receivers to carry out the measurement task of above-mentioned GNSS network, the surveying is divided into six observation sessions as arranged in Table 5 and note that only three receivers are used for sessions 2 and 5. In each session, there are at most three functional independent baseline vectors for the final network adjustment. Since the multiple baseline solutions are supposed, the baselines of each session must be stochastic dependent and the correlation coefficients from 0.2 to 0.3 between different baselines’ components are assumed during the construction of weight matrix. In the following simulations, it is assumed that the GNSS antenna height on site N006 is wrongly measured by 10 cm in observation session 2, which is possibly caused by, for example, the misreading of antenna height. Therefore, baselines No. 3 and No. 11 are directly influenced by the wrong N006 antenna height. The influence is introduced by upward continuation of the site N006 coordinates in the error-free baseline vector generation step of observation session 2, while for other sessions the coordinates of site N006 are still based on that in Table 4.
The numerical simulations are carried out 10,000 times, and for each simulation the baseline vector noises are newly produced while the 10 cm antenna height error of site N006 in session 2 is kept fixed. We use the SD approach to estimate the spatial direction, i.e., latitude and longitude, of possible outlier vector for each baseline by (14), and calculate the standard deviation (SD) of them with respect to the upward direction of site N006 by (21). In Equation (21), S D φ and S D λ stand for the SD values of latitude and longitude estimates respectively, and the upward direction of site N006 is ( φ N 006 , λ N 006 ) = ( 31.3 , 121.3 ) according to Table 4.
S D φ = i = 1 N ( φ i φ N 006 ) 2 N , S D λ = i = 1 N ( λ i λ N 006 ) 2 N     ( N = 10 , 000 )
Table 6 lists the SD values of both latitude and longitude estimates for all baselines as well as corresponding mean values over 10,000 simulations. As shown, the No.3 baseline reaches the best consistency in terms of outlier direction estimation with site N006’s antenna height direction, which is followed by the No.11 baseline for its smaller SD values compared to remaining baselines. Since the No. 3 and No. 11 are directly infected by the wrong antenna height of site N006 in session 2, the statistical results demonstrate that the SD approach can effectively determine the influence of wrong antenna height on baseline vectors in the GNSS network. Furthermore, we investigate the outlier direction estimates of other site N006-related baselines, which are observed in other sessions where the antenna height is correctly measured. Figure 5 shows the statistical distribution of outlier direction estimates for these baselines, from which we can obviously see more gathering outlier direction estimates for No. 3 and No. 11 baselines and smaller bias with respect to the antenna height direction of site N006 indicated by the corresponding red vertical line at each panel. Wih regard to the test statistics for each baseline observation calculated by (16), it turns out that among 16 baselines over 10,000 simulations, the No. 11 baseline reaches the maximum at about 99% times while the remaining part of the maximum values falls into the No. 3 baseline, which are all well beyond the critical value listed in Table 2. Therefore, it is possible to apply the SD method to detect the influence of wrongly measured receiver antenna height on baseline vectors in GNSS networks, which needs further investigation.

5. Conclusions

In this contribution, we proposed the specific direction-based outlier detection approach (SD approach), for 3D GNSS networks. By seeking the specific direction in the 3D space, the maximum test statistic of baseline vector observations is constructed. The analytical expression (14) is derived to directly obtain this specific direction and to construct the corresponding test statistic by (16). Compared to traditional 3D approach, the SD approach is derived from another point of view. It tests the baseline vector in a specific direction in which the outlier vector manifests the largest test statistic value. Evaluating (17) and (9) demonstrates that the two approaches are rigorously mathematically equivalent, while if readers want to directly investigate the spatial direction characteristic of outlier sources in the GNSS networks, the SD approach is preferred. A real GNSS network is processed to validate the effectiveness of the SD approach and the equivalence to the 3D method. Moreover, preliminary application of SD approach to detect the influence of wrong GNSS antenna height measurement on baseline vectors in the GNSS networks are carried out, which shows promising results and needs further investigation.

Author Contributions

Conceptualization, Y.S.; Methodology, Y.S. and L.Y.; Software, Y.N.; Supervision, Y.S.; Validation, Y.N.; Writing—Original Draft, Y.N. and Y.S.; Writing—Review & Editing, L.Y. and Y.S.

Funding

This research is funded by the National Key R&D Program of China (2017YFA0603103) and the National Natural Science Foundation of China (41731069 and 41504022).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The baseline vector observations and their covariance matrix of real GNSS networks are listed in Table A1 referred to in [41], where the accurate known site N001’s 3D coordinates are: X = 2830754.6300 , Y = 4650074.3450 , and Z = 3312175.0540 , with the unit of meter. The approximate 3D coordinates of seven unknown sites are listed in Table A2.
Table A1. Baseline vector observations and covariance matrix.
Table A1. Baseline vector observations and covariance matrix.
Bl.NumSta.PoEnd.Po Δ X   ( m ) Δ Y   ( m ) Δ Z   ( m ) Covariance Matrix ( × 10 6 )
1N002N001 1.5616
−119.8880516.6920−838.2730−1.2684 2.5332
−1.6092 1.6192 3.5764
2N003N001415.5670590.1690−484.37300.9704
−0.7912 1.5756
−0.9936 1.0044 2.2228
3N006N002596.3630391.2610−32.86500.8868
−0.7200 1.5160
−0.8576 0.9132 1.9000
4N002N003−535.4570−73.4720−353.89900.9180
−0.8868 2.1596
−0.2988 0.5916 0.9604
5N002N005384.0890−50.6680390.19801.0084
−0.9792 2.3960
−0.3232 0.6472 1.0364
6N003N004−650.3260−135.0610−362.49200.6004
−0.5796 1.3952
−0.1932 0.3832 0.6248
7N004N0011065.8940725.2290−121.88301.1984
−1.1608 2.7032
−0.3748 0.7292 1.1796
8N005N001−503.9770567.3630−1228.47101.0196
−0.9888 2.3016
−0.3172 0.6192 1.0156
9N005N008−1137.0770−983.7240405.97900.8212
−0.8012 1.9432
−0.2528 0.5148 0.8316
10N004N007−183.2910−458.9740478.21801.0352
−0.8360 1.4972
−0.7420 0.9900 1.3932
11N006N00360.9040317.7860−386.76100.9424
−0.7692 1.3572
−0.6940 0.9112 1.2836
12N006N004−589.4240182.7260−749.25201.1232
−0.9152 1.5996
−0.8368 1.0832 1.5328
13N006N005980.4510340.5890357.33701.3940
−1.13801.9908
−1.03961.35081.9056
14N006N007−772.7140−276.2480−271.03501.3328
−1.0844 1.8956
−0.9880 1.2792 1.8108
15N008N006156.6270643.1340−763.31901.2804
−1.0448 1.8336
−0.9552 1.2444 1.7568
16N008N007−616.0870366.8860−1034.35301.4576
−1.1908 2.1180
−1.0624 1.4064 1.9852
Table A2. Approximate coordinates of the solved-for sites.
Table A2. Approximate coordinates of the solved-for sites.
SiteX (m)Y (m)Z (m)
N002−2830634.74124649557.65143313013.3268
N003−2831170.19804649484.17733312659.4277
N004−2831820.52474649349.11663312296.9360
N005−2830250.65194649506.98123313403.5257
N006−2831231.10224649166.39103313046.1886
N007−2832003.81594648890.14273312775.1536
N008−2831387.72864648523.25653313809.5059

References

  1. Hawkins, D.M. Identification of Outliers; Chapman and Hall: London, UK, 1980; Volume 11. [Google Scholar]
  2. Hampel, F.R.; Ronchetti, E.M.; Rousseeuw, P.J.; Stahel, W.A. Robust Statistics: The Approach Based on Influence Functions; John Wiley & Sons: New York, NY, USA, 1986; pp. 307–341. [Google Scholar]
  3. Hekimoglu, S.; Erdogan, B.; Soycan, M.; Durdag, U.M. Univariate approach for detecting outliers in geodetic networks. J. Surv. Eng. 2014, 140. [Google Scholar] [CrossRef]
  4. Baarda, W. A Testing Procedure for Use in Geodetic Networks; Netherland Geodetic Commission: Delft, Switzerland, 1968; Volume 2, ISBN-13 9789061322092, ISBN-10 906132209X. [Google Scholar]
  5. Pope, A.J. The Statistics of Residuals and the Detection of Outliers; NOS 65, NGS 1; NOAA technical report; NOAA NOS: Rockville, MD, USA, 1976.
  6. Koch, K.R. Parameter Estimation and Hypothesis Testing in Linear Models, 2nd ed.; Springer-Verlag: Berlin, Germany, 1999. [Google Scholar]
  7. Teunissen, P.J.G. Testing Theory: An Introduction, 2nd ed.; Delft University Press: Delft, The Netherlands, 2006. [Google Scholar]
  8. Lehmann, R. Improved critical values for extreme normalized and studentized residuals in Gauss-Markov models. J. Geod. 2012, 86, 1137–1146. [Google Scholar] [CrossRef]
  9. Lehmann, R. On the formulation of the alternative hypothesis for geodetic outlier detection. J. Geod. 2013, 87, 373–386. [Google Scholar] [CrossRef]
  10. Koch, K.R. Deviations from the null-hypothesis to be detected by statistical tests. Bull. Géod. 1981, 55, 41–48. [Google Scholar] [CrossRef]
  11. Prószyñski, W. On outlier-hiding effects in specific Gauss-Markov models: Geodetic examples. J. Geod. 2000, 74, 581–589. [Google Scholar] [CrossRef]
  12. Gökalp, E.; Güngör, O.; Boz, Y. Evaluation of different outlier detection methods for GPS networks. Sensors 2008, 8, 7344–7358. [Google Scholar] [CrossRef]
  13. Gui, Q.; Li, X.; Gong, Y.; Li, B.; Li, G. A Bayesian unmasking method for locating multiple gross errors based on posterior probabilities of classification variables. J. Geod. 2011, 85, 191–203. [Google Scholar] [CrossRef]
  14. Knight, N.L.; Wang, J.; Rizos, C. Generalised measures of reliability for multiple outliers. J. Geod. 2010, 84, 625–635. [Google Scholar] [CrossRef] [Green Version]
  15. Yang, L.; Wang, J.; Knight, N.L.; Shen, Y. Outlier separability analysis with a multiple alternative. J. Geod. 2013, 87, 591–604. [Google Scholar] [CrossRef]
  16. Wang, J.; Chen, Y. On the reliability measure of observations. Acta Geod. Cartogr. Sin. 1994, 4, 42–51. [Google Scholar]
  17. Li, D.; Yuan, X. Error Processing and Reliability Theory, 2nd ed.; The Publishing House of Wuhan University: Wuhan, China, 2002. (In Chinese) [Google Scholar]
  18. Prószyñski, W. Criteria for internal reliability of linear least squares models. Bull. Géod. 1994, 68, 161–167. [Google Scholar] [CrossRef]
  19. Teunissen, P.J.G. Minimal detectable biases of GPS data. J. Geod. 1998, 72, 236–244. [Google Scholar] [CrossRef]
  20. Koch, K.R. Minimal detectable outliers as measures of reliability. J. Geod. 2015, 89, 483–490. [Google Scholar] [CrossRef]
  21. Teunissen, P.J.G. Distributional theory for the DIA method. J. Geod. 2018, 92, 59–80. [Google Scholar] [CrossRef]
  22. Huber, P.J. Robust Statistics; John Wiley & Sons: New York, NY, USA, 1981. [Google Scholar]
  23. Yang, Y.; Song, L.; Xu, T. Robust estimator for correlated observations based on bifactor equivalent weights. J. Geod. 2002, 76, 353–358. [Google Scholar] [CrossRef]
  24. Guo, J.; Ou, J.; Wang, H. Robust estimation for correlated observations: Two local sensitivity-based downweighting strategies. J. Geod. 2010, 84, 243–250. [Google Scholar] [CrossRef]
  25. Koch, K.R. Robust estimation by expectation maximization algorithm. J. Geod. 2013, 87, 107–116. [Google Scholar] [CrossRef]
  26. Teunissen, P.J.G.; Kleusberg, A. GPS for Geodesy, 2nd ed.; Springer: Berlin, Germany, 1998. [Google Scholar]
  27. Teunissen, P.J.G.; Montenbruck, O. Handbook of Global Navigation Satellite Systems; Springer: Cham, Switzerland, 2017. [Google Scholar]
  28. Kutterer, H. Quality aspects of a GPS reference network in Antarctica-A simulation study. J. Geod. 1998, 72, 51–63. [Google Scholar] [CrossRef]
  29. Even-Tzur, G. GPS vector configuration design for monitoring deformation networks. J. Geod. 2002, 76, 455–461. [Google Scholar] [CrossRef]
  30. Even-Tzur, G. More on sensitivity of a geodetic monitoring network. J. Appl. Geod. 2010, 4, 55–59. [Google Scholar] [CrossRef]
  31. Wu, J.; Chen, Y. Improvement of the separability of survey scheme for monitoring crustal deformations in the area of an active fault. J. Geod. 2002, 76, 77–81. [Google Scholar] [CrossRef]
  32. Wieser, A. Reliability checking for GNSS baseline and network processing. GPS Solut. 2004, 8, 55–66. [Google Scholar] [CrossRef]
  33. Aydin, C. Power of global test in deformation analysis. J. Surv. Eng. 2012, 138, 51–56. [Google Scholar] [CrossRef]
  34. Snow, K.B.; Schaffrin, B. Three-dimensional outlier detection for GPS networks and their densification via the BLIMPBE approach. GPS Solut. 2013, 7, 130–139. [Google Scholar] [CrossRef]
  35. Even-Tzur, G. Graph theory applications to GPS networks. GPS Solut. 2001, 5, 31–38. [Google Scholar] [CrossRef]
  36. Even-Tzur, G.; Nawatha, M. Gross-Error Detection in GNSS Networks Using Spanning Trees. J. Surv. Eng. 2016, 142, 04016003. [Google Scholar] [CrossRef]
  37. Koch, I.É.; Veronez, M.R.; da Silva, R.M.; Klein, I.; Matsuoka, M.T.; Gonzaga, L.; Larocca, A.P.C. Least trimmed squares estimator with redundancy constraint for outlier detection in GNSS networks. Expert Syst. Appl. 2017, 88, 230–237. [Google Scholar] [CrossRef]
  38. Klein, I.; Matsuoka, M.T.; Guzatto, M.P.; de Souza, S.F.; Veronez, M.R. On evaluation of different methods for quality control of correlated observations. Surv. Rev. 2015, 47, 28–35. [Google Scholar] [CrossRef]
  39. Leick, A.; Rapoport, L.; Tatarnikov, D. GPS Satellite Surveying, 4th ed.; John Wiley & Sons: New York, NY, USA, 2015. [Google Scholar]
  40. Aydin, C.; Demirel, H. Computation of Baarda’s lower bound of the non-centrality parameter. J. Geod. 2005, 78, 437–441. [Google Scholar] [CrossRef]
  41. Yu, H.; Shen, Y.; Yang, L.; Nie, Y. Robust M-estimation using the equivalent weights constructed by removing the influence of an outlier on the residuals. Surv. Rev. 2019, 51, 60–69. [Google Scholar] [CrossRef]
Figure 1. Shape of the GNSS network.
Figure 1. Shape of the GNSS network.
Sensors 19 01836 g001
Figure 2. The absolute test statistic values of the No. 1 baseline by numerical traversal algorithm.
Figure 2. The absolute test statistic values of the No. 1 baseline by numerical traversal algorithm.
Sensors 19 01836 g002
Figure 3. Differences between maximum absolute statistics by two calculation approaches.
Figure 3. Differences between maximum absolute statistics by two calculation approaches.
Sensors 19 01836 g003
Figure 4. The absolute value of differences between SD and 3D approach derived parameter estimation after outlier elimination. The values of parameter 12 and 13 are zero due to floating-point number round-off.
Figure 4. The absolute value of differences between SD and 3D approach derived parameter estimation after outlier elimination. The values of parameter 12 and 13 are zero due to floating-point number round-off.
Sensors 19 01836 g004
Figure 5. Statistical distribution of outlier direction estimates for site N006 related baselines. (a) for No.3 baseline, (b) for No.11 baseline, (c) for No.12 baseline, (d) for No.13 baseline, (e) for No.14 baseline and (f) for No.15 baseline (The red vertical lines stand for the latitude and longitude of site N006’s antenna height direction.).
Figure 5. Statistical distribution of outlier direction estimates for site N006 related baselines. (a) for No.3 baseline, (b) for No.11 baseline, (c) for No.12 baseline, (d) for No.13 baseline, (e) for No.14 baseline and (f) for No.15 baseline (The red vertical lines stand for the latitude and longitude of site N006’s antenna height direction.).
Sensors 19 01836 g005
Table 1. Test Statistics of baseline observations by Specific Direction-based (SD), 3D, and 1D approach.
Table 1. Test Statistics of baseline observations by Specific Direction-based (SD), 3D, and 1D approach.
Baseline Num.SD Approach3D Approach1D Approach
λ (Deg.) ϕ (Deg.)Test StatisticsTest StatisticsXYZ
15.8 118.5 1.4980.748 0.469 1.031 0.743
2−17.7 307.7 1.7300.997 0.908 0.742 0.518
352.7 210.0 4.3786.3882.395 3.4692.305
43.2 268.1 2.3161.788 1.262 2.313 0.699
534.7 267.7 2.9822.964 0.937 2.568 2.162
627.2 156.2 1.6040.858 1.422 0.670 0.287
761.5 327.9 1.7681.042 0.866 0.278 1.647
8−34.2 148.0 1.9931.324 1.425 0.101 1.527
983.0 213.3 2.6852.403 0.151 1.229 2.648
10−63.4 130.8 1.0000.333 0.375 0.496 0.975
1118.0 63.6 0.7120.169 0.608 0.588 0.083
12−19.3 344.5 2.0141.352 1.939 0.847 0.203
130.3 118.2 1.5420.792 0.308 1.184 0.990
14−5.7 315.9 0.5430.098 0.349 0.217 0.339
1570.2 141.1 1.9311.243 0.127 0.788 1.854
1666.8 140.2 0.7360.180 0.021 0.299 0.693
Table 2. Critical values of SD, 3D, and 1D test statistics.
Table 2. Critical values of SD, 3D, and 1D test statistics.
SD Approach3D Approach1D Approach
Critical Values4.0335.4223.291
Table 3. Largest test statistics in each outlier detection step.
Table 3. Largest test statistics in each outlier detection step.
Test StepBaseline Num.SD Approach3D Approach1D Approach
XYZ
134.3786.3882.395 3.4692.305
212.4131.9410.1012.1541.108
92.3071.7740.6560.7022.301
Table 4. Final sites’ coordinate parameters estimation after elimination of outliers determined by SD or 3D method.
Table 4. Final sites’ coordinate parameters estimation after elimination of outliers determined by SD or 3D method.
SiteX (m)Y (m)Z (m)
N002−2830634.74154649557.65083313013.3273
N003−2831170.19814649484.17753312659.4277
N004−2831820.52474649349.11693312296.9359
N005−2830250.65194649506.98143313403.5257
N006−2831231.10174649166.39133313046.1881
N007−2832003.81564648890.14303312775.1533
N008−2831387.72854648523.25693313809.5058
Table 5. Session arrangement for the observation of 16 baselines in the network by 4 GNSS receivers.
Table 5. Session arrangement for the observation of 16 baselines in the network by 4 GNSS receivers.
SessionReceiver StationBaseline No.
1N001, N002, N003, N0051, 2, 8
2N002, N003, N0063, 11
3N002, N003, N005, N0064, 5, 13
4N005, N006, N007, N0089, 15, 16
5N004, N006, N00710, 14
6N001, N003, N004, N0066, 7, 12
Table 6. Mean and SD values of outlier direction (latitude and longitude) estimates for all baselines (unit: degree).
Table 6. Mean and SD values of outlier direction (latitude and longitude) estimates for all baselines (unit: degree).
Baseline No.Mean Lat.Mean Long.SD Lat.SD Long.
122.7117.710.37.7
225.9114.17.68.5
331.0121.81.21.1
415.997.218.570.9
525.1118.86.53.0
626.1118.75.32.8
715.8113.616.08.9
817.7115.414.16.6
931.7123.12.53.0
1031.7122.42.02.3
1132.1120.31.21.4
1232.8122.51.91.7
1329.7121.42.01.2
1433.6123.93.23.5
1534.2124.73.84.3
1626.1129.713.249.7

Share and Cite

MDPI and ACS Style

Nie, Y.; Yang, L.; Shen, Y. Specific Direction-Based Outlier Detection Approach for GNSS Vector Networks. Sensors 2019, 19, 1836. https://doi.org/10.3390/s19081836

AMA Style

Nie Y, Yang L, Shen Y. Specific Direction-Based Outlier Detection Approach for GNSS Vector Networks. Sensors. 2019; 19(8):1836. https://doi.org/10.3390/s19081836

Chicago/Turabian Style

Nie, Yufeng, Ling Yang, and Yunzhong Shen. 2019. "Specific Direction-Based Outlier Detection Approach for GNSS Vector Networks" Sensors 19, no. 8: 1836. https://doi.org/10.3390/s19081836

APA Style

Nie, Y., Yang, L., & Shen, Y. (2019). Specific Direction-Based Outlier Detection Approach for GNSS Vector Networks. Sensors, 19(8), 1836. https://doi.org/10.3390/s19081836

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop