Aperiodic geometry design for DOA estimation using compressive sensing Sayed Zeeshan Asghar

advertisement
Aperiodic geometry design for DOA estimation
using compressive sensing
Sayed Zeeshan Asghar
Boon Poh Ng
School of Electrical and
Electronic Engineering
Nanyang Technological University
Singapore 639798
Email: SAYED1@e.ntu.edu.sg
School of Electrical and
Electronic Engineering
Nanyang Technological University
Singapore 639798
Email: EBPNG@ntu.edu.sg
Abstract—Antenna arrays used in compressive sensing based
DOA estimation algorithms are generated randomly to minimize
mutual coherence. This scheme suffers from practical limitations.
For an antenna array that is sufficiently random, some elements
of the array would almost always fall very close to each other,
which is not practicable. Rectangular arrays, although very uniform and practicable, suffer from poor performance when used
in compressive sensing algorithms that assume spatial sparsity.
Aperiodic arrays seem to offer a compromise solution. This paper
demonstrates that it is possible to design aperiodic antenna array
by using a simple disturbance optimization scheme, that can be
applied to multiple aperiodic array geometries. The optimization
scheme uses a few parameters to generate an aperiodic geometry.
We will also see that the optimized aperiodic array has better
performance than several other geometries studied in this paper
and performs very close to random array configuration.
I. I NTRODUCTION
The array geometry that is commonly used for Direction
of Arrival (DOA) estimation of signal sources is Uniform
Linear Array (ULA) or its 2-D counterpart i.e. rectangular grid
array. The neighbouring elements in these array geometries, on
each axis, are separated by a fixed distance; usually λ/2. This
configuration works well if we are looking for compression in
frequency domain. But if the design objective is to compress
the number of array elements, then this configuration is not
suitable. A random sampling of aperture is much more suitable
[7].
Herman et. al. [8] and Ender [9] studied application of
Compressive Sensing (CS) to the radar problem and gave some
initial results regarding the potential of using CS to address
DOA problem. Both of these papers however use a geometry
that is formed through random sampling of aperture. Usually,
in a randomly sampled aperture some of the elements would
lie too close to each other, which isn’t practically realisable.
The use of aperiodic geometry can reduce this problem
significantly and make the layout practically realisable. Fractal
geometries [11] have effectively been used in the design of
aperiodic arrays. These studies, however, utilize classical array
processing framework and much of the results may not be
applicable to compressive sensing scenario. In [5] Spence et.
al. introduces a design technique to generate antenna array
layouts from aperiodic tiling.
In this paper, we device an optimization technique, based
on CS, that generates an antenna array layout through a
disturbance of aperiodic tiling. This technique is well suited to
any array design methodology that can be generated through
geometrical transformations of a few initial points.
II. T HE MEASUREMENT M ATRIX
Suppose, there are m narrow-band signals with a known
centre frequency, fc , incident on an array of L sensors. We
assume that the signals originate from any of the N directions,
in which our DOA space is quantized. Then our received
signal, y, is given by
y = Ax.
(1)
y ∈ CL×1 is the measurement vector. x ∈ CN ×1 contains
source signals at the time instant t. This vector is assumed to
be sparse having just a few non-zero elements:
kxk0 = s,
(2)
where k.k0 denotes the number of non-zero elements or in
other words the sparsity level of the vector. A contains vectors
from N directions as its columns:
A = a0 . . ai . . aN −1 , i ∈ {0, ..., N − 1},
(3)
and the steering vector, ai , is given by
iT
T
T
1 h T
ai = ejkθi P = √ ejkθi p1 , ..., ejkθi pL ,
(4)
L
where pj is the three dimensional position vector for j th
sensor and kθi is the wave-number vector for source i. It is
given by kθi = 2π
λ vθi . vθi is a unit vector in the direction of
ith source.
The estimate of source vector, x̂, can be found by:
minimize
x
kxk0
(5)
subject to y = Ax,
But (5) is a combinatorial problem and is NP hard. Candes
and Romberg have proposed to use l1 −norm instead of using
l0 − norm. The theorem presented in [7] states that instead
of solving the problem given by (5), if we solve the problem
(this problem is called Basis Pursuit):
kxk1
minimize
x
(6)
subject to y = Ax,
(b) Type II
we are guaranteed to recover x with a very high probability.
This problem now becomes a linear programming optimization
problem which is relatively easy to solve.
If there is noise in the system the new model becomes y =
Ax + n. n ∈L×1 is a vector realization of random Gaussian
process. Candes and Romberg [7] suggested a solution to this
problem called Basis Pursuit deNoising (BPDN):
minimize
kxk1
subject to
ky − Axk2 < .
x
2
2
(1 − δk ) kxk ≤ kAxk ≤ (1 + δk ) kxk ,
(8)
for all x ∈ Rn such that kxk0 ≤ k.
One of the few matrices that observe this property is the
Fourier matrix [1]. Another significant parameter is the mutual
coherence, µ, which indicates the level of sparsity that can be
recovered with very high probability.
For the system y = ΦΨx, µ is defined as follows:
√
µ(Φ, Ψ) = n. max |hϕk , ψj i| .
(9)
1≤k,j≤n
If the number of measurements, m, for s-sparse vector obey
the following inequality
m ≥ Cµ(Φ, Ψ)2 s log(n),
(c) Type III
(7)
The measurement matrix, A, has a central importance to CS
theory. Its properties are crucial to the exact recovery of sparse
vectors from a very few observations. One important property
that this matrix has to observe is the restricted isometry
property [1] given by:
2
(a) Type I
(10)
for some positive constant, C, then the solution to this problem
is exact with “overwhelming probability” [1].
One such Φ, Ψ pair with low µ is the Fourier matrix and
the identity matrix pair. In our case, i.e y = Ax, where Φ =
A, Ψ = I.
Suppose there is a matrix B = AH A, the diagonal entries
of B must be unity as the vectors are normalized. The offdiagonal entries should be as close to zero as possible, to get
a lower value of µ. This depends on the column vectors of
matrix A. For a regularly sampled aperture the values of µ
are higher than that of irregular sampled aperture.
What we explore in the next section is a kind of aperiodic
geometry obtained by tiling of very simple base shapes. This
aperiodic structure was discovered by Roger Penrose in 1970
and has since been found in naturally occurring crystalline
materials as well [10]. This structure, being aperiodic, is
naturally sparse (average sensor spacing is greater than λ/2)
as it doesn’t contain any redundancies that are signature of
periodic arrays like rectangular arrays or its 1-D counterpart
i.e. uniform linear arrays.
Figure 1: Basic Danzer tiles
III. A PERIODIC T ILING
A 2-D aperiodic tiling consists of a collection of tiles that
divide the 2-D plane in such a way that it lacks horizontal
or vertical translational symmetry. A periodic tiling, however,
contains both horizontal and vertical translational symmetry. A
set of shapes, called prototiles, are used to populate the whole
2-D plane. Roger Penrose discovered a two-prototile set (kite
and dart) that could fill up a 2-D plane aperiodically, if the
tiles are placed together with certain predefined rules.
A robust iterative method called “inflation” is used for
tiling the plane aperiodically. A tile is first enlarged and then
subdivided into its constituent tiles. This process is repeated
several times iteratively until the whole plane is covered with
tiles. Figure 1 shows three prototiles know as Danzer tiles.
These three prototiles, through the process of inflation can
fill any 2-D plane aperiodically [3]. To form an antenna
array based on this geometry, we can place individual antenna
sensors at the indices of each triangle as shown in Figure 2.
IV. O PTIMIZATION SCHEME
The measurement matrix A contains an exponential at row
k, and column l
T
A(k, l) = ejk0 pk rl , k = {1, ..., L}, l = {1, ..., N },
(11)
where rl is the normalized direction vector for direction index
0
l and pk is the position vector for sensor k. k0 = 2πf
c .
Matrix B can be written as
B(u, v) =
L
X
T
T
ejk0 (pξ rv −pξ ru ) u, v ∈ {1, ..., N }.
(12)
ξ=1
There are N 2 entries in the matrix B. P is a 3 × L matrix
containing position 3-vectors of L sensors as its columns. We
choose three points p0 , p1 , p2 inside Danzer triangles of
Type-I, II and III, respectively. These initial three points pass
through a transformation matrix involving translation, rotation
and scaling sub matrices to form the next level Danzer triangle.
Now P = T p, where p = [pT0 pT1 pT2 ]T . We pose an
When u 6= v, we have N (N − 1) equations:
L
X
T
T
ejk0 ([T p]ξ rv −[T p]ξ ru ) = 0,
u 6= v.
(16)
ξ=1
For a particular u and v, u 6= v, the equation in expanded
form is
T
T
ejk0 ([T p]1 (rv −ru )) + ... + ejk0 ([T p]N (rv −ru )) = 0.
(17)
We can choose the arguments of exponentials (on a unit circle)
in such a way that their sum becomes equal to zero
ej1 + ej2 + ... + ejN = 0.
(18)
Then,
a)
[T p]T1 (rv − ru )
=
[T p]T2 (rv − ru )
=
.
1
,
k0
2
,
k0
.
.
[T p]TL (rv
b)
Figure 2: a) Level-3 Danzer aperiodic tiling b) Sensors array
antenna formed by placing sensors at the indices of each
triangle.
optimization problem:
minimize
kB − δIk2
subject to
Mu pu + Cu ≤ 0.
p
(13)
Here, Mu and Cu contains slopes and y-intercepts, respectively, of the base triangle boundaries. In its current form,
this problem does not seem to be solvable. What we do next
would pave the way for finding a solution to this problem.
Our objective is to bring B as close as possible to δI, subject
to certain conditions. There are N 2 equations to satisfy, apart
from the inequality conditions on p. Each equation looks like:
(P
T
L
jk0 ([T p]T
ξ rv −[T p]ξ ru ) = δ
if u = v
ξ=1 e
(14)
PL
T
T
jk0 ([T p]ξ rv −[T p]ξ ru )
= 0 otherwise.
ξ=1 e
When u = v, we have diagonal entries and the equations are:
L
X
ejk0 0 = δ.
(15)
ξ=1
This condition can be satisfied if we multiply each column of
A by Lδ .
− ru )
=
L
.
k0
There are L equations for each u, v pair. As [T p]Ti (rv −ru )
is a scalar, we can transpose it to get (rv − ru )T [T p]i or
equivalently (rv −ru )T [T ]i p, as the initial points denoted by p
remain constant, and only the transformation matrix changes.
As we have L such equations, we can write them in a matrix
form
T̂u,v p = ,
(19)
here T̂u,v = (rv − ru )T T and = [ k10 ... kN0 ]T , we have
L2 − L equations of the form T̂u,v p = for each u, v =
{1, 2, 3, ..., N }, u 6= v. All the combined equations could be
written into one huge matrix equation
Υp = E.
(20)


T̂1,2


.




.


 T̂1,L−1 


 T̂2,1 
 , E = [T T ...T ]T .
Here Υ = 


.




.




.




.
T̂L,L−1
The optimization problem now becomes (constrained leastsquared optimization problem):
minimize
kΥp − Ek2
subject to
Mu pu + Cu ≤ 0.
p
(21)
V. R ESULTS
We use Danzer level-2, type-I triangle as the base geometry.
There are L = 48 elements in this array. We can see the
array geometry in Fig. 3d. After optimizing the array geometry
according to the optimization problem given by (21), we get
the array shown in Fig. 3e. We compare these geometries with
three other given in Figs. 3 a, b and c. Fig. 3a shows an array of
48 elements, their positions were chosen randomly but with
a constraint that no two elements are closer to each other
than λ/3. This constraint was chosen to avoid the practical
difficulties of designing arrays that are too close to each other.
Another geometry, which is very common, is the rectangular
grid array, but its aperture has been chosen to be triangular to
match with the one chosen for Danzer array. It is shown in
Fig. 3c.
Next, We study different parameters for different geometrical configurations. There are three parameters: the minimum
nearest neighbour distance, dmin , the maximum nearest
neighbour distance, dmax and the average nearest neighbour
distance, dav , defined as following:
dmin = min(dn ),
(a)
(b)
(22)
n = 1, 2, ..., N
dmax = max(dn ),
(23)
n = 1, 2, ..., N
davg =
N
1 X
dn .
N n=1
(24)
Table I shows values of these parameters for the different
types of geometries studied. In the table, these values are
normalized through dividing each value by λ. Rectangulargrid array has all values equal to 0.5, since the separation
between the nearest neighbour for each sensor is fixed to λ/2.
For random array without any constraints dmin is the lowest as
compared to all other configurations, while dmax is the highest
as compared to the rest of the configurations. dav , the average
nearest neighbour distance, is comparable to rectangular grid
array. This shows that the utilization of aperture in this
configuration is not uniform as some of the array elements
are too close and some are too much far apart. The values
of parameters for random array configuration, but with a
Table I: Geometrical properties of different antenna array
designs. The minimum nearest neighbour distance(22). The
average nearest neighbour distance(24). The maximum nearest
neighbour distance(23)
Array Configuration
Random
Random - constrained
Danzer
Danzer - optimized
Rectangular grid
No. of
elements
48
48
48
48
179
dmin
λ
dav
λ
dmax
λ
0.0918
0.36
0.14
0.316
0.5
0.52
0.59
0.60
0.69
0.5
1.62
1.03
1.47
1.27
0.5
(c)
(d)
(e)
Figure 3: a) 48-element random array, b) 48-element constrained random array, c) 94-element triangular array (rectangular grid sampling), d) 48-element Danzer array, e) 48element optimized aperiodic array.
geometries, whether based on inflation process or some other
geometrical transformations (translation, scaling and rotation)
of base geometry.
Usually, aperiodic arrays are generated through either turning off array elements or disturbing their positions on a large
grid of array elements. Such schemes require optimization of
a large number of parameters. Our scheme requires a very
few parameters that needs optimization. This scheme can be
utilized in arrays that consist of several sub-arrays. Due to the
aperiodic structure, such arrays are inherently sparse and well
suited for applications that are using powerful capabilities of
compressive sensing algorithms that exploits sparsity in some
domain (in our case it is source-space.)
Figure 4: Mean Squared Error plot for different geometries.
constraint of λ/3 on minimum nearest neighbour separation,
are somewhat conservative as compared to completely random
configuration. For Danzer array configuration, the value of
dmin /λ is 0.14, greater than completely random configuration
but still less than desirable. For optimal configuration dmin is
quite close to λ/3 and dmax is close to 1.27λ. The value of
davg (0.69λ) for optimal configuration is largest as compared
to the rest of the configurations.
Next we compare the performance of each of these array
configurations in regard to estimating the direction of arrival of
three closely located sources. We use these array geometries
to estimate three sources, located at 19, 20 and 21 degrees
azimuth with amplitudes 1.5, 1.0 and 1.5, respectively; all at
the same elevation, 25◦ .
Fig. 4 shows the mean squared error (MSE) plot for all four
geometries using 50 iterations of monte carlo simulation. MSE
has been calculated using the following equation:
N
1 X
(xn − xnˆ)2 .
M SE =
N n=1
(25)
x̂ is the estimate of x, which is the output of the l1 minimization program (7).
We can see in Fig. 4 that the triangular array (with rectangular grid sampling, Fig 3c) has the largest MSE, followed by
Danzer array (Fig 3d), although the number of elements (179)
in the rectangular grid array is far greater than the number of
elements (48) in the Danzer array. Constrained random array
(Fig 3b) performed better than the Danzer array. Random array
configuration (Fig 3a) has slightly better performance than
optimized aperiodic array configuration (Fig 3e).
VI. C ONCLUSION
In this paper, we have introduced a design technique that
is capable of being used for generation of antenna sensor
layouts based on aperiodic tilings, specifically Danzer tilings,
in the context of compressive sensing. This technique generates antenna sensor layouts through a slight disturbance of
the inflation process through which aperiodic geometries can
be generated. We have devised an optimization scheme that
is generic in nature and can be applied to other aperiodic
R EFERENCES
[1] E.J. Candes, M.B. Wakin, “An Introduction To Compressive Sampling,”
IEEE Signal Processing Magazine, Vol. 25, No. 2, pp 21 -30, Mar 2008
[2] D. L. Donoho. M. Elad. and V. Temlyakov, “Stable recovery of sparse
overcomplete representation in the presence of noise,” IEEE Transactions on Information Theory, Vol. 52, No. 1, pp. 6-18, Jan 2006
[3] K.-P. Nischke and L. Danzer , ”A construction of inflation rules based
on n-fold symmetry” Discrete and Computational Geometry Volume 15,
Issue 2 , pp 221-236, Feb. 1996
[4] J. Chen and X. Huo, “Theoretical results on sparse representations of
multiple-measurement vectors,” IEEE Transactions on Signal Processing, Vol. 54, No 12, pp 4634-4643, Dec 2006
[5] T.G. Spence, and D.H. Werner, “Design of Broadband Planar Arrays
Based on the Optimization of Aperiodic Tilings,” IEEE Transactions on
Antennas and Propagation, Vol. 56, No. 1, pp. 76 - 86, Jan 2008
[6] K.-P. Nischke and L. Danzer , “A construction of inflation rules based
on n-fold symmetry,” Discrete & Computational Geometry Vol. 15, No
2 , pp 221-236, Feb 1996
[7] Emmanuel Cands and Justin Romberg, “Sparsity and incoherence in
compressive sampling,” Inverse Problems, Vol. 23, No. 3, 2007
[8] M. A. Herman, T. Strohmer, “High-Resolution Radar via Compressed
Sensing,” IEEE Transactions on Signal Processing, Vol. 57 , No. 6, pp
2275 - 2284, Feb 2009
[9] J. H. G. Ender, “On compressive sensing applied to radar,” Signal
Processing, Vol 90, pp 1402-1414, 2010
[10] Paul J. Steinhart, “Quasicrystals: a breif history of the impossible,”
Rendiconti Lincei, Vol 24, No. 1 Supplement, pp 85-91, Mar 2013
[11] D. H. Werner and S. Ganguly, “An overview of fractal antenna engineering research,” IEEE Antennas Propagation Magazine, Vol 45, No.
1, pp 38-57, Feb 2003
Download