Applied Fitting Theory VII Building Virtual Particles I Statement of the problem

advertisement
Paul Avery
CBX 98–38
June 8, 1998
Apr. 17, 1999 (rev.)
Applied Fitting Theory VII
Building Virtual Particles
I
Statement of the problem
In many physics analyses we encounter the problem of merging a set of particles into a single
particle so that this new “virtual” particle can be used directly in the analysis. For example, the
Ks parameters from the decay Ks → π +π − are commonly used, but we also need decays such as
D+ → K −π +π + or even complete decay chains, e.g.
B 0 → D*+π −
D*+ → D0π +
(1)
D0 → K −π +
where we want to (1) combine K −π + to form a D0 , (2) merge the D0 with a π + to make a D*+
and (3) combine the D*+ with a π − to make a B 0 . The basic idea is that once a virtual particle
has been built, we can forget about the original particles that went into it. However, the
procedure for building the new particle is complicated by the fact that the original particles have
measured parameters with associated error (covariance) matrices. The merging process has to
take this information into account in such a way that the merged parameters and their error
matrix provide the best information possible about the reconstructed particle.
In this note1 I will show how to calculate the track parameters and covariance matrix of a
particle by applying a vertex and possibly other constraints to a set of particles and then adding
together the 4–momenta of n “daughter” particles at the vertex point. The track parameters for
particle i are denoted by α i , where
p p p p = = E
x x y z xi
yi
zi
αi
i
i
i
i
i
i
1
This document can be found on the web on my fitting page at www.phys.ufl.edu/~avery/fitting.html
1
(2)
i.e., a 4–momentum and a position at which the momentum is evaluated. The full set of n tracks
is denoted by α, e.g.,
α α α= α 1
2
(3)
n
The initial covariance matrix for these particles is called Vα 0 . This has the block diagonal form
V
=
01
Vα 0
V0 2
V0 n
(4)
expressing the fact that the particles are initially independent. The track parameters for the virtual
pV
.
particle are called α V =
xV
II
Building a virtual particle with vertex constraints
Equations defining the virtual particle
The motion of a charged particle in an arbitrarily oriented, fixed magnetic field was described
in Appendix II in ref. [1]. A particle starting out with momentum p 0 at position x 0 will have a
momentum p at position x given by p = p 0 + a x − x 0 × h , where a = −0.299792458 BQ , where
1
6
B is the magnetic field in Tesla, Q is the charge of the particle and h is a unit vector in the
direction of magnetic field. The virtual particle parameters at the point x = x x , x y , x z can then
3
8
be calculated by adding the contributions from n tracks:
3
3z
3x
8 3
− x 8 − a 3x
− x 8 − a 3y
8
−x 8
−x 8
pV x = ∑ p0 i x + az i y0 i − x y − ayi z0 i − x z
i
pV y = ∑ p0 i y + a x i
i
pV z = ∑ p0 i z + a yi
i
0i
0i
z
zi
0i
x
xi
0i
x
y
EV = ∑ E0 i
i
xV = x
where, e.g., a x i = −0.299792458 Bx Qi , Qi is the charge on the particle and Bx is the x
component of the B field.
2
(5)
Applying the vertex constraint
The following is a brief summary of the derivation in ref. [1]. We assume that the constraint
equations can be linearized by expanding about convenient points α A and x A . Since the
covariance matrix of the vertex may be known in advance (sometimes known as a “prior”
covariance matrix), we write the overall χ 2 condition as
1
χ2 = α − α0
6 V 1α − α 6 + 1x − x 6 V 1x − x 6 + 2λ 0Dδα + Eδx + d5
−1
α0
T
−1
x0
T
0
0
T
(6)
0
where δα = α − α A and δx = x − x A . The terms represent, respectively, the contribution to the
χ 2 from the tracks, the vertex and the constraints. λ is the vector of Lagrange multipliers, one
per constraint, while x 0 and Vx 0 are the initial vertex location and its covariance matrix. If the
vertex location is unknown one can set its covariance matrix to a large value.
There are two constraint equations for each track. For example, in a solenoidal field
0 = px i ∆yi − p y i ∆xi −
0 = ∆zi −
pz i
az i
sin
−1
az i
2
4∆x
2
i
+ ∆yi2
9
3
8
a z i px i ∆xi + p y i ∆yi /
(7)
p⊥2 i
where ∆xi = x x − xi , etc. A more complicated pair of equations can be derived for B fields
oriented along other directions [1]. Whatever the B field orientation, the E and D matrices have
the simple form
E E E= E D
D=
1
D 1
2
n
D2
(8)
n
where each Ei is a 2 × 3 matrix and each Di is a 2 × 7 matrix. Each row of these matrices
corresponds to a single constraint.
To cast the χ 2 equation into a familiar form, we write it using the “extended” matrices
x
~ = α α
α 1
V
=
x0
Vα~ 0
n
giving the new equation
1
V01
~ −α
~
χ2 = α
0
V0 n
E
~ E
D=
E
1
D1
2
D2
n
6 V 1α~ − α~ 6 + 2λ 4D~ δα~ + d9
T
−1
~0
α
T
0
The minimization of this χ 2 with respect to all the variables has the solution [1]:
3
D (9)
n
(10)
α = α 0 − Vα 0 DT λ
x = x 0 − Vx 0 E T λ
1
= V 1Dδα
= 3DV D
6
λ = VD~ Dδα 0 + Eδx 0 + d = λ 0 + VDEδx
λ0
D
VD~
0
T
α0
+d
6
+ EVx 0 E T
8
(11)
−1
= VD − VD EVx E T VD
with the matrix VD defined by
V
=
D1
VD
VD 2
VD n
4
VD i = Di V0 i DTi
9
−1
(12)
The χ 2 is given by the simplex expression
3
8
χ 2 = λ T VD−~1λ = λ T VD−1 + EVx 0 E T λ
1
= λ T Dδα 0 + Eδx 0 + d
6
(13)
The covariance matrices are
3
Vx = Vx−01 + VE−1
8
−1
Vα = Vα 0 − Vα 0 DT VDDVα 0 + Vα 0 DT VDEVx ET VDDVα 0
0 5
(14)
cov α , x = − Vα 0 DT VDEVx
Solution for the virtual particle parameters
We are now ready to construct the virtual particle parameters and covariance matrix. The
parameters can be written in matrix form, following eqn. (5):
pV = Aα + Bx = ∑ Aiα i + Bx
i
(15)
xV = x
where
1
A = A1 A 2 0
∑ a
A 6 B=
−∑ a
0
−∑ az i
zi
n
and
4
yi
0
∑ ax i
0
∑ayi −∑ ax i 0
0
(16)
1
0
A =
0
0
0
1
0
0
i
0
0
1
0
0
0
0 −az i
0 ayi
1
0
az i
0
−ax i
−a y i
ax i
0
0
0
(17)
This part is easy since the track parameters are determined from the vertex constraint above. The
harder part is getting the covariance matrix right. It can be written as
Vα V =
V
cov1x
pV
V , pV
6
1
cov pV , x V
Vx V
6
(18)
From the definition of the virtual particle parameters in eqn. (15)
1
0 5
0 5
Vp V = AVα AT + A cov α , x BT + B cov x, α AT + BVx BT
6
0 5
cov pV , x V = A cov α , x + BVx
Vx V = Vx
(19)
Using the matrices in eqn. (14), we find the covariance matrix for the virtual particle
Vα V =
∑ 3A V
i
T
i 0 i Ai
8
− S1i VDiS1Ti + TVx TT
− Vx TT
where S1i = A i V0 i DTi , S3 i = A i V0 i DTi VD i Ei and T = − B + ∑ i S3i .
− TVx
Vx
(20)
The construction of the virtual particle, including track parameters and covariance matrix, is
now complete. The particle can be used freely in subsequent fits with the understanding that
when it is used in a fit, its daughters’ track parameters will not be updated. To make it possible to
update the daughter tracks when the virtual particle participates in a later fit, one would first have
to use eqns. (11) and (14) to recalculate the daughter parameters and their covariance matrices,
including the correlations between tracks introduced by the vertex constraint. This is not possible
in the KWFIT package, which does not keep track of inter-track correlations.
III
Building virtual particles when a subset of particles is verticized
Building virtual particles is more complicated than discussed above because we want the
flexibility of merging different kinds of particles, some of which have poor or no position
information and so cannot be easily used in a vertex constraint. I divide the particles into three
classes:
1. Particles which are used to determine the vertex.
2. Particles which are forced to pass through the vertex determined by the first class, but
which are not used to find the vertex. One might use soft pions from D*+ → D 0π + in this
way.
5
3. Particles which have no position information at all (such as π 0 s and photons) which only
contribute to the total 4-momentum.
It is necessary to re-derive the equations from the beginning to account for these changes. The
overall χ 2 condition for the vertex is, as before,
1
χ2 = α − α 0
6
T
1
6 1
Vα−10 α − α 0 + x − x 0
6
T
1
0
6
Vx−01 x − x 0 + 2λ T Dδα + Eδx + d
5
(21)
where this time only class 1 and class 2 particles are included. Care must be taken when taking
partial derivatives, however. Classes 1 and 2 will both be used in the ∂ / ∂ α i partials, since both
sets of particles must pass though the vertex, but only class 1 particles will participate in ∂ / ∂ x
because they alone determine the vertex. Taking partials with respect to x, α and λ in this manner
to minimize the χ 2 yields the following system of equations
1 6
1α − α 6 + D λ = 0
Vx−01 x − x 0 + E ′ T λ = 0
Vα−10
T
(22)
0
Dδα + Eδx + d = 0
where Ei′ = Ei for class 1 particles and Ei′ = 0 for all others. Solving for the unknowns α and x
yields
α = α 0 − V0 DT λ
x = x 0 − Vx 0 E′ T λ
1
6
= 3DV D + EV E′ 8
= V − V E3 V + V 8
λ = VD~ Dδα 0 + Eδx 0 + d
VD~
α0
D
(23)
T −1
T
x0
D
−1
x0
−1 −1 T
E′ VD
E′
where VE−1′ = E′ T VDE = E′ T VDE′ . This final equality follows from the definition of Ei′ . Also
note that VD~ is no longer symmetric, a fact which greatly complicates the matrix algebra.
After an absolutely incredible amount of matrix manipulation, the covariance matrices can be
shown to reduce to the following forms
3
Vx = Vx−01 + VE−1′
8
−1
Vα = Vα 0 − Vα 0 DT VDDVα 0 + Vα 0 DT VDEVx ET VDDVα 0
(24)
0 5
cov α , x = − Vα 0 DT VDEVx
which can be seen by inspection to be identical to those derived in the previous section, with the
4
exception that Vx is computed only from the class 1 tracks: Vx = Vx−01 + VE−1′
6
9
−1
.
The calculation of the virtual particle’s parameters and covariance matrix must include tracks
from all classes, including those from class 3, which were not used in the vertex calculation
above. We define the virtual particle parameters as before
pV = Aα + Bx = ∑i Aiα i + Bx
(25)
xV = x
where the sums now run over all particles and A and B are defined as before to be
0
a
A 6 B= ∑
−∑ a
0
1
A = A1 A 2 with
zi
n
1
0
=
A
0
0
0
1
0
0
i
0
0
1
0
0
0
0 −az i
0 ayi
1
0
V
cov1x
pV
V , pV
where, as before,
1
6
∑ayi −∑ ax i 0
0
0
az i
0
−ax i
−a y i
ax i
0
0
0
The covariance matrix is, by definition,
Vα V =
yi
−∑ az i
0
∑ ax i
1
cov pV , x V
Vx V
0 5
(26)
(27)
6
(28)
0 5
Vp V = AVα AT + A cov α , x BT + B cov x, α AT + BVx BT
6
0 5
cov pV , x V = A cov α , x + BVx
Vx V = Vx
(29)
Using (24) and collecting terms with the understanding that Di = 0 and VD i = 0 for class 3
particles, we find the covariance matrix for the virtual particle:
Vα V
3A V
= ∑
i
T
i 0 i Ai
8
− S1i VDiS1Ti + TVx TT
− Vx TT
− TVx
Vx
(30)
where S1i = A i V0 i DTi , S3 i = A i V0 i DTi VD i Ei and T = − B + ∑ i S3i . This is the identical result as
before, provided we use the proper values for the matrices. This algorithm is implemented in
KWFIT.
IV Building virtual particles incorporating additional constraints
Frequently we want to build virtual particles with the vertex constraint augmented by, say, a
mass constraint. The incorporation of additional constraints is not difficult at a fundamental
7
level. However, the equations derived in the last section do not work because the extra
constraints do not have the convenient block diagonal structure. Although one can include the
extra constraints simultaneously in the fit (see Appendix), it is far easier to adopt the two step
method described in Section V in ref. [1].
The strategy is to first build a virtual particle, using the standard vertex fit, and compute the
χ 2 . The additional constraints are then applied to the virtual particle itself, using one of the
constraints described in ref. [3]. The track parameters and covariance matrix are updated and a
new χ 2 is computed, which is added to the first one to form the total χ 2 . If the additional
constraints are such that they apply to the virtual particle parameters as a whole, then this two
step procedure is mathematically identical to utilizing all the constraints in one simultaneous fit,
as proven in ref. [1].
Note, however, that the input tracks are not updated by the two step procedure. To update the
track parameters using the extra information, we would have to update the track covariance
matrices after the initial vertex constraint, a process that would introduce correlations between
the tracks. This is not particularly difficult to work out, but it does not seem necessary at this
time to keep track of the updated input tracks.
8
Appendix
Directly Incorporating Additional Constraints in Vertex Fitting
We frequently want to build a virtual particle where a mass constraint is imposed in addition
to the vertex conditions. I argued in Section IV that the best way to handle this case is to build
the virtual particle first with only the vertex constraint and add the additional constraint
afterwards. This method is identical to putting all the constraints together and turning the crank,
and offers the additional advantage of returning two χ 2 values. However, for reasons of
mathematical completeness I decided to include here an algorithm for simultaneously (as
opposed to consecutively) adding additional constraints when building a virtual particle.
There is no fundamental problem in adding m additional constraints to the vertex fitting
problem. The straightforward solution was worked out in ref. [2] using the matrix D to express
the constraints. For n tracks, we have a problem with 2n + m total constraints, thus a
2 n + m × 2n + m matrix must be inverted. However, I show here an alternative method which
is computationally faster and only involves the inversion of an additional m × m matrix.
Again we expand the constraint equations about convenient values α A and x A . As in the
vertex only case, the constraint conditions are denoted H α, x = 0 , where H is a vector of length
0
5 0
5
0 5
m + 2n , and the non-vertex constraints are assumed to come first.
The χ 2 condition including all the constraints can be written in the standard linearized form:
1
χ2 = α − α 0
6
T
1
6 1
Vα−10 α − α 0 + x − x 0
6
T
1
0
6
Vx−01 x − x 0 + 2 λ T Gδα + Fδx + g
5
(31)
where δα = α − α A , δx = x − x A . F and G are matrices representing the partial derivatives of H
with respect to the vertex and track parameters, respectively, while g = H α A , x A is a column
1
6
vector representing the values of the constraints at the expansion point. These matrices can be
written
N E F = E E x
1
2
n
N
D
G=
α1
Nα 2 Nα n
1
D2
Dn
(32)
The D and E matrices are the partials of H (the vertex constraint portion) with respect to the
track and vertex parameters, respectively, as described in Section II. The N matrices are the
partial derivatives of H for the non-vertex constraints. Note that I include an N x term, allowing
the extra constraints to depend on the vertex position. The dimensions of these matrices are
tabulated below.
Ei
2×3
Di
2×7
9
Nx
m×3
Nαi
m×7
g
(m + 2n) × 1
F
(m + 2n) × 3
G
(m + 2n) × 7n
Again we proceed by defining extended matrices to incorporate vertex and track information,
e.g.,
x
~ = α α
α V
=
x0
1
Vα~ 0
n
V01
V0 n
N
E
~ G = E
E
Nα1 Nα 2 Nα n
D1
D2
Dn
x
1
2
n
(33)
The new χ equation can be written in these external variables as
2
1
~ −α
~
χ2 = α
0
6
T
1
6
4
~ ~
~ −α
~ + 2λ T G
Vα~−10 α
δα + g
0
~ and λ yields the solution [1]:
Minimizing this with respect to α
~T
~ =α
~ − V~ G
α
λ
α0
0
4
~ ~
λ = VG~ Gδα
0 +d
4
~
~
VG~ = GVα~ G T
0
9
9
(34)
9
−1
(35)
~
~
Vα~ = Vα~ 0 − Vα~ 0 G T VG~ GVα~ 0
4
~ ~
χ 2 = λ T VG~−1λ = λ T Gδα
0 +g
9
Implementing this solution requires that we invert a matrix of size (2n + m) × (2n + m). For 3
tracks with a mass constraint, the matrix is 7 × 7, which is starting to get a bit large. However, it
is possible to simplify the problem. We compute VG~ to be
N V N + ∑ N V N
E V N + D V N
=
E V N +D V N
~
NV N
NV D =~
DV N V T
x
x x0
VG~
1 x0
n x0
~0
α
~0
α
T
T
T
x
T
0
T
i αi 0i αi
T
1 01 α 1
n 0n
~0
α
−1
~
D
T
αn
N x Vx 0 E1T + N α 1V01D1T
E1Vx 0 E1T + VD−11
E n Vx 0 E1T
N x Vx 0 E nT + N α n V0 n DTn
E1Vx 0 E Tn
−1
E n Vx 0 E Tn + VDn
T −1
3
where VD−1i = Di V0 i DTi are 2 × 2 matrices and N = N x
10
8
Nα1 Nα 2 Nα n .
−1
(36)
Step aside and watch me work. To compute VG~ I use a matrix inversion technique derived in
the Appendix in ref. [2]. Consider a large matrix V which is split into a 2 × 2 block. Then it is
easy to prove that
V
−1
V
=
V
11
T
12
V12
V22
= S
−V V
−1
−1
−1 T −1
22 12 S
−1
− S −1V12 V22
−1
−1 T −1
−1
V22
V12 S V12 V22
+ V22
(37)
−1 T
V12 and V22 is invertible. You can verify this solution by multiplying the
where S = V11 − V12 V22
matrices together. The trick is to make the dimensions of V11 as small as possible while keeping
V22 invertible.
~ , which we inverted in eqn. (11), so we only need to
The lower block of VG~ is just VD−1
calculate an additional m × m matrix inversion. This yields, using the above theorem,
~
VN
− VN NVα~ 0 DT VD~
VG~ =
~
~
~
− VD~ DVα~ 0 N T VN VD~ + VD~ DVα~ 0 N T VN NVα~ 0 DT VD~
~
~
where VN−1 = NVα~ 0 N T − NVα~ 0 D T VD~ DVα~ 0 N T is the new m × m matrix.
(38)
Equations (34) and (37) constitute the solution to the vertex problem plus additional
constraints. It requires only one m × m matrix inverse in addition to what is required for the
vertex constraints. However, the calculation of the covariance matrix for the new particle (see
eqns. 17 and 18) is much more complicated than before, mostly because VG~ cannot be expressed
in a simple way. For now, I see no way of easily computing the covariance matrix, even if the
form of N is simplified (say by containing no vertex component). For this reason I vastly prefer
using the two step approach outlined in Section IV.
References
1. Paul Avery, “Applied Fitting Theory VI: Formulas for Kinematic Fitting”, CBX 97–,
http://www.phys.ufl.edu/~avery/fitting.html.
2. Paul Avery, “Applied Fitting Theory I: General Least Squares Theory”, CBX 91–72,
http://www.phys.ufl.edu/~avery/fitting.html.
11
Download