ece3528-sup-0001-Appendices

advertisement
Supporting Information
Appendix S1........................................................................................................................ 2
The Pareto front is the locus of all points in which the gradients of the performance
functions are positive-linearly dependent ....................................................................... 2
Appendix S2...................................................................................................................... 10
The Pareto front associated with 2 tasks in a 2D-mirphospace is a hyperbola ............. 10
Appendix S3...................................................................................................................... 16
The Pareto front of 2 tasks in an n-dimensional morphospace has hyperbolic
projections ..................................................................................................................... 16
Appendix S4...................................................................................................................... 19
Calculation of the deviation of the Pareto front from a straight line for 2 tasks in a 2Dmorphospace ................................................................................................................. 19
Appendix S5...................................................................................................................... 22
Each Pareto front of 2 tasks in a 2D morphospace is generated by a 1-dimensional
family of norm-pairs ..................................................................................................... 22
Appendix S6...................................................................................................................... 38
Generally, for 3 tasks in a 2D morphospace, the norms can be uniquely determined by
the shape of the Pareto front ......................................................................................... 38
Appendix S7...................................................................................................................... 43
The boundary of the 3-tasks Pareto front is composed of the three 2-tasks Pareto fronts
....................................................................................................................................... 43
Appendix S8...................................................................................................................... 55
1
The resulting Pareto front when one of the performance function is maximized in a
region ............................................................................................................................ 55
Appendix S9...................................................................................................................... 58
Bounds on the Pareto front for general performance functions show that normally it is
located in a region close to the archetype ..................................................................... 58
Appendix S10.................................................................................................................... 63
The Pareto front of r strongly concave performance functions is a connected set of
Hausdorff dimension of at most r-1. ............................................................................. 63
References ......................................................................................................................... 67
Appendix S1
The Pareto front is the locus of all points in which the gradients of the performance
functions are positive-linearly dependent
In this appendix we will analytically calculate the Pareto front for a system that needs to
perform r tasks in an n-dimensional morphospace V. Each performance function has a
single maximum - the archetype ๐‘ฃ๐‘–∗ , and it decreases monotonically with the distance
from the archetype, where the distance is derived from a general inner-product norm.
Each performance function may depend on a different inner product norm.
⇒ The performance at task i is Pi (v) ๏€ฝ pi ( v ๏€ญ vi*
2
i
) , where v ๏€ญ vi*
2
i
๏€ฝ (v ๏€ญ v*i )T M i (v ๏€ญ v*i )
, M i is a positive definite matrix, and Pi is a monotonically decreasing function of a
single argument.
We say that v is Pareto optimal relative to P1 ,..., Pr , if for every v ' ๏‚น v there exists
j ๏ƒŽ๏ป1,..., n๏ฝ such that p j ( v ๏€ญ v*j
2
j
) ๏€พ p j ( v '๏€ญ v*j
2
2
j
) . Since P1 ,..., Pr are monotonically
decreasing with their argument, it is equivalent to say that ๐‘ฃ is Pareto optimal relative to
P1 ,..., Pr if for every v ' ๏‚น v there exists j ๏ƒŽ๏ป1,..., n๏ฝ such that ๏€ญ v '๏€ญ v*j
Denote Di (v) ๏€ฝ v ๏€ญ vi*
2
i
2
j
๏€ผ ๏€ญ v ๏€ญ v*j
2
j
.
. The Pareto front associated with P1 ,..., Pr is exactly the same as
the Pareto front associated with D1 ,..., Dr . Hence, from now on, without loss of
generality, we will assume that Pi (v) ๏€ฝ Di (v)
Here we will show that the Pareto front associated with r tasks in an n-dimensional
morphospace V is given by all points ๐‘ฃ for which:
r
(I) ๏€ค๏ก i ๏‚ณ 0 , ๏ƒฅ ๏ก i ๏€พ 0 , s.t.
i ๏€ฝ1
r
๏ƒฅ ๏ก M (v ๏€ญ v ) ๏€ฝ 0 .
i ๏€ฝ1
i
i
*
i
Note that since Pi (v) ๏€ฝ ๏€ญ(v ๏€ญ v*i )T M i (v ๏€ญ v*i ) , this is equivalent to
๏‚ฎ
r
๏ƒฅ ๏ก ๏ƒ‘ P (v ) ๏€ฝ 0 .
i ๏€ฝ1
i
i
๏‚ฎ
For the rest of this Appendix we will denote gi :๏€ฝ ๏ƒ‘ Pi (v) .
In this Appendix, we will regard V as an inner-product space, using the standard
Euclidean inner product (using the standard basis of measured traits). Do not be confused
with the inner products associated with each performance function Pi , which is only used
to define Pi by measuring distance from the respective archetype ๐‘ฃ๐‘–∗ .
To show that if v is Pareto optimal it satisfies property (I), we will rely on a theorem from
a paper by Gerstenhaber(4). Our approach is to show that if v does not satisfy property
(I), it is not Pareto optimal. Note that if v does not satisfy property (I) then ๏€ขi : gi ๏‚น 0 , as
if ๏€คi : gi ๏€ฝ 0 , then property (I) will be satisfied by choosing: ๏ก i ๏€ฝ 1 , ๏€ขj ๏‚น i : ๏ก j ๏€ฝ 0 .
Here are some relevant definitions quoted from (ref):
The halfline generated by the vector u ๏ƒŽV is the set of all points {๏ฌu | ๏ฌ ๏‚ณ 0} .
3
The convex polyhedral cone framed by u1 ,..., ur ๏ƒŽV is the convex hull of their respective
halflines.
Note that this is equivalent to the set of all non-negative linear combinations
r
{๏ƒฅ ๏ฌi ui | ๏ฌi ๏‚ณ 0} .
i ๏€ฝ1
Let A be a convex polyhedral cone. L(A) is defined to be the convex hull of all linear
subspaces contained in A.
l(A) is the dimension of L(A).
A convex polyhedral cone A is said to be pointed if ๐‘™(๐ด) = 0.
Note that A is pointed if and only if L(A) does not contain a nontrivial full line (a 1dimensional linear subspace), if and only if A does not contain a full line.
Lemma 1: A convex polyhedral cone A framed by g1 ,..., g r is pointed if and only if
g1 ,..., g r do not satisfy property (I)
Proof:
1. “Only if”: If A is not pointed, it contains a full line spanned by the vector ๐‘ฅ ≠ 0. It
means that both x and -x are in A.
r
x ๏ƒŽ A ๏ƒž x ๏€ฝ ๏ƒฅ ๏ก i gi with ๏ก i ๏‚ณ 0 .
i ๏€ฝ1
r
๏€ญ x ๏ƒŽ A ๏ƒž ๏€ญ x ๏€ฝ ๏ƒฅ ๏ข i g i with ๏ขi ๏‚ณ 0 .
i ๏€ฝ1
4
r
r
r
i ๏€ฝ1
i ๏€ฝ1
i ๏€ฝ1
0 ๏€ฝ x ๏€ญ x ๏€ฝ ๏ƒฅ ๏ก i gi ๏€ซ ๏ƒฅ ๏ขi gi ๏€ฝ ๏ƒฅ (๏ก i ๏€ซ ๏ขi ) gi
๏กi ๏‚ณ 0, ๏ขi ๏‚ณ 0 ๏ƒž ๏กi ๏€ซ ๏ขi ๏‚ณ 0
r
As x ๏‚น 0 , not all ๏กi are zero, and not all ๏ขi are zero ๏ƒž ๏ƒฅ ๏ก i ๏€พ 0 and
i ๏€ฝ1
r
r
r
i ๏€ฝ1
i ๏€ฝ1
i ๏€ฝ1
r
๏ƒฅ๏ข
i ๏€ฝ1
i
๏€พ0
๏ƒž ๏ƒฅ (๏ก i ๏€ซ ๏ขi ) ๏ƒž ๏ƒฅ ๏ก i ๏€ซ ๏ƒฅ ๏ขi ๏€พ 0 .
We showed that g1 ,..., g r satisfy property (I).
Hence, if g1 ,..., g r do not satisfy property (I), A is pointed
r
2. “If”: If g1 ,..., g r satisfy property (I), there exist ๏ก1 ,..., ๏ก r ๏‚ณ 0, ๏ƒฅ ๏ก i ๏€พ 0 such that
i ๏€ฝ1
r
๏ƒฅ๏ก g
i ๏€ฝ1
i
๏€ฝ 0 . Choose i such that ๏ก i ๏‚น 0 . Denote x ๏€ฝ ๏ก i gi . Then x ๏‚น 0 .
i
x ๏ƒŽ A by
definition.
r
๏ƒฅ๏ก
j ๏€ฝ1
j ๏‚นi
j
g j ๏€ฝ ๏€ญ๏ก i gi ๏€ฝ ๏€ญ x .
r
๏ƒฅ๏ก g
j ๏€ฝ1
j ๏‚นi
j
j
๏ƒŽ A by definition ๏ƒž ๏€ญ x ๏ƒŽ A .
x ๏‚น 0, x ๏ƒŽ A and ๏€ญ x ๏ƒŽ A ๏ƒž A contains the non-trivial full line: {๏ฌ x | ๏ฌ ๏ƒŽ } ๏ƒž A is not
pointed.
Hence, if A is pointed, g1 ,..., g r do not satisfy property (I) โˆŽ
5
According to theorem 17 in (ref), a convex polyhedral cone A is pointed if and only if
there exists a half plane H ๏ƒŒ V such that, except for the origin, A is contained in the
interior of H.
The interior of a half plane is defined as {x | h ๏ƒ— x ๏€พ 0} , where h is the unit vector
perpendicular to the hyperplane separating the space into 2 halves.
Note that for a convex polyhedral cone ๐ด framed by nonzero vectors g1 ,..., g r , and for
every vector h:
(๏€ข0 ๏‚น x ๏ƒŽ A : x ๏ƒ— h ๏€พ 0) ๏ƒ› ๏€ขi : gi ๏ƒ— h ๏€พ 0 .
“ ๏ƒž ” is trivial, as gi ๏ƒŽ A
r
r
i ๏€ฝ1
i ๏€ฝ1
“ ๏ƒœ ” stems from the fact that ๏€ข0 ๏‚น x ๏ƒŽ A, ๏€ค(๏ก i ๏‚ณ 0, ๏ƒฅ ๏ก i ๏€พ 0) such that x ๏€ฝ ๏ƒฅ ๏ก i gi
Putting together all of the above, we get:
g1 ,..., g r do not satisfy property (I) ⇔
The convex polyhedral cone A framed by g1 ,..., g r is pointed ⇔
There exists a half plane H ๏ƒŒ V such that, except for the origin, A is contained in the
interior of H ⇔
There exists a vector h such that ๏€ขi : h ๏ƒ— gi ๏€พ 0 (note that ๏€ขi : gi ๏‚น 0 )
Claim 1: If there exists a vector โ„Ž, such that ๏€ขi : h ๏ƒ— gi (v) ๏€พ 0 , then v is not Pareto optimal.
Proof: Consider a vector v ' ๏€ฝ v ๏€ซ ๏ฅ h . The performance of task i of v’ is
Pi (v ') ๏€ฝ Pi (v ๏€ซ ๏ฅ h) . Since Pi is smooth, we can approximate:
6
๏‚ฎ
Pi (v ๏€ซ ๏ฅ h) ๏€ฝ Pi (v) ๏€ซ ๏ฅ h ๏ƒ—๏ƒ‘ Pi (v) ๏€ซ O(๏ฅ 2 ) ๏€ฝ Pi (v) ๏€ซ ๏ฅ h ๏ƒ— g i ๏€ซ O (๏ฅ 2 )
By assumption, h ๏ƒ— gi ๏€พ 0 so for small enough ๏ฅ , ๏€ขi : Pi (v ') ๏€พ Pi (v) .
This implies that v is not Pareto optimal, as required.
From all of the above we can deduce that if g1 (v),..., g r (v) do not satisfy property (I), v is
not Pareto optimal.
Note that we’ve shown that a Pareto optimal point must satisfy property (I), without
using any prior assumptions on the nature of the performance functions Pi (v) , beside
differentiability.
On the other hand, if a point ๐‘ฃ satisfies property (I), it is Pareto optimal:
Consider the following function ( ๏กi are given by property (I)):
r
r
i ๏€ฝ1
i ๏€ฝ1
f (u ) ๏€ฝ ๏ƒฅ ๏ก i Pi (u ) ๏€ฝ ๏€ญ ๏ƒฅ ๏ก i (u ๏€ญ vi* )T M i (u ๏€ญ vi* )
๏‚ฎ
๏‚ฎ
r
r
v is an extreme point of f as (๏ƒ‘ f )(v) ๏€ฝ ๏ƒฅ ๏ก i ๏ƒ‘ Pi (v) ๏€ฝ๏ƒฅ ๏ก i gi (v) ๏€ฝ0 .
i ๏€ฝ1
i ๏€ฝ1
f has a single maximal point, since:
๏‚ฎ
r
๏‚ฎ
๏‚ฎ
r
r
i ๏€ฝ1
i ๏€ฝ1
๏ƒ‘ ๏ƒฅ ๏ก i ๏ƒ‘ Pi (v) ๏€ฝ ๏€ญ ๏ƒ‘ ๏ƒฅ ๏ก i (u ๏€ญ vi* )T M i (u ๏€ญ vi* ) ๏€ฝ ๏€ญ2๏ƒฅ ๏ก i M i (u ๏€ญ vi* ) ๏€ฝ 0
i ๏€ฝ1
7
implies
r
vmax ๏€ฝ (๏ƒฅ ๏ก i M i ) ๏€ญ1 (๏ก i M i vi* )
i ๏€ฝ1
(∑๐‘– ๐›ผ๐‘– ๐‘€๐‘– is positive-definite as all ๐‘€๐‘– are positive definite, all ๐›ผ๐‘– are non-negative and not
all of them are zero, which implies ∑๐‘– ๐›ผ๐‘– ๐‘€๐‘– is also invertible). And r
r
i ๏€ฝ1
i ๏€ฝ1
๏ƒ‘ 2 ๏ƒฅ ๏ก i Pi (v) ๏€ฝ ๏€ญ๏ƒฅ ๏ก i M i
which is the negative of a positive definite matrix, and hence a negative definite matrix
๏ƒž vmax is a maximum. Also notice that vmax is a global maximum since f is defined
continuously on the entire space and has a single extremum.
r
So if v satisfies property (I) it maximizes
๏ƒฅ๏ก P
i i
i ๏€ฝ1
(i.e. v ๏€ฝ vmax ). If it is not Pareto optimal,
it means that there is a point v ' ๏‚น v such that ๏€ขi : Pi (v) ๏‚ฃ Pi (v ') and since ๏€ขi : ๏ก i ๏‚ณ 0 we
get
r
r
i ๏€ฝ1
i ๏€ฝ1
๏ƒฅ๏กi Pi (v) ๏€ผ ๏ƒฅ๏กi Pi (v ') , in opposed to the maximality of v. So if v satisfies property
(I), it is Pareto optimal. โˆŽ
Conclusion 1: ๐‘ฃ is Pareto optimal ๏ƒ› v satisfies property (I) ⇔ there exists no vector h
such that ๏€ขi : h ๏ƒ— gi ๏€พ 0 .
Conclusion 2: v is Pareto optimal ๏ƒ› v satisfies property (I) ๏ƒ› v maximizes
r
๏ƒฅ ๏ก P (u)
i ๏€ฝ1
(with ๏กi given by property (I)).
8
i i
In other words, the set of Pareto optimal points equals the set of all points ๐‘ฃ for which
r
there exist ๏ก i ๏‚ณ 0, ๏ƒฅ ๏ก i ๏€พ 0 , such that
i ๏€ฝ1
๏‚ฎ
r
๏ƒฅ ๏ก ๏ƒ‘ P (v ) ๏€ฝ 0
i ๏€ฝ1
i
i
๐›ผ๐‘–
Note that as ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ≠ 0, we can define ๐›ฝ๐‘– = ∑๐‘Ÿ
๐‘–=1 ๐›ผ๐‘–
∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐›ป๐‘ƒ๐‘– (๐‘ฃ) = 0 ⇔
1
∑๐‘Ÿ ๐›ผ ๐›ป๐‘ƒ๐‘– (๐‘ฃ)
∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐‘–=1 ๐‘–
. Then ๐›ฝ๐‘– ≥ 0, ∑๐‘Ÿ๐‘–=1 ๐›ฝ๐‘– = 1, and:
= 0 ⇔ ∑๐‘Ÿ๐‘–=1 ๐›ฝ๐‘– ๐›ป๐‘ƒ๐‘– (๐‘ฃ) = 0
Hence, ∃๐›ผ๐‘– ≥ 0, ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– > 0 s.t. ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐›ป๐‘ƒ๐‘– (๐‘ฃ) = 0
⇔ ∃ ๐›ผ๐‘– ≥ 0, ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– = 1, s.t. ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐›ป๐‘ƒ๐‘– (๐‘ฃ) = 0
Thus, the set of Pareto optimal points equals the set of all points ๐‘ฃ for which there exist
๐›ผ๐‘– ≥ 0, ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– = 1, such that ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐›ป๐‘ƒ๐‘– (๐‘ฃ) = 0
We consider performance functions of the form ๐‘ƒ๐‘– (๐‘ฃ) = (๐‘ฃ − ๐‘ฃ๐‘–∗ )๐‘‡ ๐‘€๐‘– (๐‘ฃ − ๐‘ฃ๐‘–∗ ) ⇒
๐›ป๐‘ƒ๐‘– (๐‘ฃ) = ๐‘€๐‘– (๐‘ฃ − ๐‘ฃ๐‘–∗ ). Hence,
๐‘Ÿ
๐‘Ÿ
∑ ๐›ผ๐‘– ๐›ป๐‘ƒ๐‘– (๐‘ฃ) = 0 ⇒ ∑ ๐›ผ๐‘– ๐‘€๐‘– (๐‘ฃ − ๐‘ฃ๐‘–∗ ) = 0 ⇒ ๐‘ฃ = (∑ ๐›ผ๐‘– ๐‘€๐‘– )
๐‘–=1
๐‘–
๐‘–=1
−1
(๐›ผ๐‘– ๐‘€๐‘– ๐‘ฃ๐‘–∗ )
Conclusion 3: The set of Pareto optimal points, associated with ๐‘ƒ1 , … , ๐‘ƒ๐‘Ÿ is given by:
{(∑๐‘– ๐›ผ๐‘– ๐‘€๐‘– )−1 ∑๐‘–(๐›ผ๐‘– ๐‘€๐‘– ๐‘ฃ๐‘–∗ ) |๐›ผ๐‘– ≥ 0, ∑3๐‘–=1 ๐›ผ๐‘– = 1}
9
Appendix S2
The Pareto front associated with 2 tasks in a 2D-mirphospace is a hyperbola
We would like to prove that the Pareto front associated with 2 tasks in a 2-dimensional
morphospace is a section of a hyperbola or a line. As explained in Appendix S1, the
performance of task ๐‘– is taken to be:
๐‘ƒ๐‘– = −(๐‘ฃ − ๐‘ฃ๐‘–∗ )๐‘‡ ๐‘€๐‘– (๐‘ฃ − ๐‘ฃ๐‘–∗ )
Where ๐‘€๐‘– is a positive-definite 2× 2 matrix, and ๐‘ฃ๐‘–∗ = (๐‘ฅ๐‘–∗ , ๐‘ฆ๐‘–∗ ) is the archetype for task ๐‘–.
Positive definite matrices have positive eigenvalues and are Hermitian, and thus ๐‘€๐‘– can
be diagonalized by a rotation matrix. Thus
๐œ†๐‘–,1 ²
๐‘€๐‘– = ๐‘…(๐œƒ๐‘– ) (
0
0
๐œ†๐‘–,2 ²
) ๐‘…(−๐œƒ๐‘– ) ,
where ๐‘…(๐œƒ) is an orthogonal matrix (rotation matrix by angle ๐œƒ) and ๐œ†๐‘–,1 > 0, ๐œ†๐‘–,2 > 0
(i.e. real and non-zero).
Note: The contours of such performance functions ๐‘ƒ๐‘– are concentric ellipses with
๐œ†
eccentricity ๐œ† = ๐œ†๐‘–,2 which are rotated by an angle of ๐œƒ relative to the y axis. These
๐‘–,2
contours and their parameters are widely used in the main text.
As will be explained immediately, we can assume without loss of generality that:
1. ๐‘€1 = I, implying - (๐‘ฃ − ๐‘ฃ1∗ )๐‘‡ ๐‘€1 (๐‘ฃ − ๐‘ฃ1∗ ) = (๐‘ฃ − ๐‘ฃ1∗ )๐‘‡ (๐‘ฃ − ๐‘ฃ1∗ ) .
2. ๐‘ฃ1∗ = (0,0)
3. ๐‘ฃ2∗ = (1,0)
4. One of ๐‘€2 ′๐‘  eigenvalues is 1.
Those assumptions will be true in a rotated, translated and rescaled coordinate system.
We will show that the Pareto front is a hyperbola. Since a hyperbola remains a hyperbola
under such transformations, and all such transformations are invertible, it is enough to
work in the coordinate system where the above assumptions hold.
10
The first and second assumptions are satisfied by the transformation ๐‘ฃ → ๐‘‡๐‘Ÿ(๐‘ฃ1∗ )๐‘‡1 ๐‘ฃ
where:
๐‘‡1 = ๐‘…(๐œƒ1 )
(
1
๐œ†๐‘–,1
0
0
1
๐œ†๐‘–,2
)
(then translate such that ๐‘ฃ1∗ is at (0,0), rotate such that ๐‘€1 is diagonal, and then scale such
that ๐‘€1 = ๐ผ).
Under the above transformation, the second archetype moves to:
ฬƒ
{Δ๐‘ฅ
ฬƒ } = ๐‘‡1−1 (๐‘ฃ2∗ − ๐‘ฃ1∗ )
Δ๐‘ฆ
The third assumption is satisfied, while keeping assumptions 2, and keeping ๐‘€1 scalar by
further applying:
๐‘‡2 = (
ฬƒ
Δ๐‘ฅ
ฬƒ
Δ๐‘ฆ
ฬƒ
−Δ๐‘ฆ
)
ฬƒ
Δ๐‘ฅ
(Move the second archetype to the ๐‘ฅ axis and then scale it to be at (1,0)).
ฬƒ ≠ 0 or Δ๐‘ฆ
ฬƒ ≠0
This transformation is invertible since Δ๐‘ฅ
We apply the transformation ๐‘ฃ → ๐‘‡2 ๐‘‡1 ๐‘ฃ.
In this coordinate system:
2
ฬƒ 2 + Δ๐‘ฆ
ฬƒ 2 )๐‘ฃ ๐‘‡ ๐‘ฃ
The functional โ€–๐‘ฃ − ๐‘ฃ1∗ โ€–1 becomes โ€–๐‘‡2 ๐‘‡1 ๐‘ฃ − ๐‘ฃ1∗ โ€–12 = (Δ๐‘ฅ
๐‘‡
2
The functional โ€–๐‘ฃ − ๐‘ฃ2∗ โ€–2 becomes โ€–๐‘‡2 ๐‘‡1 ๐‘ฃ − ๐‘ฃ2∗ โ€–22 = (๐‘ฃ − (1,0)) ๐‘€(๐‘ฃ − (1,0))
Where ๐‘€ = (๐‘‡2 ๐‘‡1 )๐‘‡ ๐‘€2 ๐‘‡2 ๐‘‡1
Since ๐‘€2 is positive definite and ๐‘‡2 ๐‘‡1 is invertible, ๐‘€ is positive definite.
๐œ†12
→ ๐‘€ = ๐‘…(๐œƒ) (
0
0
) ๐‘…(−๐œƒ), with ๐œ†1 > 0, ๐œ†2 > 0
๐œ†22
11
Finally, assumption 1 and 4 are reached by using the following lemma:
Lemma 1: The Pareto front is invariant to scaling of any of the norms
Proof: ๐‘ฃ is Pareto optimal with respect to ๐‘ƒ1 (๐‘ฃ), ๐‘ƒ2 (๐‘ฃ) if for every ๐‘ฃ ′ ≠ ๐‘ฃ there exists ๐‘— ∈
2
2
๐‘—
๐‘—
{1,2} such that โ€–๐‘ฃ − ๐‘ฃ๐‘—∗ โ€– < โ€–๐‘ฃ ′ − ๐‘ฃ๐‘—∗ โ€– . Let ๐›ฝ๐‘—2 > 0 be a constant for each ๐‘— ∈ {1,2}.
2
2
2
2
๐‘—
๐‘—
โ€–๐‘ฃ − ๐‘ฃ๐‘—∗ โ€– < โ€–๐‘ฃ ′ − ๐‘ฃ๐‘—∗ โ€– ⇔ ๐›ฝ๐‘—2 โ€–๐‘ฃ − ๐‘ฃ๐‘—∗ โ€– < ๐›ฝ๐‘—2 โ€–๐‘ฃ ′ − ๐‘ฃ๐‘—∗ โ€– , which means that the
๐‘—
๐‘—
norm-pairs {๐›ฝ๐‘—2 โ€–๐‘ฃ −
2
๐‘ฃ๐‘—∗ โ€– }
๐‘—
and {โ€–๐‘ฃ −
1
2
๐‘ฃ๐‘—∗ โ€– }
๐‘—
result in the same Pareto front. โˆŽ
1
We choose ๐›ฝ12 = (Δ๐‘ฅ
and ๐›ฝ22 = ๐œ†2 and we get that
ฬƒ 2 +Δ๐‘ฆ
ฬƒ 2)
1
2
ฬƒ 2 + Δ๐‘ฆ
ฬƒ 2 )๐‘ฃ ๐‘‡ ๐‘ฃ , (๐‘ฃ − (1,0))๐‘‡ ๐‘…(๐œƒ) (๐œ†1
(Δ๐‘ฅ
0
0
) ๐‘…(−๐œƒ)(๐‘ฃ − (1,0)) results in the same
๐œ†22
Pareto front as
1
๐‘‡
๐‘ฃ ๐‘‡ ๐‘ฃ , (๐‘ฃ − (1,0)) ๐‘…(๐œƒ) (0
0
๐œ†22
) ๐‘…(−๐œƒ)(๐‘ฃ − (1,0)). Denote ๐œ†2 =
๐œ†21
๐œ†22
๐œ†21
.
To conclude, we can assume without loss of generality that:
2
||๐‘ฃ − ๐‘ฃ๐‘–∗ ||๐‘– = (๐‘ฃ − ๐‘ฃ๐‘–∗ )๐‘‡ ๐‘€๐‘– (๐‘ฃ − ๐‘ฃ๐‘–∗ ) = (๐‘ฃ − ๐‘ฃ๐‘–∗ )๐‘‡ ๐‘…(๐œƒ) (
1 0
) ๐‘…(−๐œƒ)(๐‘ฃ − ๐‘ฃ๐‘–∗ )
0 ๐œ†2๐‘–
With ๐‘ฃ1∗ = (0,0), ๐‘ฃ2∗ = (1,0), ๐œ†12 = 1, ๐œ†22 = ๐œ†2
As seen in Appendix S1 (conclusion 1), the Pareto front associated with 2 tasks is given
by:
{๐‘ฃ | ∃ 0 ≤ ๐›ผ ≤ 1 ๐‘ . ๐‘ก. ๐›ผ๐‘”1 (๐‘ฃ) + (1 − ๐›ผ)๐‘”2 (๐‘ฃ) = 0}
Where ๐‘”๐‘– = ๐›ป๐‘ƒ๐‘– (๐‘ฃ)
Thus, the gradients of the 2 performance functions at a point that is Pareto optimal
relative to 2 tasks point in opposite directions.
12
Let’s try to give a geometrical intuition to the above statement. The gradients of the
performance functions at point ๐‘ฃ point in opposite directions if and only if ๐‘ฃ is a
tangency point between 2 contours of the performance function.
2
Each point ๐‘ฃ is on some contour of ๐‘ƒ1 : ๐ถ1 = {๐‘ฃ ′ | ||๐‘ฃ ′ − ๐‘ฃ1∗ ||1 = ||๐‘ฃ − ๐‘ฃ1∗ ||1 2 }, and on a
2
contour of ๐‘ƒ2 : ๐ถ2 = {๐‘ฃ ′ | ||๐‘ฃ ′ − ๐‘ฃ2∗ ||2 = ||๐‘ฃ − ๐‘ฃ2∗ ||2 2 }. As mentioned in the main text, ๐ถ1
and ๐ถ2 are ellipses. Point ๐‘ฃ is a common point to ๐ถ1 and ๐ถ2 . It can be an intersection
point, an internal tangency point (when the intersection of the interiors of both ellipses is
non-empty), or an external tangency point.
The gradients of the performance functions at point ๐‘ฃ point in opposite directions ⇔ ๐‘ฃ is
an external tangency point between the 2 contours.
If the gradients of the performance function at a point ๐‘ฃ do not point in opposite
directions, it is either an internal tangency point or an intersection point between ๐ถ1 and
2
2
๐ถ2 . In both cases ๐ถ1 โ—‹ ∩ ๐ถ2 โ—‹ ≠ ๐œ™, where ๐ถ๐‘– โ—‹ = {๐‘ฃ ′ | ||๐‘ฃ ′ − ๐‘ฃ๐‘–∗ ||๐‘– < ||๐‘ฃ − ๐‘ฃ๐‘–∗ ||๐‘– } - all
points that outperform ๐‘ฃ in the ๐‘–-th task (see figure S1a). It means that there exists ๐‘ฃ ′ โˆŠ
๐ถ1 โ—‹ ∩ ๐ถ2 โ—‹ ⇒ ๐‘ƒ1 (๐‘ฃ ′ ) > ๐‘ƒ1 (๐‘ฃ) and ๐‘ƒ2 (๐‘ฃ ′ ) > ๐‘ƒ2 (๐‘ฃ), i.e., if the gradients at point ๐‘ฃ don’t
point at opposite directions, we can find a point ๐‘ฃ ′ at the neighborhood of ๐‘ฃ that performs
both tasks better than it. That is why a point for which the gradients don’t point in
opposite directions is not Pareto optimal. In case of an external tangency point ๐‘ฃ, ๐ถ1 โ—‹ ∩
๐ถ2 โ—‹ = {๐‘ฃ}, in which case there are no points that outperform ๐‘ฃ in both tasks (figure S1b).
13
Figure S1: The Pareto front is composed of tangency points between performance functions’
contours. Archetypes are marked as red dots, Pareto front in blue. (a) A point whose
contours intersect is not Pareto optimal; points in the intersection area outperform it in both
tasks. (b) A point whose contours are tangent is optimal – there is no intersection area with
other outperforming points.
Denote the Pareto front by P.F., as seen in Appendix S1 (conclusion 3):
๐‘ƒ. ๐น = {(๐›ผ๐‘€1 + (1 − ๐›ผ)๐‘€2 )−1 (๐›ผ๐‘€1 ๐‘ฃ1∗ + (1 − ๐›ผ)๐‘€2 ๐‘ฃ2∗ ) | 0 ≤ ๐›ผ ≤ 1}
⇒ ∀0 ≤ ๐›ผ ≤ 1 ∃๐‘ฃ ∈ ๐‘ƒ. ๐น โˆถ
๐‘ฃ = (๐›ผ ๐‘€1 + (1 − ๐›ผ)๐‘€2 )−1 (๐›ผ ๐‘€1 ๐‘ฃ1∗ + (1 − ๐›ผ)๐‘€2 ๐‘ฃ2∗ ) =
−1
1 0
(๐‘…(๐œƒ) (๐›ผ ๐ผ + (1 − ๐›ผ) (
)) ๐‘…(−๐œƒ) )
0 ๐œ†2
1
0
((1 − ๐›ผ) ๐‘…(๐œƒ) (
0
) ๐‘…(−๐œƒ ′ )(1,0)) =
๐œ†2
(−1 + ๐›ผ)(−๐›ผ + (−2 + ๐›ผ)๐œ†2 + ๐›ผ(−1 + ๐œ†2 )Cos[2๐œƒ]) (−1 + ๐›ผ)๐›ผ(−1 + ๐œ†2 )Cos[๐œƒ]Sin[๐œƒ]
(−
,
)
−2๐›ผ + 2(−1 + ๐›ผ)๐œ†2
๐›ผ + ๐œ†2 − ๐›ผ๐œ†2
When ๐›ผ → 0, ๐‘ฃ → (1, 0). When ๐›ผ → 1, ๐‘ฃ → (0,0). Thus, the Pareto front lies on a curve
between the archetypes ๐‘ฃ1∗ ′ and ๐‘ฃ2∗ ′ .
We next characterize this curve. Denote ๐‘ฃ = (๐‘ฅ, ๐‘ฆ). By eliminating ๐›ผ, we get that our
curve is a quadratic curve that satisfies:
14
๐‘ฅ 2 (1 − ๐œ†2 )๐‘†๐‘–๐‘›2 (2 ๐œƒ) − 2๐‘ฅ๐‘ฆ(1 − ๐œ†2 )๐‘†๐‘–๐‘›(2 ๐œƒ)๐ถ๐‘œ๐‘ (2 ๐œƒ) − ๐‘ฆ 2 (1 − ๐œ†2 )๐‘†๐‘–๐‘›2 (2 ๐œƒ)
− ๐‘ฅ (1 − ๐œ†2 )๐‘†๐‘–๐‘›2 (2 ๐œƒ) + ๐‘ฆ ((1 + ๐œ†2 − (1 − ๐œ†2 )๐ถ๐‘œ๐‘ (2 ๐œƒ)) ๐‘†๐‘–๐‘›(2 ๐œƒ) = 0
A general quadratic curve is of the form ๐ด๐‘ฅ๐‘ฅ ๐‘ฅ 2 + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 + 2๐ด๐‘ฅ ๐‘ฅ + 2๐ด๐‘ฆ ๐‘ฆ +
๐ด=0
If J = ๐ด๐‘ฅ๐‘ฅ ๐ด๐‘ฆ๐‘ฆ -
๐ด2๐‘ฅ๐‘ฆ
๐ด๐‘ฅ๐‘ฅ
< 0 and Δ = ๐ท๐‘’๐‘ก ( ๐ด๐‘ฅ๐‘ฆ
๐ด๐‘ฅ
๐ด๐‘ฅ๐‘ฆ
๐ด๐‘ฆ๐‘ฆ
๐ด๐‘ฆ
๐ด๐‘ฅ
๐ด๐‘ฆ ) ≠ 0 then the equation
๐ด
represents a hyperbola.
For our curve:
J = ๐ด๐‘ฅ๐‘ฅ ๐ด๐‘ฆ๐‘ฆ , ๐ด2๐‘ฅ๐‘ฆ = −(−1 + λ2 )2 sin2 (2๐œƒ) , Δ = 4๐œ†2 (−1 + ๐œ†2 ) sin([2๐œƒ]4 )
J < 0 and Δ ≠ 0 unless ๐œ† = 1 ๐‘œ๐‘Ÿ ๐œƒ = 0 ๐‘œ๐‘Ÿ
๐œ‹
2
and then J = 0 and Δ = 0. This means that
the Pareto optimal points lie on a section of a hyperbola between the 2 archetypes, unless
๐œ‹
๐œ† = 1 ๐‘œ๐‘Ÿ ๐œƒ ∈ {0, 2 } and then they lie on the line between the archetypes. This matches
previous results [1] since if ๐œ† = 1 both norms are equal. This also shows that if both
๐œ‹
norms’ contours are perpendicular to the line between them (๐œƒ๐‘– ∈ {0, 2 }), the Pareto front
is the line between the archetypes.
To conclude, we showed that the Pareto front associated with 2 archetypes in a 2-D trait
space with performance functions that decrease monotonically with a general innerproduct norm distance from the archetype is a section of a hyperbola (or a line) between
the archetypes.
15
Appendix S3
The Pareto front of 2 tasks in an n-dimensional morphospace has hyperbolic
projections
In this Appendix we will characterize the Pareto front for 2 tasks in an n-dimensional
morphospace.
We would like to prove that the Pareto front in an n-dimensional trait space for a system
that needs to perform 2 tasks, when each performance function depends on a different
inner-product norm, is a 1-dimensional curve connecting the archetypes whose
projections on the principle planes of a certain coordinate system are hyperbolae (or
lines).
As shown in Appendix S1, the performance functions ๐‘ƒ๐‘– can be written as
๐‘ƒ๐‘– (๐‘ฃ) = −(๐‘ฃ − ๐‘ฃ๐‘–∗ )๐‘‡ ๐‘€๐‘– (๐‘ฃ − ๐‘ฃ๐‘–∗ )
with positive-definite ๐‘€๐‘– that can be decomposed as
๐œ†2๐‘–,1
๐‘€๐‘– = ๐‘…๐‘– ( 0
0
0
0
… 0 ) ๐‘…๐‘–๐‘‡
0 ๐œ†2๐‘–,๐‘
where ๐‘…๐‘– is an orthogonal matrix. The Pareto front is the locus of all points satisfying
๐›ผ1 ๐‘€1 (๐‘ฃ − ๐‘ฃ1∗ ) + ๐›ผ2 ๐‘€2 (๐‘ฃ − ๐‘ฃ2∗ ) = 0, ๐›ผ1 + ๐›ผ2 = 1
Moreover, there exists a basis in which ๐‘€1 = ๐ผ, simply redefine:
๐œ†๐‘–,1
๐‘ฃ โˆถ= ( 0
0
0
0
… 0 ) ๐‘…๐‘›๐‘‡ ๐‘ฃ
0 ๐œ†๐‘–,๐‘›
In which ๐‘€2 (describing ๐‘ƒ2 ) is still positive-definite (as it does not depend on the basis).
Denote ๐ด๐‘– = ๐›ผ๐‘– ๐‘€๐‘– , ๐ด = ๐ด1 + ๐ด2 , ๐ต๐‘– = ๐ด−1 ๐ด๐‘–
16
๐ต1 + ๐ต2 = ๐ด−1 (๐ด1 + ๐ด2 ) = ๐ด−1 ๐ด = ๐ผ
๐‘ฃ = ๐ด−1 (๐ด1 ๐‘ฃ1∗ + ๐ด2 v2∗ ) = ๐ต1 ๐‘ฃ1∗ + ๐ต2 ๐‘ฃ2∗
The eigenvalues of ๐‘€๐‘– are positive. The eigenvalues of ๐ด๐‘– are non-negative as ๐›ผ๐‘– ≥ 0. As
๐›ผ๐‘– are real, ๐ด๐‘– are symmetric. ๐ด1 has a single eigenvalue - ๐›ผ1 , since ๐‘€1 is assumed to be
the identity matrix. The eigenvalues of ๐ด2 are ๐›ผ2 ๐œ†๐‘—2 . As ๐ด1 is scalar, the eigenvectors
of ๐ด = ๐ด1 + ๐ด2 are the same as those of ๐ด2 , with eigenvalues ๐›ผ1 + ๐›ผ2 ๐œ†๐‘—2 . These
eigenvalues are positive since at least one ๐›ผ๐‘– is positive, therefore A is invertible. The
eigenvalues of ๐ด−1 are ๐›ผ
1
1
+ ๐›ผ2 ๐œ†2๐‘—
. ๐ด is symmetric as a sum of 2 symmetric matrices.
As ๐ด1 is scalar, it commutes with ๐ด2 . [๐ด1 , ๐ด2 ] = ๐ด1 ๐ด2 − ๐ด2 ๐ด1 = 0 ⇒ [๐ด, ๐ด1 ] = 0 and
[๐ด, ๐ด2 ] = 0 ⇒ [๐ด−1 , ๐ด1 ] = 0
๐ด๐‘– ๐ด๐‘‡
−1
๐‘‡
[๐ด−1 , ๐ด2 ] = 0 ⇒ ๐ต๐‘–๐‘‡ = (๐ด−1 ๐ด๐‘– )๐‘‡ = ๐ด๐‘‡๐‘– ๐ด−1 =
and
= ๐ด๐‘– ๐ด−1 = ๐ต๐‘– .
Also, ๐ด−1 has the same eigenvectors as ๐ด, ๐ด1 and ๐ด2 so the eigenvalues of ๐ต1 are
๐›ผ1
๐›ผ1 +๐›ผ2 ๐œ†2๐‘—
=
๐›ผ
๐›ผ+(1−๐›ผ) ๐œ†2๐‘—
and of ๐ต2 are
1−๐›ผ
๐›ผ
1−๐›ผ+ 2
๐œ†
๐‘—
๐ต1 and ๐ต2 are mutually diagonalizable: Let D be such that ๐ท๐ต1 ๐ท−1 = Λ1 . Now ๐ต2 = ๐ผ −
๐ต1 ⇒ ๐ท๐ต2 ๐ท−1 = ๐ท(๐ผ − ๐ต1 )๐ท−1 = ๐ผ − Λ1
๐‘ฃ = ๐ท−1 Λ1 ๐ท ๐‘ฃ1∗ + ๐ท−1 (๐ผ − Λ1 )๐ท ๐‘ฃ2∗
= ๐‘ฃ2∗ + ๐ท−1 Λ1 ๐ท (๐‘ฃ1∗ − ๐‘ฃ2∗ ) → ๐ท(๐‘ฃ − ๐‘ฃ2∗ ) =
Λ1 ๐ท(๐‘ฃ1∗ − ๐‘ฃ2∗ )
Consider the rotated coordinate system, ๐‘ฃ โ‰” ๐ท−1 ๐‘ฃ, and rename ๐‘ฃ2∗ โˆถ= ๐ท−1 ๐‘ฃ2∗ . In this
๐›ผ
system ๐‘ฃ −
๐‘ฃ2∗
= Λ1 (๐‘ฃ −
๐‘ฃ1∗ )
๐›ผ+(1−๐›ผ) ๐œ†21
=(
โ‹ฎ
0
…
0
โ‹ฑ
…
โ‹ฎ
๐›ผ
) (๐‘ฃ − ๐‘ฃ1∗ ).
๐›ผ+(1−๐›ผ) ๐œ†2๐‘›
∗
∗
∗
∗
Denote ๐‘ฃ = (๐‘ฃ1 , … , ๐‘ฃ๐‘› ), ๐‘ฃ1∗ = (๐‘ฃ1,1
, … , ๐‘ฃ1,๐‘›
), ๐‘ฃ2∗ = (๐‘ฃ2,1
, … , ๐‘ฃ2,๐‘›
)
For every 2 coordinates ๐‘—, ๐‘˜ we get that:
17
๐‘ฃ๐‘˜ ๐‘ฃ๐‘— (๐œ†๐‘— − ๐œ†๐‘˜ ) + ๐‘ฃ๐‘˜ (๐‘ฃ1,๐‘— ๐œ†๐‘˜ − ๐‘ฃ2,๐‘— ๐œ†๐‘— ) + ๐‘ฃ๐‘— (๐‘ฃ2,๐‘˜ ๐œ†๐‘˜ − ๐‘ฃ1,๐‘˜ ๐œ†๐‘— ) + ๐‘ฃ1,๐‘˜ ๐‘ฃ2,๐‘— ๐œ†๐‘— − ๐‘ฃ1,๐‘— ๐‘ฃ2,๐‘˜ ๐œ†๐‘˜
=0
Thus, the projection of the Pareto front on any main plane is a quadratic curve, with
parameters:
2
J = ๐ด๐‘ฅ๐‘ฅ ๐ด๐‘ฆ๐‘ฆ − ๐ด2๐‘ฅ๐‘ฆ = −(๐œ†๐‘— − ๐œ†๐‘˜ ) , Δ = −(๐œ†๐‘— − ๐œ†๐‘˜ )
2
For components for which ๐œ†๐‘— ≠ ๐œ†๐‘˜ , ๐ฝ < 0 and Δ ≠ 0, so the projection on their plane is a
hyperbola. For components for which ๐œ†๐‘— = ๐œ†๐‘˜ , ๐ฝ = 0 and Δ = 0, so the projection on
their plane is a line.
To conclude, we get that the Pareto front associated with 2 archetypes in an ndimensional trait space is a 1-dimensional curve between the 2 archetypes. There exists a
coordinate system such that the projection of this curve on each principal plane is a
section of a hyperbola or a line between the projections of the archetypes.
18
Appendix S4
Calculation of the deviation of the Pareto front from a straight line for 2 tasks in a
2D-morphospace
In this Appendix, we calculate the maximal deviation of the Pareto front associated with
2 tasks in a 2D morphospace from the line between the archetypes (which is the Pareto
front in case the norms are equal).
The deviation of the front from the line between the archetypes is defined as the maximal
height โ„Ž of a point on the front with respect to the line, divided by the Euclidean distance
between the archetypes, D.
1
1
We can assume without loss of generality that the archetypes are at (− 2 , 0) and (2 , 0),
since this assumption can be satisfied by a combination of translations/rotations and
isometric scaling, all of which preserve distance ratios. Notice that in this case ๐ท = 1 so
โ„Ž
the ratio ๐ท is simply โ„Ž.
Norm ๐‘– depends on the parameters ๐œ†๐‘– , ๐œƒ๐‘– , i.e - ๐‘€๐‘– = R(θi ) (
1 0
) ๐‘…(−θi ). During the
0 ๐œ†2๐‘–
๐œ‹
solution we assume that 0 ≤ ๐œƒ๐‘– < 2 . This is possible since the Pareto front is symmetric
๐œ‹ 1
under the transformation: ๐œƒ๐‘– , ๐œ†๐‘– → ๐œƒ๐‘– + 2 , ๐œ† (see Appendix S5)
๐‘–
Appendix S1 (Conclusion 3) gives a parametric representation of the front, {๐‘ฅ(๐‘ข), ๐‘ฆ(๐‘ข)},
with a parameter 0 ≤ ๐‘ข ≤ 1. As the line between the archetypes lies on the ๐‘ฅ axis, the
maximal deviation is given by max |๐‘ฆ(๐‘ข) |. max |๐‘ฆ(๐‘ข) | = |๐‘ฆ(๐‘ข๐‘š๐‘Ž๐‘ฅ )|, for ๐‘ข๐‘š๐‘Ž๐‘ฅ with
๐œ•๐‘ข ๐‘ฆ|๐‘ข๐‘š๐‘Ž๐‘ฅ = 0 (The maximum is not obtained at the edges, since ๐‘ฆ(0) = ๐‘ฆ(1) = 0).
So, from Appendix S1:
๐‘ฆ(๐‘ข) = −
1
(−1 + ๐‘ข)๐‘ข (−(−1 + ๐œ†21 )Sin[2θ1 ](Cos[θ2 ]2 + ๐œ†22 Sin[θ2 ]2 ) − (−1 + ๐œ†22 )(−1 − ๐œ†21 + (−1 + ๐œ†21 )Cos[2θ1 ])Sin[2θ2 ])
2
2๐œ†22 + ๐‘ข(1 + ๐œ†21 + (−3 + ๐œ†21 )๐œ†22 + ๐‘ข(−1 + ๐œ†21 + ๐œ†22 − ๐œ†21 ๐œ†22 )) + (−1 + ๐‘ข)๐‘ข(−1 + ๐œ†21 )(−1 + ๐œ†22 )Cos[2(θ1 − θ2 )]
Straightforward calculation shows that ๐œ•๐‘ข ๐‘ฆ = 0 for ๐‘ข๐‘š๐‘Ž๐‘ฅ =
1
λ
1+ 1
λ2
19
Substituting this into the expression for ๐‘ฆ, we get that the maximal deviation for given
parameters ๐œ†1 , ๐œƒ1 and ๐œ†2 , ๐œƒ2 is:
โ„Ž(๐œ†1 , ๐œƒ1 ๐œ†2 , ๐œƒ2 ) = |−
(−1+๐œ†21 )(1+๐œ†22 )Sin[2θ1 ]−(−1+๐œ†22 )((−1+๐œ†21 )Sin[2(θ1 −θ2 )]+(1+๐œ†21 )Sin[2θ2 ])
2(1+๐œ†21 +4λ1 λ2 +๐œ†22 +๐œ†21 ๐œ†22 −(−1+๐œ†21 )(−1+๐œ†22 )Cos[2(θ1 −θ2 )])
|
It is bounded by:
|−
≤
=
(−1 + ๐œ†21 )(1 + ๐œ†22 )Sin[2θ1 ] − (−1 + ๐œ†22 )((−1 + ๐œ†21 )Sin[2(θ1 − θ2 )] + (1 + ๐œ†21 )Sin[2θ2 ])
|
2(1 + ๐œ†21 + 4λ1 λ2 + ๐œ†22 + ๐œ†21 ๐œ†22 − (−1 + ๐œ†21 )(−1 + ๐œ†22 )Cos[2(θ1 − θ2 )])
|(−1 + ๐œ†21 )(1 + ๐œ†22 )Sin[2θ1 ]| + |(−1 + ๐œ†22 )(−1 + ๐œ†21 )Sin[2(θ1 − θ2 )]| + |(1 + ๐œ†21 )(−1 + ๐œ†22 )Sin[2θ2 ]|
2||1 + ๐œ†21 + 4λ1 λ2 + ๐œ†22 + ๐œ†21 ๐œ†22 | − |(−1 + ๐œ†21 )(−1 + ๐œ†22 )Cos[2(θ1 − θ2 )]||
|(−1 + ๐œ†21 )(1 + ๐œ†22 )Sin[2θ1 ]| + |(−1 + ๐œ†22 )(−1 + ๐œ†21 )Sin[2(θ1 − θ2 )]| + |(1 + ๐œ†21 )(−1 + ๐œ†22 )Sin[2θ2 ]|
2(1 + ๐œ†21 + 4λ1 λ2 + ๐œ†22 + ๐œ†21 ๐œ†22 − (1 − ๐œ†21 )(1 − ๐œ†22 )|Cos[2(θ1 − θ2 )]|)
≤
(1 − ๐œ†21 )(1 + ๐œ†22 ) + (1 − ๐œ†22 )(1 − ๐œ†21 ) + (1 + ๐œ†21 )(1 − ๐œ†22 )
2(1 + ๐œ†21 + 4λ1 λ2 + ๐œ†22 + ๐œ†21 ๐œ†22 − (1 − ๐œ†21 )(1 − ๐œ†22 ))
= −
−3 + ๐œ†21 + ๐œ†22 + ๐œ†21 ๐œ†22
4(λ1 + λ2 )2
So, the maximal deviation from the line between the archetypes, for any given ๐œƒ1 , ๐œƒ2 is
bounded by
3−๐œ†21 −๐œ†22 −๐œ†21 ๐œ†22
.
4(λ1 +λ2 )2
Let’s focus on the special case when one of the performance functions depends on
Euclidean norm, i.e. – λ1 = 1.
In this case the upper bound becomes โ„Ž ≤
1−๐œ†2
โ„Ž(1, ๐œƒ1 , ๐œ†2 , ๐œƒ2 ) =
When setting ๐œƒ2 =
๐œ‹
4
, and this is a tight bound:
2(1+๐œ†2 )
(1 − ๐œ†2 )๐‘†๐‘–๐‘›[2๐œƒ2 ]
2(1 + ๐œ†2 )
this bound is obtained (as expected, ๐œƒ1 is irrelevant when ๐œ†1 = 1).
Hence, when task 1 depends on Euclidean norm, the maximal deviation of the front per
given ๐œ†2 is obtained for ๐œƒ2 =
๐œ‹
4
and is given by:
โ„Ž(1, ๐œƒ1 , ๐œ†2 , ๐œƒ2 ) ≤
20
(1 − ๐œ†2 )
2(1 + ๐œ†2 )
In this case, the deviation of the Pareto front from the line between the archetypes,
maximized on all ๐œ†2 ′๐‘ , and on all ๐œƒ2 ′ ๐‘  is half the distance between the archetypes. The
deviation approaches this value as ๐œ†2 → 0 (the contours of the second norm becomes
more and more eccentric).
21
Appendix S5
Each Pareto front of 2 tasks in a 2D morphospace is generated by a 1-dimensional
family of norm-pairs
Here, we would like to deal with the following question: Let PF be a Pareto front
associated with 2 tasks in a 2D-morphospace, where each performance decays from its
archetype with a different inner-product norm. In Appendix S2 we showed that under the
above assumptions, the Pareto front is a segment of a hyperbola (or a line) that connects
the archetypes ๐‘ฃ1∗ and ๐‘ฃ2∗ . In that case, can we deduce the norms that the performance
functions decay with from the exact shape of the Pareto front?
When approaching the question presented above, we assume that the position of the
archetypes is known. This is a reasonable assumption – for a given hyperbola/line-shaped
data set, the 2 edge points of the front are assumed to be the archetypes.
We will show that not all hyperbola-shaped datasets can be explained by the model with
its current assumptions – there are hyperbolae that are not generated by any pair of
norms. However, if for a hyperbola-shaped dataset there exists a pair of norms that
generates it, then there exists a one-dimensional family of norm-pairs that generate it.
This is expected. A hyperbola is a quadratic curve defined by an equation of the form:
๐ด๐‘ฅ๐‘ฅ ๐‘ฅ 2 + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 + 2๐ด๐‘ฅ ๐‘ฅ + 2๐ด๐‘ฆ ๐‘ฆ + ๐ด = 0. Hence, it is defined by 5 free
parameters (we can normalize the equation by one of the coefficients). We assume that
the position of the archetypes is known. A hyperbola-shaped front must pass through the
2 archetypes, leaving it with 3 free parameters (each point yields a single equation the
hyperbola’s coefficients must satisfy, reducing the number of free parameters by one).
The norms that the performance functions depend on are represented by the matrices
๐‘€1 , ๐‘€2 , where ๐‘€๐‘– = ๐‘…(๐œƒ๐‘– ) (
1 0
) ๐‘…(−๐œƒ๐‘– ) (Appendix S2). Hence, besides the position
0 ๐œ†2๐‘–
of the archetype ๐‘ฃ๐‘–∗ , each norm can be describes by 2 parameters - the ratio between the
eigenvalues of the norm’s matrix, ๐œ†2๐‘– , and the angle of the rotation matrix that
22
1
diagonalizes it, ๐œƒ๐‘– (๐‘ƒ๐‘– = −(๐‘ฃ − ๐‘ฃ๐‘–∗ )๐‘‡ ๐‘…(๐œƒ๐‘– ) (
0
0
) ๐‘…(−๐œƒ๐‘– )(๐‘ฃ − ๐‘ฃ๐‘–∗ )). It means that if the
๐œ†2๐‘–
location of the archetypes is known, the 2 performance functions together have 4 free
parameters. Thus, the problem of deducing the norms from the shape of the Pareto front
is expectedly degenerate, since we try to determine 4 free parameters (the norms of the
performance functions) using only 3 observed parameters (the hyperbola). Also, we
expect the family of norm-pairs that generate each hyperbola to depend only on 1
parameter – i.e. - to be 1-dimensional.
Consider a hyperbola/line-shaped dataset that is generated by a pair of norms ๐‘€1 and ๐‘€2 ,
with archetypes ๐‘ฃ1∗ , ๐‘ฃ2∗ . We can transform to a coordinate system where ๐‘ฃ1∗ = (0,0) and
๐‘ฃ2∗ = (1,0), ๐‘€1 = ๐ผ and ๐‘€2 is a positive definite matrix, using a transformation under
which a hyperbola/line remains a hyperbola/line. Such transformation was shown to exist
in Appendix S2. There is a 1-to-1 correspondence between norm-pairs that generate the
transformed front in the transformed coordinate system and the norm-pairs that generate
the front in the original coordinate system. Hence, if we show that there is a 1dimensional family of norm pairs that generate the front in the transformed system, the
conclusion will also hold in the original system.
Assume that in the transformed coordinate system, the quadratic curve that the Pareto
front lies on is represented by:
๐ด๐‘ฅ๐‘ฅ ๐‘ฅ 2 + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 + 2๐ด๐‘ฅ ๐‘ฅ + 2๐ด๐‘ฆ ๐‘ฆ + ๐ด = 0
We constrain the curve to go through the archetypes at (0,0) and (1,0), and get that ๐ด =
0 and ๐ด๐‘ฅ๐‘ฅ = − 2๐ด๐‘ฅ
So we expect the Pareto front to be of the form
(๐‘ƒ๐น) ๐ด๐‘ฅ๐‘ฅ ๐‘ฅ 2 + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 − ๐ด๐‘ฅ๐‘ฅ ๐‘ฅ + 2๐ด๐‘ฆ ๐‘ฆ = 0
Each quadratic curve is associated with 2 parameters, Δ and ๐ฝ, defined by:
J = ๐ด๐‘ฅ๐‘ฅ ๐ด๐‘ฆ๐‘ฆ -
๐ด2๐‘ฅ๐‘ฆ
๐ด๐‘ฅ๐‘ฅ
and Δ = det ( ๐ด๐‘ฅ๐‘ฆ
๐ด๐‘ฅ
๐ด๐‘ฅ๐‘ฆ
๐ด๐‘ฆ๐‘ฆ
๐ด๐‘ฆ
23
๐ด๐‘ฅ
๐ด2 ๐ด
๐ด๐‘ฆ ) = − ๐‘ฅ๐‘ฅ ๐‘ฆ๐‘ฆ − ๐ด๐‘ฅ๐‘ฅ ๐ด๐‘ฅ๐‘ฆ ๐ด๐‘ฆ − ๐ด๐‘ฅ๐‘ฅ A2y
4
๐ด
For a hyperbola ๐ฝ < 0 and Δ ≠ 0. For a line, ๐ฝ = 0 and Δ = 0.
We would like to find all norm-pairs (๐‘š1 , ๐‘š2 ) that generate (๐‘ƒ๐น), given that the pair
(๐ผ, ๐‘€2 ) generates it.
Let ๐‘š1 depend on parameters ๐œ†1 , ๐œƒ1 and ๐‘š2 depend on parameters ๐œ†2 , ๐œƒ2 . Namely 1
๐‘š1 = ๐‘…(๐œƒ1 ) (
0
0
1
) ๐‘…(−๐œƒ1 ) ; ๐‘š2 = ๐‘…(๐œƒ2 ) (
๐œ†12
0
0
) ๐‘…(−๐œƒ2 )
๐œ†22
As will be shown in Appendix S7 (Lemma 1), the quadratic curve on which lies a Pareto
front associated with 2 tasks can be given by the equation: ๐œ‹๐‘ง (๐‘”1 (๐‘ฃ) × ๐‘”2 (๐‘ฃ)) ≡ 0,
where
๐‘”๐‘– (๐‘ฃ) = ∇๐‘ƒ๐‘– (๐‘ฃ) = ๐‘š๐‘– (๐‘ฃ − ๐‘ฃ๐‘–∗ ).
Denote
๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ) = ๐œ‹๐‘ง (๐‘”1 (๐‘ฃ) ×
๐‘”2 (๐‘ฃ)) ⇒
๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 )
= ๐‘ฆ (1 + λ12 + λ22 + λ12 λ22 + (−1 + λ12 )(1 + λ22 )Cos[2θ1 ]
− (−1 + λ12 )(−1 + λ22 )Cos[2(θ1 − θ2 )] − (1 + λ12 )(−1 + λ22 )Cos[2θ2 ])
+ 2๐‘ฅ๐‘ฆ(−(−1 + λ12 )(1 + λ22 )Cos[2θ1 ] + (1 + λ12 )(−1 + λ22 )Cos[2θ2 ])
+ ๐‘ฅ((−1 + λ12 )(1 + λ22 )Sin[2θ1 ] − (−1 + λ1)(−1 + λ22 )Sin[2(θ1 − θ2 )]
− (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
+ ๐‘ฆ 2 ((−1 + λ12 )(1 + λ22 )Sin[2θ1 ]
+ (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] − (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
+ ๐‘ฅ 2 (−(−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1
− θ2 )] + (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
The Pareto front associated with ๐‘š1 , ๐‘š2 is thus given by ๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ) = 0.
1
๐œ‹
Here we observe the invariance of the Pareto front under ๐œ†๐‘– → ๐œ† , ๐œƒ๐‘– → ๐œƒ๐‘– + 2, which is
๐‘–
inherent to the problem since the norms are symmetric under this transformation. This is
only a “technical” degeneracy rather than a genuine one, since ๐‘‘๐‘– (๐œ†๐‘– , ๐œƒ๐‘– ) is essentially the
1
๐œ‹
same as ๐‘‘๐‘– (๐œ† , ๐œƒ๐‘– + 2 ). The resulting norms only differ by a factor of ๐œ†, and generate the
๐‘–
24
same contours and the same Pareto front (see Appendix S2). Hence, we can assume 0 ≤
๐œ‹
๐œƒ < 2.
We would like to know when ๐‘ƒ๐น = 0 and ๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ) = 0 represent the same
line/hyperbola. This happens if and only if these quadratic forms are equal up to a factor,
i.e. - ๐‘ƒ๐น = ๐›ฝ ๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ).
We know that ๐‘ƒ๐น and ๐‘ƒ๐น(๐œ†1 = 1, ๐œ†, ๐œƒ1 = 0, ๐œƒ) represent the same quadratic curve.
Hence,
∃๐›ฝ ๐‘ . ๐‘ก. ๐‘ƒ๐น = ๐›ฝ ๐‘ƒ๐น(๐œ†1 = 1, ๐œ†, ๐œƒ1 = 0, ๐œƒ).
Examining
the
expression
for
๐‘ƒ๐น(๐œ†1 = 1, ๐œ†, ๐œƒ1 = 0, ๐œƒ), we find out that the coefficients of ๐‘ฅ 2 and of ๐‘ฆ 2 are opposite to
one another. That is – if the hyperbola was indeed generated by the given norm-pair, it
mush have ๐ด๐‘ฅ๐‘ฅ = −๐ด๐‘ฆ๐‘ฆ .
First, let’s find which norm pairs generate ๐‘ƒ๐น given that it is a line. If the front is a line,
then both ๐ฝ = 0 and Δ = 0. As ๐ด๐‘ฅ๐‘ฅ = −๐ด๐‘ฆ๐‘ฆ and ๐ฝ = −๐ด2๐‘ฆ๐‘ฆ − ๐ด2๐‘ฅ๐‘ฆ we get that ๐ด๐‘ฆ๐‘ฆ =
0 and ๐ด๐‘ฅ๐‘ฆ = 0
We look for all other ๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 , for which exists ๐›ฝ such that ๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ) = ๐›ฝ๐‘ƒ๐น.
We know that when this happens, the coefficients of ๐‘ฅ 2 and of ๐‘ฆ 2 are opposite to one
another. This results in the equation:
(๐ด) − 2(−1 + λ1 )(−1 + λ2 )Sin[2(θ1 − θ2 )] = 0
๐œ‹
This equation is satisfied if λ1 = 1, λ2 = 1 or ๐œƒ1 = ๐œƒ2 as all angles are taken modulo 2 .
Also, we know that the coefficients of ๐‘ฅ 2 , ๐‘ฆ 2 and ๐‘ฅ๐‘ฆ are zero. This results in the
following equation:
(๐ต)((−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )]
− (1 + λ12 )(−1 + λ22 )Sin[2θ2 ]) = 0
Substituting into (๐ต) each of the solutions found for equation (๐ด), we get that:
25
If ๐œ†1 = 1, equation (B) becomes: −2(−1 + λ2 )Sin[2θ2 ] = 0. It is satisfied if λ2 = 1 or
๐œ‹
θ2 = 0 (as all angles are taken modulo 2 ).
If ๐œ†2 = 1, equation (B) becomes: 2(1 − λ1 )Sin[2θ1 ] = 0. It is satisfied if λ1 = 1 or θ1 =
0.
If ๐œƒ1 = ๐œƒ2 , equation (B) becomes: 2(๐œ†1 − λ2 )Sin[2θ2 ] = 0. It is satisfied if θ2 = 0, or
๐œ†1 − λ2 = 0
To conclude, equations (๐ด) and (๐ต), which must be satisfied in order that
๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ) will represents the same line as ๐‘ƒ๐น, are satisfied if and only if:
(I)
๐œ†1 = ๐œ†2 and ๐œƒ1 = ๐œƒ2
๐‘œ๐‘Ÿ
(II)
๐œƒ1 = ๐œƒ2 = 0
Note that if ๐œ†๐‘– = 1, ๐œƒ๐‘– has no effect on the norm and we can take it to be whatever fits.
On the other hand, it can be easily checked that if one of those conditions holds,
๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ) represents a line (or intersecting lines) that the line between the
archetypes lies on.
It means that 2 norms generate the line between the archetypes if and only if one of the
above conditions holds. The meaning of (I) is that ๐‘š1 = ๐‘š2 , and (II) means that ๐‘š1 ′๐‘ 
and ๐‘š2 ′๐‘  elliptic contours each have an axis parallel to the ๐‘ฅ axis (which is parallel to
๐‘ฃ1∗ − ๐‘ฃ2∗ )
Note that if a dataset is shaped like a line, 2 Euclidean norms will always generate it. This
means that in order for the above conclusions to hold, we only need to translate, rotate
and isometrically scale the coordinate system such that the archetypes are at (0,0) and
(1,0). This implies that the above conclusions also hold in the original coordinate system.
Equality of matrices is not affected by change of basis, which covers (I). Regarding case
(II) – notice that the contours’ axes transform like regular vectors when applying
translations / rotations, and parallel vectors remain parallel under any linear
transformation.
26
To summarize – for a given archetype pair (๐‘ฃ1∗ , ๐‘ฃ2∗ ), the norm-pair (๐‘š1 , ๐‘š2 ) generates a
Pareto front which is the line between them if and only if ๐‘š1 = ๐‘š2 or both ๐‘š1 and ๐‘š2
have elliptic contours with an axis that is parallel to ๐‘ฃ1∗ − ๐‘ฃ2∗ .
After taking care of which norms generate a line-shaped front, we will assume from now
on that the front is not a straight line but a hyperbola.
We saw that on the frame we work on, the Pareto front lies on a curve given by the
equation:
๐ด๐‘ฅ๐‘ฅ ๐‘ฅ 2 + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ − ๐ด๐‘ฅ๐‘ฅ ๐‘ฆ 2 − ๐ด๐‘ฅ๐‘ฅ ๐‘ฅ + 2๐ด๐‘ฆ ๐‘ฆ = 0
We know that ๐ด๐‘ฅ๐‘ฅ ≠ 0, since otherwise ๐›ฅ = 0 and the resulting Pareto front is not a
hyperbola, so we can normalize the coefficient of ๐‘ฅ 2 to be 1, and equate both
(normalized) representations:
๐‘ฅ 2 − ๐‘ฅ + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ − ๐‘ฆ 2 + 2๐ด๐‘ฆ ๐‘ฆ = 0
And
๐‘ฅ2 − ๐‘ฅ
+ ๐‘ฆ
(1 + λ12 + λ22 + λ12 λ22 + (−1 + λ12 )(1 + λ22 )Cos[2θ1 ] − (−1 + λ12 )(−1 + λ22 )Cos[2(θ1 − θ2 )] − (1 + λ12 )(−1 + λ22 )Cos[2θ2 ])
(−(−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] + (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
+ 2๐‘ฅ๐‘ฆ
+ ๐‘ฆ2
(−(−1 +
λ12 )(1
(−(−1 + λ12 )(1 + λ22 )Cos[2θ1 ] + (1 + λ12 )(−1 + λ22 )Cos[2θ2 ])
+ λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] + (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
((−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] − (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
(−(−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] + (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
=0
These normalized representations describe the same hyperbola if and only if the
coefficients are equal. This results in 3 equations (equation set E1):
(−1+λ21 )(1+λ22 )Sin[2θ1 ]+(−1+λ21 )(−1+λ22 )Sin[2(θ1 −θ2 )]−(1+λ21 )(−1+λ22 )Sin[2θ2 ]
(I)
−1 +
(II)
−2๐ด๐‘ฆ +
(III)
−๐ด๐‘ฅ๐‘ฆ +
(1−λ21 )(1+λ22 )Sin[2θ1 ]+(−1+λ21 )(−1+λ22 )Sin[2(θ1 −θ2 )]+(1+λ21 )(−1+λ22 )Sin[2θ2 ]
=0
(1+λ21 )(1+λ22 )+(−1+λ21 )(1+λ22 )Cos[2θ1 ]−(−1+λ21 )(−1+λ22 )Cos[2(θ1 −θ2)]−(1+λ21 )(−1+λ22 )Cos[2θ2 ]
(1−λ21 )(1+λ22 )Sin[2θ1 ]+(−1+λ22 )((−1+λ21 )Sin[2(θ1 −θ2 )]+(1+λ21 )Sin[2θ2 ])
(1−λ21 )(1+λ22 )Cos[2θ1 ]+(1+λ21 )(−1+λ22 )Cos[2θ2 ]
2
2
(1−λ1 )(1+λ2 )Sin[2θ1 ]+(−1+λ22 )((−1+λ21 )Sin[2(θ1 −θ2 )]+(1+λ21 )Sin[2θ2 ])
Consider equation (I):
27
=0
=0
2(−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )]
=0
−(−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ22 )((−1 + λ12 )Sin[2(θ1 − θ2 )] + (1 + λ12 )Sin[2θ2 ])
It is satisfied when λ12 = 1 or λ22 = 1 or θ1 = θ2 . Substituting either ๐œ†๐‘– = 1 into equations
(I) and (III) yields equations that don’t depend on ๐œƒ๐‘– (this is expected since in case ๐œ†๐‘– = 1
the respective norm is Euclidean and ๐œƒ๐‘– is meaningless). So it is safe to assume that in
this case it is true, in a sense, that ๐œƒ1 = ๐œƒ2 . From now on we’ll simply use ๐œƒ to denote
the common angle. Equations (I) and (II) now become (regardless of which condition
satisfied equation (I)):
−2๐ด๐‘ฅ๐‘ฆ + 2 cot[2๐œƒ] = 0
2๐ด๐‘ฆ (λ12 − λ22 ) + λ12 cot[๐œƒ] + λ22 tan[๐œƒ]
=0
−λ12 + λ22
This means that ๐œƒ โ‰” θ1 = θ2 =
arccot[๐ด๐‘ฅ๐‘ฆ ]
2
For the second equation we see that it reduces to
2๐ด๐‘ฆ (1 −
λ22
λ22
)
+
cot[๐œƒ]
+
tan[๐œƒ] = 0
λ12
λ12
2
λ
2
λ
It means that from this equation we can only deduce (λ2 ) . Denote ๐ถ = (λ2 ) . C depends
1
1
on the parameters of the hyperbola:
arccot[๐ด๐‘ฅ๐‘ฆ ]
]
2
๐ถ=
arccot[๐ด๐‘ฅ๐‘ฆ ]
−2๐ด๐‘ฆ + tan[
]
2
−2๐ด๐‘ฆ − cot[
Note that for consistency, the value deduced for C from equation (II) must be positive. If
๐ถ ≤ 0, it means that there is no pair of norms that generate this hyperbola. We know that
the Pareto front is generated by ๐‘š1 = ๐ผ, ๐‘š2 = ๐‘€2 , so their parameters must satisfy
28
equation
(II).
arccot[๐ด๐‘ฅ๐‘ฆ ]
๐œ† tan[
2
arccot[๐ด๐‘ฅ๐‘ฆ ]
๐œ†1 = 1, ๐œ†2 = ๐œ†, ๐œƒ1 = 0, ๐œƒ2 = ๐œƒ ⇒ 2๐ด๐‘ฆ (1 − ๐œ†) + cot[
2
]+
]=0
arccot[๐ด๐‘ฅ๐‘ฆ ]
]
2
⇒ ๐œ†2 =
=๐ถ
arccot[๐ด๐‘ฅ๐‘ฆ ]
−2๐ด๐‘ฆ + tan[
]
2
−2๐ด๐‘ฆ − cot[
๐œ†2 > 0 , therefore ๐ถ > 0. This means that for every choice of ๐œ†1 > 0, choosing ๐œ†22 =
๐ถ ๐œ†12 will result in ๐œ†22 greater than zero, and hence ๐‘š2 that depends on ๐œ†2 will represent
an inner-product norm. Hence, the parameters ๐œƒ1 = ๐œƒ2 =
arccot[๐ด๐‘ฅ๐‘ฆ ]
, ๐œ†1 , ๐œ†2 = √๐ถ๐œ†1
2
define 2 norms, ๐‘š1 and ๐‘š2 that generate (PF), for every ๐œ†1 > 0. This means that there is
a 1-dimensional family of norm-pairs that generate PF, parameterized by ๐œ†1 . Note that
choosing ๐‘š1 determines ๐‘š2 uniquely.
However, note that there exist hyperbola-shaped data sets that cannot be describes as a
Pareto front associated with 2 tasks. This is since the existence of any solution relies on C
being positive. C is defined by the parameters of the hyperbola (๐ถ =
arccot[๐ด๐‘ฅ๐‘ฆ ]
]
2
arccot[๐ด๐‘ฅ๐‘ฆ ]
−2๐ด๐‘ฆ +tan[
]
2
−2๐ด๐‘ฆ −cot[
).
We can find a hyperbola with parameters that define a negative C, which means that it is
not the Pareto front of any norm pairs. This also means that given a hyperbola, it is easy
to check whether or not it is generated by a norm-pair by simply calculating ๐ถ. An
example for a hyperbola with ๐ถ < 0 is ๐‘ฅ 2 − 2๐‘ฅ๐‘ฆ − ๐‘ฆ 2 − ๐‘ฅ + 2๐‘ฆ = 0.
So, knowing at least one norm pair that generates the Pareto front, we can deduce all
norm pairs. However, another question can be asked: can we determine from the
parameters of the hyperbola whether there are norm pairs that generate the hyperbola, and
29
if so what are the norms? We will show that in the common case, it is numerically
possible.
To approach this question, assume we have data shaped like a hyperbola section. From
the data, we can identify the edge points of the hyperbola, and change coordinate system
such that the edge points are at (0,0) and (1,0). Those points are assumed to be the 2
archetypes - ๐‘ฃ1∗ = (0,0) and ๐‘ฃ2∗ = (1,0). The new coordinate system results from the old
coordinate system by a rotation, translation and isometric scaling. Note the difference
between this transformation and the one described earlier – in both of them we transform
the archetypes to be at (0,0) and (1,0). However, earlier we transformed such that one of
the norms that generates the front is Euclidean. Here we can’t do so as we don’t know
which norm pairs generate the hyperbola. As before, we get that under such
transformations, a hyperbola remains a hyperbola, and there is a 1-to-1 correspondence
between norm-pairs that generate the transformed front in the transformed coordinate
system and the norm-pairs that generate the front in the original coordinate system.
Hence, once we find which norms generate the front in the transformed coordinate
system, we can transform back and find the norms that generate the front in the original
coordinate system. If we find that there isn’t a norm pair that generates the hyperbola in
the transformed coordinate system, this conclusion will hold in the original coordinate
system.
In this coordinate system, the hyperbola is given by the equation
๐ด๐‘ฅ๐‘ฅ ๐‘ฅ 2 + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 − ๐ด๐‘ฅ๐‘ฅ ๐‘ฅ + 2๐ด๐‘ฆ ๐‘ฆ = 0
The ๐‘ฅ coefficient and free parameter are determined since the hyperbola has to pass
through the archetypes at (0,0) and (0,1).
We would like to know which pairs of norms, if exist, generate a Pareto front with those
parameters. We search for parameters ๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 such that ๐‘ƒ๐น(๐œ†1 , ๐œ†2 , ๐œƒ1 , ๐œƒ2 ), defined
before, describes the same curve as ๐ด๐‘ฅ๐‘ฅ ๐‘ฅ 2 + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 − ๐ด๐‘ฅ๐‘ฅ ๐‘ฅ + 2๐ด๐‘ฆ ๐‘ฆ = 0.
30
We know that ๐ด๐‘ฅ๐‘ฅ ≠ 0, since otherwise ๐›ฅ = 0 and the Pareto front is a line, so we can
normalize the coefficient of ๐‘ฅ 2 to be 1, and equate both (normalized) representations:
๐‘ฅ 2 − ๐‘ฅ + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 + 2๐ด๐‘ฆ ๐‘ฆ = 0
and
๐‘ฅ2 − ๐‘ฅ
+ ๐‘ฆ
(1 + λ12 + λ22 + λ12 λ22 + (−1 + λ12 )(1 + λ22 )Cos[2θ1 ] − (−1 + λ12 )(−1 + λ22 )Cos[2(θ1 − θ2 )] − (1 + λ12 )(−1 + λ22 )Cos[2θ2 ])
(−(−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] + (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
+ 2๐‘ฅ๐‘ฆ
+ ๐‘ฆ2
(−(−1 +
λ12 )(1
(−(−1 + λ12 )(1 + λ22 )Cos[2θ1 ] + (1 + λ12 )(−1 + λ22 )Cos[2θ2 ])
+ λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] + (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
((−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] − (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
(−(−1 + λ12 )(1 + λ22 )Sin[2θ1 ] + (−1 + λ12 )(−1 + λ22 )Sin[2(θ1 − θ2 )] + (1 + λ12 )(−1 + λ22 )Sin[2θ2 ])
=0
These normalized representations describe the same hyperbola if and only if the
coefficients are equal. This results in 3 equations (equation set E2):
(I)
−๐ด๐‘ฆ๐‘ฆ +
(II)
−2๐ด๐‘ฆ +
(III)
−๐ด๐‘ฅ๐‘ฆ +
(−1+λ21 )(1+λ22 )Sin[2θ1 ]+(−1+λ21 )(−1+λ22 )Sin[2(θ1 −θ2 )]−(1+λ21 )(−1+λ22 )Sin[2θ2 ]
(1−λ21 )(1+λ22 )Sin[2θ1 ]+(−1+λ21 )(−1+λ22 )Sin[2(θ1 −θ2 )]+(1+λ21 )(−1+λ22 )Sin[2θ2 ]
=0
(1+λ21 )(1+λ22 )+(−1+λ21 )(1+λ22 )Cos[2θ1 ]−(−1+λ21 )(−1+λ22 )Cos[2(θ1 −θ2)]−(1+λ21 )(−1+λ22 )Cos[2θ2 ]
(1−λ21 )(1+λ22 )Sin[2θ1 ]+(−1+λ22 )((−1+λ21 )Sin[2(θ1 −θ2 )]+(1+λ21 )Sin[2θ2 ])
(1−λ21 )(1+λ22 )Cos[2θ1 ]+(1+λ21 )(−1+λ22 )Cos[2θ2 ]
(1−λ21 )(1+λ22 )Sin[2θ1 ]+(−1+λ22 )((−1+λ21 )Sin[2(θ1 −θ2 )]+(1+λ21 )Sin[2θ2 ])
=0
=0
The solution of this equation set behaves quite differently depending on ๐ด๐‘ฆ๐‘ฆ .
First assume that ๐ด๐‘ฆ๐‘ฆ = −1. In that case, equation set E2 becomes equation set E1. Note
that as mentioned under equation set E1, this scenario happens if ๐œƒ1 = ๐œƒ2 , ๐œ†1 = 1 or ๐œ†2 =
1 (one of the norms is Euclidean). We know that the resulting solution is as calculated
above for E1, and that it is valid only if the hyperbola’s parameters are such that C > 0.
Here, given only the fit of the hyperbola, we can determine if there are norms that
generate the hyperbola, and what are the norms if they exist.
When ๐ด๐‘ฆ๐‘ฆ ≠ −1, solving equations (E2.I)+(E2.III) results in:
λ12 = 1 +
2(1 + ๐ด๐‘ฆ๐‘ฆ )
−1 − ๐ด๐‘ฆ๐‘ฆ + (−1 + ๐ด๐‘ฆ๐‘ฆ )Cos[2θ1 ] + 2๐ด๐‘ฅ๐‘ฆ Sin[2θ1 ]
31
λ22 = 1 +
2(1 + ๐ด๐‘ฆ๐‘ฆ )
−1 − ๐ด๐‘ฆ๐‘ฆ + (−1 + ๐ด๐‘ฆ๐‘ฆ )Cos[2θ2 ] + 2๐ด๐‘ฅ๐‘ฆ Sin[2θ2 ]
Substituting this into equation (II) we get:
2
(IV) Sin[2θ1 ] (4(๐ด๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐ด๐‘ฆ )Cos[2θ2 ] + (−4๐ด๐‘ฅ๐‘ฆ 2 + (1 + ๐ด๐‘ฆ๐‘ฆ ) ) Sin[2θ2 ]) +
4 Cos[2θ1 ] (๐ด๐‘ฆ๐‘ฆ Cos[2θ2 ] − (๐ด๐‘ฅ๐‘ฆ ๐ด๐‘ฆ๐‘ฆ + ๐ด๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐ด๐‘ฆ )Sin[2θ2 ]) = 0
๐œ‹
For every ๐œƒ2 , we will show that there exists a single angle ๐œƒ1 (modulo 2 ), such that
equation (IV) is satisfied. (An example for specific parameters ๐ด๐‘ฆ๐‘ฆ , ๐ด๐‘ฅ๐‘ฆ , ๐ด๐‘ฆ is shown in
Fig.S2)
2
For convenience denote ๐œ‰ โ‰” (1 + ๐ด๐‘ฆ๐‘ฆ ) − 4๐ด๐‘ฅ๐‘ฆ 2 , ๐œ‚ = 4(๐ด๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐ด๐‘ฆ )
There are 2 cases:
(A) ๐œƒ2 ≠
๐œ‹
4
๐œ‹
(๐‘š๐‘œ๐‘‘ 2 ):
Now Cos[2θ2 ] ≠ 0 and we can divide the equation by it:
(Sin[2θ1 ](๐œ‚ + ๐œ‰ Tan[2θ2 ]) + 4 Cos[2θ1 ] (๐ด๐‘ฆ๐‘ฆ Cos[2θ2 ] − (๐ด๐‘ฅ๐‘ฆ ๐ด๐‘ฆ๐‘ฆ + ๐ด๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐ด๐‘ฆ )Sin[2θ2 ])) = 0
Again, there are 2 cases:
๐œ‚
(A.1) Tan[2θ2 ] = − ๐œ‰
Substituting this into equation (IV) we get the equation:
2
8(1 + ๐ด๐‘ฆ๐‘ฆ ) (๐ด๐‘ฆ๐‘ฆ + 4๐ด๐‘ฆ (๐ด๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ ))
(
) Cos[2θ1 ] = 0
๐œ‰
2
(1 + ๐ด๐‘ฆ๐‘ฆ ) (๐ด๐‘ฆ๐‘ฆ + 4๐ด๐‘ฆ (๐ด๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ )) ≠ 0. This is because we assume that ๐ด๐‘ฆ๐‘ฆ ≠ −1,
and, in addition, the Δ property of the quadratic curve is ๐›ฅ = 4(๐ด๐‘ฆ๐‘ฆ + 4๐ด๐‘ฆ (๐ด๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ ))
and for a hyperbola ๐›ฅ ≠ 0. Hence, cos[2θ1 ] must equal 0 for the equation to be satisfied,
๐œ‹
๐œ‹
meaning ๐œƒ1 = 4 (๐‘š๐‘œ๐‘‘ 2 ).
32
๐œ‚
(A.2) Tan[2θ2 ] ≠ − ๐œ‰ :
๐œ‹
๐œ‹
In that case, we can assume that ๐œƒ1 ≠ 4 . This is because when ๐œƒ1 = 4 , Cos[2θ1 ] = 0,
Sin[2θ1 ] = 1, and ๐œƒ1 =
๐œ‹
4
solves the equation only if ๐œ‚ + ๐œ‰ Tan[2θ2 ] = 0. Note that
either ๐œ‰ ≠ 0 or ๐œ‚ ≠ 0, since if both ๐œ‰ = 0 and ๐œ‚ = 0, it can be shown that the Δ property
of the quadratic curve becomes 0. This means that this equation has a solution only if
๐œ‚
Tan[2θ2 ] = − ๐œ‰ , which is assumed not to be case.
๐œ‹
So we assume ๐œƒ1 ≠ 4 . In that case, Cos[2๐œƒ1 ] ≠ 0 and we can divide the equation by it.
The equation becomes:
8(๐ด๐‘ฆ๐‘ฆ − (๐ด๐‘ฅ๐‘ฆ ๐ด๐‘ฆ๐‘ฆ + ๐ด๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐ด๐‘ฆ )) Tan[2θ2 ] + Tan[2θ1 ](๐œ‚ + ๐œ‰ Tan[2θ2 ]) = 0
Note that if ๐œ‚ + ๐œ‰ Tan[2θ2 ] ≠ 0, as was shown earlier to be the case here, then ๐œƒ1 such
that:
Tan[2θ1 ] =
−4๐ด๐‘ฆ๐‘ฆ + 4(๐ด๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐ด๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐ด๐‘ฆ ) Tan[2θ2 ]
(๐œ‚ + ๐œ‰ Tan[2θ2 ])
solves equation (IV).
๐œ‹
Note that once Tan[2๐œƒ1 ] is determined, ๐œƒ1 is determined modulo 2 . However, we assume
๐œ‹
that 0 ≤ ๐œƒ1 , ๐œƒ2 < 2 , so determining ๐œƒ1 up to
๐œ‹
๐œ‹
2
is enough. Moreover, we know that if
1
๐œƒ1 → ๐œƒ1 + 2 , then if also ๐œ†1 → ๐œ† the Pareto front doesn’t change. Examining the term for
1
๐œ†1 , we see this symmetry.
๐œ‹
To conclude, for each ๐œƒ2 ≠ 4 , there is a single ๐œƒ1 in the range 0 ≤ ๐œƒ1 <
๐œƒ1 , ๐œƒ2 , ๐œ†(๐œƒ1 ), ๐œ†(๐œƒ2 ) generate the hyperbola.
๐œ‹
(B) ๐œƒ2 = 4 :
In that case, ๐œƒ1 =
๐œ‹
4
is a solution to the equation if and only if ๐œ‰ = 0.
33
๐œ‹
2
such that
๐œ‹
Otherwise, ๐œƒ1 ≠ 4 , and we can divide by cos[2๐œƒ1 ] to get:
2(๐œ‚ + ๐œ‰ tan[2θ1 ]) = 0
Again, for a hyperbola, it is not possible that both ๐œ‚ and ๐œ‰ are zero, so the only way that
๐œ‚
the equation will be satisfied is if tan[2θ1 ] = ๐œ‰ .
To conclude, for every value of ๐œƒ2 , we can determine a unique value for ๐œƒ1 in the range
๐œ‹
[0, 2 ) such that the hyperbola on which lies the Pareto front that is generated by the 2
norms ๐‘€1 (๐œ†1 (๐œƒ1 ), ๐œƒ1 ), ๐‘€2 (๐œ†2 (๐œƒ2 ), ๐œƒ2 ) is the same as the hyperbola described by ๐‘ฅ 2 −
๐‘ฅ + 2๐ด๐‘ฅ๐‘ฆ ๐‘ฅ๐‘ฆ + ๐ด๐‘ฆ๐‘ฆ ๐‘ฆ 2 + 2๐ด๐‘ฆ ๐‘ฆ = 0. The ๐œƒ1 that matches ๐œƒ2 is given by:
θ1 =
๐œ‹
If ๐œƒ2 ≠ 4 , tan[2๐œƒ2 ] ≠
4๐ด๐‘ฆ๐‘ฆ + ๐œ‚ tan[2θ2 ]
1
arctan[
]
(๐œ‚ + ๐œ‰ tan[2θ2 ])
2
๐œ‚
๐œ‰
θ1 =
๐œ‹
๐œ‹
4
๐œ‚
If ๐œƒ2 ≠ 4 , tan[2๐œƒ2 ] = ๐œ‰ , ๐œ‰ ≠ 0
θ1 =
If ๐œƒ2 =
1
๐œ‚
arctan [ ]
2
๐œ‰
๐œ‹
4
Note that this describes a 1-dimensional continuous curve. When ๐œƒ2 ≠
๐œ‚
1
๐‘‡๐‘Ž๐‘›[2๐œƒ2 ] → ๐œ‰ , then ๐œƒ1 = 2 ๐ด๐‘Ÿ๐‘๐‘‡๐‘Ž๐‘› [
4๐ด๐‘ฆ๐‘ฆ +๐œ‚ tan[2θ2 ]
(๐œ‚+๐œ‰ tan[2θ2 ])
๐œ‹
1
]→
2
๐œ‹
4
and
๐ด๐‘Ÿ๐‘๐‘‡๐‘Ž๐‘›[∞] =
๐œ‹
4
๐œ‚
which is the solution when ๐œƒ2 ≠ 4 , ๐‘‡๐‘Ž๐‘›[2๐œƒ2 ] = ๐œ‰ .
๐œ‹
๐œ‹
1
When ๐œƒ2 → 4 , then the solution when ๐œƒ2 ≠ 4 , θ1 = 2 arctan[
1
๐œ‚
๐œ‹
θ1 = 2 arctan [๐œ‰ ] which is the solution when ๐œƒ2 = 4 .
34
4๐ด๐‘ฆ๐‘ฆ +๐œ‚ tan[2θ2 ]
(๐œ‚+๐œ‰ tan[2θ2 ])
] goes to
Note that ๐œƒ2 ≠ ๐œƒ1 since we assume ๐ด๐‘ฆ๐‘ฆ ≠ −1, so when ๐œƒ2 →
๐œ‹
4
๐œ‹
4
it is okay to assume ๐œƒ1 ≠
.
To conclude, for every ๐œƒ2 , there is a single ๐œƒ1 (๐œƒ2 ), such that norms with the parameters
๐œƒ1 (๐œƒ2 ), ๐œƒ2 , ๐œ†12 (๐œƒ1 (๐œƒ2 )), ๐œ†22 (๐œƒ2 ) generate the hyperbola. However, those parameters will
represent matrices of inner-product norms only if ๐œ†12 (๐œƒ1 (๐œƒ2 )) > 0 and ๐œ†22 (๐œƒ2 ) > 0.
Hence, if there exists ๐œƒ2 such that ๐œ†12 (๐œƒ1 (๐œƒ2 )) > 0 and ๐œ†22 (๐œƒ2 ) > 0, the norms
represented by ๐œƒ1 (๐œƒ2 ), ๐œƒ2 , ๐œ†12 (๐œƒ1 (๐œƒ2 )), ๐œ†22 (๐œƒ2 ) will generate the hyperbola. We can find
all norms that generate a given hyperbola by considering all ๐œƒ2 ′๐‘  for which ๐œ†12 (๐œƒ1 (๐œƒ2 )) >
0 and ๐œ†22 (๐œƒ2 ) > 0. In that case, we know that there is a 1-dimensional family of norms
that generate this hyperbola. If there doesn’t exist such ๐œƒ2 , it means that there doesn’t
exist a pair of norms that generates the given hyperbola.
We can examine the existence of such ๐œƒ2 numerically: we plot ๐œ†12 (๐œƒ1 (๐œƒ2 )) and ๐œ†22 (๐œƒ2 ),
and check if there is an area where both ๐œ†12 and ๐œ†22 are positive. An example for such a
plot is given in Fig S3.
To conclude, we showed that we cannot uniquely determine norm pairs that generated a
given front. Some hyperbolae cannot be described as the Pareto front of 2 tasks in 2D.
For those who can be described, there is a one-dimensional family of norm-pairs that
generate them. However, it is generally enough to determine a single parameter of one of
the norms, to completely determine both norms. If such a parameter could be obtained by
other means (e.g. – a biomechanical model, etc.), the above method can be used to
exactly determine the norms, and therefore the relative importance of each trait to the
performance.
35
Fig S2: When considering the set of norm-pairs parameters that generate a given hyperbola,
๐œฝ๐Ÿ is a function of ๐œฝ๐Ÿ . Here, an example is shown for the hyperbola with parameters as
displayed in the figure.
Fig S3: ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ (๐œฝ๐Ÿ )) and ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ ), for 2 different hyperbolae. a) A plot of ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ (๐œฝ๐Ÿ )) and
๐Ÿ
๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ ) for a hyperbola with the parameters: ๐‘จ๐’™๐’š = −๐Ÿ—, ๐‘จ๐’š๐’š = −๐Ÿ“, ๐‘จ๐’š = − . It can be
๐Ÿ
seen that there exist ๐œฝ๐Ÿ ′๐’” for which both ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ (๐œฝ๐Ÿ )) and ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ ) are positive. This means
36
that there are norms whose Pareto front is the above hyperbola. b) A plot of ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ (๐œฝ๐Ÿ )) and
๐Ÿ
๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ ) for a hyperbola with the parameters: ๐‘จ๐’™๐’š = ๐Ÿ‘, ๐‘จ๐’š๐’š = ๐Ÿ, ๐‘จ๐’š = − . It can be seen
๐Ÿ‘
that there isn’t any ๐œฝ๐Ÿ such that both ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ (๐œฝ๐Ÿ )) and ๐€๐Ÿ๐Ÿ (๐œฝ๐Ÿ ) are positive. This means that
this hyperbola cannot be described as the Pareto front of any norm.
37
Appendix S6
Generally, for 3 tasks in a 2D morphospace, the norms can be uniquely determined
by the shape of the Pareto front
Let us assume that we have 3 tasks in a 2๐ท-morphospace. The matrices that describe the
norms that the performance functions depend on are denoted by ๐‘€๐‘– , for ๐‘– ∈ {1,2,3}.
Denote the parameters that ๐‘€๐‘– depends on with ๐œ†๐‘– , ๐œƒ๐‘– .
The Pareto front in this case is given by (Appendix S1):
3
๐‘ฃ = (∑ ๐›ผ๐‘– ๐‘€๐‘– )
๐‘–=1
−1
3
3
(∑ ๐›ผ๐‘– ๐‘€๐‘– ๐‘ฃ๐‘–∗
๐‘–=1
) ๐‘ . ๐‘ก. 0 ≤ ๐›ผ๐‘– ≤ 1, ∑ ๐›ผ๐‘– = 1
๐‘–=1
As proven in Appendix S7, the boundary of ๐‘ƒ1,2,3, the 3-tasks Pareto front, is composed
of the three 2-tasks fronts. Hence, given a dataset, we can take its boundary and assume
that it represents 3 hyperbolae. Denote by ๐‘ƒ๐‘–,๐‘— the 2-tasks front between ๐‘ฃ๐‘–∗ and ๐‘ฃ๐‘—∗ . Since
there are 3 tasks, each one is associated with two 2-tasks fronts. From the front ๐‘ƒ๐‘–,๐‘— we
can deduce the 1-dimensional family of norm-pairs that generates it, as explained in
๐‘–
Appendix S5. Denote this family by โ„ฑ๐‘–,๐‘— . Denote by โ„ฑ๐‘–,๐‘—
the family of norms that are
associated with task ๐‘– and were calculated from the front ๐‘ƒ๐‘–,๐‘— . Thus, there are 2 families of
๐‘–
๐‘–
potential norms that task ๐‘– might depend on - โ„ฑ๐‘–,๐‘—
and โ„ฑ๐‘–,๐‘˜
(๐‘– ≠ ๐‘— ≠ ๐‘˜). However, each
task can depend only on one norm. It means that if the hyperbolae triad is indeed
๐‘–
generated by a norms-triad, then there must be at least one common member between โ„ฑ๐‘–,๐‘—
๐‘–
๐‘–
๐‘–
and โ„ฑ๐‘–,๐‘˜
. Denote โ„ฑ ๐‘– = โ„ฑ๐‘–,๐‘—
∩ โ„ฑ๐‘–,๐‘˜
, then โ„ฑ ๐‘– ≠ ๐œ™. Choosing a member ๐‘›๐‘– ∈ โ„ฑ ๐‘– determines
(see Appendix S5) a member ๐‘›๐‘–,๐‘— of โ„ฑ๐‘–,๐‘— and a member ๐‘›๐‘–,๐‘˜ of โ„ฑ๐‘–,๐‘˜ . Those in turn
determine a member ๐‘›๐‘— ∈ โ„ฑ๐‘— and ๐‘›๐‘˜ ∈ โ„ฑ ๐‘˜ . Define ๐‘๐‘—,๐‘˜ = {(๐‘›๐‘— , ๐‘›๐‘˜ )|๐‘›๐‘– ∈ โ„ฑ ๐‘– }. ๐‘๐‘—,๐‘˜
includes all pairs of norms on which the performance of task ๐‘— and task ๐‘˜ can depend,
deduced from looking on task ๐‘–. On the other hand, task j and task k generate ๐‘“๐‘—,๐‘˜ . This
means that the existence of a triplet of norms that generate the Pareto front requires that
๐‘๐‘—,๐‘˜ ∩ ๐‘“๐‘—,๐‘˜ ≠ ๐œ™.
38
Hence, three cases are possible:
(I)
|๐‘๐‘—,๐‘˜ ∩ โ„ฑ๐‘—,๐‘˜ | = 1. In that case, there is only one triplet of norms that generates
the given Pareto front.
(II)
|๐‘๐‘—,๐‘˜ ∩ โ„ฑ๐‘—,๐‘˜ | > 1. In that case, the norms-triplet cannot be determined
uniquely from the front.
(III)
|๐‘๐‘—,๐‘˜ ∩ โ„ฑ๐‘—,๐‘˜ | = 0. In that case, the hyperbole-bound triangular shape
corresponds to no triplet of norms and hence cannot be explained in the scope
of the current model.
Let’s check under what conditions each of the above cases occur. But first, we prove the
following lemma:
Lemma 1: Let ๐‘ƒ๐‘–,๐‘— be the Pareto front associated with tasks ๐‘– and ๐‘—.
If one of the following occurs:
๐‘—
๐‘–
(I) ๐ผ ∈ โ„ฑ๐‘–,๐‘—
or ๐ผ ∈ โ„ฑ๐‘–,๐‘— (where ๐ผ is identity matrix, representing the Euclidean norm)
(II) ∃(๐‘š1 , ๐‘š2 ) ∈ โ„ฑ๐‘–,๐‘— such that ๐œƒ1 = ๐œƒ2 .
Then for every ๐œ† > 0 there is a norm-pair that generate ๐‘ƒ๐‘–,๐‘— given by
1
0
๐‘š1 = ๐‘…[๐œƒ๐‘–,๐‘— ] (
0
) ๐‘…[−๐œƒ๐‘–,๐‘— ]
๐œ†2
1
0
๐‘š2 = ๐‘…[๐œƒ๐‘–,๐‘— ] (0 ๐ถ ๐œ†2 ) ๐‘…[−๐œƒ๐‘–,๐‘— ]
๐‘–,๐‘—
where ๐ถ๐‘–,๐‘— > 0 and ๐œƒ๐‘–,๐‘— are constant determined by the parameters of ๐‘ƒ๐‘–,๐‘— .
ฬƒ๐‘–∗ = (0,0), ๐‘ฃ๐‘—∗ โ†ฆ ฬƒ
Proof: We transform to a coordinate system where ๐‘ฃ๐‘–∗ โ†ฆ ๐‘ฃ
๐‘ฃ๐‘—∗ = (1,0).
This transformation is a combination of a translation, a rotation by an angle ๐œƒฬƒ, and an
isomorphic scaling. Under those transformations, the parameters of the norms transform
in the following way:
39
๐œ†1 โ†ฆ ๐œ†ฬƒ1 = ๐œ†1 ,
๐œ†2 โ†ฆ ๐œ†ฬƒ2 = ๐œ†2 ,
๐œƒ1 โ†ฆ ๐œƒฬƒ1 = ๐œƒ1 + ๐œƒฬƒ,
๐œƒ2 โ†ฆ ๐œƒฬƒ2 = ๐œƒ2 + ๐œƒฬƒ
It means that if condition (I) or (II) holds in the original coordinate system, it will hold in
the transformed coordinate system. So either way, from Appendix S5 we know that in the
transformed coordinate system, ๐‘ƒ๐‘–,๐‘— โ†ฆ ๐‘ƒฬƒ
๐‘–,๐‘— whose parameter ๐ด๐‘ฆ๐‘ฆ = −1. In that scenario,
ฬƒ
the family of norm pairs that generate ๐‘ƒฬƒ
๐‘–,๐‘— , โ„ฑ๐‘–,๐‘— , was fully classified in Appendix S5:
ฬƒ2
ฬƒ2
ฬƒ ฬƒ
ฬƒ
โ„ฑ
ฬƒ 1, ๐‘š
ฬƒ 2 ) ๐‘ . ๐‘ก. ๐œƒฬƒ1 = ๐œƒฬƒ2 = ๐œƒฬƒ
๐‘–,๐‘— = (๐‘š
๐‘–,๐‘— , ๐œ†2 = ๐ถ๐‘–,๐‘— ๐œ†1 , for ๐œƒ๐‘–,๐‘— , ๐ถ๐‘–,๐‘— > 0 that are determined
by the parameters of the hyperbola ๐‘ƒฬƒ
๐‘–,๐‘— . Going back to the original coordinate system, we
ฬƒ
can deduce that โ„ฑ๐‘–,๐‘— = (๐‘š1 , ๐‘š2 ) ๐‘ . ๐‘ก. ๐œƒ1 = ๐œƒ2 = ๐œƒ๐‘–,๐‘— , ๐œ†22 = ๐ถ๐‘–,๐‘— ๐œ†12, for ๐œƒ๐‘–,๐‘— = ๐œƒฬƒ
๐‘–,๐‘— − ๐œƒ ,
๐ถ๐‘–,๐‘— = ๐ถฬƒ
๐‘–,๐‘— . โˆŽ
Consider a hyperbole-bound triangular shape with vertices ๐‘ฃ1∗ , ๐‘ฃ2∗ , ๐‘ฃ3∗ . Denote โ„ฑ๐‘–,๐‘— as
above. For now we assume that there is a triplet of norms – ๐‘€1 , ๐‘€2 , ๐‘€3 - that generate that
shape, i.e. โ„ฑ๐‘–,๐‘— ∩ ๐‘๐‘–,๐‘— ≠ ๐›ท. We will attempt to characterize when this norm-triplet is
unique, and show that if it is not unique, there exists a 1-dimensional family of normtriplets that generate the same ๐‘ƒ1,2,3.
We choose coordinate system such that ๐‘€1 = ๐ผ. This is possible as shown in Appendix
S2. In that case, from lemma 1 we know that โ„ฑ1,2 - the family of norm pairs that generate
๐‘ƒ1,2 , contains norms for which ๐œƒ1 = ๐œƒ2 =: ๐œƒ1,2 and
λ22
λ21
=: ๐ถ1,2 , where ๐ถ1,2 > 0, ๐œƒ1,2 are
defined by the parameters of ๐‘ƒ1,2 . Lemma 1 also tell us that โ„ฑ1,3- the family of norm pairs
that generate ๐‘ƒ1,3 - contains norms for which ๐œƒ1 = ๐œƒ3 =: ๐œƒ1,3 and
λ23
λ21
=: ๐ถ1,3 , where ๐ถ1,3 >
0, ๐œƒ1,3 are defined by the parameters of ๐‘ƒ1,3 .
There are 2 cases, either ๐œƒ1,3 ≠ ๐œƒ1,2 or ๐œƒ1,3 = ๐œƒ1,2 .
Claim 1: If ๐œƒ1,3 ≠ ๐œƒ1,2 , then โ„ฑ 1 = {๐‘€1 } = {๐ผ}, and the norm-triplet is unique.
1
1
Proof: Let ๐‘€′ ∈ โ„ฑ 1 ⇒ ๐‘€′ ∈ โ„ฑ1,2
and ๐‘€′ ∈ โ„ฑ1,3
. From the above conclusions on โ„ฑ1,2
and โ„ฑ1,3 we deduce that on one hand ๐œƒ๐‘€′ = ๐œƒ1,3 and on the other hand ๐œƒ๐‘€′ = ๐œƒ1,2 . If
๐œƒ1,3 ≠ ๐œƒ1,2 , the only possibility to resolve the conflict is if ๐œ†๐‘€′ = 1 and then ๐‘€′ = ๐ผ,
40
since in this case the norm is Euclidean and ๐œƒ is meaningless. ๐‘€1 is thus determined
uniquely, and since ๐‘€2 is determined uniquely by ๐‘€1 and also ๐‘€3 is determined uniquely
by ๐‘€1 . This means that there is only one triplet of norms that generates the Pareto front
⇒ we can uniquely deduce the norms that he performance functions depend on from the
Pareto front. โˆŽ
Claim 2: If ๐œƒ1,3 = ๐œƒ1,2 , then there’s a 1-dimensional family of norm-triplets that
generates the given ๐‘ƒ1,2,3.
1
1
1
Proof: In that case โ„ฑ1,2
= โ„ฑ1,3
. This is true since according to lemma 1 โ„ฑ1,2
contains all
1
norms with ๐œƒ = ๐œƒ1,2 and any ๐œ† > 0, and โ„ฑ1,3
contains all norms with ๐œƒ = ๐œƒ1,3 and any
1
1
1
1
1
๐œ† > 0. Since โ„ฑ 1 = โ„ฑ1,2
∩ โ„ฑ1,3
we get that โ„ฑ 1 = โ„ฑ1,2
= โ„ฑ1,3
. โ„ฑ1,2
is infinite since โ„ฑ1,2 is
infinite (Appendix S5) and hence โ„ฑ 1 is infinite. Under the current assumptions, there is
๐‘€1 ∈ โ„ฑ 1 with (๐‘€2 , ๐‘€3 ) ∈ โ„ฑ2,3 (i.e. the norm ๐‘€2 that is paired with ๐‘€1 in โ„ฑ1,2 and the
norm ๐‘€3 that is paired with ๐‘€1 in โ„ฑ1,3 generate ๐‘ƒ2,3 ). Note that ๐‘€2 and ๐‘€3 both have the
same angle ๐œƒ. In that case, from lemma 1 we get that (๐‘š2 , ๐‘š3 ) ∈ ๐‘“2,3 ⇔ ๐œƒ2 = ๐œƒ3 =
๐œƒ2,3 ๐‘Ž๐‘›๐‘‘ ๐œ†23 = ๐ถ2,3 ๐œ†22, for ๐œƒ2,3 and ๐ถ2,3 determined by the parameters of the hyperbola.
(๐‘€2 , ๐‘€3 ) generates ๐‘ƒ2,3, and thus
๐œ†2๐‘€2
๐œ†2๐‘€3
= ๐ถ2,3 and θM2 = θM3 = ๐œƒ2,3 . In addition,
(๐‘€1 , ๐‘€2 ) ∈ โ„ฑ1,2 ⇒ ๐œ†2M2 = ๐ถ1,2 ๐œ†2๐‘€1 .
(๐‘€1 , ๐‘€3 ) ∈ โ„ฑ1,3 ⇒ ๐œ†2M3 = ๐ถ1,3 ๐œ†2๐‘€1 .
⇒ ๐ถ2,3 =
๐œ†2M3
๐œ†2M2
๐ถ1,3 ๐œ†2๐‘€1
=๐ถ
2
1,2 ๐œ†๐‘€1
Now take another
๐‘€1′
๐ถ
= ๐ถ1,3 , and θM1 = θM2 = ๐œƒ1,2 = ๐œƒ1,3 = θM3 = ๐œƒ2,3.
1,2
1
∈ โ„ฑ . We know that,
๐œ†2๐‘€′
2
=
๐ถ1,2 ๐œ†2๐‘€′
1
,
๐œ†2๐‘€′
3
=
๐ถ1,3 ๐œ†2๐‘€′
1
⇒
๐œ†2 ′
๐‘€
3
๐œ†2 ′
๐‘€
=
2
๐ถ1,3 ๐œ†2 ′
๐‘€
1
๐ถ1,2 ๐œ†2 ′
๐‘€1
=
๐ถ1,3
๐ถ1,2
= ๐ถ2,3 . We also know that θ๐‘€1′ = θ๐‘€2′ , θ๐‘€1′ = θ๐‘€3′ ⇒ θ๐‘€2′ = θ๐‘€3′ ⇒
(๐‘€2′ , ๐‘€3′ ) ∈ โ„ฑ2,3. Hence, every ๐‘€1′ ∈ โ„ฑ 1 is a part of a norm-triplet that generates the
hyperbolae triplet. Since โ„ฑ 1 is infinite, it means that there are infinite number of normtriplets that generate the given hyperbolae-bound shape. The family is defined by one of
41
the ๐œ† parameters, since all ๐œƒs are known, and every other ๐œ† is determined by the
respective ๐ถ constant. โˆŽ
Those conclusions are true in the frame where ๐‘€1 = ๐ผ – i.e. – there is a norm triplet that
generates the Pareto front in which the norm associated with task 1 is Euclidean. We can
go back to the original coordinate system by rotating, translating and rescaling the space.
Those transformations are invertible, so there is a 1-to-1 correspondence between normtriplets that generates the shape in the transformed coordinate system and norm-triplets
that generates the shape in the original coordinate system.
It can be seen that most of hyperbolae-bound triangular shapes that correspond to norms
triplets correspond to a single triplet, since it is much more common that ๐œƒ1 ≠ ๐œƒ2 .
This method can be used to find the norms that generate a given hyperbolae-bound
1
1
triangular shape. We can fit ๐‘ƒ1,2 and ๐‘ƒ1,3 , and find โ„ฑ1,2 and โ„ฑ1,3. Then, we find โ„ฑ1,2
, โ„ฑ1,3
1
1
and โ„ฑ 1 = โ„ฑ1,2
∩ โ„ฑ1,3
. If โ„ฑ 1 = ฯ• we deduce that the shape corresponds to no norms-
triplet. Otherwise, we choose ๐‘€1 ∈ โ„ฑ 1 . We change coordinate system such that ๐‘€1 = ๐ผ
while ๐‘ฃ1∗ , ๐‘ฃ2∗ are at (0,0), (1,0), respectively. From the transformed ๐‘“1,2 and ๐‘“1,3 , we find
๐œƒ1,2 and ๐œƒ1,3 . Examining if ๐œƒ1,2 = ๐œƒ1,3 determines if the solution is unique or degenerate.
42
Appendix S7
The boundary of the 3-tasks Pareto front is composed of the three 2-tasks Pareto
fronts
In this Appendix, we will show that the boundary of the Pareto front associated with 3
tasks in a 2D morphospace is composed of the 3 Pareto fronts associated with each pair
of tasks
Consider the Pareto front defined by tasks i, j , denoted by ๐‘ƒ๐‘–,๐‘— . As seen before - this front
is a section of a hyperbola (or a line as a special case).
Denote the hyperbola branch that contains ๐‘ƒ๐‘–,๐‘— by ๐ต๐‘–,๐‘— and the entire hyperbola by ๐ป๐‘–,๐‘— (in
case ๐‘ƒ๐‘–,๐‘— is a line, ๐ต๐‘–,๐‘— = ๐ป๐‘–,๐‘— ). The hyperbola branch divides the space into 3 parts – one
side of ๐ต๐‘–,๐‘— , ๐ต๐‘–,๐‘— itself and the other side of ๐ต๐‘–,๐‘— .
Denote by ๐‘ƒ1,2,3 the Pareto front associated with all three tasks.
First note that ∀๐‘–, ๐‘—: ๐‘ƒ๐‘–,๐‘— ⊆ ๐‘ƒ1,2,3 : If ๐‘ฃ ∈ ๐‘ƒ๐‘–,๐‘— , then for every ๐‘ฃ ′ ≠ ๐‘ฃ, there is ๐‘˜ ∈ {๐‘–, ๐‘—}
such that ๐‘ƒ๐‘˜ (๐‘ฃ ′ ) < ๐‘ƒ๐‘˜ (๐‘ฃ), and specifically there is ๐‘˜ ∈ {1,2,3} such that ๐‘ƒ๐‘˜ (๐‘ฃ ′ ) < ๐‘ƒ๐‘˜ (๐‘ฃ)
⇒ ๐‘ฃ ∈ ๐‘ƒ1,2,3 ⇒ ๐‘ƒ๐‘–,๐‘— ⊂ ๐‘ƒ1,2,3 .
For convenience sake, and without loss of generality, we assume that the gradient ๐‘”๐‘– =
๐›ป๐‘ƒ๐‘– is normalized (๐‘”๐‘– ≠ 0 except at the archetype ๐‘ฃ๐‘–∗ , everywhere else we can redefine
๐‘”
๐‘”๐‘– โ‰” โ€–๐‘”๐‘– โ€–).
๐‘–
For now, assume that for ๐‘–, ๐‘—, ๐‘˜ ∈ {1,2,3} such that ๐‘– ≠ ๐‘— ≠ ๐‘˜, ๐ป๐‘–,๐‘— ≠ ๐ป๐‘–,๐‘˜ . The case
where there are overlapping hyperbolae will be discussed later.
We will implicitly consider the 2-dimensional morphospace V as embedded in a 3dimensional vector space in the trivial way ((๐‘ฅ, ๐‘ฆ) โ†ฆ (๐‘ฅ, ๐‘ฆ, 0)) for the use of operations
such as cross product. So expressions such as ๐‘”๐‘– × ๐‘”๐‘— should be understood as operations
in the 3-dimensional space, while operations such as ๐œ•๐‘ƒ should be understood as
operations in the original 2-dimensional space.
43
Consider the functional ๐‘“๐‘–,๐‘— = ๐œ‹๐‘ง โˆ˜ (๐‘”๐‘– × ๐‘”๐‘— ), where ๐œ‹๐‘ง is the standard projection
function ๐œ‹๐‘ง (๐‘ฅ, ๐‘ฆ, ๐‘ง) = ๐‘ง. ๐‘”๐‘– × ๐‘”๐‘— = (0,0, โ€–๐‘”๐‘– โ€–โ€–๐‘”๐‘— โ€– sin[๐œƒ๐‘–,๐‘— ]), where ๐œƒ๐‘–,๐‘— is the angle
between ๐‘”๐‘– and ๐‘”๐‘— , measured anticlockwise from ๐‘”๐‘– to ๐‘”๐‘— , so we get
๐‘“(๐‘ฃ) = โ€–๐‘”๐‘– (๐‘ฃ)โ€–โ€–๐‘”๐‘— (๐‘ฃ)โ€– sin[๐œƒ๐‘–,๐‘— ]
Lemma 1: On ๐ต๐‘–,๐‘— , ๐‘“๐‘–,๐‘— ≡ 0. On one side of ๐ต๐‘–,๐‘— , close enough to it, ๐‘“๐‘–,๐‘— > 0, and on the
other side of ๐ต๐‘–,๐‘— , close enough to it, ๐‘“๐‘–,๐‘— < 0.
Proof: First note that ๐‘“ is continuous as a projection of a cross product of 2 continuous
functions.
As we demonstrated before – on the Pareto front ๐‘ƒ๐‘–,๐‘— , the gradients point in opposite
directions, meaning sin[๐œƒ๐‘–,๐‘— ] = 0, which implies ๐‘“๐‘–,๐‘— = 0.
We’ve also seen that ๐‘ƒ๐‘–,๐‘— is part of a hyperbola (or line) ๐ป๐‘–,๐‘— . ๐ป๐‘–,๐‘— is given by a quadratic
(or linear) form ๐ป such that ๐ป๐‘–,๐‘— = {๐‘ฃ|๐ป(๐‘ฃ) = 0}
๐‘“๐‘–,๐‘— is also a quadratic (or linear) form (as a cross product of two linear forms), and
๐‘“๐‘–,๐‘— |๐‘ƒ ≡ ๐ป|๐‘ƒ๐‘–,๐‘— ≡ 0, which implies ๐‘“ = ๐›ผ๐ป for some ๐›ผ ∈ โ„.
๐‘–,๐‘—
In case ๐‘ƒ๐‘–,๐‘— is a line – ๐‘“๐‘–,๐‘— is a linear functional and it is trivial that it is negative on one
half space, and positive on the other.
In the case where ๐‘ƒ๐‘–,๐‘— is a hyperbola – by definition ๐‘“๐‘–,๐‘— = ๐›ผ๐ป = 0 on the hyperbolae
(both branches). We’ll show that it changes sign between the 3 connected components of
โ„2 โˆ– ๐ป๐‘–,๐‘— - let h be the line between the 2 foci of the hyperbola. h intersects each branch
44
of the hyperbola exactly once. Consider the function ๐‘“๐‘–,๐‘— โˆ˜ โ„Ž - it is a quadratic real
function of a single parameter (since ๐‘“๐‘–,๐‘— : โ„ → โ„2 is quadratic in 2 variables and โ„Ž: โ„ →
โ„ is linear). ๐‘“๐‘–,๐‘— โˆ˜ โ„Ž = 0 on both intersections of โ„Ž with ๐ป๐‘–,๐‘— . It means that ๐‘“|โ„Ž has to
change sign once it passes the hyperbola. So on one side of a branch of the hyperbola
๐‘“๐‘–,๐‘— |โ„Ž is positive and on the other side it is negative.
Of course ๐‘“๐‘–,๐‘— is continuous, and ๐‘“๐‘–,๐‘— ≠ 0 on โ„2 โˆ– ๐ป๐‘–,๐‘— , so its sign is constant across the
connected components. Hence, on one side of ๐ต๐‘–,๐‘— , ๐‘“๐‘–,๐‘— is positive, and on the other side
๐‘“๐‘–,๐‘— is negative (as long as the other branch of the hyperbola is not approached).
All in all, in a neighborhood of ๐ต๐‘–,๐‘— (and hence of ๐‘ƒ๐‘–,๐‘— ), ๐‘“๐‘–,๐‘— is positive on one side of ๐ต๐‘–,๐‘— ,
negative on the other side of ๐ต๐‘–,๐‘— , and 0 on ๐ต๐‘–,๐‘— . โˆŽ
๐‘“๐‘–,๐‘— > 0 implies that ๐‘”๐‘— is less than ๐œ‹ radians anticlockwise than ๐‘”๐‘– , ๐‘“๐‘–,๐‘— < 0 implies that
๐‘”๐‘— is more than ๐œ‹ radians anticlockwise than ๐‘”๐‘– . We would like to show that ๐‘ƒ๐‘–,๐‘— is at the
boundary of ๐‘ƒ1,2,3, the Pareto front associated with all 3 tasks. To do so, we need to show
that
∀ ๐‘ฃ ∈ ๐‘ƒ๐‘–,๐‘— ๐‘Ž๐‘›๐‘‘ ∀๐œ– > 0: ∃๐‘ฃ ′ ∈ ๐ต0 (๐‘ฃ, ๐œ–) ๐‘ . ๐‘ก. ๐‘ฃ ′ ∈ โ„2 \๐‘ƒ1,2,3
where ๐ต0 (๐‘ฃ, ๐œ–) is an open ball of radius ๐œ– around ๐‘ฃ.
Theorem 2: ∀๐‘–, ๐‘— (๐‘– ≠ ๐‘—): ๐‘ƒ๐‘–,๐‘— ⊆ ๐œ•๐‘ƒ1,2,3
Proof: Assume without loss of generality that ๐‘– = 1, ๐‘— = 2 (the proof will be identical for
any pair of ๐‘–, ๐‘— as long as ๐‘– ≠ ๐‘—). Assume by negation ๐‘ƒ1,2 is not at the boundary of ๐‘ƒ1,2,3,
it means that there exist ๐‘ฃ ∈ ๐‘ƒ1,2 and ๐œ– > 0 such that ๐ต0 (๐‘ฃ, ๐œ–) ⊂ ๐‘ƒ1,2,3.
45
Let ๐œ– > 0, and ๐‘ข ∈ ๐ต0 (๐‘ฃ, ๐œ–).
Claim 2: For ๐‘ข ∈ ๐‘ƒ1,2,3, ๐‘“๐‘–,๐‘— (๐‘ข) (๐‘— ≠ ๐‘–) and ๐‘“๐‘–,๐‘˜ (๐‘ข) (๐‘˜ ≠ ๐‘–, ๐‘—) can’t be neither both
positive nor both negative.
Proof:
Assume ๐‘“๐‘–,๐‘— > 0 ⇒ ๐‘”๐‘— is less than ๐œ‹ radians anticlockwise to ๐‘”๐‘– .
In this case ๐‘“๐‘–,๐‘˜ can’t be positive: If ๐‘“๐‘–,๐‘˜ > 0, it means that ๐‘”๐‘˜ is less than ๐œ‹ radians
anticlockwise to ๐‘”๐‘– . In this case all three gradients lie in the same half-space -choose the
gradient ๐‘”๐‘š that has the maximal angle with ๐‘”๐‘– (๐‘š = ๐‘Ž๐‘Ÿ๐‘”๐‘š๐‘Ž๐‘ฅ๐‘š {๐‘“๐‘–,๐‘š }). The angle
between ๐‘”๐‘– and ๐‘”๐‘š is smaller than ๐œ‹ (because both gradients are less than ๐œ‹ radians
anticlockwise from ๐‘”๐‘– . Choose โ„Ž๐‘ข to be the unit vector bisecting the angle between ๐‘”๐‘–
and ๐‘”๐‘š (โ„Ž๐‘ข =
๐‘”ฬ‚๐‘– +๐‘”ฬ‚๐‘š
2
๐œ‹
). The angle between โ„Ž๐‘ข and each gradient is smaller than 2 . Then,
∀๐‘›: โ„Ž๐‘ข ⋅ ๐‘”๐‘› > 0. However, we showed in Appendix S1 that ๐‘ข is Pareto optimal if and
only if there doesn’t exists a vector โ„Ž๐‘ข such that ∀๐‘›: โ„Ž๐‘ข โˆ™ g n > 0. Since ๐‘ข ∈ ๐‘ƒ1,2,3 (e.g. is
Pareto optimal), we must conclude that ๐‘“๐‘–,๐‘˜ โ‰ฏ 0.
To show that ๐‘“๐‘–,๐‘— , and ๐‘“๐‘–,๐‘˜ can’t both be negative, follow the above proof while changing
the word “anticlockwise” to “clockwise”. โˆŽ
Corollary 1: For ๐‘ข ∈ ๐‘ƒ1,2,3, such that ∀๐‘–, ๐‘—: ๐‘ข ∉ ๐ป๐‘–,๐‘— - ๐‘ ๐‘”๐‘› (๐‘“1,2 (๐‘ข)) = ๐‘ ๐‘”๐‘› (๐‘“2,3 (๐‘ข)) =
๐‘ ๐‘”๐‘› (๐‘“3,1 (๐‘ข)).
46
Proof: This results directly from the claim, the anti-symmetry of ๐‘“๐‘–,๐‘— , and the fact that
๐‘“๐‘–,๐‘— (๐‘ข) = 0 ⇔ ๐‘ข ∈ ๐ป๐‘–,๐‘—
Claim 3: If ๐‘ข ∉ ๐‘ƒ1,2,3 and ๐‘ข ∉ ๐ป๐‘–,๐‘— for any ๐‘–, ๐‘—, not all ๐‘“1,2 (๐‘ข), ๐‘“2,3 (๐‘ข), ๐‘“3,1 (๐‘ข) have the
same sign (see Figure S4)
Proof: According to Conclusion 1 from Appendix S1, since ๐‘ข is not Pareto optimal, all
three gradients ๐‘”1 , ๐‘”2 , ๐‘”3 lie in the same half-space. Choose a vector ๐‘™ on the line
separating the two half-spaces such that ๐œ‹๐‘ง (๐‘™ × ๐‘”๐‘– ) are all positive (they are either all
positive or all negative since they are all in the same half-space). If we order the gradients
according to their anticlockwise angle from ๐‘™ (they are all smaller than ๐œ‹) and name them
๐‘”๐‘– , ๐‘”๐‘— , ๐‘”๐‘˜ we get that ๐‘“๐‘–,๐‘— > 0, ๐‘“๐‘—,๐‘˜ > 0 but ๐‘“๐‘˜,๐‘– < 0, since the anticlockwise angles from ๐‘–
to ๐‘—, from ๐‘— to ๐‘˜ and from ๐‘– to ๐‘˜ are smaller than ๐œ‹ radians (they are all non-zero since ๐‘ข
is not on any hyperbola), and since ๐‘“ is anti-symmetric ๐‘“๐‘–,๐‘˜ ≥ 0 ⇒ ๐‘“๐‘˜,๐‘– ≤ 0.
This proves the claim for every assignment of ๐‘–, ๐‘—, and ๐‘˜.
Corollary 2: a point ๐‘ข ∈ ๐‘‰, which is not on any hyperbola ๐ป๐‘–,๐‘— is Pareto optimal if, and
only if, ๐‘ ๐‘”๐‘› (๐‘“1,2 (๐‘ข)) = ๐‘ ๐‘”๐‘› (๐‘“2,3 (๐‘ข)) = ๐‘ ๐‘”๐‘› (๐‘“3,1 (๐‘ข)).
Now, it is clear that if ๐‘ข ∈ ๐‘ƒ๐‘–,๐‘— , in every neighborhood of ๐‘ข there are points from both
sides of ๐‘ƒ๐‘–,๐‘— , meaning points with ๐‘“๐‘–,๐‘— > 0 and points with ๐‘“๐‘–,๐‘— < 0. This means that every
point on ๐‘ƒ๐‘–,๐‘— that is not any other hyperbola ๐ป๐‘˜,๐‘™ (i.e. not an intersection point), if it has
Pareto optimal points on one side of ๐‘ƒ๐‘–,๐‘— , on the other side there are no Pareto optimal
points (since only ๐‘“๐‘–,๐‘— changes sign on ๐ป๐‘–,๐‘— ).
If the point ๐‘ข ∈ ๐‘ƒ๐‘–,๐‘— is an intersection of two or more hyperbolae ๐ป๐‘–,๐‘— and some ๐ป๐‘˜,๐‘™ (not
necessarily on ๐‘ƒ๐‘˜,๐‘™ ), then in every neighborhood there are points on ๐‘ƒ๐‘–,๐‘— that are not
47
intersection points (the number of intersection points between 2 hyperbolae/lines is finite)
and therefore in that neighborhood there are points that are not Pareto optimal. Note that
the archetypes, for example, are at the intersection of ๐‘ƒ๐‘–,๐‘— with ๐‘ƒ๐‘–,๐‘˜ .
Thus – any point ๐‘ข ∈ ๐‘ƒ๐‘–,๐‘— for every pair of ๐‘–, ๐‘—, is not in the interior of ๐‘ƒ1,2,3, but since it is
in ๐‘ƒ1,2,3 it must be in ๐œ•๐‘ƒ1,2,3. โˆŽ
Theorem 3: For every ๐‘ข ∈ ๐‘ƒ1,2,3 , if ๐‘ข ∉ ๐‘ƒ๐‘–,๐‘— for any ๐‘–, ๐‘—, then ๐‘ข ∉ ๐œ•๐‘ƒ1,2,3
Proof 3: First, we will show that ๐‘ข is not on any hyperbola. If it were on some ๐ป๐‘–,๐‘— , then
๐‘”๐‘– (๐‘ข) × ๐‘”๐‘— (๐‘ข) = 0. But since ๐‘ข ∉ ๐‘ƒ๐‘–,๐‘— then the gradients ๐‘”๐‘– and ๐‘”๐‘— cannot point to
opposite directions, so they must point in the same direction. i.e. ๐‘”๐‘– + ๐›ผ๐‘”๐‘— = 0 but ๐›ผ <
0. Since ๐‘ข is Pareto optimal, then the third gradient ๐‘”๐‘˜ (๐‘˜ ≠ ๐‘–, ๐‘—) must point in the
opposite direction (otherwise there was โ„Ž s.t. ∀๐‘›: โ„Ž ⋅ ๐‘”๐‘› > 0, namely โ„Ž =
๐‘”๐‘– +๐‘”๐‘˜
2
). So we
get that ๐‘”๐‘– and ๐‘”๐‘˜ must point in opposite directions, which means that ๐‘ข ∈ ๐‘ƒ๐‘–,๐‘˜ , which is
in contradiction to the assumption.
So ๐‘ข is not on any hyperbola, and hence there is a neighborhood of ๐‘ข that doesn’t include
any point of ๐ป๐‘–,๐‘— for each i,j. If ๐‘ข is Pareto optimal, then the signs of ๐‘“1,2 , ๐‘“2,3 , ๐‘“3,1 are all
equal on ๐‘ข, but since all ๐‘“′๐‘  change signs only on the hyperbolae, they all have the same
sign in a neighborhood of ๐‘ข, which means that there is a neighborhood of ๐‘ข which is
Pareto optimal, i.e. – ๐‘ข is in the interior of ๐‘ƒ1,2,3.โˆŽ
Theorem 4: ๐œ•๐‘ƒ1,2,3 = ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1
We showed that ∀๐‘–, ๐‘—: ๐‘ƒ๐‘–,๐‘— ⊂ ๐œ•๐‘ƒ1,2,3, and that for every ๐‘ข ∈ ๐‘ƒ1,2,3, such that ๐‘ข ∉ ๐‘ƒ๐‘–,๐‘— for
any ๐‘–, ๐‘—, then ๐‘ข ∉ ๐œ•๐‘ƒ1,2,3. All that remains to be shown is that if ๐‘ข ∉ ๐‘ƒ1,2,3, then ๐‘ข ∉
๐œ•๐‘ƒ1,2,3. That is true because ๐‘ƒ1,2,3 is a closed set. In Appendix S1 we showed that:
48
๐‘ƒ1,2,3 = {(∑๐‘– ๐›ผ๐‘– ๐‘€๐‘– )−1 ∑๐‘–(๐›ผ๐‘– ๐‘€๐‘– ๐‘ฃ๐‘–∗ ) |๐›ผ๐‘– ≥ 0, ∑3๐‘–=1 ๐›ผ๐‘– = 1}
So ๐‘ƒ1,2,3 is the image of the (compact) unit triangle (0 ≤ ๐›ผ1 + ๐›ผ2 ≤ 1) under the
continuous mapping
−1
๐›ผ1 , ๐›ผ2 โ†ฆ ๐›ผ1 , ๐›ผ2 , 1 − ๐›ผ1 − ๐›ผ2 โ†ฆ (∑ ๐›ผ๐‘– ๐‘€๐‘– )
๐‘–
∑(๐›ผ๐‘– ๐‘€๐‘– ๐‘ฃ๐‘–∗ )
๐‘–
(It is continuous since ∑๐‘– ๐›ผ๐‘– ๐‘€๐‘– is a non negative combination of positive definite
matrices, and not all coefficients are zero, and hence it is positive definite, and hence
always invertible).
The image of compact sets are compact, and specifically closed, so there cannot be any
points outside ๐‘ƒ1,2,3 in ๐œ•๐‘ƒ1,2,3. โˆŽ
Lemma: For every ๐‘– ≠ ๐‘— ≠ ๐‘˜, ๐ป๐‘–,๐‘— and ๐ป๐‘˜,๐‘– can intersect only once except in the
archetype ๐ด๐‘– .
Proof: First note that ๐ป๐‘—,๐‘˜ goes through any intersection point between ๐ป๐‘–,๐‘— and ๐ป๐‘˜,๐‘– . Let ๐‘ก
be an intersection point between ๐ป๐‘–,๐‘— and ๐ป๐‘˜,๐‘– . ๐‘ก ∈ ๐ป๐‘–,๐‘— . As shown earlier this implies that
๐‘”๐‘– (๐‘ก) โˆฅ ๐‘”๐‘— (๐‘ก). ๐‘ก ∈ ๐ป๐‘˜,๐‘– ⇒ ๐‘”๐‘– (๐‘ก) โˆฅ ๐‘”๐‘˜ (๐‘ก). From the above two conclusions we can
conclude that ๐‘”๐‘— (๐‘ก) โˆฅ ๐‘”๐‘˜ (๐‘ก) ⇒ ๐‘ก is on ๐ป๐‘—,๐‘˜ . We see that except the archetypes, each
intersection point between 2 of the hyperbolae is actually an intersection point between
all 3 hyperbolae. Two different hyperbolae can intersect only 4 times. As each pair of
hyperbolae intersect at an archetypes, it means that excluding the archetypes, there are at
most 3 intersection points between the hyperbolae, and specifically a total of at most 3
intersection points between any ๐‘ƒ๐‘–,๐‘— and any ๐‘ƒ๐‘˜,๐‘— .
We showed that ๐œ•๐‘ƒ1,2,3 = ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1. We now want to show that there are points
๐‘ฃ ∈ ๐‘ƒ1,2,3 such that ๐‘ฃ ∉ ๐‘ƒ๐‘–,๐‘— for any {๐‘–, ๐‘—} ∈ {1,2,3} (i.e. – there are Pareto optimal points
beside the 3 2-tasks Pareto fronts).
49
Lemma: Let ๐‘ฃ ∈ ๐‘ƒ๐‘–,๐‘— such that ๐‘ฃ is not an intersection point between ๐ป๐‘–,๐‘— and any other
∗)
hyperbola ๐ป๐‘–,๐‘˜ . Then, if ๐‘ฃ = (∑๐‘š ๐›ผ๐‘š ๐‘€๐‘š )−1 ∑๐‘š(๐›ผ๐‘š ๐‘€๐‘š ๐‘ฃ๐‘š
for ๐›ผ๐‘š ≥ 0, ∑3๐‘–=1 ๐›ผ๐‘– = 1,
then ๐›ผ๐‘˜ = 0, for ๐‘˜ ∈ {1,2,3}, ๐‘˜ ≠ ๐‘–, ๐‘—.
∗)
∗
Proof: ๐‘ฃ = (∑๐‘š ๐›ผ๐‘š ๐‘€๐‘š )−1 ∑๐‘š(๐›ผ๐‘š ๐‘€๐‘š ๐‘ฃ๐‘š
⇒ ∑๐‘š ๐›ผ๐‘š ๐‘€๐‘š (๐‘ฃ − ๐‘ฃ๐‘š
) = 0 . However, we
showed that we can
∗ )๐‘‡
∗
๐‘ƒ๐‘š (๐‘ฃ) = −(๐‘ฃ − ๐‘ฃ๐‘š
๐‘€๐‘š (๐‘ฃ − ๐‘ฃ๐‘š
). Hence,
assume that
∑๐‘š ๐›ผ๐‘š ๐‘”๐‘š (๐‘ฃ) = 0. ๐‘ฃ ∈ ๐‘ƒ๐‘–,๐‘— , so ๐‘”๐‘– ||๐‘”๐‘— ⇒ ๐›ผ๐‘– ๐‘”๐‘– + ๐›ผ๐‘— ๐‘”๐‘— = ๐‘ ๐‘”๐‘– ⇒ ๐‘ ๐‘”๐‘– + ๐›ผ๐‘˜ ๐‘”๐‘˜ = 0 (๐‘˜ ∈
{1,2,3}, ๐‘˜ ≠ ๐‘–, ๐‘—). Since ๐‘ฃ is not an intersection point between ๐ป๐‘–,๐‘— and any other
hyperbola, ๐‘ฃ is not an archetype, so ๐‘”๐‘– (๐‘ฃ) ≠ 0. Also, it means that ๐‘”๐‘– (๐‘ฃ) โˆฆ ๐‘”๐‘˜ (๐‘ฃ), as
otherwise ๐‘ฃ would be on ๐ป1,3. It means that ๐‘ ๐‘”๐‘– + ๐›ผ๐‘˜ ๐‘”๐‘˜ = 0 ⇔ ๐‘ = 0 ๐‘Ž๐‘›๐‘‘ ๐›ผ๐‘˜ = 0. โˆŽ
Theorem 5: Let ๐‘ฃ ∈ ๐‘ƒ๐‘–,๐‘— for any ๐‘– ≠ ๐‘—. Let ๐‘ˆ be an open environment of ๐‘ฃ. Then, there is
๐‘ฃ ′ ∈ ๐‘ˆ such that ๐‘ฃ ′ ∈ ๐‘ƒ1,2,3 โˆ• ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 .
Proof:
As mentioned earlier, ๐‘ƒ1,2,3 is the image of the triangle ๐‘‡ = {0 ≤ ๐›ผ1 + ๐›ผ2 ≤ 1} under the
map:
−1
๐‘“: ๐›ผ1 , ๐›ผ2 โ†ฆ (∑ ๐›ผ๐‘– ๐‘€๐‘– )
๐‘–
∑(๐›ผ๐‘– ๐‘€๐‘– ๐‘ฃ๐‘–∗ )
๐‘–
As shown earlier, ๐‘“ is continuous. ๐‘“({๐›ผ1 , ๐›ผ2 }) = ๐‘ฃ ⇒ ๐‘ฃ = (∑๐‘– ๐›ผ๐‘– ๐‘€๐‘– )−1 ∑๐‘–(๐›ผ๐‘– ๐‘€๐‘– ๐‘ฃ๐‘–∗ )
where ๐›ผ3 = 1 − ๐›ผ1 − ๐›ผ2
Let ๐‘ฃ ∈ ๐‘ƒ๐‘–,๐‘— . Either ๐‘ฃ is an intersection point between ๐ป๐‘–,๐‘— and another hyperbola ๐ป๐‘˜,๐‘– , or
it is not.
First assume that ๐‘ฃ is not an intersection point between ๐ป๐‘–,๐‘— and any other hyperbola ๐ป๐‘˜,๐‘– .
Let ๐‘ˆ be an open environment of ๐‘ฃ that doesn’t intersect any hyperbola besides ๐‘ƒ1,2 . The
above lemma implies that for every ๐‘ฃ ′ such that ๐‘ฃ ′ ∈ ๐‘ƒ๐‘–,๐‘— ∩ ๐‘ˆ , ๐‘“ −1 (๐‘ฃ ′ ) ⊆ ๐‘‡ can’t
50
contain points for which ๐›ผ๐‘– + ๐›ผ๐‘— ≠ 1. Let ๐›ผฬƒ๐‘– , ๐›ผฬƒ๐‘— be such that ๐›ผฬƒ๐‘– ๐‘”๐‘– (๐‘ฃ) + ๐›ผฬƒ๐‘— ๐‘”๐‘— (๐‘ฃ) = 0.
Such ๐›ผฬƒ๐‘– , ๐›ผฬƒ๐‘— exist since ๐‘ฃ ∈ ๐‘ƒ๐‘–,๐‘— .
Since ๐‘“ is continuous, the origin of ๐‘ˆ is open in ๐‘‡. {๐›ผฬƒ๐‘– , ๐›ผฬƒ๐‘— } ∈ ๐‘“ −1 (๐‘ˆ) ⇒ ๐‘“ −1 (๐‘ˆ) is a
non-empty open environment of {๐›ผฬƒ๐‘– , ๐›ผฬƒ๐‘— } and hence must contain points from the interior
of the triangle ๐‘‡, i.e there are {๐›ผ๐‘–′ , ๐›ผ๐‘—′ } ∈ ๐‘“ −1 (๐‘ˆ), ๐›ผ๐‘–′ + ๐›ผ๐‘—′ < 1 such that ๐‘“({๐›ผ๐‘–′ , ๐›ผ๐‘—′ }) ∈ ๐‘ˆ.
๐‘“({๐›ผ๐‘–′ , ๐›ผ๐‘—′ }) can’t be on ๐‘ƒ๐‘–,๐‘— . Otherwise, as ๐‘ˆ doesn’t intersect any other hyperbola, we
would have gotten by the lemma that ๐›ผ๐‘˜′ = 0 , or ๐›ผ๐‘–′ + ๐›ผ๐‘—′ = 1, in contradiction to the
assumption. So, ๐‘“({๐›ผ1′ , ๐›ผ2′ }) is not on ๐‘ƒ๐‘–,๐‘— but also not on any other hyperbola, as by
assumption ๐‘ˆ does not intersect them. However, ๐‘“({๐›ผ1′ , ๐›ผ2′ }) ∈ ๐‘ƒ1,2,3 . So, we get that
๐‘“({๐›ผ1′ , ๐›ผ2′ }) ∈ ๐‘ƒ1,2,3 \๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1
If ๐‘ฃ is an intersection point between ๐ป๐‘–,๐‘— and any other hyperbola ๐ป๐‘˜,๐‘– , then since the
number of intersection points between the different hyperbolae is finite, there exists ๐‘ข ∈
๐‘ˆ ∩ ๐‘ƒ๐‘–,๐‘— that is not an intersection point between ๐‘ƒ๐‘–,๐‘— and any other hyperbola. Let ๐‘‰ ⊆ ๐‘ˆ
be an open environment of ๐‘ข, then by the above proof there exists ๐‘ฃ ′ ∈ ๐‘‰ ⊆ ๐‘ˆ such that
๐‘ฃ ′ ∈ ๐‘ƒ1,2,3 โˆ• (๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 ). This is the required ๐‘ฃ′โˆŽ
So we showed that the boundary of the 3-tasks Pareto front is exactly the 3 2-tasks Pareto
fronts, but they are not identical (i.e. it is not ‘empty’). Now we would like to better
characterize the 3-tasks Pareto front.
Examine ℜ2 ⁄โ‹ƒ๐‘–,๐‘— ๐ป๐‘–,๐‘— . It is divided into connected components, ๐‘‹๐‘˜ .
Theorem 6: Each component ๐‘‹๐‘˜ is either entirely Pareto optimal or not Pareto optimal at
all.
Proof: Let ๐‘ฅ๐‘˜ ∈ ๐‘‹๐‘˜ . By corollary 2, ๐‘ฅ๐‘˜ is Pareto optimal ⇔ ๐‘“1,2 (๐‘ฅ๐‘˜ ) = ๐‘“2,3 (๐‘ฅ๐‘˜ ) =
๐‘“3,1 (๐‘ฅ๐‘˜ ). By lemma 1, ๐‘“๐‘–,๐‘— changes signs only on ๐ป๐‘–,๐‘— , which means that the sign of each
๐‘“๐‘–,๐‘— is constant across each ๐‘‹๐‘˜ . Hence, if ๐‘ฅ๐‘˜ is Pareto optimal, ๐‘‹๐‘˜ is entirely Pareto
optimal, and if ๐‘ฅ๐‘˜ is not Pareto optimal, ๐‘‹๐‘˜ is entirely not Pareto optimal.
51
Theorem 7: If ๐‘‹๐‘˜ is unbounded it is not Pareto optimal
Proof: The Pareto front is compact as a continuous image of a compact set (the triangle
๐‘‡), and hence it is bounded.
Theorem 8: 2 adjacent connected components, ๐‘‹๐‘˜ and ๐‘‹๐‘š , cannot be both Pareto optimal.
Proof: Assume ๐‘‹๐‘˜ is Pareto optimal. It means that in this area, ๐‘“1,2 = ๐‘“2,3 = ๐‘“3,1.
Assume that without loss of generality that ๐‘‹๐‘˜ and ๐‘‹๐‘š are separated by ๐ป1,2. It means
that ๐‘“1,2 (and only ๐‘“1,2 ) has a different sign on ๐‘‹๐‘˜ and on ๐‘‹๐‘š . Hence, ๐‘“1,2 โ‰ข ๐‘“2,3 = ๐‘“3,1 on
๐‘‹๐‘š , so ๐‘‹๐‘š is not Pareto optimal.
Theorem 9: Let ๐‘ข ∈ ๐‘ƒ๐‘–,๐‘— . Then ๐‘ข is at the boundary of a Pareto optimal region.
Proof: Each ๐‘ข is at the boundary of 2 connected components ๐‘‹๐‘˜ . Theorem 5 shows that in
every open environment ๐‘ˆ of ๐‘ข there are Pareto optimal points that are not on ๐‘ƒ๐‘–,๐‘— . This
implies that one of the connected components is entirely Pareto optimal. According to
theorem 8, the second one is necessarily not Pareto optimal. This implies that ๐‘ข is at the
boundary of a Pareto optimal region.
Theorem 10: Let ๐‘ข ∈ ๐ป๐‘–,๐‘— โˆ• ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 . Then the connected components on which
boundary ๐‘ข is, are not Pareto optimal.
Proof: ๐‘ข cannot be Pareto optimal. Since ๐‘ข ∈ ๐ป๐‘–,๐‘— ⁄๐‘ƒ๐‘–,๐‘— , ๐‘”๐‘– and ๐‘”๐‘— point to the same
direction. ๐‘”๐‘˜ (๐‘˜ ≠ ๐‘– ≠ ๐‘—) must also point to the same direction (otherwise ๐‘”๐‘˜ would point
to the opposite direction than ๐‘”๐‘– and ๐‘”๐‘— , implying that ๐‘ข ∈ ๐‘ƒ๐‘–,๐‘˜ , ๐‘ƒ๐‘—,๐‘˜ in contradiction). So,
all three gradients are in the same half plane. As shown in Appendix S1, this means ๐‘ข is
not Pareto optimal. Since the Pareto front is compact, it is also closed, and has an open
compliment. Hence, there is an environment U of ๐‘ข which is not Pareto optimal. If ๐‘ข is
on the boundary of a connected component ๐‘‹๐‘˜ , then ๐‘ˆ ∩ ๐‘‹๐‘˜ ≠ Φ, meaning that ๐‘‹๐‘˜
contains points that are not Pareto optimal, and hence ๐‘‹๐‘˜ is not Pareto optimal.
52
We showed that ๐œ•๐‘ƒ1,2,3 = ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1, if for every ๐‘– ≠ ๐‘— ≠ ๐‘˜ ∈ {1,2,3}, ๐ป๐‘–,๐‘— ≠ ๐ป๐‘–,๐‘˜ ,
yet this statement is true also if there are such ๐‘–, ๐‘—, ๐‘˜.
Assume that ๐ป๐‘–,๐‘— = ๐ป๐‘–,๐‘˜ . As shown, ๐ป๐‘–,๐‘— = {๐‘ข|๐‘“๐‘–,๐‘— (๐‘ข) = 0 ⇔ ๐‘”๐‘– (๐‘ข) × ๐‘”๐‘— (๐‘ข) = 0}, and
๐ป๐‘–,๐‘˜ = {๐‘ข|๐‘”๐‘– (๐‘ข) × ๐‘”๐‘˜ (๐‘ข) = 0}. It means that on ๐ป๐‘–,๐‘— , ๐‘”๐‘– is parallel to ๐‘”๐‘— , and ๐‘”๐‘– is
parallel to ๐‘”๐‘˜ ⇒ ๐‘”๐‘— is parallel to ๐‘”๐‘˜ ⇒ ๐‘”๐‘— × ๐‘”๐‘˜ = 0 on ๐ป๐‘–,๐‘— ⇒ ๐‘“๐‘—,๐‘˜ = 0 on ๐ป๐‘–,๐‘— . As
explained earlier, this implies that ๐ป๐‘—,๐‘˜ = ๐ป๐‘–,๐‘— . ๐ป๐‘–,๐‘— divides the space into 2 (if it’s a line)
or 3 (if it’s a hyperbola) parts. Each of ๐‘“๐‘–,๐‘— , ๐‘“๐‘–,๐‘˜ , ๐‘“๐‘—,๐‘˜ changes sign when passing any branch
of ๐ป๐‘–,๐‘— . So, either they always have the same sign (expect on ๐ป๐‘–,๐‘— itself), or they never
have the same sign. As seen before - a point outside of ๐ป๐‘–,๐‘— is Pareto optimal if and only
if ๐‘“๐‘–,๐‘— , ๐‘“๐‘–,๐‘˜ and ๐‘“๐‘—,๐‘˜ all have the same sign. So either the entire space (maybe except parts
of ๐ป๐‘–,๐‘— ) is Pareto optimal, or none of it is. However, we showed that the Pareto front is
compact, and hence bounded, and hence it cannot be the entire space. So, the Pareto front
is placed on ๐ป๐‘–,๐‘— (which has no interior), i.e. - ๐‘ƒ1,2,3 = ๐œ•๐‘ƒ1,2,3.
To show that ๐œ•๐‘ƒ1,2,3 = ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 we need to show that ๐‘ƒ1,2,3 = ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 .
On one hand, ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 ⊆ ๐‘ƒ1,2,3. On the other hand, ๐‘ƒ1,2,3 ⊆ ๐ป๐‘–,๐‘— , so if ๐‘ข ∈ ๐‘ƒ1,2,3,
๐‘ข ∈ ๐ป๐‘–,๐‘— and then ๐‘”1 (๐‘ข), ๐‘”2 (๐‘ข), ๐‘”3 (๐‘ข) are either all aligned, or one is pointing away from
the other two. The first case happens if and only if the point is not Pareto optimal, so for
๐‘ข ∈ ๐‘ƒ1,2,3 the latter must happen. However, in that case ๐‘ข ∈ ๐‘ƒ๐‘š,๐‘› for ๐‘š, ๐‘› ∈ {1,2,3} ⇒
๐‘ข ∈ ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 ⇒ ๐‘ƒ1,2,3 ⊆ ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 ⇒ ๐‘ƒ1,2,3 = ๐‘ƒ1,2 ∪ ๐‘ƒ2,3 ∪ ๐‘ƒ3,1 .
53
Figure S4: For 3 tasks in 2D, the Pareto front is the locus of all points such that all ๐’‡๐’Š,๐’‹ have
the same sign. Each of the 2 tasks Pareto fronts, ๐‘ท๐’Š,๐’‹ (in blue), lies on a hyperbola ๐‘ฏ๐’Š,๐’‹ . ๐’‡๐’Š,๐’‹
switches sign when passing ๐‘ฏ๐’Š,๐’‹ . ๐‘ฏ๐Ÿ,๐Ÿ , ๐‘ฏ๐Ÿ,๐Ÿ‘ and ๐‘ฏ๐Ÿ‘,๐Ÿ are plotted in green, gray and pink,
respectively. The Pareto front associated with the three tasks, ๐‘ท๐Ÿ,๐Ÿ,๐Ÿ‘ is plotted in light blue.
54
Appendix S8
The resulting Pareto front when one of the performance function is maximized in a
region
Consider again the case of 3 tasks in ๐‘› dimensions, each with a point archetype ๐‘Ž๐‘– , and a
performance function ๐‘ƒฬƒ๐‘– that decays with an inner-product norm from the archetype.
Denote by ๐‘‡ the resulting Pareto front. Now consider the case where ๐‘ƒฬƒ1 is truncated –
instead of a point archetype there is a region, ๐ด1 , that maximizes performance. Denote
the truncated performance function by ๐‘ƒ1 . Since ๐‘ƒ1 is truncated, it means that outside the
archetypal region, its contours are identical to those of ๐‘ƒฬƒ1. For convenience - denote
๐‘ƒฬƒ2 , ๐‘ƒฬƒ3 by ๐‘ƒ2 , ๐‘ƒ3 .
We would now like to calculate the Pareto front relative to ๐‘ƒ1 , ๐‘ƒ2 , ๐‘ƒ3 .
Theorem: The Pareto front relative to ๐‘ƒ1 , ๐‘ƒ2 , ๐‘ƒ3 is composed of ๐‘ƒ2,3 ∪ (๐‘‡⁄๐ด1° ), where ๐‘ƒ2,3
is the Pareto front related to ๐‘ƒ2 and ๐‘ƒ3 .
Proof:
Lemma: If ๐‘ฅ ∉ ๐‘‡, it is not Pareto optimal.
Proof: ๐‘ฅ ∉ ๐‘‡. In Appendix S1 we saw that this means that there is ๐‘ฆ in the vicinity
of ๐‘ฅ such that ∀๐‘–: ๐‘ƒฬƒ๐‘– (๐‘ฅ) < ๐‘ƒฬƒ๐‘– (๐‘ฆ). This implies that ๐‘ƒ2 (๐‘ฅ) < ๐‘ƒ2 (๐‘ฆ), ๐‘ƒ3 (๐‘ฅ) <
๐‘ƒ3 (๐‘ฆ). When instead of considering ๐‘ƒฬƒ1 we consider ๐‘ƒ1 , it is possible that instead
of performing task 1 better than ๐‘ฅ, ๐‘ฆ performs task 1 the same as ๐‘ฅ (This happens
if both ๐‘ฅ and ๐‘ฆ are on ๐ด1 ). Hence, ๐‘ฅ is not Pareto optimal also relative to ๐‘ƒ1 , ๐‘ƒ2
and ๐‘ƒ3 .
Lemma: If ๐‘ฅ ∈ ๐‘‡ and ๐‘ฅ ∉ ๐ด1° , then ๐‘ฅ is Pareto optimal.
Proof: Let ๐‘ฅ ∈ ๐‘‡, ๐‘ฅ ∉ ๐ด1° . If there was a point ๐‘ฆ that dominated ๐‘ฅ, it would mean
that ๐‘ƒ๐‘– (๐‘ฅ) ≤ ๐‘ƒ๐‘– (๐‘ฆ), with at least one proper inequality. ๐‘ƒ2 (๐‘ฅ) ≤ ๐‘ƒ2 (๐‘ฆ) or ๐‘ƒ3 (๐‘ฅ) ≤
55
๐‘ƒ3 (๐‘ฆ) ⇒ ๐‘ƒฬƒ2 (๐‘ฅ) ≤ ๐‘ƒฬƒ2 (๐‘ฆ) or ๐‘ƒฬƒ3 (๐‘ฅ) ≤ ๐‘ƒฬƒ3 (๐‘ฆ). If ๐‘ƒ1 (๐‘ฅ) < ๐‘ƒ1 (๐‘ฆ), it means that ๐‘ฅ ∉
๐ด1 and certainly ๐‘ƒฬƒ1 (๐‘ฅ) < ๐‘ƒฬƒ1 (๐‘ฆ). For ∈ ๐œ•๐ด1 ๐‘ƒ1 (๐‘ฅ) = ๐‘ƒ1 (๐‘ฆ) ⇒ ๐‘ƒฬƒ1 (๐‘ฅ) ≤ ๐‘ƒฬƒ1 (๐‘ฆ),
and for ๐‘ฅ ∈ ๐ด1 ๐‘ƒ1 (๐‘ฅ) = ๐‘ƒ1 (๐‘ฆ) ⇒ ๐‘ƒฬƒ1 (๐‘ฅ) = ๐‘ƒฬƒ1 (๐‘ฆ). So, for ๐‘ฅ ∉ ๐ด1° , ๐‘ƒ1 (๐‘ฅ) =
๐‘ƒ1 (๐‘ฆ) ⇒ ๐‘ƒฬƒ1 (๐‘ฅ) ≤ ๐‘ƒฬƒ1 (๐‘ฆ). Thus, ๐‘ƒ๐‘– (๐‘ฅ) ≤ ๐‘ƒ๐‘– (๐‘ฆ) ⇒ ๐‘ƒฬƒ๐‘– (๐‘ฅ) ≤ ๐‘ƒฬƒ๐‘– (๐‘ฆ), and ๐‘ƒ๐‘– (๐‘ฅ) <
(๐‘ฆ) ⇒ ๐‘ƒฬƒ๐‘– (๐‘ฅ) < (๐‘ฆ), so if ๐‘ฅ is not Pareto optimal relative to {๐‘ƒ๐‘– } it is not Pareto
optimal relative to {๐‘ƒฬƒ๐‘– }. Since ๐‘ฅ ∈ ๐‘‡, it is Pareto optimal relative to {๐‘ƒฬƒ๐‘– }, and
hence it is Pareto optimal relative to {๐‘ƒ๐‘– }.
Lemma: If ๐‘ฅ ∈ ๐‘‡ ∩ ๐ด1° , then it is either on ๐‘ƒ2,3 or it is not Pareto optimal.
Proof: Let ๐‘ฅ ∈ ๐‘‡ ∩ ๐ด10 . Assume ๐‘ฅ is not on ๐‘ƒ2,3 . We would like to show that in
this case, ๐‘ฅ is not Pareto optimal. There is ๐œ–1 > 0 such that ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…
๐ต0 (๐‘ฅ, ๐œ–1 ) ⊆ ๐ด1 , since
๐‘ฅ ∈ ๐ด1° . There is ๐œ–2 > 0 such that ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…
๐ต0 (๐‘ฅ, ๐œ–2 ) does not intersect ๐‘ƒ2,3 since ๐‘ฅ is not
on that front (It was shown in Appendix S7 that the Pareto front is closed. The
argument brought there applies to any number of dimensions). Let ๐œ– =
min(๐œ–1 , ๐œ–2 ). ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…
๐ต0 (๐‘ฅ, ๐œ–) ⊆ ๐ด1 and does not intersect ๐‘ƒ2,3 . ๐‘ฅ is not Pareto optimal
relative to ๐‘ƒ2 and ๐‘ƒ3 since it is not on ๐‘ƒ2,3 . We saw in Appendix S1 that if x is not
Pareto optimal (when the performance functions decay we a norm from a point
archetype), there is a direction such that points in this direction close enough to
the point dominate it. This means that there is ๐‘ฆ ∈ ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…
๐ต0 (๐‘ฅ, ๐œ–) that performs tasks 2
ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…ฬ…
and 3 better than ๐‘ฅ does. ๐‘ฆ ∈ ๐ต
0 (๐‘ฅ, ๐œ–) ⊆ ๐ด1 ⇒ ๐‘ฆ ∈ ๐ด1 ⇒ ๐‘ฆ performs task 1 the
same as ๐‘ฅ does since they are both on the archetype of task 1 ⇒ ๐‘ฅ is dominated by
๐‘ฆ ⇒ ๐‘ฅ is not Pareto optimal.
Lemma: If ๐‘ฅ ∈ ๐‘ƒ2,3 , then ๐‘ฅ is pareto optimal (even if it is on ๐ด1 )
Proof: If ๐‘ฅ ∈ ๐‘ƒ2,3 it means that no other point performs both task 2 and task 3
better than it, which in turn means that no other point performs all 3 tasks better
than it, without regard to performance 1. Hence, ๐‘ฅ is Pareto optimal.
56
When considering the case of performance functions that decay with Euclidean norm, we
get a circular shaped archetypal region, and a resulting Pareto front as depicted in Fig. 12
in the main text.
For the special case where ๐‘Ž2 = ๐‘Ž3 , ๐‘ƒ2 = ๐‘ƒ3 , we effectively get the case of 2 tasks, one
maximized in a region and one at a single point. Such a case, where ๐‘ƒ1 and ๐‘ƒ2 /๐‘ƒ3 decay
with different inner product norms, is depicted in Fig. 11 in the main text.
57
Appendix S9
Bounds on the Pareto front for general performance functions show that normally
it is located in a region close to the archetype
We would like to find conditions as general as possible under which the Pareto front will
be constrained to an area close to the line between the archetypes. We consider the case
of more general performance functions, and of archetypes that are regions instead of
points.
We start with the case of 2 point-like archetypes: ๐‘Ž1 and ๐‘Ž2 . Denote by ๐ถ21 the contour of
the first performance function, ๐‘ƒ1 , on which lies ๐‘Ž2 . Define ๐ต21 as the set of points in
which performance 1 is greater or equal to its value on the contour (i.e. ๐ต21 = {๐‘ฅ|๐‘ƒ1 (๐‘ฅ) ≥
๐‘ƒ1 (๐‘Ž2 )}). In the same manner, denote by ๐ถ12 the contour of the second performance
function, ๐‘ƒ2 , on which lies ๐‘Ž1 , and ๐ต12 = {๐‘ฅ|๐‘ƒ2 (๐‘ฅ) ≥ ๐‘ƒ2 (๐‘Ž1 )}.
Claim: Under these conditions, the Pareto front is bounded in ๐ต21 ∩ ๐ต12.
Proof: Let ๐‘ง ∉ ๐ต21 ∩ ๐ต12. There are 3 options: ๐‘ง ∈ ๐ต21⁄๐ต12 , ๐‘ง ∈ ๐ต12⁄๐ต21 , ๐‘ง ∉ ๐ต12 ∪ ๐ต21 .
If ๐‘ง ∈ ๐ต21⁄๐ต12 , it is dominated by ๐‘Ž1 as ๐‘ƒ2 (๐‘ง) < ๐‘ƒ2 (๐‘Ž1 ) since ๐‘Ž2 ∈ ๐ต12 and ๐‘ƒ1 (๐‘ง) <
๐‘ƒ1 (๐‘Ž1 ) since ๐‘Ž1 is the archetype of task 1 so it maximizes the performance of that task.
The same argument shows that if ๐‘ง ∈ ๐ต12⁄๐ต21 , then ๐‘ง is dominated by ๐‘Ž2 . If ๐‘ง ∉ ๐ต12 ∪ ๐ต21 ,
it is dominated by any point in ๐ต12 ∪ ๐ต21. Hence, the Pareto front is restricted to the area
๐ต12 ∩ ๐ต21.
In this area the Pareto front does not have to be connected. Such scenario can arise when
the performance functions are not monotonic, or do not depend monotonically on strictly
concave functions (See Fig S5).
58
Consider now the case of ๐‘˜ point-like archetypes. Denote by ๐ถ๐‘—๐‘– the contour of the ๐‘– ๐‘กโ„Ž
performance function, ๐‘ƒ๐‘– , on which ๐‘Ž๐‘— lies. Denote by ๐ต๐‘—๐‘– the set of points in which
performance ๐‘– is greater or equal to its value on the contour (๐ต๐‘—๐‘– = {๐‘ฅ|๐‘ƒ๐‘– (๐‘ฅ) ≥ ๐‘ƒ๐‘– (๐‘Ž๐‘— )}).
For each ๐‘—, the Pareto front must lie in โ‹ƒ๐‘˜๐‘–=1, ๐ต๐‘—๐‘– , since each point outside this set is
๐‘–≠๐‘—
dominated by ๐‘Ž๐‘— . If ๐‘ฅ ∉ โ‹ƒ๐‘˜๐‘–=1, ๐ต๐‘—๐‘– ⇒ ∀๐‘– ≠ ๐‘—: ๐‘ฅ ∉ ๐ต๐‘—๐‘– ⇒ ๐‘ฅ performs task ๐‘– for ๐‘– ≠ ๐‘— worse
๐‘–≠๐‘—
than any phenotype in ๐ต๐‘—๐‘– . As ∀๐‘–: ๐‘Ž๐‘— ∈ ๐ต๐‘—๐‘– , ๐‘ฅ performs task ๐‘– worse than ๐‘Ž๐‘— . ๐‘ฅ performs
task ๐‘— worse than ๐‘Ž๐‘— . Since ๐‘Ž๐‘— is the archetype of task ๐‘—, ๐‘ฅ performs task ๐‘— worse than ๐‘Ž๐‘—
⇒ ๐‘ฅ performs all tasks worse than ๐‘Ž๐‘— ⇒ ๐‘ฅ is not Pareto optimal.
Conclusion: ๐‘ฅ ∈ Pareto front ⇒ ๐‘ฅ ∈ โ‹ƒ๐‘˜๐‘–=1, ๐ต๐‘—๐‘– for every ๐‘—.
๐‘–≠๐‘—
⇒Pareto front ⊆ โ‹‚๐‘˜๐‘—=1 โ‹ƒ๐‘˜๐‘–=1, ๐ต๐‘—๐‘–
๐‘–≠๐‘—
Consider again the case of 2 archetypes. Assume now that instead of a point, one of the
archetypes is a region. We would like to find a bound to the Pareto front in this scenario.
We assume the first archetype ๐ด1 is a closed bounded region. Let ๐‘ฅ1,2 be the point on ๐ด1
with the best performance of task 2. Since ๐ด1 is compact and ๐‘ƒ2 is continuous, such points
exist. As before, let ๐ต21 be the set of points outperforming ๐‘Ž2 in task 1, ๐ต21 = {๐‘ฅ|๐‘ƒ1 (๐‘ฅ) ≥
๐‘ƒ1 (๐‘Ž2 )}. Let ๐ต12 be the set of points outperforming ๐‘ฅ1,2 in task 2, ๐ต12 = {๐‘ฅ|๐‘ƒ2 (๐‘ฅ) ≥
๐‘ƒ2 (๐‘ฅ1,2 )}. Take a point ๐‘ง outside of ๐ต21 ∩ ๐ต12. Again there are 3 options: ๐‘ง ∈ ๐ต12⁄๐ต21 , ๐‘ง ∈
๐ต21⁄๐ต12 , ๐‘ง ∉ ๐ต21 ∪ ๐ต12. If ๐‘ง ∈ ๐ต12⁄๐ต21 , then ๐‘ƒ1 (๐‘ง) < ๐‘ƒ1 (๐‘Ž2 ) since ๐‘Ž2 ∈ ๐ถ21 and ๐‘ง ∉ ๐ถ21 , and
๐‘ƒ2 (๐‘ง) < ๐‘ƒ2 (๐‘Ž2 ) since ๐‘Ž2 is the archetype of task 2 so it maximizes the performance of
that task. It means that ๐‘ง is dominated by ๐‘Ž2 , and hence it is not Pareto optimal. If ๐‘ง ∈
๐ต21⁄๐ต12 , then ๐‘ƒ2 (๐‘ง) < ๐‘ƒ2 (๐‘ฅ1,2 ) since ๐‘ฅ1,2 ∈ ๐ต12 and ๐‘ง ∉ ๐ต12 , and ๐‘ƒ1 (๐‘ง) ≤ ๐‘ƒ1 (๐‘ฅ1,2 ) since
๐‘ฅ1,2 is on ๐ด1 , the archetype of task 1 so it maximizes the performance of that task.
Therefore, ๐‘ง is dominated by ๐‘ฅ12 and is not Pareto optimal. If ๐‘ง ∉ ๐ต21 ∪ ๐ต12, it is
59
dominated by any point in ๐ต21 ∪ ๐ต12. Hence, the Pareto front is restricted to the area ๐ต21 ∩
๐ต12 in this case as well.
We further ask what happens in the case where there are 3 archetypes – the first
archetype, ๐ด1 is a region, and the 2 other are points - ๐‘Ž2 , ๐‘Ž3 . Let ๐‘ฅ1,2 and ๐‘ฅ1,3 be the point
3
2
on ๐ด1 with the best performance of task 2 and 3, respectively. Let ๐ต1,2
, ๐ต1,2
be the set of
3
2
points that outperform ๐‘ฅ1,2 in task 2 and 3, respectively. Let ๐ต1,3
, ๐ต1,3
be the set of points
๐‘—
that outperform ๐‘ฅ1,3 in task 2 and 3, respectively. By definition - ∀๐‘ฅ ∈ ๐ต1,๐‘– , ๐‘ง ∉
๐‘—
๐ต1,๐‘– : ๐‘ƒ๐‘— (๐‘ง) < ๐‘ƒ๐‘— (๐‘ฅ).
3
2
In that case, the Pareto front must be contained in ๐ต1,2
∪ ๐ต1,2
: Any point outside of this
set is dominated by ๐‘ฅ1,2 , since ๐‘ฅ1,2 is contained in both contours, it performs tasks 2 and 3
better than any point outside of those contours, and task 1 better than any other point
3
2
since it is on ๐ด1 . The same argument applies to ๐ต1,3
∪ ๐ต1,3
. From the same arguments
brought earlier, the front is contained in ๐ต21 ∪ ๐ต23 and in ๐ต31 ∪ ๐ต32. Hence, the front is
3
3
2
2
bounded in (๐ต1,2
∪ ๐ต1,2
) ∩ (๐ต1,3
∪ ๐ต1,3
) ∩ (๐ต21 ∪ ๐ต23 ) ∩ (๐ต31 ∪ ๐ต32 )
๐‘–
๐‘˜
If all 3 archetypes are regions, each term can be replaced by (๐ต๐‘—๐‘– ∪ ๐ต๐‘—๐‘˜ ) → (๐ต๐‘—,๐‘–
∪ ๐ต๐‘—,๐‘–
)∩
๐‘–
๐‘˜
๐‘–
๐‘˜
(๐ต๐‘—,๐‘˜
∪ ๐ต๐‘—,๐‘˜
), resulting in the Pareto front bounded by โ‹‚3๐‘—=1 โ‹ƒ3๐‘–≠๐‘˜≠๐‘—,๐‘–,๐‘˜=1(๐ต๐‘—,๐‘–
∪ ๐ต๐‘—,๐‘–
)∩
๐‘–
๐‘˜
(๐ต๐‘—,๐‘˜
∪ ๐ต๐‘—,๐‘˜
)
๐‘–
๐‘˜
Conclusion: The Pareto Front ⊆ โ‹‚3๐‘—=1 (๐ต๐‘—,๐‘–
∪ ๐ต๐‘—,๐‘–
)
๐‘–≠๐‘—≠๐‘˜
60
Figure S5: The Pareto front does not have to be connected for non monotonic
performance functions. A and B: A plot of 2 chosen non-monotonic performance
functions,
5.2๐‘’
−(๐‘ฅ2 +๐‘ฆ2 )
0.1
+ 4.9๐‘’
−((๐‘ฅ−1)2 +๐‘ฆ2 )2
.5
+ 5๐‘’
−((๐‘ฅ)2 +(๐‘ฆ−1)2 )2
.5
−((๐‘ฅ)2 +(๐‘ฆ+1)2 )2
.5
+
−((๐‘ฅ−1)2 +(๐‘ฆ−1−1)2 )2
.5
+
+ 5.1๐‘’
−((๐‘ฅ+1)2 +๐‘ฆ2 )2
.5
and
2๐‘’
๐‘ƒ1 = 4.1๐‘’
๐‘ƒ2 = 4.1๐‘’
−((๐‘ฅ+1−1)2 +(๐‘ฆ−1)2 )2
.5
+ 5.1๐‘’
−((๐‘ฅ−1)2 +(๐‘ฆ−1)2 )
0.1
+ 4.9๐‘’
−((๐‘ฅ−1)2 +(๐‘ฆ+1−1)2 )2
.5
−((๐‘ฅ−1−1)2 +(๐‘ฆ−1)2 )2
.5
+ 3๐‘’
−((๐‘ฅ−1.9)2 +(๐‘ฆ+1.3)2 )2
.2
+ 5๐‘’
.
C: The Pareto front related to those performance functions is not connected. ๐ถ21 - the
contour of performance function 1 going through ๐‘Ž2 , the archetype of task 2, is in thick
61
purple. ๐ถ12 - the contour of performance function 2 going through ๐‘Ž1 , the archetype of
task 1, is in thin blue. ๐‘Ž1 and ๐‘Ž2 are red dots. The Pareto front is plotted in red. It can be
seen that it is not connected.
62
Appendix S10
The Pareto front of r strongly concave performance functions is a connected set of
Hausdorff dimension of at most r-1.
We would like to calculate the Pareto front of a system that needs to perform r tasks in an
n-dimensional trait space V = โ„› ๐‘› . Each performance function ๐‘ƒ๐‘– (๐‘ฃ) has a single
maximum - the archetype ๐‘ฃ๐‘–∗ . ๐‘ƒ๐‘– is assumed to be smooth and strongly concave.
In Appendix S1 we’ve shown that for such a system, for a Pareto optimal point v there
๐‘Ÿ
โƒ‘ ๐‘ƒ๐‘– (๐‘ฃ)} that equals zero.
exists a convex linear combination of {∇
๐‘–=1
For performance functions ๐‘ƒ๐‘– (๐‘ฃ) that decay with inner product norms, we’ve shown that
๐‘Ÿ
โƒ‘ ๐‘ƒ๐‘– (๐‘ฃ)}
if there exists a convex linear combination of {∇
that equals zero, v is Pareto
๐‘–=1
optimal.
This remains true for strongly concave performance functions:
Lemma 1: For strongly concave {๐‘ƒ๐‘– }, if there exists a convex linear combination of
๐‘Ÿ
โƒ‘ ๐‘ƒ๐‘– (๐‘ฃ)} that equals zero, then v is Pareto optimal
{∇
๐‘–=1
Proof: By assumption, there exist {๐›ผ๐‘– ≥ 0}๐‘Ÿ๐‘–=1 , ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– = 1 such that ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– โƒ‘∇ ๐‘ƒ๐‘– (๐‘ฃ) =
0.
63
Consider the function ๐‘“ = ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐‘ƒ๐‘– . Since all ๐‘ƒ๐‘– ’s are strongly concave, all ๐›ผ๐‘– ’s are
positive and are not all zero, f is strictly concave. By our assumption โƒ‘∇๐‘“(๐‘ฃ) =
∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– โƒ‘∇ ๐‘ƒ๐‘– = 0, and together those imply that v maximizes f. Since f is monotonic with
all ๐‘ƒ๐‘– ’s, this implies v is Pareto optimal.
Hence, v is Pareto optimal iff there is a convex combination of โƒ‘∇ ๐‘ƒ๐‘– (๐‘ฃ) that equals zero.
๐‘Ÿ
In addition, for each convex set {๐›ผ๐‘– }๐‘–=1 , there exists a v such that ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– โƒ‘∇ ๐‘ƒ๐‘– (๐‘ฃ) = 0.
This is because each of the ๐‘ƒ๐‘– is strongly concave on โ„› ๐‘› , and hence decays to negative
infinity. This implies that ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐‘ƒ๐‘– decays to negative infinity, so it must attain a
maximum. This maximum point v will satisfy ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– โƒ‘∇ ๐‘ƒ๐‘– (๐‘ฃ) = 0. Also, from strict
concavity, there will be only a single point v that satisfies this relation.
๐‘Ÿ
To conclude, we found that for every convex set {๐›ผ๐‘– }๐‘–=1 there is a single point v such that
∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– โƒ‘∇ ๐‘ƒ๐‘– (๐‘ฃ) = 0, v is Pareto optimal, and those are all Pareto optimal points.
Hence, the set {๐›ผ1 , … ๐›ผ๐‘Ÿ |๐›ผ๐‘– ≥ 0, ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– = 1} fully determines the Pareto front.
Denote by T := {๐›ผ1 , … ๐›ผ๐‘Ÿ |๐›ผ๐‘– ≥ 0, ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– = 1} the unit (r-1)-simplex. ๐‘‡ ⊂ โ„› ๐‘Ÿ−1. By the
above considerations, we can define a function h:T→ โ„› ๐‘› by assigning each {๐›ผ1 , … ๐›ผ๐‘Ÿ }
with the maximal point of ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ๐‘ƒ๐‘– . By the above results, we know that Im(T) equals
the Pareto front.
Lemma 2: h is continuously differentiable
64
Proof:
Take ๐›ผ โ‰” {๐›ผ1 , … , ๐›ผ๐‘Ÿ } ∈ ๐‘‡ ⊂ โ„› ๐‘Ÿ−1 . Take ๐‘ฃ = โ„Ž(๐›ผ) ∈ โ„› ๐‘› .
โƒ‘ ๐‘ƒ๐‘– (๐‘ฆ). k is continuously differentiable.
๐‘˜(๐›ผ, ๐‘ฃ) = 0 , where ๐‘˜(๐‘ฅ, ๐‘ฆ) โ‰” ∑๐‘Ÿ๐‘–=1 ๐‘ฅ๐‘– ∇
Denote ๐‘˜ = {๐‘˜1 , … , ๐‘˜๐‘› }
๐œ•๐‘˜1
๐œ•๐‘ฆ1
Then
โ‹ฎ
๐œ•๐‘˜1
(๐œ•๐‘ฆ๐‘›
โ‹ฏ
โ‹ฑ
โ‹ฏ
๐œ•๐‘˜1
๐œ•๐‘ฆ๐‘›
โ‹ฎ
๐œ•๐‘˜๐‘›
= ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ∇2 ๐‘ƒ๐‘– .
๐œ•๐‘ฆ๐‘› )
Since ๐‘ƒ๐‘– is strongly convex, ∇2 ๐‘ƒ๐‘– is positive definite for every i, and hence ∑๐‘Ÿ๐‘–=1 ๐›ผ๐‘– ∇2 ๐‘ƒ๐‘–
is positive definite, and hence not singular.
According to the implicit function theorem, this implies that there exists an open
environment U of ๐›ผ (U itself is not necessarily contained in T) , an open environment V
of v, and a unique continuously differentiable function ๐‘”: ๐‘ˆ → ๐‘‰ such that ∀๐‘ฅ ∈ ๐‘ˆ:
๐‘˜(๐‘ฅ, ๐‘”(๐‘ฅ)) = 0.
However, we know that ∀๐›ผ ′ ∈ ๐‘‡: ∃ a single v’ such that ๐‘˜(๐›ผ ′ , ๐‘ฃ) = 0. This implies that
โ„Ž|๐‘ˆ∩๐‘‡ = ๐‘”| ๐‘‡ ⇒ โ„Ž is continuously differentiable at ๐›ผ.
This is true for every ๐›ผ ∈ ๐‘‡, and hence h is continuously differentiable on T.
Corollary: For strongly concave and smooth performance functions, the Pareto front is
connected.
65
Proof: We saw that the Pareto front is the image of T under h. T is connected (as the unit
(r-1)-simplex in โ„› ๐‘Ÿ ) and h is continuous, and hence the Pareto front is connected.
Corollary: The Pareto front has Hausdorff dimension of at most r-1
Proof: This follows as T has Hausdorff dimension of r-1 (as the unit (r-1)-simplex) and h
is ๐ถ 1 .
โˆŽ
Note that v is Pareto optimal relative to ๐‘ƒ1 , … , ๐‘ƒ๐‘Ÿ ⇔ โˆ„๐‘ฃ ′ ≠ ๐‘ฃ ๐‘ . ๐‘ก ∀๐‘–: ๐‘ƒ๐‘– (๐‘ฃ) < ๐‘ƒ๐‘– (๐‘ฃ ′ ) ⇔
โˆ„๐‘ฃ ′ ≠ ๐‘ฃ ๐‘ . ๐‘ก ∀๐‘–: ๐‘“(๐‘ƒ๐‘– (๐‘ฃ)) < ๐‘“(๐‘ƒ๐‘– (๐‘ฃ′)) where ๐‘“: โ„› → โ„› is a monotonic increasing
function ⇔ v is Pareto optimal relative to ๐‘“ โˆ˜ ๐‘ƒ1 , … , ๐‘“ โˆ˜ ๐‘ƒ๐‘Ÿ
Corollary: The Pareto front relative to ๐‘ƒ1 , … , ๐‘ƒ๐‘Ÿ , where each ๐‘ƒ๐‘– is a smooth monotonically
increasing function of a strongly concave function is continuous and has a maximal
Hausdorff dimension of r-1.
66
References
1. Grant PR, Abbott I, Schluter D, Curry RL, Abbott LK (1985) Variation in the size
and shape of Darwin’s finches. Biological Journal of the Linnean Society 25:1-39.
2. Barber CB, Dobkin DP, Huhdanpaa H (1996) The quickhull algorithm for convex
hulls. ACM Trans Math Softw 22:469–483.
3. Klee V, Laskowski MC (1985) Finding the smallest triangles containing a given
convex polygon. Journal of Algorithms 6:359-375.
4. M. Gerstenhaber (1951) Theory of convex polyhedral cones, Chap. XVIII of Cowles
Commission Monograph No. 13, Activity analysis of production and allocation, ed. T. C.
Koopmans, Wiley, New York,.
67
Download