The Load Balancing Algorithm for the Star

advertisement
The Load Balancing Algorithm for the Star
Interconnection Network
Ahmad M. Awwad, Jehad Al-Sadi

Abstract—The star network is one of the promising
interconnection networks for future high speed parallel computers, it
is expected to be one of the future-generation networks. The star
network is both edge and vertex symmetry, it was shown to have
many gorgeous topological proprieties also it is owns hierarchical
structure framework. Although much of the research work has been
done on this promising network in literature, it still suffers from
having enough algorithms for load balancing problem. In this paper
we try to work on this issue by investigating and proposing an
efficient algorithm for load balancing problem for the star network.
The proposed algorithm is called Star Clustered Dimension Exchange
Method SCDEM to be implemented on the star network. The
proposed algorithm is based on the Clustered Dimension Exchange
Method (CDEM). The SCDEM algorithm is shown to be efficient in
redistributing the load balancing as evenly as possible among all
nodes of different factor networks
Keywords— Interconnection networks, Load balancing, Star
network.
I. INTRODUCTION
T
HE star graph was proposed by Akers and et al as one
of the attractive topologies, it was proposed as an
alternative to the cube network [1]. The star graph shown to
have excellent topological properties for comparable network
sizes of the cube network [2]. The star graph showed to have
many good properties over many networks including the wellknown cube network including: smaller diameter, smaller
degree, and smaller average diameter. The star graph proved
to have a hierarchical structure which will enable it building
large network size of smaller ones, the star graph is both edge
and vertex symmetry [1].
Although some algorithms proposed for the star graph such
as distributed fault-tolerant routing algorithm [3]. The
proposed algorithm adapts the routing decisions in response to
node failures, also the star network is shown to have fault
tolerance properties [3]. Anyway one of the main problems
that the star network still suffers from is having enough
algorithms for load balancing problem.
To our knowledge there is no enough results proposed in
literature about implementing and proposing efficient
algorithms for load balancing on star network. In this paper we
try to fill this gap by proposing and embedding the SCDEM
algorithm on the star graph which is based on the CDEM
Dr. Awwad works at University of Petra (awwad@uop.edu.jo),
Sadi works at Arab Open University (j_alsadi@aou.edu.jo)
Dr. Al-
algorithm which was shown to be attractive on OTISHypercube network by redistributing the load as evenly
among different processors [4]. Efficient implementation of
the SCDEM algorithm on the star network will make the star
network more acceptable network for real life application in
connection to load balancing problem. The rest of the paper is
organized as follows: In section 2 we present the necessary
basic notations and definitions, in section 3 we introduce some
of the related work on load balancing, in section 4 we present
and discuss the implementation of the SCDEM algorithm on
the star graph, also we present an example of SCDEM on S4
star network, finally section 5 concludes this paper.
II. DEFINITIONS AND BASIC TOPOLOGICAL PROPERTIES
During the last decade a huge number of interconnection
networks for High Speed Parallel Computers (HSPC) have
been investigated and proposed in literature [5] - [7]. As an
example one of these networks was the hypercube
interconnection network, also this network is known as the
binary n-cube. The star graph [1] is another example, which
has been proposed as an attractive alternative to the hypercube
network. Since its appearance the star network has attracted a
lot of research efforts. Some properties of this network have
been studied in the literature including its basic topological
properties [1], parallel path classification [4], node
connectivity [2] and embedding [8]. The authors Akers and
Krishnamurthy [3] have proved that the star graph has several
advantages over the hypercube network including a lower
degree for a fixed network size of the comparable network
sizes, a smaller diameter, and smaller average diameter.
Furthermore they showed that the star graph is maximally
fault tolerant edge, and vertex symmetric [3].
The structure of the star network can play an effective step
for proposing any algorithms on it. The authors in [21] Menn
and Somani have shown that the star graph may be seen as n 
(n-1)!, where the rows and the columns in this framework are
(n-1)-star and an n-linear array correspondingly. Also, Ferreria
and Berthome [9] claimed that the star graph may be seen also
as a rectangular framework RC (Rows by Columns) where
the rows are substar-Sn-2 and the columns are n (n-1) nodes on
each of its column.
However, there has been relatively a limited research efforts
have been dedicated to design efficient algorithms for the star
graph including computing fast Fourier transforms [10],
broadcasting [11], selection and sorting [12], [21], and Matrix
Multiplications [13] and load balancing [14], [15]. In an
attempt to overcome this problem we present an efficient
algorithm for load balancing problem on star graph to
redistribute the load balancing among all processors of the
network as evenly as possible.
Definition 1: The n-star graph, which is denoted by Sn, has
n nodes each labelled with a sole permutation n = {1,…,n}.
Any two nodes of Sn are connected if, and only if, their
corresponding permutations differ exactly in the first position
and any other position.
Fig. 1 shows the 4-star graph with 4 groups each containing
6 vertices (i.e. four copies of 3-star graphs). The degree, ,
and the diameter, , of the star graph are as follows [1], [16]:
, of the n-star graph = n-1, where n1.
the proposed algorithms on load balancing has shown a
considerably major advancement in enhancement of efficiency
and stability [18], [19]. In another research done by Zaho and
Xiao they have presented different DED-X schemes for load
balancing on homogeneous OTIS networks and they proposed
new algorithm structure called Generalized DiffusionExchange- Diffusion Method, the proposed scheme enabled
load balancing on Heterogeneous OTIS networks [20].
4231
1234
b
a
2134
3241
2431
3214
3421
3124
2314
c
, of n-star graph =  32 (n-1).
2341
d
1324
4321
3412
2413
III. BACKGROUND AND RELATED WORK
The attractive results shown and proved by researchers in
literature of the star graph make it a one of the strongest
competitor’s topology for High Speed Parallel Computers
(HSPC) and a strong candidate network for real life
applications. This fact has motivated us to investigate the load
balancing problem on the star network since the star graph
suffers from limited number of efficient algorithms proposed
for it in general and load balancing problem as specific case.
The load balancing problem has been investigated on various
types of infrastructure ranging from electronic networks [15]
and OTIS networks [4].
Load balancing problem is one of the well-known and
important types of problems which was studied from different
point views and different approaches. This problem was
studied and investigated by Ranka, Won, and Sahni [17], they
proposed and introduced the Dimension Exchange Method
(DEM) on the hypercube topology. The DEM algorithm was
based on the idea of finding the average load of neighbours’
nodes, such that the dimension of the network is n, the
neighbours which are connected on the nth dimension they will
exchange their loads to redistribute the load and achieve
evenly load balancing as possible, the processor which have
more load will broadcast the extra amount of the load to its
direct neighbour node. The main advantage of the Dimension
Exchange Method is that every node will be able to
redistribute tasks to its direct neighbours to reach even load
balancing among all nodes. Ranka and et al have achieved that
in the worst case in the DEM method to redistribute load
balancing was log2n on the cube network [17].
The researchers Zaho, Xiao, and Qin have presented hybrid
scheme of diffusion and dimension exchange called DED-X
for load balancing on Optical Transpose Interconnection
System (OTIS) [18] , [19]. The proposed algorithm works by
dividing the load balancing task to three different phases. The
results achieved on OTIS networks showed that the load
balance efficiently redistributed almost evenly. On the other
hand the achieved results of the simulation from Zaho et al of
b
a
4213
4312
1423
1432
1342
4123
4132
1243
c
d
2143
3142
Fig. 1: The 4-Star Graph
Furthermore Zaho, Xiao, and Qin have shown that the
usability of the new proposed load balancing methods to be
better than the X traditional load balancing algorithm [20].
The main objective of this paper is to propose and present a
new load balancing algorithm for the star networks named Star
Clustered Dimension Exchange Method (SCDEM) based on
the algorithm [4].
The Star Clustered Dimension Exchange Method for Load
Balancing on the Star Network
The algorithm we present in this paper SCDEM is based on
the Clustered Dimension Exchange Method CDEM for load
balancing for Optical Transpose Interconnection system on
Hypercube factor network [4]. The worst time case complexity
of CDEM for load balancing on OTIS-Hypercube was
O(Sqrt(p)*M log2 p). Also the number of communication
steps which is required by CDEM proved to be 3log2 p [4].
The main achievement of the new presented SCDEM is to
obtain even load balancing for the Sn network by
redistributing number of tasks between different nodes on
different groups. The number cooperating moves needed
between different nodes in the SDEM is 2n-1, where n is the
degree of Sn. Fig. 2 presents the SCDEM algorithm for load
balancing problem on n! Processors of Sn.
same process will be repeated continually until it reach the
The SCDEM load balancing algorithm is based on the
following 3 phases:
٠Phase 1: The load balancing of neighbour nodes of the 1st
stage is achieved by redistributing the load balancing of all
direct neighbour nodes, and only if, their corresponding
permutations differ exactly in the 1st and 2nd position.
Then redistributing the load balancing of any two neighbour
nodes that are connected if, and only if, their corresponding
permutations differ exactly in the first and 3rd position.
Keep redistributing the load balancing of any two neighbour
nodes “as above” that are connected if, and only if, their
corresponding permutations differ exactly in the first and nth
position.
٠Phase 2: Repeat phase one more time.
٠Phase 3: For each direct neighbour’s processors pi and pj
find the maximum difference weight of the directed weights
and redistribute the weight among the nodes of highest
difference following the steps 3-12.
Note that n-1 is the number of neighbours of any
processor in Sn:
neighbours pj that is n positions far away from pi.
٠Phase 2: To enhance the load balancing efficiency between
different processors of n factor networks, the algorithm
suggests repeating the steps 2 to 12 as mentioned above.
٠Phase 3: As a final phase all adjacent processors which they
differ in first position and any other position i.e. pi and pj. The
algorithms will find the maximum difference among all the
weights of these neighbors and redistribute the weight
between pi and pj with max difference, │pi - pj│ following the
steps 2-12 of the SCDEM algorithm.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
for n = 1; n ≤ n-1; n++
for all neighbour nodes pi and pj which they differ in
1st and n+1 position of Sn do in parallel
Give-and-take pi and pj total load sizes of the two
nodes
TheAverageLoad pi,j = Floor (Load pi + Load pi)/2
if ( Totalload pi >= excess AverageLoad pi,j )
Send excess load pi to the neighbour node pi
Load pi = Load pi – extra load
Load pj = Load pj + extra load
else
Receive extra load from neighbour pj
Load pi = Load pi + extra load
Load pj = Load pj – extra load
Repeat steps 3 to 12 one more time
for all adjacent processors of pi and find the max │pi
– pj,│ such that pj is the set of all neighbours of pi
where 1≤ j ≤ n-1.
redistribute the weight as in steps as in steps 3 to12
between pi and the node with the max │pi – pj│.
Fig. 2: The SCDEM load balancing Algorithm
b
3
15
a
20
40
30
10
2134
3241
7
10
2431
3214
2314
1.
2.
4231
1234
5
c
2
12
2341
d
4321
3412
2413
b
a
20
4213
2
8
2
1432
17
2
1342
14
d
13
20
4132
3421
8
1324
4
4312
3124
1243
12
3142
7
1423
4123
c
2143
Fig. 3: 4-star network – load balancing - initial state
Example: - To explain the SCDEM algorithm presented in
Fig. 2, the following example implements the load balancing
algorithm on the different factor networks of S4.
SCDEM algorithm works on redistributing load balancing
Fig. 3 shows the four factor networks 4-S3 of the Network
among all processors of the network, phases: one, two and
S4, each factor network has 6 processors with a specific load
three are done in parallel.
assigned to it. Since the degree of Sn is n-1 it follows that each
٠Phase 1: The load balancing between the processors of Sn
node connected to three other direct nodes, two of the inner
based on SCDEM algorithm is exchanged as in steps 2 to 12
in parallel, in first step the load exchange will be between all
the processors in which they differ in 1 st position and 2nd
position for all the factor networks of Sn i.e. Sn -1. Then the
group and one in outer group. The number which was assigned
next to the processor presents the starting load.
25
b
5
27
a
of this phase Fig. 6 shows the new load balancing distribution
4231
1234
28
2134
3241
5
11
of the phase 1 of the algorithm.
4231
1234
2431
3214
2314
25
8
7
3124
c
5
11
2341
d
3421
15
12
14
2134
4321
a
b
3
14
11
1432
15
12
3124
c
9
8
2341
d
15
14
4132
1243
5
13
1423
4123
4321
3412
2413
4312
a
b
11
4213
12
7
16
1432
9
8
1342
8
1243
11
12
d
Fig. 4: 4-Star network – load balancing phase 1-A
14
16
4132
17
2134
3241
5
8
Fig. 6: 4-Star network – load balancing phase 1-C
b
8
26
4231
1234
2431
3214
13
17
18
3124
c
2341
d
13
2314
b
a
9
12
12
13
7
1432
13
c
10
14
4132
3421
12
4321
3412
2413
b
1243
10
a
13
11
12
11
2143
following the steps 2-12. Fig.s 4, 5 and 6 reflects phase1: A, B
and C of the SCDEM algorithm, each pair of nodes which
they differ in 1st position, 2nd, 3rd and 4th position. At the end
10
12
13
4132
d
First we start by implanting phase 1 of the algorithm by
13
1432
c
Fig 5: 4-Star network – processors load balancing phase 1-B
12
4213
4123
1342
3142
12
9
2341
d
1423
4312
8
d
3124
1324
12
13
2431
13
4213
8
10
2413
8
1342
2134
12
3214
4321
3412
8
3421
3241
8
1324
4312
5
8
b
12
13
a
16
c
4231
1234
2314
4123
2143
3142
26
1423
c
2143
3142
a
3421
13
1324
10
4
d
2431
11
4213
13
8
2413
3412
1342
7
3214
2314
3
17
3241
5
1324
4312
b
17
17
a
11
3142
1243
10
1423
4123
c
2143
Fig.7: 4-Star network –load balancing – phase 2
In phase 2 we repeat the same steps one more time to
redistribute the load balancing among the neighbours’ nodes
as suggested in SCDEM algorithm presented in Fig. 2 by
following the steps 2- 12. Fig. 7 shows the load balancing of
As future extension of this research work we will do some
all processors at the end of phase 2.
analytical estimation including: execution time, load balancing
13
b
11
13
a
accuracy, communication steps and speed to prove the
4231
1234
13
3241
2134
12
10
SCDEM efficiency mathematically.
2431
REFERENCES
3214
[1]
2314
12
12
3124
13
c
12
10
2341
d
3421
[2]
11
4321
1324
[3]
[4]
2413
3412
b
12
4312
a
4213
12
11
12
1432
12
11
1342
11
d
12
13
4132
11
[5]
12
1243
11
1423
[6]
4123
[7]
c
2143
3142
[8]
Fig. 8: 4-Star network – load balancing – phase 3
Finally in phase 3 all adjacent nodes which differ in first
position and any other position i.e. pi and pj will redistribute
[9]
[10]
their load balancing by first finding the maximum difference
between their weights then by implementing the steps of the
[11]
algorithm between the nodes with maximum difference (Fig.
8).
[12]
The new achieved distribution of the load balance shown to
[13]
be efficient and optimal. The final distribution is achieved in
2n-1 communication steps where n is the degree of the star
[14]
network.
[15]
IV.
CONCLUSION
This paper presented an efficient algorithm for distributing
[16]
the load balancing of nodes in star network. The proposed
algorithm is called Star Clustered Dimension Exchange
[17]
Method (SCDEM) is based on the well-known algorithm
[18]
which was proposed by Mahafza et al (CDEM). The proposed
[19]
algorithm SCDEM resulted in almost even distributions load
balancing among the all nodes of star network. The algorithm
is able to redistribute load balancing among all nodes in 2n-1
communication steps which is considered efficient.
[20]
[21]
S. B. Akers, D. Harel and B. Krishnamurthy, “The Star Graph: An
Attractive Alternative to the n-Cube” Proc. Intl. Conf. Parallel
Processing, 1987, pp. 393-400.
K. Day and A. Tripathi, “A Comparative Study of Topological
Properties of Hypercubes and Star Graphs”, IEEE Trans. Parallel &
Distributed Systems, vol. 5.
Kaled Day and Abdel-Elah Al-Ayyoub, “Node-ranking schemes for the
star networks”, Journal of parallel and Distributed Computing, Vol. 63
issue 3, March 2003, pp 239-250.
B.A. Mahafzah and B.A. Jaradat, “The Load Balancing problem in
OTIS-Hypercube Interconnection Network”, J. of Supercomputing
(2008) 46, 276-297.
S. B. Akers, and B. Krishnamurthy, “A Group Theoretic Model for
Symmetric Interconnection Networks,” Proc. Intl. Conf. Parallel Proc.,
1986, pp. 216-223.
K. Day and A. Al-Ayyoub, “The Cross Product of Interconnection
Networks”, IEEE Trans. Parallel and Distributed Systems, vol. 8, no. 2,
Feb. 1997, pp. 109-118.
A. Al-Ayyoub and K. Day, “A Comparative Study of Cartesian Product
Networks”, Proc. of the Intl. Conf. on Parallel and Distributed
Processing: Techniques and Applications, vol. I, August 9-11, 1996,
Sunnyvale, CA, USA, pp. 387-390.
I. Jung and J. Chang, “Embedding Complete Binary Trees in Star
Graphs,” Journal of the Korea Information Science Society, vol. 21, no.
2, 1994, pp. 407-415.
Berthome, P., A. Ferreira, and S. Perennes, “Optimal Information
Dissemination in Star and Panckae Networks,” IEEE Trans. Parallel
and Distributed Systems, vol. 7, no. 12, Aug. 1996, pp. 1292-1300.
P. Fragopoulou and S. Akl, “A Parallel Algorithm for Computing
Fourier Transforms on the Star Graph,” IEEE Trans. Parallel &
Distributed Systems, vol. 5, no. 5, 1994, pp. 525-31.
Mendia V. and D. Sarkar, “Optimal Broadcasting on the Star Graph,”
IEEE Trans. Parallel and Distributed Systems, Vo;. 3, No. 4, 1992, pp.
389-396.
S. Rajasekaran and D. Wei, “Selection, Routing, and Sorting on the Star
Graph,” J. Parallel & Distributed Computing, vol. 41, 1997, pp. 225-33.
S. Lakshmivarahan, and S.K. Dhall, “Analysis and Design of Parallel
Algorithms Arithmetic and Matrix Problems,” McGraw-Hill Publishing
Company, 1990.
N. Imani et al, “Perfect load balancing on star interconnection network”,
J. of supercomputers, Volume 41 Issue 3, September 2007. pp. 269 –
286.
Jehad Al-Sadi, “Implementing FEFOM Load Balancing Algorithm on
the Enhanced OTIS-n-Cube Topology”, Proc. of the Second Intl. Conf.
on Advances in Electronic Devices and Circuits - EDC 2013, 47-5.
K. Day and A. Al-Ayyoub, “The Cross Product of Interconnection
Networks”, IEEE Trans. Parallel and Distributed Systems, vol. 8, no. 2,
Feb. 1997, pp. 109-118.
Ranka, Y. Won, S. Sahni, “Programming a Hypercube Multicomputer”,
IEEE Software, 5 (5): 69 – 77, 1998.
Zhao C, Xiao W, Qin Y (2007), “Hybrid diffusion schemes for load
balancing on OTIS networks”, In: ICA3PP, pp 421–432
G. Marsden, P. Marchand, P. Harvey, and S. Esener, “Optical Transpose
Interconnection System Architecture,” Optics Letters, 18(13), 1993, pp.
1083-1085.
Qin Y, Xiao W, Zhao C (2007), “GDED-X schemes for load balancing
on heterogeneous OTIS networks”, In: ICA3PP, pp 482–492.
A. Menn and A.K. Somani, “An Efficient Sorting Algorithm for the Star
Graph Interconnection Network,” Proc. Intl. Conf. on Parallel
Processing, 1990, pp.1-8.
Download