Paper : Models of Parallel Computation: A survey and Synthesis

advertisement
A critique on the research paper:
LogP: Towards a Realistic Model of Parallel Computation
[Authors : David Culler, Richard Karp y , David Patterson, Abhijit Sahay, Klaus Erik
Schauser, Eunice Santos, Ramesh Subramonian, and Thorsten von Eicken]
by
Rajvikram Singh
1
Paper : LogP: Towards a Realistic Model of Parallel Computation
[Authors : David Culler, Richard Karp y , David Patterson, Abhijit Sahay, Klaus Erik
Schauser, Eunice Santos, Ramesh Subramonian, and Thorsten von Eicken]
Main Contribution(s):
The authors of the paper have stressed more than once that the present popular
models used for depicting parallel architectures do not accurately describe the machines.
The present models, which are derived from the ideal, PRAM models are described by
the authors as too “simple” One of the main drives for presenting a new model was given
as the trend that the computing industry has been taking since the past few years.
Statistical and empirical data has been given in the paper to show that the latest trend has
been towards more “multi computer” type of machines. These are generally massively
parallel machines constituted of many nodes, each containing a powerful processor and
substantial memory and cache, so as to be regarded as independent computer in itself.
In such ‘distributed architecture’ machines, the following features can be
considered as the salient ones:  Performance of the interconnect network – The bandwidth, latency, jitter,
congestion aspects of the interconnect networks typically become important in
deciding the overall performance of the machine.
 Unsynchronized nodes – Since each of the nodes is more or less an independent
computer in itself, the various nodes in a MPP machine can no longer considered
to be synchronous.
 Shared memory absent – Since each node now carries its own memory modules,
the architecture is more suited for a message-passing system than a shared
memory one.
All the three points mentioned above are starkly opposite to the features of the
popular PRAM model, which describes a synchronized, unbounded shared memory,
MIMD model. Thus acknowledging the latest trend in the HPC industry, the authors have
suggested the LogP model, which takes into consideration, the parameters, namely:
latency, communication overheads, communication bandwidth and the number of
processors to describe a more accurate model. The model also takes care of some other
important aspects such as affect of load on network performance and affect of routing for
describing a more pragmatic model.
Evaluation of the contributions to advancing science and the state-of-the-art of the topic:
The LogP model suggested by the authors does seem to alleviate quite a few of
loopholes of the overly simplified PRAM model and in the present scenario where indeed
focus on MPP machines has shifted towards a more distributed architecture, the LogP
model definitely provides a much more accurate depiction of practical MPP machines.
Thus though the model was suggested in’93, it can still be considered as state-of-the-art
2
and of relevance even now and with HPC systems like clusters becoming popular day by
day, the model might well be of use even in the future.
Deficiencies:
Though the authors have presented a model, which they feel is accurate for
depicting present day MPP architectures, they have used parameters, which concentrate
primarily on the performance of the interconnection network. They seem to neglect the
other factors such as network topology and message-passing v/s shared memory
topology, which also affect performance.
The paper also seems to sort of re-instate the obvious concerns for designing a
parallel computing machine. And though the authors have published results of their
experiments showing the predicted performance and the actual performance for some
algorithms but they have not compared the results to those predicted by the existing
models.
Directions for future improvements:
The model seems to give too much importance to the underlying interconnect
network. Maybe some of the parameters in the model should be dropped and replaced
with others pertaining to the nodes. Especially with the recent development of
inexpensive and high speed optical interconnects, which provide excellent latency, near
zero contention environments (WDM) and bandwidths higher than the processors, it
might be a good idea to drop some of the parameters like L and o or g in lieu of future
trends. Also the authors agree that in most of the cases all the parameters are not required.
Questions –Regarding the paper:
 Can one of the parameters out of ‘o’ or ‘g’ be dropped? Or can one be
expressed as a function of the other? They seem to depict the same thing.
 The authors claim that the model takes care of network saturation due to
loading of the network. How do L, o , g or P reflect the loading of the network,
when there is no parameter to define the network topology ?
 Why haven’t the results of performance predicted by other known models
compared to the ones predicted by the LogP model? (eg. For the FFT
algorithm)
3
SUPPLEMENTARY PAPERS
Paper : Algorithms in CGM, BSP and BSP Model – A Survey
[ Author: Silvia Gotz]
Summary:
The paper has been written with the intention of evaluating the pluses and
minuses of some of the most popular parallel models that have received considerable
attention in the past few years of its publication. The paper talks about the following
models : The BSP (Bulk synchronous Parallel Model)
 The BSP* - Which is an extension of the basic BSP model but takes into account
an additional parameter B (minimum size of transmitted message that fully
utilizes n/w bandwidth) - which defines the interconnect more accurately and
leads to better bandwidth utilization.
 The CGM (Coarse Grained Multicomputer ) model
After surveying these popular models and evaluating them for different
algorithms, the author concludes that the BSP model is the one that comes close to being
the one model that can be standardized. But at the same time there exist whole classes of
problems which can not use the BSP model.
Note : Since the paper was just a survey of existing models, it by itself does not contribute
anything, nor does it contain any deficiencies which can be attributed to the survey.
4
Paper: BSP - The Bulk Synchronous Parallel model
[University of Oxford Parallel Applications Center]
Main contribution(s):
This paper is not a research paper and provides a brief but insightful view into the
BSP model developed primarily at the University of Oxford. The paper defines the
characteristics of a BSP computer, the parameters that define a BSP model and the
computation and programming model for a typical BSP computer.
5
Paper : Models of Parallel Computation: A survey and Synthesis
[ Authors: B.M Maggs, L.R Matheson and R.E Tarjan ]
Main Contribution(s):
The paper tries to form present the past trends in the field of parallel computing
and tries to explain why there has been no standard model which can define all the
existing and possible future parallel machines.
The paper evaluates the research undertaken by the computer scientists in coming
up with a unified model and the levels at which models have been introduced (eg.
Memory, network etc)
The authors stress on the point that the models should not only be ‘descriptive’ i.e they should not only reflect the existing parallel architectures, but they should also be
prescriptive – i.e they should be able to provide guidelines for the machines of the future.
And while the authors agree that the models should not contain a lot of parameters, they
have described a few parameters, which they feel symbolize a parallel architecture. They
recommend that a subset of these parameters be used for modeling.
6
Paper : Models for Parallel Computation
[ Author: Susanne E. Hambrusch]
Summary:
The paper gives a brief discussion of the most popular models, which have
received the most attention over the past years and describes the major advantages /
disadvantages of one over the other. The paper acknowledges that in order for a
successful standard model to evolve, the model should:  Accurately reflect the constraints of the existing classes of machines
 Should also broadly apply to existing as well as future machines
 Allow accurate measure and prediction of performance.
The paper was not intended as a research paper and thus does not make any major
contributions or has any deficiencies.
7
Download