Guidelines

advertisement
Guidelines for Reporting Meta-Heuristics in LOG 741
In LOG 741 there are three assignments. These assignments follow the same
framework. You have been given a black box, and should answer some simple
questions about it, based on running some tests.
About Testing
Computational testing can be a difficult and time-consuming task. There are many
different dimensions to test, and usually the combined number of possibilities is
too high for complete testing. Usually there are a number of test instances, having
similar and/or different characteristics.
The basic question one wants answered is usually:
 Which method is best?
 Which parameter values are the best for my optimization method?
 What is the performance profile over time for my method?
o Is it worth the effort to search for a longer time?
Test instances. In our tests we use randomly generated instances of 0/1 MCKP
(Multi-Constrained Knapsack Problems).They can be divided into 27 distinct
groups, with 10 instances in each group. The distinguishing instance dimensions
are:
Number of Variables (100, 250, 500)
Number of Constraints (5, 10, 30)
Knapsack Density (25%, 50%, 75%) – this is the expected number of items
needed to fill the knapsack as a percentage of variables.
Testing. Some times only 1 test instance is available. At other times millions are
available. When testing, often the basic underlying question is: Is my method better
than the competition? It is important to define what is meant by better. Some
typical examples:
 Better in one test on a specific instance
 Better in 30 tests (with different starting points) on a specific instance.
 Better in 30 tests (with different starting points) on a class of instances
 Better in 30 tests (with different starting points) on all instances
Either compare values from the same instance, or use the average of a statistically
significant number of similar instances.
Some final points




Formulate some hypothesis to prove (or disprove)
Be systematic
Make fair comparisons
Remember to test also for long times, 100.000.000 iterations? 1 hour?
Answer the questions. This is often forgotten.
Justify the answers. This means that you should explain why you reach a specific
conclusion.
Statistical significance. Local Search Heuristic choices are very often based on
some generated random number. A single computational test can thus have a high
variance in the results produced (i.e. best solution found) when repeated with
different random seeds. To produce higher statistical significance, a single test may
be repeated repeatedly, as long as different random number sequences are used.
You should consult your statistics course literature, but mainly there are two
choices for statistical tests. One is to use a non-parametric test like Mann-Whitney,
or have enough samples to approach a normal distribution, and then use a
parametric test for normal distributions.
Be able to reproduce the results. There is often a need to rerun all the tests made.
This might have different reasons, most often it is because an error in the code is
detected.The following points can help this re-testing.
 Collect all tests in a set of batch-files. A good example is given in
assignment 1.
 If a random number generator is used, reproducibility of single tests
can be guaranteed by using the same random number seed, starting the
random number generator at the same place in its pseudo-random
sequence.
About the document










Only deliver 1 document
Length 3 – 10 pages
Add page numbers
The front page of the document should contain the name of the authors
Do not hand in log-files
Use e.g. EXCEL to make graphics, and then cut and paste into the report
Related text and graphics and tables should be together
Always reference a table or graphic.
Make the document self-contained
Make the document look nice.
o Pay attention to layout.
o Remember that someone else might read it
o A nice document puts the reader (grader) in a better mood.
Other
Max group size = 2
Download