ch2 - Architecture, Design and Planning

advertisement
PART I
Experimental
considerations
Perceptua l Audio Eva lua tio n – Theo ry, M etho d a nd Applica tio n Søren Bech and Nick Zacharov
~c 2006 John Wiley & Sons, Ltd. ISBN: 0-470-86923-2
CHAPTER
2
Definition of
research question
and hypothesis
Perceptua l Audio Eva lua tio n – Theo ry, M etho d a nd Applica tio n Søren Bech and Nick Zacharov
~c 2006 John Wiley & Sons, Ltd. ISBN: 0-470-86923-2
17
hIS chapter discusses the formulation of research questions and the
principles behind establishing a testable statement based on a premise
Tor hypothesis and a number of initial conditions. The formulation of
the hypothesis, the initial conditions and the testable statement is
perhaps the most important part of any scientific experiment. Firstly
because they determine the scientific quality of the experimental
results, secondly because the truthfulness of all or some of the initial
conditions are determined by the design of the experimental set-up and
thirdly because the statistical analysis of the observed data depends on
the form of the testable statement.
The research question is a broad formulation of the problem that
is to be investigated. An example would be the research question that
Perceptua l Audio Eva lua tio n – Theo ry, M etho d a nd Applica tio n Søren Bech and Nick Zacharov ~c
2006 John Wiley & Sons, Ltd. ISBN: 0-470-86923-2
17
5
Chapter 2 Definition of research question and hypothesis
was formulated for the Archimedes project3: ‘Why is the perceived
sound quality of a loudspeaker different in different domestic sized
rooms and in different positions in the same room?’.
This question involves a large number of physical (see discussion
in Chapter 5) and psycho-acoustical (see discussion in Chapter 4)
variables that need to be carefully considered in order to obtain
meaningful experimental results. The next logical step would thus
be to divide this general question into a number of experimentally
manageable questions where subsets of these variables are examined
in more detail. However, the first immediate question one could ask
is if it has been proven that loudspeakers actually do sound different
in different rooms and in different positions.
Some researchers would argue that their practical experiences have
shown this to be the case and thus conclude that it must be a general
problem for all loudspeakers in all rooms. Others would argue that
though experience indicates that this might be true it is not possible to
draw conclusions for all loudspeakers and all rooms on the basis of a
few observations and therefore a more rigourous validation is needed
before a major research project is started.
The above dilemma, empirical versus rational knowledge, is one
of the basic questions when discussing how scientific knowledge may
be established. The following discussion will be an introduction to the
problem and provide some guidance on how to tackle this and other
related questions.
The purpose of this chapter is not to give a complete overview of the
history of scientific argumentation, but to introduce the concepts and
to provide an understanding of the general principles in formulating
a research hypothesis.
The discussion will be based on two main references, Fjelland and
Gjengedal [123] and Johansson and Lynoee [248], so only additional
references will be noted in the following text.
Scientific knowledge is traditionally characterised by three fundamental aspects:
It is theoretical Scientific knowledge is theoretical in the sense that
science does not solve practical problems per se. This is not to
3The Archimedes project was a five-year collaborative research effort funded
under the European research scheme Eureka. The partners were the Acoustics
Laboratory (now Acoustic Technology) of The Technical University of Denmark, KEF
Electronics of England, and Bang & Olufsen of Denmark. Results of the project have
been published in, for example, Bech [29–31,34]
Chapter 2 Definition of research question and hypothesis
6
say that scientific questions cannot originate from practical problems or that scientific knowledge cannot be used for practical
purposes, but if the work is aimed at solving a particular practical
situation and the gained knowledge cannot be generalised, then it
is not considered science in the traditional sense.
It is the truth The basic difference between knowledge and belief is
that knowledge can only be the truth whereas belief can be either
true or false. In addition, it is often required that knowledge must
be documented using established scientific rules and measures.
It is systematical The systematical aspect is based on the fact that
new knowledge must be based on systemised knowledge from
previous experiments.
These requirements have led to formal systems representative
of how scientific knowledge is established–the so-called axiomatic or
deductive systems. Platon (427–347 B.C.) was one of the first to develop
such a system and later Galileo Galilei (1564–1642) and Isaac Newton
(1642–1727) used such a system to organise the development of their
new theories. An axiomatic system is based on a set of definitions upon
which it is possible to establish axioms or proven hypotheses, and based
on the definitions and the axioms it is possible to establish theories.
The knowledge represented by the axioms or hypotheses must be
true knowledge for the developed theories to be the building blocks
for new knowledge. However, the definition of true knowledge and
how it is proven to be true have been debated since Galilei, who
used rational arguments (using common sense) in some cases and
empiricism (using his senses to gather experience) in others. This
fundamental division between how scientific knowledge is created is
represented by two major principles of scientific argumentation. One
is the principle of empiricism, which claims that only observations and
experiments can determine which theories or hypotheses are true. The
other, the principle of rationalism, claims that deduction using common
sense is the correct way to obtain true knowledge.
Most scientists, especially those working in areas involving both
exact sciences (for example mathematics) and social sciences (for
example psychology), will use a mixture of the two principles,
depending on the experimental situation. Perceptual experiments
such as evaluation of reproduced sound quality typically involves this
mixture, so a short description of both principles will be given in the
following text.
7
Chapter 2 Definition of research question and hypothesis
2.1 Principle of empiricism
The principle of empiricism is based on the fact that it is only observations and experiments that form the basis for the generalisation
of observations: reasoned from the particular to the general. This is
expressed in the principle of induction: ‘(the premiss:) If a certain
observation A is observed together with another observation B and
further that A has never been found without B then it follows: (the
conclusion) the more often A and B are observed together, the greater
is the likelihood that they will appear together in another situation
where one of them is observed’.
An argument based on induction for the premiss of the Archimedes
project, that the same loudspeaker sounds different in different positions in the same room, would be as follows:
Premiss 1: It has been observed through a number of listening tests,
including a range of different loudspeaker types, that the same
loudspeaker positioned in different positions (observation A) in
the same room sounds different (observation B).
Conclusion: It is true for all loudspeaker types that the same loudspeaker will sound different in all positions in the same room.
According to this approach, there would be no reason to start
a series of listening tests to prove the assumption, but instead the
experimenter should start a series of experiments to examine various
reasons for this observed difference between positions.
The problem, according to critics of the principle of empiricism, is
that the conclusion can be flawed but the premiss can still be true. This
violates another fundamental law in scientific argumentation: The law
of contradiction, which states that a statement and its negation cannot
be true at the same time.
When empiricism is the foundation for generalisation, an often
applied research strategy is to test the observed principle or law under
a range of different conditions and, if it is found to be valid in all the
tested situations, it is accepted as a general law. Examples of laws
based on empiricism can be found both in physics where Boyle’s law
states that the relation between the pressure p and the volume V of a gas
is always constant (p × V = constant), and in psychophysics where
Stevens, [403, 406] using a direct magnitude estimation procedure,
2.2 P rincip le o f r ati on a li sm
8
found that the relationship between sensation S and stimulus intensity
I can be described by a power law (S = constant × ( I I 0 )k), where I0
is a reference intensity and the exponent k depends on the modality
and the stimulus. The ITU-R recommendation BS.1387 [213], which is
intended for prediction of the perceived impairment in sound quality
of a low bit-rate codec as a function of bit rate, is also an example
of a ‘law’ or relationship that has been verified for a large number of
situations and has been accepted as an international standard.
The principle of empiricism has often been criticised, especially the
fact that the main focus is on confirmative results rather than deviating
results, which according to the critics is where scientific progress is
often made. One of the main opponents of the empirical principle
was Popper (see, for example, [350]), who in 1934 published the book
‘The logic of scientific discovery’. Popper argues that, no matter
how many supportive observations there are, there will always be a
finite probability that there are cases that do not support the previous
observations and such cases will violate the law of contradiction as
discussed earlier.
Popper’s alternative is the so-called ‘hypothetico-deductive’
method, which is discussed in the next section.
2.2 Principle of rationalism
Popper [350] suggested the ‘hypothetico-deductive’ as an alternative
(in fact, Popper argued that it is the only method) to the principle of
empiricism or inductive method discussed in the previous section.
Before discussing the hypothetico-deductive method, it is useful
to examine the principle behind deductive arguments. A deductive
argument is based on a number of premisses and a conclusion. A
simple deductive argument could be as follows:
Premiss 1: When it rains the street will be wet.
Premiss 2: It rains.
Conclusion: The street will be wet.
The deductive principle means that if the premisses are true the
conclusion must be true. The opposite, the premisses were true and
the conclusion false, would violate the law of contradiction.
9
Chapter 2 Definition of research question and hypothesis
The ‘conclusion’ is typically of a form that can be tested in an
experiment (in the example above, you would look out of the window)
and if the conclusion can be proven to be false (the street is not wet)
it follows that one of the premisses must be false (probably that it is
not raining). However, if the conclusion is found to be true (the street
is wet), it does not finally prove that the premisses are true (the street
cleaning truck could just have passed by and sprinkled water on the
street). This principle is called the asymmetry between falsification
and verification, and Popper argued that only falsification leads to true
knowledge and progress: A falsification means that the experimenter
must re-think or reformulate the premisses, which is equivalent to
scientific progress.
The hypothetico-deductive method is based on a set of general
observations that lead to the formulation of a hypothesis (see premiss
1 in the example above). The hypothesis together with a set of
initial conditions (premiss 2) leads, based on a deductive argument,
to a testable statement (the conclusion). The hypothetico-deductive
method thus reasons from the general to the particular, which is the
opposite of the inductive method discussed above.
Because the test statement is based on deductive arguments, it
follows that if the initial conditions are true but the test statement
is false then the hypothesis must be false. If the test statement was
found to be true it could be assumed, but not formally verified, that
the hypothesis was true and this would not lead to any real scientific
progress according to Popper. So the experimenter’s primary aim is, if
the hypothetico-deductive method is used, to formulate one or several
hypotheses that together with the initial conditions lead to a testable
statement that must be rejected.
If the hypothetico-deductive method is used to test the assumption
of the Archimedes project, the rationale would be as follows: The experience shows that for some loudspeaker types the same loudspeaker
will sound different in different positions in the same room. However,
it has also been found that the sound quality of certain other types are
less influenced by their positions. So, in order to follow the principle
of falsification, the focus is on the loudspeaker types that are not
influenced by their positions in the room:
Hypothesis (premiss 1): Physically identical loudspeakers positioned
in different positions in the same room sound similar.
2.2 P rincip le o f r ati on a li sm
10
Initial conditions (premiss 2): Two physically identical loudspeakers
in two different positions in the same room are compared in a
listening test and the results of the test are true representations
of the perceived sound quality of the individual loudspeakers.
Testable statement (conclusion): The perceived sound quality of the
two loudspeakers will be identical.
This strategy was used in the Archimedes project, and the results
have been reported in Bech [29], where the test statement was proven
statistically false for some of the tested loudspeakers, and true for others.
This was the experimental basis for concluding that the hypothesis was
false for at least some of the combinations of loudspeaker types and
positions, and it was decided to perform further experiments to examine
the influence of position on sound quality.
Now the basic assumption of the deductive principle is that if the
premisses are true the conclusion must be true, and further that if
the conclusion is false and the initial condition is true the hypothesis
is false. The principle also states that the testable statement should
be formulated with the purpose of falsification. The truthfulness of
the initial condition thus becomes very important and this is often
related to the quality of the experimental part of the investigation. The
above example from the Archimedes project is a good illustration of
why this can be quite complicated and difficult to achieve in the real
experimental situation.
The initial condition was that the results of the listening tests
represent the true magnitude of the perceived sound quality of the
tested loudspeakers. However, this will depend on a number of
variables such as the selected programmes (musical genre and number
of excerpts), physical properties of the loudspeaker, number and type
of subjects, physical identity of the two loudspeakers, and so on. So the
truthfulness of the initial condition is related not to the truthfulness of
a single condition but instead to a sum of conditions.
The scientific quality of the inferences of an experiment thus relies
very much on the truthfulness of each individual part constituting
the initial condition. The truthfulness of these individual parts is
determined by the variables that define the experimental conditions.
These variables are identified and discussed in the remainder of
this book.
11
Chapter 2 Definition of research question and hypothesis
2.3 Other principles of scientific argum entation
The principles of empiricism and rationalism discussed above are,
in most scientific papers, the two basic principles for establishing a
testable statement and for the subsequent discussion of the results.
However, there are a number of other relevant types of scientific argumentation that are especially important when considering practical
experiments often used in, for example, engineering in contrast to
theoretical disciplines like mathematics.
2.3.1 Probabilistic reasoning
Probabilistic reasoning is used when the truthfulness of the hypothesis
is based on a limited number of observations from a given population.
This introduces a finite probability that a rejected hypothesis is not
representative or correct for the whole population. This is a very
typical situation in medical experiments where only a limited number
of persons can be tested and likewise in listening tests where in
addition to a limited number of subjects only a limited number of
programmes, listening rooms, loudspeaker positions, and so on, can
be tested. This makes the probabilistic reasoning approach the most
typical for assessments of sound quality using subjects.
Probabilistic hypotheses can be tested using either a deductive
principle or an inductive principle, but in addition a probabilistic
hypothesis arguments can be
mathematical.
objective.
subjective.
The classical mathematical principle is always deductive, and
theoretical probabilistic theory follows the standard mathematical
rules. However, when probability calculations are used on an observed
data set rather than theory, the truth of the hypothesis becomes
testable. However, the deductive line of reasoning can still be applied
and a typical example could be as follows:
Hypothesis (premiss 1): For a standard dice, the relative frequency
of observing any one number in the range 1–6 is 1 6.
2.3 Other principles of scientific argumentation
12
Initial condition (premiss 2): A standard dice is thrown n times.
Testable statement (conclusion): Approximately 6 of the n observations will be ‘one’.
The hypothesis is now based on an observed probability and the
truthfulness is now finite depending on the number of observations
in the observed sample. A hypothesis originating from a calculated
probability (based on a number of samples) is called an ‘objective’
hypothesis.
A ‘subjective’ hypothesis refers to the fact that some probabilities,
although being objectively determined (based on a number of samples), have other conditions attached to them that must be fulfilled
before the objective probability is relevant. The evaluation of these
other conditions then becomes the subjective part. An example would
be that it is possible to calculate the objective probability that an
otologically normal person will hear a given pure tone at a given frequency and sound pressure level, but there might be other conditions
(for example, the person is tired or has a cold) that could influence
the ability to hear and thus need to be evaluated by the experimenter
before that subject is used in the listening test where the objective
probability is used.
The argument including the dice is strictly deductive even if it
includes an objective hypothesis; however, it is often the case that the
objective hypothesis is not tested and instead the hypothesis is based
on an inductive argument:
Observation (premiss 1): Approximately 6 of the numbers observed
for this dice were ‘one’.
Conclusion: The probability of getting a ‘one’ with this dice is 6.
It is important to note that if the hypothesis is based on an inductive
argument then it is impossible for the argument that includes the
hypothesis to be anything but inductive.
The above discussion hopefully illustrates that in some cases it is
very difficult to determine if an argument is deductive or inductive. In
situations where the hypothesis is based on a limited sample, which
is the case in statistics, the argument will always be inductive as
the premisses describes the sample, but often (but not always) the
conclusion refers to the whole population.
13
Chapter 2 Definition of research question and hypothesis
This means that the design of most listening tests, theoretically
speaking, will be following the inductive principle, but it is still important to apply the hypothetico-deductive principle when designing
the experiment to ensure the truthfulness of the initial conditions.
However, when the general validity of the conclusions are discussed,
it is important to consider and keep in mind the principle the hypothesis was based on. This requires a careful assessment of the statistical
assumptions and the subsequent analysis. This is discussed in more
detail in Section 6.2.
2.3.2 Argumentum ad hominem
There is another principle called ‘argumentum ad hominem’. It refers
to the principle of using personal statements from authorities within
the field as proven facts without checking them formally. This type of
arguments should be avoided, in the authors’ view, as they carry no
scientific weight in themselves. This is not to say that the authorities
being referred to are not producing scientifically valid results, but then
these should be referenced, and not personal, statements. This line of
reasoning is often applied in Hi-Fi magazines, where the reviewers
will have or gain status as authorities in their areas and quite often the
reader will use the arguments of the reviewer for buying a particular
product.
2.3.3 Conclusion by analogy
Yet another principle is ‘conclusions by analogy’, which means that
the conclusion is assumed to be valid because it is similar to the
premisses. An example would be the doctor who concluded that cold
water should be used to treat burnt skin. He drew that conclusion
by analogy from the boiled egg he used to have every morning. One
morning the egg was more hard boiled than usual and he found the
reason to be that his wife had not rinsed the egg in cold water as
she used to. He concluded that cold water could stop the process
of coagulation in protein and hence that cold water could stop the
coagulation process when used on burnt skin.
This line of reasoning is rarely used (i.e. the authors have never
seen it being applied) in auditory evaluations, but the principle should
not be excluded as not being useful.
2.4 Summary
27
2.4 Summary
The above sections have discussed the major principle for establishing
a hypothesis and based on a set of initial conditions, a testable statement. The most often applied principle in audio evaluations, based
on the experience of the authors, is that of probabilistic reasoning.
This, however, is also one of the most complicated as the line between
the principles of empiricism and rationalism is not always crystal
clear in such experiments. However, it is still the authors’ view that
rationalism and the hypothetico-deductive principle should be aimed
for although it is clear that this is not trivial and requires careful
consideration of a large number of factors. Especially, experimental
planning and control are important to ensure the truthfulness of the
initial conditions. The next chapter will discuss the fundamentals of
experimentation and how this relates to the hypothesis and testable
statement.
Download