Uploaded by walididolor

psych assesment

advertisement
lOMoARcPSD|3551091
JOJO Psychological-Assessment-Lecture-Notes
Personality (Central College)
StuDocu is not sponsored or endorsed by any college or university
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
LECTURE NOTES
PSYCHOLOGICAL ASSESSMENT
Prepared and Screened by:
Prof. Jose J. Pangngay, MS Psych, RPm
CHAPTER I: BRIEF HISTORY OF PSYCHOLOGICAL TESTING AND PROMINENT INDIVIDUALS IN PSYCHOLOGICAL ASSESSMENT
A.
Ancient Roots
• Chinese Civilization – testing was instituted as a means of selecting who, of the many applicants, would obtain government jobs
• Greek Civilization – tests were used to measure intelligence and physical skills
• European Universities – these universities relied on formal exams in conferring degrees and honors
B.
Individual Differences
• Charles Darwin – believed that despite our similarities, no two humans are exactly alike. Some of these individual differences are more “adaptive than
others and these differences lead to more complex, intelligent organisms over time.
• Francis Galton – he established the testing movement; introduced the anthropometric records of students; pioneered the application of rating-scale and
questionnaire method, and the free association technique; he also pioneered the use of statistical methods for the analysis of psychological tests He used
the Galton bar (visual discrimination length) and Galton whistle (determining the highest audible pitch). Moreover, he also noted that persons with mental
retardation tend to have diminished ability to discriminate among heat, cold and pain.
C.
Early Experimental Psychologists
• Johan Friedrich Herbart – Mathematical models of the mind; father of pedagogy as an academic discipline; went against Wundt
• Ernst Heinrich Weber – sensory thresholds; just noticeable differences (JND)
• Gustav Theodor Fechner – mathematics of sensory thresholds of experience; founder of psychophysics; considered one of the founders of
experimental psychology; Weber-Fechner Law first to relate sensation and stimulus
• Wilhelm Wundt – considered one of the founders of Psychology; first to setup a psychology laboratory
• Edward Titchner – succeeded Wundt; brought Structuralism to America; his brain is still on display in the psychology department at Cornell
• Guy Montrose Whipple – pioneer of human ability testing; conducted seminars that changed the field of psychological testing
• Louis Leon Thurstone – large contributor of factor analysis; approach to measurement was termed as the law of comparative judgment
D.
The Study of Mental Deficiency and Intelligence Testing (Theories of Intelligence)
• Jean Esquirol – provided the first accurate description of mental retardation as an entity separate from insanity.
• Edouard Seguin – pioneered modern educational methods for teaching people who are mentally retarded/intellectually disabled
• James McKeen Cattell – an American psychologist who coined the term “mental test”
• Alfred Binet – the father of IQ testing
• Lewis M. Terman – introduced the concept of IQ as determined by the mental age and chronological age
IQ Classification according to the Stanford-Binet 5 (* reflects extended IQ scores)
*176-225
: Profoundly Gifted
*161-175
: Extremely Gifted
145-160
: Very Gifted
130-144
: Gifted
120-129
: Superior
110-119
: High Average
90-109
: Average
80-89
: Low Average
70-79
: Borderline Impaired
55-69
: Mildly Impaired
40-54
: Moderately Impaired
*25-39
: Severely Impaired
*10-24
: Profoundly Impaired
• Charles Spearman – introduced the two-factor theory of intelligence (General ability or “g” – required for performance on mental tests of all kinds; and
Special abilities or “s” – required for performance on mental test of only one kind)
• Thurstone – Primary Mental Abilities
• David Wechsler – Wechsler Intelligence Tests (WISC, WAIS)
• Raymond Cattell – introduced the components of “g” (Fluid “g” – ability to see relationships as in analogies and letter and number series, also known as
the primary reasoning ability which decreases with age; and Crystallized “g” – acquired knowledge and skills which increases with age)
• Guilford – theorized the “many factor intelligence theory” (6 types of operations X 5 types of contents X 6 types of products = 180 elementary abilities)
• Vernon and Carroll – introduced the hierarchical approach in “g”
• Sternberg – introduced the “3 g’s” (Academic g, Practical g, and Creative g)
• Howard Gardner – conceptualized the multiple intelligences theory
• Henry Goddard – translated the Binet-Simon test into French
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
E.
World War I
• Robert Yerkes – pioneered the first group intelligence test known as the Army Alpha (for literate) and Army Beta (for functionally illiterate)
• Arthur S. Otis – introduced multiple choice and other “objective” item type of tests
• Robert S. Woodworth – devised the Personal Data Sheet (known as the first personality test) which aimed to identify soldiers who are at risk for shell
shock
F.
Personality Testers
• Herman Rorschach – slow rise of projective testing; Rorschach Inkblot Test
• Henry Murray & Christina Morgan – Thematic Apperception Test
• Early 1940’s – structure tests were being developed based on their better psychometric properties
• Raymond B. Cattell – 16 Personality Factors
• McCrae & Costa – Big 5 Personality Factors
G.
Psychological Testing in the Philippines
• Virgilio Enriquez – Panukat ng Ugali at Pagkatao or PUP
• Aurora R. Palacio – Panukat ng Katalinuhang Pilipino or PKP
• Anadaisy Carlota – Panukat ng Pagkataong Pilipino or PPP
• Gregorio E.H. Del Pilar – Masaklaw na Panukad ng Loob or Mapa ng Loob
• Alfredo Lagmay – Philippine Thematic Apperception Test (PTAT)
CHAPTER II: PSYCHOLOGICAL TESTING AND PSYCHOLOGICAL ASSESSMENT
A.
B.
C.
Objectives of Psychometrics
1. To measure behavior (overt and covert)
2. To describe and predict behavior and personality (traits, states, personality types, attitudes, interests, values, etc.)
3. To determine signs and symptoms of dysfunctionality (for case formulation, diagnosis, and basis for intervention/plan for action)
Psychological Testing vs. Psychological Assessment
Psychological Testing
Objective
Typically, to obtain some gauge, usually numerical in
nature, with regard to an ability or attribute
Focus
How one person or group compares with others
(nomothetic)
Process
Testing may be individual or group in nature. After test
administration, the tester will typically add up “the number
of correct answers or the number of certain types of
responses… with little if any regard for the how or
mechanics of such content”
Role of Evaluator
The tester is not the key to the process; practically
speaking, one tester may be substituted for another tester
without appreciably affecting the evaluation.
Skill of Evaluator
Testing typically requires technician-like skills in terms of
administering and scoring a test as well as in interpreting
a test result.
Outcome
Typically, testing yields a test score or series of test
scores.
Duration
Sources of Data
Shorter, lasting from few minutes to few hours
One person, the test taker only
Qualification for Use
Knowledge of tests and testing procedures
Cost
Inexpensive, especially when group testing is done
Psychological Assessment
Typically to answer a referral question, solve a problem, or
arrive at a decision through the use of tools of evaluation.
The uniqueness of a given individual, group, or situation
(idiographic)
Assessment is typically individualized. In contrast to testing,
assessment more typically focuses on how an individual
processes rather than simply the results of that processing.
The assessor is the key to the process of selecting tests and/or
other tools of evaluation as well as in drawing conclusions from
the entire evaluation.
Assessment typically requires an educated selection of tools of
evaluation, skill in evaluation, and thoughtful organization and
integration of data.
Typically, assessment entails a logical problem-solving
approach that brings to bear many sources of data designed to
shed light on a referral question.
Longer, lasting from a few hours to a few days or more
Often collateral sources, such as relatives or teachers, are used
in addition to the subject of the assessment
Knowledge of testing and other assessment methods as well as
of the specialty area assessed (psychiatric disorders, job
requirements, etc.)
Very expensive, requires intensive use of highly qualified
professionals
Assumptions about Psychological Testing and Assessment
1. Psychological traits and states exist.
• Trait - characteristic behaviors and feelings that are consistent and long lasting.
• State -temporary behaviors or feelings that depend on a person's situation and motives at a particular time
2. Psychological traits and states can be quantified and measured.
3. Test-related behavior predicts non-test-related behavior.
• Postdict it - To estimate or suppose something which took place in past; to conjecture something that occurred beforehand
• Predict - say or estimate that (a specified thing) will happen in the future or will be a consequence of something
4. Tests and other measurement techniques have strengths and weaknesses.
5. Various sources of error are part of the assessment process.
• Error – long standing assumption that factors other than what a test attempts to measure will influence performance on the test
• Error variance – the component of test score attributable to sources other than the trait or ability being measured
6. Testing and assessment can be conducted in a fair and unbiased manner.
7. Testing and assessment benefit society.
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
D.
Tools of Psychological Assessment
1. Psychological Tests – a standardized measuring device or procedure used to describe the ability, knowledge, skills or attitude of the individual
• Measurement – the process of quantifying the amount or number of a particular occurrence of event, situation, phenomenon, object or person
• Assessment – the process of synthesizing the results of measurement with reference to some norms and standards
• Evaluation – the process of judging the worth of any occurrence of event, situation, phenomenon, object or person which concludes with a
particular decision
2. Interviews – a tool of assessment in which information is gathered through direct, reciprocal communication. Has three types (structured, unstructured
and semi-structured).
3. Portfolio Assessment – a type of work sample is used as an assessment tool
4. Case-History Data – records, transcripts, and other accounts in any media that preserve archival information, official and informal accounts, and other
data and items relevant to the assessee
5. Behavioral Observation – monitoring the actions of other or oneself by visual or electronic means while recording qualitative and/or quantitative
information regarding those actions, typically for diagnostic or related purposes and either to design intervention or to measure the outcome of an
intervention.
E.
Parties in Psychological Assessment
1. Test Authors and Developer – create tests or other methods of assessment
2. Test Publishers – they publish, market, and sell tests, thus controlling their distribution
3. Test Reviewers – they prepare evaluative critiques of tests based on their technical and practical merits
4. Test Users – professionals such as clinicians, counselors, school psychologists, human resource personnel, consumer psychologists, experimental
psychologists, social psychologists, etc. that use these tests for assessment
5. Test Sponsors – institutional boards or government agencies who contract test developers or publishers for a various testing services
6. Test Takers – those who are taking the tests; those who are subject to assessment
7. Society at Large
F.
Three-Tier System of Psychological Tests
1. Level A
– these tests are those that can be administered, scored and interpreted by responsible non-psychologist who have carefully read the manual and
are familiar with the overall purpose of testing. Educational achievement tests fall into this category.
– Examples: Achievement tests and other specialized (skill-based) aptitude tests
2. Level B
– these tests require technical knowledge of test construction and use of appropriate advanced coursework in psychology and related courses
– examples: Group intelligence tests and personality tests
3. Level C
– these tests require an advanced degree in Psychology or License as Psychologist and advanced training/supervised experience in a particular
test (Examples: Projective tests, Individual Intelligence tests, Diagnostic tests)
G.
General Types of Psychological Tests According to Variable Measured
1. Ability Tests
- Assess what a person can do
- Includes Intelligence Tests, Achievement Tests and Aptitude Tests
- Best conditions are provided to elicit a person’s full capacity or maximum performance
- There are right and wrong answers
- Objective of motivation: for the examinee to do his best
2. Tests of Typical Performance
- Assess what a person usually does
- Includes personality tests, interest/attitude/values inventories
- Typical performance can still manifest itself even in conditions not deemed as best
- There are no right or wrong answers
- Objective of motivation: for the examinee to answer questions honestly
H.
Specific Types of Psychological Tests
1. Intelligence Test
– measures general potential
– Assumption: fewer assumptions about specific prior learning experiences
– Validation process: Content Validity and Construct Validity
– examples: WAIS, WISC, CFIT, RPM
2. Aptitude Test
– Measures an individual’s potential for learning a specific task, ability or skill
– Assumption: No assumptions about specific prior learning experiences
– Validation process: Content validity and Predictive Validity
– Examples: DAT, SATT
3. Achievement Test
– This test provides a measure for the amount, rate and level of learning, success or accomplishment, strengths/weaknesses in a particular subject
or task
– Assumption: Assumes prior relatively standardized educational learning experiences
– Validation process: Content validity
– Example: National Achievement Test
4. Personality Test
– measures traits, qualities, attitudes or behaviors that determine a person’s individuality
– can measure overt or covert dispositions and levels of adjustment as well
– can be measured idiographically (unique characteristics) or nomothetically (common characteristics)
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
– has three construction strategies namely: theory-guided inventories, factor-analytically derived inventories, criterion-keyed inventories
– examples: NEOPI, 16PF, MBTI, MMPI
5. Interest Inventory
– Measures an individual’s performance for certain activities or topics and thereby help determine occupational choice or make career decisions
– Measure the direction and strength of interest
– Assumption: Interests though unstable, have a certain stability or else it cannot be measured
– Stability is said to start at 17 years old
– Broad lines of interests are more stable while specific lines of interests are more unstable, they can change a lot.
– Example: CII
6. Attitude Inventory
– Direct observation on how a person behaves in relation to certain things
– Attitude questionnaires or scales (Bogardus Social Distance Scale, 1925)
– Reliabilities are good but not as high as those of tests of ability
– Attitude measures have not generally correlated very highly with actual behavior
– Specific behaviors, however, can be predicted from measures of attitude toward the specific behavior
7. Values Inventory
– Purports to measure generalized and dominant interests
– Validity is extremely difficult to determine by statistical methods
– The only observable criterion is overt behavior
– Employed less frequently than interest in vocational counseling and career decision-making
8. Diagnostic Test
– This test can uncover and focus attention on weaknesses of individuals for remedial purposes
9. Power Test
– Requires an examinee to exhibit the extent or depth of his understanding or skill
– Test with varying level of difficulty
10. Speed Test
– Requires the examinee to complete as many items as possible
– Contains items of uniform and generally simple level of difficulty
11. Creativity Test
– A test which assesses an individual’s ability to produce new/original ideas, insights or artistic creations that are accepted as being social, aesthetic
or scientific value
– Can assess the person’s capacity to find unusual or unexpected solutions for vaguely defined problems
12. Neuropsychological Test
– Measures cognitive, sensory, perceptual and motor performance to determine the extent, locus and behavioral consequences of brain damage,
given to persons with known or suspected brain dysfunction
– Example: Bender-Gestalt II
13. Objective Test
– Standardized test
– Administered individually or in groups
– Objectively scored
– There are limited number of responses
– Uses norms
– There is a high level of reliability and validity
– Examples: Personality Inventories, Group Intelligence Test
14. Projective Test
– Test with ambiguous stimuli which measures wishes, intrapsychic conflicts, dreams and unconscious motives
– Projective tests allow the examinee to respond to vague stimuli with their own impressions
– Assumption is that the examinee will project his unconscious needs, motives, and conflicts onto the neutral stimulus
– Administered individually and scored subjectively
– Have 5 types/techniques: Completion Technique, Expressive Technique, Association Technique, Construction Technique, Choice or Ordering
Technique
– With low levels of reliability and validity
– Examples: Rorschach Inkblot Test, TAT, HTP, SSCT, DAP
15. Norm-Referenced Test – raw scores are converted to standard scores
16. Criterion-Referenced Test – raw scores are referenced to specific cut-off scores
***Clinical Differences Between Projective Tests and Psychometric (Objective Tests)
Point of Comparison/Difference
Projective Test
Definiteness of Task
Allows variation in responses and recall more
individualized response pattern
Response Choice vs. Constructed Response
The subject gives whatever response seems
fitting within the range allowed by the test
direction
Response vs. Product
Watches the subject at work from a general
direction
Analysis of Results
Gross score could still be supplemented by
investigation of the individual’s reaction and
opinion
Emphasis on Critical Validation
Makes analysis of individual response
The tester is satisfied in comparing impression
based on one procedure with impression
gained from another
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
Psychometric Test
Subjects are judged in very much the same
basis
It can be more objectively scored and does not
depend on fluency or expressive skills
It concerns itself with the tangible product of
performance
Formal scoring plays large part in scoring the
test
Measured in standard norms
The tester accompanies every numerical score
with a warning regarding the error of the
measurement and every prediction with an
index that shows how likely it is to come true
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
I.
Basic Principles in the Use of Psychological Tests
1. Tests are samples of behavior
2. Tests do not reveal traits or capacities directly
3. Psychological maladjustments selectively and differentially affect the test scores
4. The psychometric and projective approaches, although indistinguishable, are mutually complementary
J.
Psychological Tests are used in the following settings:
1. Educational Settings
– Basis for admission and placement to an academic institution
– Identify developmental problems or exceptionalities for which a student may need special assistance
– Assist students for educational od vocational planning
– Intelligence tests and achievement tests are used from an early age. From kindergarten on, tests are used for placement and advancement.
– Educational institutions have to make admissions and advancement decisions regarding students. e.g, SAT, GRE, subject placement tests
– Used to assess students for special education programs. Also, used in diagnosing learning difficulties.
2. Clinical Settings
– Tests of Psychological Adjustment and tests which can classify and/or diagnose patients are used extensively.
– Psychologist generally use a number of objective and projective personality tests.
– Neuropsychological tests which examine basic mental function also fall into this category. Perceptual tests are used detecting and diagnosing
brain damage.
– For diagnosis and treatment planning
3. Counseling Settings
– Counseling in schools, prisons, government or private institutions
4. Geriatric Settings
– Assessment for the aged
5. Business Settings (Personnel Testing)
– Tests are used to assess: training needs, worker’s performance in training, success in training programs, management development, leadership
training, and selection.
– For example, the Myers -Briggs type indicator is used extensively to assess managerial potential. Type testing is used to hopefully match the right
person with the job they are most suited for.
– Selection of employees’ classification of individuals to positions suited for them
– Basis for promotion
6. Military Settings
– For proper selection of military recruits and placement in the military duties
7. Government and Organizational Credentialing
– For promotional purposes, licensing, certification or general credentialing of professionals
8. Courts
– Evaluate the mental health of people charged with a crime
– Investigating malingering cases in courts
– Making child custody/annulment/divorce decisions
9. Academic Research Settings
K.
Uses of Psychological Test
1. Classification – assigning a person to one category rather than the other
a. Placement – refers to sorting of persons into different programs appropriate to their needs/skills (example: a university mathematics placement
exam is given to students to determine if they should enroll in calculus, in algebra or in a remedial course)
b. Screening – refers to quick and simple tests/procedures to identify persons who might have special characteristics or needs (example: identifying
children with exceptional thinking and the top 10% will be singled out for a more comprehensive testing)
c. Certification – determining whether a person has at least the minimum proficiency in some discipline/activity (example: right to practice medicine
after passing the medical board exam; right to drive a car)
d. Selection – example: provision of an opportunity to attend a university; opportunity to gain employment in a company or in a government
2. Aptitude Testing
a. Low selection ratio
b. Low success ratio
3. Diagnosis and Treatment Planning – diagnosis conveys information about strengths, weaknesses, etiology and best choices for treatment (example: IQ
tests are absolutely essential in diagnosing intellectual disability)
4. Self-Knowledge – psychological tests also supply a potent source of self-knowledge and in some cases, the feedback a person receives from
psychological tests is so self-affirming that it can change the entire course of a person’s life.
5. Program Evaluation – another use of psychological tests is the systematic evaluation of educational and social programs (they are designed to provide
services which improve social conditions and community life)
a. Diagnostic Evaluation – refers to evaluation conducted before instruction.
b. Formative Evaluation – refers to evaluation conducted during or after instruction.
c. Summative Evaluation – refers to evaluation conducted at the end of a unit or a specified period of time.
6. Research – psychological tests also play a major role in both the applied and the theoretical branches of behavioral researches
L.
Steps in (Clinical) Psychological Assessment
1. Deciding what is being assessed
2. Determining the goals of assessment
3. Selecting standards for making decisions
4. Collecting assessment data
5. Making decisions and judgments
6. Communicating results
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
M.
Approaches in Psychological Assessment
1. Nomothetic Approach - characterized by efforts to learn how a limited number of personality traits can be applied to all people
2. Idiographic Approach - characterized by efforts to learn about each individual’s unique constellation of personality traits, with no attempt to characterize
each person according to any particular set of traits
N.
Making Inferences and Decisions in Psychological Testing and Assessment
1. Base Rate - An index, usually expressed as a proportion, of the extent to which a particular trait, behavior, characteristic, or attribute exists in a
population
2. Hit Rate - The proportion of people a test or other measurement procedure accurately identifies as possessing or exhibiting a particular trait, behavior,
characteristic, or attribute
3. Miss Rate - The proportion of people a test or other measurement procedure fails to identify accurately with respect to the possession or exhibition of a
trait, behavior, characteristic, or attribute; a "miss" in this context is an inaccurate classification or prediction and can be classified as:
a. False Positive (Type I error) - an inaccurate prediction or classification indicating that a testtaker did possess a trait or other attribute being
measured when in reality the testtaker did not
b. False Negative (Type II error) - an inaccurate prediction of classification indicating that a testtaker did not possess a trait or other attribute being
measured when in reality the testtaker did
O.
Cross-Cultural Testing
1. Parameters where cultures vary
– Language
– Education
– Test Content
– Speed (Tempo of Life)
2. Culture Free Tests
– An attempt to eliminate culture so nature can be isolated
– Impossible to develop such because culture is evident in its influence since birth or an individual
– The interaction between nature and nurture is cumulative and not relative
3. Culture Fair Tests
– These tests were developed because of the non-success of culture-free tests
– Nurture is not removed but parameters are common an fair to all
– Can be done using three approaches such as follows:
✓ Fair to all cultures
✓ Fair to some cultures
✓ Fair only to one culture
4. Culture Loadings
– The extent to which a test incorporates the vocabulary, concepts, traditions, knowledge, and feelings, associated with particular culture
CHAPTER III: RESEARCH REFRESHER
A.
Research Purposes
– to generate new knowledge
– to develop new gadgets, techniques
– to evaluate a program or technique
– to validate theories
B.
Steps in Research
1. Identify the problem
2. Conduct literature review
3. Identify theoretical/conceptual framework
4. Formulate hypothesis
5. Operationalize variables
6. Select research design
7. Ascertain and select sample
8. Conduct a pilot study
9. Collect data
10. Analyze data
11. Interpret results
12. Disseminate information
C.
D.
E.
Research Problems
– Research problem is a situation in need of description or quantification, solution, improvement or alteration. You can evaluate these problems by using
the following criteria:
✓ Significance of the problem
✓ Feasibility
✓ Researchability of the problem
✓ Interest of the researcher
– Sources of Problems
✓ Replication studies
✓ Experiences
✓ Intellectual curiosity
✓ Review of related literature
✓ Issues and popular concern
Hypotheses - statements of the anticipated or expected relationship between the independent and dependent variables.
– Types
✓ Null hypothesis- states no relationship between variables
✓ Alternative hypothesis- gives the predicted relationship
– Complexity
✓ Simple- one independent and one dependent variable
✓ Complex or Multivariate- 2 or more independent or dependent variable
Research Design
Research Component
Purpose
Philosophical
Assumptions
Qualitative Research Design
• To gain an understanding of underlying reasons and
motivations
• To provide insights into the setting of a problem,
generating ideas and/or hypotheses for later
quantitative research
• To uncover prevalent trends in thought and opinion
• To explore causality
• Post-positivist perspective
Quantitative Research Design
• To quantify data and generalize results from a sample to
the population of interest
• To measure the incidence of various views and opinions
in a chosen sample
• Sometimes followed by qualitative research which is used
to explore some findings further
• To suggest causality
• Positivist perspective
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
Research Component
Research Method
Time Element
Research Problem &
Hypotheses/Assumptions
Sample
Data Collection
Data Analysis
Outcome
F.
Qualitative Research Design
• Naturalistic
• Social, multiple & subjective reality where researcher
interacts with that being researched
• Phenomenology
• Case study
• Ethnography
• Grounded theory
• Cultural studies
• Conducted if time is not limited because of the extensive
interviewing
• Hypothesis is informed guess or prediction
• Hypotheses are being generated
• Usually a small number of non-representative cases.
Respondents selected to fulfill a given quota.
• Sampling depends on what needs to be learned
• More focused geographically
• Control group is not required
• Unstructured or semi-structured techniques e.g.:
individual depth interviews or group discussions.
• Non-statistical analysis (Thematic)
• Exploratory and/or investigative. Findings are not
conclusive and cannot be used to make
generalizations about the population of interest.
Research Methods
Research Method
Descriptive-Qualitative
(Case Study/Ethnography)
Descriptive-Quantitative
Correlational Analysis
Regression Analysis
Quasi-Experimental Research
Experimental Research
Meta-analysis
Quantitative Research Design
• Objective reality
• Researcher is independent of that which is researched
•
•
•
•
•
•
Experimental
Quasi-experimental
Single subject
Comparative
Correlational
Most suitable if time and resources are limited
• Question is evolving, general and flexible
• Hypotheses are being tested
• Usually a large number of cases representing the
population of interest. Randomly selected respondents.
• Sampling focus is on probability and “representativeness”.
• More dispersed geographically
• Control group or comparison is necessary to determine the
impact
• Structured techniques such as online questionnaires, and
standardized tests..
• Statistical analysis
• Used to recommend a final course of action.
Salient Features
Detailed descriptions of specific situation(s) using interviews, observations, document review.
The researcher’s task is to describe things as they are.
Numerical descriptions (frequency, average) of specific situations.
The researcher’s task is to measure things as they are.
Quantitative analyses of the strength of relationships between two or more variables.
Quantitative analyses of causal or predictive links between two or more variables.
Comparing a group that gets a particular intervention with another group that is similar in characteristics but did not
receive the intervention.
▪ There is no random assignment used.
▪ Using random assignment to assign participants to an experimental or treatment group and a control or comparison
group.
▪ Synthesis of results from multiple studies to determine the average impact of a similar intervention across the
studies.
▪
▪
▪
▪
▪
▪
▪
G.
Experiment Validity
– Experimental validity refers to the manner in which variables that influence both the results of the research and the generalizability to the population at
large
1. Internal Validity of an Experiment
– It refers to a study’s ability to determine if a causal relationship exists between one or more independent variables and one or more dependent
variables
– Threatened by the following:
• History and
• Testing
• Selection
Confounding Variables
• Statistical Regression
• Experimenter Bias
• Maturation
• Instrumentation
• Mortality
2. External Validity of an Experiment
– It refers to a study’s generalizability to the general population
• Demand Characteristics (subjects become wise to
• Order Effects (Carry Over Effects)
anticipated results)
• Treatment Interaction Effects (treatment +
• Hawthorne Effects
selection/history/testing)
H.
Sampling Techniques
1. In non-probability sampling, not every element of the population has an opportunity to be included.
Examples: accidental/convenience, quota, purposive and network/snowball.
2. In probability sampling, every member of the population has a probability of being included in the sample.
Examples: simple random sampling, stratified random sampling, cluster sampling and systematic sampling.
Research Variables
1. An independent variable is the presumed “cause”
2. The dependent variable is the presumed “effect”.
3. Extraneous variables are other factors that affects the measurement of the IV or DV
4. Intervening variables are any factor that are not directly observable in research situation but which maybe affecting the behavior of the subject.
I.
CHAPTER IV: STATISTICS REFRESHER
A.
Scales of Measurement
1. Primary Scales of Measurement
a. Nominal: a non-parametric measure that is also called categorical variable, simple classification. We do not need to count to distinguish one item
from another.
Example: Sex (Male and Female); Nationality (Filipino, Japanese, Korean); Color (Blue, Red and Yellow)
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
b. Ordinal: a non-parametric scale wherein cases are ranked or ordered; they represent position in a group where the order matters but not the
difference between the values.
Example: 1st, 2nd, 3rd, 4th and 5th; Pain threshold in a scale of 1 – 10, 10 being the highest
c. Interval: a parametric scale wherein this scale use intervals equal in amount measurement where the difference between two values is
meaningful. Moreover, the values have fixed unit and magnitude.
Example: Speed of a car (70KpH); Temperature (Fahrenheit and Celsius only)
d. Ratio: a parametric scale wherein this scale is similar to interval but include a true zero point and relative proportions on the scale make sense.
Example: Height and Weight
2. Comparative Scales of Measurement
a. Paired Comparison: a comparative technique in which a respondent is presented with two objects at a time and asked to select one object according
to some criterion. The data obtained are in ordinal nature.
Example: Pairing the different brands of cold drink with one another please put a check mark in the box corresponding to your preference.
Brand
Coke
No. of Times Preferred
✓
✓
✓
3
Coke
Pepsi
Sprite
Limca
Pepsi
Sprite
Limca
✓
✓
1
✓
2
0
b. Rank Order: respondents are presented with several items simultaneously and asked to rank them in order of priority. This is an ordinal scale that
describes the favoured and unfavoured objects, but does not reveal the distance between the objects. The resultant data in rank order is ordinal
data. This yields a better result when comparisons are required between the given objects. The major disadvantage of this technique is that only
ordinal data can be generated.
Example: Rank the following brands of cold drinks you like most and assign it a number 1. Then find the second most preferred brand and assign
it a number 2. Continue this procedure until you have ranked all the brands of cold drinks in order of preference. Also remember that no two
brands should receive the same rank order.
Brand
Rank
1
3
2
4
Coke
Pepsi
Sprite
Limca
c. Constant Sum: respondents are asked to allocate a constant sum of units such as points, rupees or chips among a set of stimulus objects with
respect to some criterion. For example, you may wish to determine how important the attributes of price, fragrance, packaging, cleaning power and
lather of a detergent are to consumers. Respondents might be asked to divide a constant sum to indicate the relative importance of the attributes.
The advantage of this technique is saving time. However, the main disadvantages of are the respondent may allocate more or fewer points than
those specified. The second problem is respondents might be confused.
Example: Between attributes of detergent, please allocate 100 points among the attributes so that your allocation reflects the relative importance
you attach to each attribute. The more points an attribute receives, the more important the attribute is. If an attribute is not at all important, assign
it zero points. If an attribute is twice as important as some other attribute, it should receive twice as many points.
Attribute
Number of Points
50
05
10
30
05
100
Price
Fragrance
Packaging
Cleaning power
Lather
Total Points
3.
d. Q-Sort Technique: This is a comparative scale that uses a rank order procedure to sort objects based on similarity with respect to some criterion.
The important characteristic of this methodology is that it is more important to make comparisons among different responses of a respondent than
the responses between different respondents. Therefore, it is a comparative method of scaling rather than an absolute rating scale. In this method
the respondent is given statements in a large number for describing the characteristics of a product or a large number of brands of products.
Example: The bag given to you contain pictures of 90 magazines. Please choose 10 magazines you prefer most, 20 magazines you like, 30
magazines which you are neutral (neither like nor dislike), 20 magazines you dislike and 10 magazines you prefer least.
Prefer Most
Like
Neutral
Dislike
Prefer Least
(10)
(20)
(30)
(20)
(10)
Non-Comparative Scales of Measurement
a. Continuous Rating Scales: the respondent’s rate the objects by placing a mark at the appropriate position on a continuous line that runs from one
extreme of the criterion variable to the other.
Example: How would you rate the TV advertisement as a guide for buying?
Strongly Agree
10
9
8
7
6
5
4
3
2
1
Strongly Disagree
b. Itemized Rating Scale: itemized rating scale is a scale having numbers or brief descriptions associated with each category. The categories are
ordered in terms of scale position and the respondents are required to select one of the limited numbers of categories that best describes the
product, brand, company or product attribute being rated. Itemized rating scales are widely used in marketing research. This can take the graphic,
verbal or numerical form.
c. Likert Scale: the respondents indicate their own attitudes by checking how strongly they agree or disagree with carefully worded statements that
range from very positive to very negative towards the attitudinal object. Respondents generally choose from five alternatives (say strongy agree,
agree, neither agree nor disagree, disagree, strongly disagree). A likert scale may include a number of items or statements. Disadvantage of Likert
scale is that it takes longer time to complete that other itemized rating scales because respondents have to read each statement. Despite the above
disadvantages, this scale has several to advantages. It is easy to construct, administer and use.
Example: I believe that ecological questions are the most important issues facing human beings today.
1
Strongly Disagree
2
Disagree
3
Neutral
4
Agree
5
Strongly Agree
d. Semantic Differential Scale: This is a seven-point rating scale with end points associated with bipolar labels (such as good and bad, complex and
simple) that have semantic meaning. It can be used to find whether a respondent has a positive or negative attitude towards an object. It has been
widely used in comparing brands and company images. It has also been used to develop advertising and promotion strategies and in a new product
development study.
Example: Please indicate you attitude towards work using the scale below:
Boring
Unnecessary
:
:
:
:
:
:
Attitude towards work
:
:
:
:
:
:
:
:
Interesting
Necessary
e. Staple Scale: The staple scale was originally developed to measure the direction and intensity of an attitude simultaneously. Modern versions of
the staple scale place a single adjective as a substitute for the semantic differential when it is difficult to create pairs of bipolar adjectives. The
modified staple scale places a single adjective in the center of an even number of numerical values.
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
Example: Select a plus number for words that you think describe personnel banking of a bank accurately. The more accurately you think the
word describes the bank, the larger the plus number you should choose. Select a minus number for words you think do not describe the bank
accurately. The less accurate you think the word describes the bank, the larger the minus number you should choose.
+3
+2
+1
Friendly Personnel
-1
-2
-3
B.
+3
+2
+1
Competitive Loan Rates
-1
-2
-3
Descriptive Statistics
1. Frequency Distributions – distribution of scores by frequency with which they occur
2. Measures of Central Tendency – a statistic that indicates the average or midmost score between the extreme scores in a distribution
Σ(fX)
ΣX
(for ungrouped distribution) ̅
X=
(for grouped distribution)
a. Mean – formula: ̅
X=
N
N
b. Median – the middle score in a distribution
c. Mode – frequently occurring score in a distribution
***Appropriate use of central tendency measure according to type of data being used:
Type of Data
Measure
Nominal Data
Mode
Ordinal Data
Median
Interval / Ratio Data (Normal)
Mean
Interval / Ratio Data (Skewed)
Median
3. Measures of Variability – a statistic that describe the amount of variation in a distribution
a. Range – the difference between the highest and the lowest scores
b. Interquartile range – the difference between Q1 and Q3
c. Semi-Interquartile range – interquartile range divided by 2
d. Standard Deviation – the square root of the averaged squared deviations about the mean
4. Measures of Location
a. Percentiles – an expression of the percentage of people whose score on a test or measure falls below a particular raw score
Formula for Percentile =
Number of students beaten
Total number of students
x 100
b. Quartiles – one of the three dividing points between the four quarters of a distribution, each typically labelled Q1, Q2 and Q3
c. Deciles – divided to 10 parts
5. Skewness - a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean
a. Positive skew
– relatively few scores fall at the positive end
– reflects a very difficult type of test
6. Kurtosis - the sharpness of the peak of a frequency-distribution curve.
C.
b. Negative skew
– relatively few scores fall at the negative end
– reflects a very easy type of test
The Normal Curve and Standard Scores
1. “z” Scores – Mean of 0, SD of 1 (Formula:
̅
X−X
SD
)
2. T scores – Mean of 50, SD of 10 (Formula: z-score X 10 + 50)
3. Stanines – Mean of 5, SD of 2 (Formula: z-score X 2 + 5)
4. Sten – Mean of 5.5, SD of 2 (Formula: z-score X 2 + 5.5)
5. IQ scores – Mean of 100, SD of 15
6. A scores – Mean of 500, SD of 100
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
D.
Inferential Statistics
1. Parametric vs. Non-Parametric Tests
Requirements
Common Statistical
Tools
•
•
•
•
•
•
•
•
Parametric Test
Normal Distribution
Homogenous Variance
Interval or Ratio Data
Pearson’s Correlation
Independent Measures t-test
One-way, independent-measures ANOVA
Paired t-test
One-way, repeated-measures ANOVA
•
•
•
•
•
•
•
•
Non-Parametric Test
Normal Distribution is not required
Homogenous Variance is not required
Nominal or Ordinal Data
Spearman’s Correlation
Mann-Whitney U test
Kruskal-Wallis H test
Wilcoxon Signed-Rank test
Friedman’s test
2. Measures of Correlation
a. Pearson’s Product Moment Correlation – parametric test for interval data
b. Spearman Rho’s Correlation – non-parametric test for ordinal data
c. Kendall’s Coefficient of Concordance – non-parametric test for ordinal data
d. Phi Coefficient – non-parametric test for dichotomous nominal data
e. Lambda – non-parametric test for 2 groups (dependent and independent variable) of nominal data
***Correlation Ranges:
1.00
: Perfect relationship
0.25 – 0.49
: Weak relationship
0.75 – 0.99
: Very strong relationship
0.01 – 0.24
: Very weak relationship
0.50 – 0.74
: Strong relationship
0.00
: No relationship
3. Measures of Prediction
a. Biserial Correlation – predictive test for artificially dichotomized and categorical data as criterion with continuous data as predictors
b. Point-Biserial Correlation – predictive test for genuinely dichotomized and categorical data as criterion with continuous data as predictors
c. Tetrachoric Correlation – predictive test for dichotomous data with categorical data as criterion and categorical data as predictors
d. Simple Linear Regression – a predictive test which involves one criterion that is continuous in nature with only one predictor that is continuous
e. Multiple Linear Regression – a predictive test which involves one criterion that is continuous in nature with more than one continuous predictor
f. Ordinal Regression – a predictive test which involves a criterion that is ordinal in nature with more than one predictors that are continuous in
4. Chi-Square Test
a. Goodness of Fit – used to measure differences and involves nominal data and only one variable with 2 or more categories
b. Test of Independence – used to measure correlation and involves nominal data and two variables with two or more categories
5. Comparison of Two Groups
a. Paired t-test – a parametric test for paired groups with normal distribution
b. Unpaired t-test – a parametric test for unpaired groups with normal distribution
c. Wilcoxon Signed-Rank Test – a non-parametric test for paired groups with non-normal distribution
d. Mann-Whitney U test – a non-parametric test for unpaired groups with non-normal distribution
6. Comparison of Three or More Groups
a. Repeated measures ANOVA – a parametric test for matched groups with normal distribution
b. One-way/Two-Way ANOVA – a parametric test for unmatched groups with normal distribution
c. Friedman F test – a non-parametric test for matched groups with non-normal distribution
d. Kruskal-Wallis H test – a non-parametric test for unmatched groups with non-normal distribution
7. Factor Analysis
CHAPTER V: PSYCHOMETRIC PROPERTIES OF A GOOD TEST
A.
Reliability – the stability or consistency of the measurement
1. Goals of Reliability
a. Estimate errors in psychological measurement
b. Devise techniques to improve testing so errors are reduced
2. Sources of Measurement Error
Source of Error
Type of Test Prone to Each Error Source
Inter-scorer differences and
Tests scored with a degree of subjectivity
Interpretation
Time Sampling Error
Tests of relatively stable traits or behavior
Content Sampling Error
Tests for which consistency of results, as a
whole, is required
Inter-item Inconsistency
Tests that require inter-item consistency
Appropriate Measures Used to Estimate Error
Scorer reliability
Test-Retest Reliability (rtt), a.k.a. Stability Coefficient
Alternate-form reliability (a.k.a. coefficient of equivalence)
or split-half reliability (a.k.a. coefficient of internal
consistency)
Split-half reliability or more stringent internal consistency
measures, such as KR-20 or Cronbach Alpha
Internal consistency measures and additional evidence
of homogeneity
Delayed alternate-form reliability
Inter-item Inconsistency and
Tests that require inter-item consistency and
Content Heterogeneity combined homogeneity
Time and Content Sampling error Tests that require stability and consistency of
combined
result, as a whole
3. Types of Reliability
a. Test-Retest Reliability
– compare the scores of individuals who have been measured twice by the instrument
– this is not applicable for tests involving reasoning and ingenuity
– longer interval will result to lower correlation coefficient while shorter interval will result to higher correlation
– the ideal time interval for test-retest reliability is 2-4 weeks
– source of error variance is time sampling
– utilizes Pearson r or Spearman rho
b. Parallel-Forms/Alternate Forms Reliability
– same persons are tested with one form on the first occasion and with another equivalent form on the second
– the administration of the second, equivalent form either takes place immediately or fairly soon.
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
4.
5.
6.
7.
B.
– the two forms should be truly paralleled, independently constructed tests designed to meet the same specifications, contain the same
number of items, have items which are expressed in the same form, have items that cover the same type of content, have items with the
same range of difficulty, and have the same instructions, time limits, illustrative examples, format and all other aspects of the test
– has the most universal applicability
– for immediate alternate forms, the source of error variance is content sampling
– for delayed alternate forms, the source of error variance is time sampling and content sampling
– utilizes Pearson r or Spearman rho
c. Split-Half Reliability
– Two scores are obtained for each person by dividing the test into equivalent halves (odd-even split or top-bottom split)
– The reliability of the test is directly related to the length of the test
– The source of error variance is content sampling
– Utilizes the Spearman-Brown Formula
d. Other Measures of Internal Consistency/Inter-Item Reliability – source of error variance is content sampling and content heterogeneity
– KR-20 – for dichotomous items with varying level of difficulty
– KR-21 – for dichotomous items with uniform level of difficulty
– Cronbach Alpha/Coefficient Alpha – for non-dichotomous items (likert or other multiple choice)
– Average Proportional Distance – focuses on the degree of difference that exists between item scores.
e. Inter-Rater/Inter-Observer Reliability
– Degree of agreement between raters on a measure
– Source of error variance is inter-scorer differences
– Often utilizes Cohen’s Kappa statistic
Reliability Ranges
– 1
: perfect reliability (may indicate redundancy and homogeneity)
– ≥ 0.9
: excellent reliability (minimum acceptability for tests used for clinical diagnoses)
– ≥ 0.8 < 0.9
: good reliability,
– ≥ 0.7 < 0.8
: acceptable reliability (minimum acceptability for psychometric tests),
– ≥ 0.6 < 0.7
: questionable reliability (but is still acceptable for research purposes),
– ≥ 0.5 < 0.6
: poor reliability,
– < 0.5
: unacceptable reliability,
– 0
: no reliability.
Standard Error of Measurement
– an index of the amount of inconsistency or the amount of expected error in an individual’s score
– the higher the reliability of the test, the lower the SEM
• Error – long standing assumption that factors other than what a test attempts to measure will influence performance on the test
• Error Variance – the component of test score attributable to sources other than the trait or ability being measured
• Trait Error – are those sources of errors that reside within an individual taking the test (such as, I didn’t study enough, I felt bad that
missed blind date, I forgot to set the alarm, excuses)
• Method Error– are those sources of errors that reside in the testing situation (such as lousy test instructions, too-warm room, or
missing pages).
• Confidence Interval – a range or band of test scores that is likely to contain the true score
• Standard error of the difference – a statistical measure that can aid a test user in determining how large a difference should be before it
is considered statistically significant
Factors Affecting Test Reliability
a. Test Format
e. Test Scoring
b. Test Difficulty
f. Test Economy
c. Test Objectivity
g. Test Adequacy
d. Test Administration
What to do about low reliability?
– Increase the number of items
– Use factor analysis and item analysis
– Use the correction of attenuation formula – a formula that is being used to determine the exact correlation between two variables if the test is
deemed affected by error
Validity – a judgment or estimate of how well a test measures what it purports to measure in a particular test
1. Types of Validity
a. Face Validity
– the least stringent type of validity, whether a test looks valid to test users, examiners and examinees
– Examples:
✓ An IQ test containing items which measure memory, mathematical ability, verbal reasoning and abstract reasoning has a good face
validity.
✓ An IQ test containing items which measure depression and anxiety has a bad face validity.
✓ A self-esteem rating scale which has items like “I know I can do what other people can do.” and “I usually feel that I would fail on a
task.” has a good face validity.
✓ Inkblot test have low face validity because test takers question whether the test really measures personality.
b. Content Validity
– Definitions and concepts
✓ whether the test covers the behavior domain to be measured which is built through the choice of appropriate content areas, questions,
tasks and items
✓ It is concerned with the extent to which the test is representative of a defined body of content consisting of topics and processes.
✓ Content validation is not done by statistical analysis but by the inspection of items. A panel of experts can review the test items and
rate them in terms of how closely they match the objective or domain specification.
✓ This considers the adequacy of representation of the conceptual domain the test is designed to cover.
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
✓ If the test items adequately represent the domain of possible items for a variable, then the test has adequate content validity.
✓ Determination of content validity is often made by expert judgment.
– Examples:
✓ Educational Content Valid Test – syllabus is covered in the test; usually follows the table of specification of the test. (Table of
specification – a blueprint of the test in terms of number of items per difficulty, topic importance, or taxonomy)
✓ Employment Content Valid Test – appropriate job-related skills are included in the test. Reflects the job specification of the test.
✓ Clinical Content Valid Test – symptoms of the disorder are all covered in the test. Reflects the diagnostic criteria for a test.
– Issues arising from lack of content validity:
✓ Construct underrepresentation-Failure to capture important components of a construct (e.g. An English test which only contains
vocabulary items but no grammar items will have a poor content validity.)
✓ Construct-irrelevant variance-Happens when scores are influenced by factors irrelevant to the construct (e.g. test anxiety, reading
speed, reading comprehension, illness)
c. Criterion-Related Validity
– What is a criterion?
✓ standard against which a test or a test score is evaluated.
✓ A criterion can be a test score, psychiatric diagnosis, training cost, index of absenteeism, amount of time.
✓ Characteristics of a criterion:
• Relevant
• Valid and Reliable
• Uncontaminated: Criterion contamination occurs if the criterion based on predictor measures; the criterion used is a criterion of
what is supposed to be the criterion
– Criterion-Related Validity Defined:
✓ indicates the test effectiveness in estimating an individual’s behavior in a particular situation
✓ Tells how well a test corresponds with a particular criterion.
✓ A judgment of how adequately a test score can be used to infer an individual’s most probable standing on some measure of interest.
– Types of Criterion-Related Validity:
✓ Concurrent Validity – the extent to which test scores may be used to estimate an individual’s present standing on a criterion
✓ Predictive – the scores on a test can predict future behavior or scores on another test taken in the future
✓ Incremental Validity – this type of validity is related to predictive validity wherein it is defined as the degree to which an additional
predictor explains something about the criterion measure that is not explained by predictors already in use
d. Construct Validity
– What is a construct?
✓ An informed scientific idea developed or hypothesized to describe or explain a behavior; something built by mental synthesis.
✓ Unobservable, presupposed traits; something that the researcher thought to have either high or low correlation with other variables
– Construct Validity defined
✓ A test designed to measure a construct must estimate the existence of an inferred, underlying characteristic based on a limited sample
of behavior
✓ Established through a series of activities in which a researcher simultaneously defines some construct and develops instrumentation to
measure it.
✓ A judgment about the appropriateness of inferences drawn from test scores regarding individual standings on a variable called
construct.
✓ Required when no criterion or universe of content is accepted as entirely adequate to define the quality being measured.
✓ Assembling evidence about what a test means.
✓ Series of statistical analysis that one variable is a separate variable.
✓ A test has a good construct validity if there is an existing psychological theory which can support what the test items are measuring.
✓ Establishing construct validity involves both logical analysis and empirical data. (Example: In measuring aggression, you have to check
all past research and theories to see how the researchers measure that variable/construct)
✓ Construct validity is like proving a theory through evidences and statistical analysis.
– Evidences of Construct Validity
✓ Test is homogenous, measuring a single construct.
• Subtest scores are correlated to the total test score.
• Coefficient alpha may be used as homogeneity evidence.
• Spearman Rho can be used to correlate an item to another item.
• Pearson or point biserial can be used to correlate an item to the total test score. (item-total correlation)
✓ Test score increases or decreases as a function of age, passage of time, or experimental manipulation.
• Some variable/construct are expected to change with age.
✓ Pretest, posttest differences
• Difference of scores from pretest and posttest of a defined construct after careful manipulation would provide validity
✓ Test scores differ from groups.
• Also called a method of contrasted group
• T-test can be used to test the difference of groups.
✓ Test scores correlate with scores on other test in accordance to what is predicted.
• Discriminant Validation
o Convergent Validity – a test correlates highly with other variables with which it should correlate (example: Extraversion
which is highly correlated sociability)
o Divergent Validity – a test does not correlate significantly with variables from which it should differ (example: Optimism
which is negatively correlated with Pessimism)
• Factor Analysis – a retained statistical technique for analyzing the interrelationships of behavior data
o Principal Components Analysis – a method of data reduction
o Common Factor Analysis – items do not make a factor, the factor should predict scores on the item and is classified into two
(Exploratory Factor Analysis for summarizing data and Confirmatory Factor Analysis for generalization of factors)
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
• Cross-Validation - Revalidation of the test to a criterion based on another group different from the original group from which the
test was validated
o Validity Shrinkage – decrease in validity after cross validation.
o Co-validation – validation of more than one test from the same group.
o Co-norming – norming more than one test from the same group
2. Test Bias
– This is a factor inherent in a test that systematically prevents accurate, impartial measurement
✓ Rating Error – a judgment resulting from the intentional or unintentional misuse of rating scales
• Severity Error/Strictness Error – less than accurate rating or error in evaluation due to the rater’s tendency to be overly critical
• Leniency Error/Generosity Error – a rating error that occurs as a result of a rater’s tendency to be too forgiving and insufficiently
critical
• Central Tendency Error – a type of rating error wherein the rater exhibits a general reluctance to issue ratings at either a positive
or negative extreme and so all or most ratings cluster in the middle of the rating continuum
• Proximity Error – rating error committed due to proximity/similarity of the traits being rated
• Primacy Effect – “first impression” affects the rating
• Contrast Effect – the prior subject of assessment affects the latter subject of assessment
• Recency Effect – tendency to rate a person based from recent recollections about that person
• Halo Effect – a type of rating error wherein the rater views the object of the rating with extreme favour and tends to bestow ratings
inflated in a positive direction
• Impression Management
• Acquiescence
• Non-acquiescence
• Faking-Good
• Faking-Bad
3. Test Fairness
– This is the extent to which a test is used in an impartial, just and equitable way
4. Factors Influencing Test Validity
a. Appropriateness of the test
e. Test Construction factors
b. Directions/Instructions
f. Length of Test
c. Reading Comprehension Level
g. Arrangement of Items
d. Item Difficulty
h. Patterns of Answer
C.
Norms – designed as reference for evaluating or interpreting individual test scores
1. Basic Concepts
a. Norm - Behavior that is usual or typical for members of a group.
b. Norms - Reference scores against which an individual’s scores are compared.
c. Norming - Process of establishing test norms.
d. Norman - Test developer who will use the norms.
2. Establishing Norms
a. Target Population
b. Normative Sample
c. Norm Group
- Size
- Geographical Location
- Socioeconomic Level
3. Types of Norms
a. Developmental Norms
– Mental Age
* Basal Age
* Ceiling Age
* Partial Credits
– Intelligence Quotient
– Grade Equivalent Norms
– Ordinal Scales
- Ethnicity
- Age Group
b. Within Group Norms
– Percentiles
– Standard Scores
c. Relativity Norms
– National Norms
– Co-norms
– Local Norms
– Subgroup Norms
CHAPTER VI: TEST DEVELOPMENT
A.
Standardization
1. When to decide to standardize a test?
a. No test exists for a particular purpose
b. The existing tests for a certain purpose are not adequate for one reason or the another
2. Basic Premises of standardization
– The independent variable is the individual being tested
– The dependent variable is his behavior
– Behavior = person x situation
– In psychological testing, we make sure that it is the person factor that will ‘stand out’ and the situation factor is controlled
– Control of extraneous variables = standardization
3. What should be standardized?
a. Test Conditions
– There should be uniformity in the testing conditions
– Physical condition
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
– Motivational condition
b. Test Administration Procedure
– There should be uniformity in the instructions and administration proper. Test administration includes carefully following standard procedures
so that the test is used in the manner specified by the test developers. The test administrator should ensure that test takers work within
conditions that maximize opportunity for optimum performance. As appropriate, test takers, parents, and organizations should be involved in
the various aspects of the testing process
– Sensitivity to Disabilities: try to help the disable subject overcome his disadvantage, such as increasing voice volume or refer to other available
tests
– Desirable Procedures of Group Testing: Be care for time, clarity, physical condition (illumination, temperature, humidity, writing surface and
noise), and guess.
c. Scoring
– There should be a consistent mechanism and procedure in scoring. Accurate measurement necessitates adequate procedures for scoring
the responses of test takers. Scoring procedures should be audited as necessary to ensure consistency and accuracy of application.
d. Interpretation
– There should be common interpretations among similar results. Many factors can impact the valid and useful interpretations of test scores.
These can be grouped into several categories including psychometric, test taker, and contextual, as well as others.
a. Psychometric Factors: Factors such as the reliability, norms, standard error of measurement, and validity of the instrument are important
when interpreting test results. Responsible test use considers these basic concepts and how each impacts the scores and hence the
interpretation of the test results.
b. Test Taker Factors: Factors such as the test taker’s group membership and how that membership may impact the results of the test is a
critical factor in the interpretation of test results. Specifically, the test user should evaluate how the test taker’s gender, age, ethnicity, race,
socioeconomic status, marital status, and so forth, impact on the individual’s results.
c. Contextual Factors: The relationship of the test to the instructional program, opportunity to learn, quality of the educational program, work
and home environment, and other factors that would assist in understanding the test results are useful in interpreting test results. For
example, if the test does not align to curriculum standards and how those standards are taught in the classroom, the test results may not
provide useful information.
4. Tasks of test developers to ensure uniformity of procedures in test administration:
– Prepare a test manual containing the ff:
i. Materials needed (test booklets & answer sheets)
ii. Time limits
iii. Oral instructions
iv. Demonstrations/examples
v. Ways of handling querries of examinees
5. Tasks of examiners/test users/psychometricians
– Ensure that test user qualifications are strictly met (training in selection, administration, scoring and interpretation of tests as well as the required
license)
– Advance preparations
i. Familiarity with the test/s
ii. Familiarity with the testing procedure
iii. Familiarity with the instructions
iv. Preparation of test materials
v. Orient proctors (for group testing)
6. Standardization sample
– A random sample of the test takers used to evaluate the performance of others
– Considered a representative sample if the sample consists of individuals that are similar to the group to be tested
B.
Objectivity
1. Time-Limit Tasks – every examinee gets the same amount of time for a given task
2. Work-Limit Tasks – every examinee has to perform the same amount of work
3. Issue of Guessing
C.
Stages in Test Development
1. Test Conceptualization – in creating a test plan, specify the following:
– Objective of the Test
– Clear definition of variables/constructs to be measured
– Target Population/Clientele
– Test Constraints and Conditions
– Content Specifications (Topics, Skills, Abilities)
– Scaling Method
✓ Comparative scaling
✓ Non-comparative scaling
– Test Format
✓ Stimulus (Interrogative, Declarative, Blanks, etc.)
✓ Mechanism of Response (Structured vs. Free)
✓ Multiple Choice
• more answer options (4-5) reduce the chance of guessing that an item is correct
• many items can aid in student comparison and reduce ambiguity, increase reliability
• Easy to score
• measures narrow facets of performance
• reading time increased with more options
• transparent clues (e.g., verb tenses or letter uses “a” or “an”) may encourage guessing
• difficult to write four or five reasonable choices
• takes more time to write questions
• test takers can get some correct answers by guessing
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
2.
3.
4.
5.
D.
✓ True or False
• Ideally a true/false question should be constructed so that an incorrect response indicates something about the student's
misunderstanding of the learning objective.
• This may be a difficult task, especially when constructing a true statement
Test Construction – be mindful of the following test construction guidelines:
– Deal with only one central thought in each item.
– Avoid irrelevant information.
– Be precise.
– Present items in a positive language
– Be brief.
– Avoid double negatives
– Avoid awkward wordings or dangling constructs.
– Avoid terms like “all” and “none”
Test Tryout
Item Analysis (Factor Analysis for Typical-Performance Tests)
Test Revision
Item Analysis
– Measures and evaluates the quality and appropriateness of test questions
– How well the items could measure ability/trait
1. Classical Test Theory
– Analyses are the easiest and the most widely used form of analyses
– Often called the “true-score model” which involves the true score formula:
𝑋𝑡𝑒 = 𝑟𝑥𝑥 (𝑋 − 𝑋̅ ) + 𝑋̅
Where:
𝑋𝑡𝑒 = True Score
𝑋 = Raw Score
𝑟𝑥𝑥 = Correlation Coefficient
𝑋̅ = Mean Score
– Assumes that a person’s test score is comprised of their “true score” plus some measurement error (X = T + e)
– Employs the following statistics
a. Item difficulty
– The proportion of examinees who got the item correctly
– The higher the item mean, the easier the item is for the group; the lower the item mean, the more difficult the item is for the group
– Formula: =
Nu + Nl
– Formula: =
Nu − Nl
N
where: Nu = number of students from the upper group who answered the item correctly
Nl = number of students from the lower group who answered the item correctly
N = total number of examinees
– 0.00-0.20 :
Very Difficult
:
Unacceptable
– 0.21-0.40 :
Difficult
:
Acceptable
– 0.41-0.60 :
Moderate
:
Highly Acceptable
– 0.61-0.80 :
Easy
:
Acceptable
– 0.81-1.00 :
Very Easy
:
Unacceptable
b. Item discrimination
– measure of how well an item is able to distinguish between examinees who are knowledgeable and not
– how well is each item related to the trait
– The discrimination index range is between -1.00 to +1.00
– The closer the index to +1, the more effectively the item distinguishes between the two groups of examinees
– The acceptable index is 0.30 and above
c.
d.
e.
f.
1
N
2
where: Nu = number of students from the upper group who answered the item correctly
Nl = number of students from the lower group who answered the item correctly
N = total number of examinees
– 0.40-above :
Very Good Item
:
Highly Acceptable
– 0.30-0.39 :
Good Item
:
Acceptable
– 0.20-0.29 :
Reasonably Good Item:
For Revision
– 0.10-0.19 :
Difficult Item
:
Unacceptable
– Below 0.19 :
Very Difficult Item
:
Unacceptable
Item reliability index - the higher the index, the greater the test’s internal consistency
Item validity index - the higher the index, the greater the test’s criterion-related validity
Distracter Analysis
– All of the incorrect options, or distractors, should be equally distracting
– preferably, each distracter should be equally selected by a greater proportion of the lower scorers than of the top group
Overall Evaluation of Test Items
DIFFICULTY LEVEL
DISCRIMINATIVE POWER
ITEM EVALUATION
Highly Acceptable
Highly Acceptable
Very Good Item
Highly Acceptable/ Acceptable
Acceptable
Good Item
Highly Acceptable/ Acceptable
Unacceptable
Revise the Item
Unacceptable
Highly Acceptable/ Acceptable
Discard the Item
Unacceptable
Unacceptable
Discard the Item
2. Item-Response Theory (Latent Trait Theory)
– Sometimes referred to as “modern psychometrics”
– Latent trait models aim to look beyond that at the underlying traits which are producing the test performance
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
CHAPTER VII: ETHICAL STANDARDS IN PSYCHOLOGICAL ASSESSMENT
A.
B.
C.
Ethics
1. Ethics Defined
– The moral framework that guides and inspires the Professional
– An agreed-on set of morals, values, professional conduct and standards accepted by a community, group, or culture
– A social, religious, or civil code of behavior considered correct, especially that of a particular group, profession, or individual
2. Professional Ethics
– It is the core of every discipline
– Addresses professional conduct and ethical behavior, issues of confidentiality, ethical principles and professional code of ethics, ethical decisionmaking
– Provide a mechanism for professional accountability
– Serve as a catalyst for improving practice
– Safeguard our clients
3. All professional ethics have relationships and dissimilarities, but all focus on:
– Protecting clients
– Professionals scope of competency
– No harm by acting responsibly and avoiding exploitation
– Protecting confidentiality and privacy
– Maintaining the integrity of the profession
4. Functions and Purposes of Ethical Codes
– Identify values for members of the organization to strive for as they perform their duties
– Set boundaries for both appropriate and inappropriate behavior
– Provide guidelines for practitioners facing difficult situations encountered in the course of work performance
– Communicate a framework for defining and monitoring relationship boundaries of all types
– Provide guidelines for day-to-day decision-making by all professionals along with the staff and volunteers in the organization
– Protect integrity and reputation of the professional and/or individual members of an organization and the organization itself
– Establish high standards of ethical and professional conduct within the culture of the organization
– Protect health and safety of clients, while promoting quality of services provided to them
– Enhance public safety
5. Limitations of Ethical Codes
– Codes can lack clarity
– A code can conflict with another code, personal values, organizational practice, or local laws and regulations
– Codes are usually reactive rather than proactive
– A code may not be adaptable to another cultural setting
6. Ethical Values
– Basic beliefs that an individual think to be true
– The bases on which an individual makes a decision regarding good or bad, right or wrong, most important or least important
– Cultural, guiding social behavior
– Organizational, guiding business or other professional behavior
7. Universal Ethical Values
– Autonomy: Enhance freedom of personal identity
– Honesty and Candor: Tell the truth
– Obedience: Obey legal and ethically permissible
– Fidelity: Don’t break promises
directives
– Loyalty: Don’t abandon
– Conscientious Refusal: Disobey illegal or unethical
– Diligence: Work hard
directives
– Discretion: Respect confidentiality and privacy
– Beneficence: Help others
– Self-improvement: Be the best that you can be
– Gratitude: “Giving back,” or passing good along to others
– Non-maleficence: Don’t hurt anyone
– Competence: Be knowledgeable and skilled
– Restitution: Make amends to persons injured
– Justice: Be fair, distribute by merit
– Self-interest: Protect yourself
– Stewardship: Use resources judiciously
8. Law and Ethics
– Law presents minimum standards of behavior in a professional field
– Ethics provides the ideal for use in decision-making
Common Ethical Issues and Debates
1. When to break confidentiality?
5. Acceptance of gifts
2. Release of psychological reports to the public
6. Dehumanization
3. Golden rule in assessing and diagnosing public figures
7. Divided Loyalties
4. Multiple relationships
8. Labelling and Self-Fulfilling Prophecy
Psychological Association of the Philippines (PAP) Ethical Principles
1. Respect for Dignity of Persons and Peoples
– Respect for the unique worth and inherent dignity of all human beings;
– Respect for the diversity among persons and peoples;
– Respect for the customs and beliefs of cultures.
2. Competent caring for the well-being of persons and peoples
– Maximizing benefits, minimizing potential harm, and offering or correcting harm.
– Application of knowledge and skills that are appropriate for the nature of a situation as well as social and cultural context.
– Adequate self-knowledge of how one’s values, experiences, culture, and social context might influence one’s actions and interpretations.
– Active concern for the well-being of individuals, families, groups, and communities;
– Taking care to do no harm to individuals, families, groups, and communities;
– Developing and maintaining competence.
3. Integrity
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
D.
E.
– Integrity is based on honesty, and on truthful, open and accurate communications.
– Maximizing impartiality and minimizing biases
– It includes recognizing, monitoring, and managing potential biases, multiple relationships, and other conflicts of interest that could result in harm
and exploitation of persons and peoples.
– Avoiding incomplete disclosure of information unless complete disclosure is culturally inappropriate, or violates confidentiality, or carries the
potential to do various harm to individuals, families, groups, or communities
– Not exploiting persons or peoples for personal, professional, or financial gain
– Complete openness and disclosure of information must be balanced with other ethical considerations, including the need to protect the safety or
confidentiality of persons and peoples, and the need to respect cultural expectations.
– Avoiding conflicts of interest and declaring them when they cannot be avoided or are inappropriate to avoid.
4. Professional and Scientific responsibilities to society
– We shall undertake continuing education and training to ensure our services continue to be relevant and applicable.
– Generate researches
Roles of a Psychometrician
1. Administering and scoring of objective personality tests; structured personality tests, excluding projective tests and other higher level of psychological
tests;
2. Interpreting the results of these tests and preparing a written report on these results; and
3. Conducting preparatory intake interviews of clients for psychological intervention sessions.
4. All the assessment reports prepared and done by the psychometrician, shall always bear the signature of the supervising psychologist who shall take
full responsibility for the integrity of the report.
Ethical Standards in Psychological Assessment
1. Responsibilities of Test Publishers
– The publisher is expected to release tests of high quality
– The publisher is expected to market product in a responsible manner
– The publisher restrict distributions of test only to person with proper qualification
2. Publication and Marketing Issues
– The most important guideline is to guard against premature release of a test
– The test authors should strive for a balanced presentation of their instruments and refrain from one-sided presentation of information
3. Competence of Test Purchasers
4. Responsibilities of Test Users
– Best interest of clients
– Informed Consent
✓ Must be presented in a clear and understandable manner to both the student & parent.
✓ Reason for the test administration.
✓ tests and evaluations procedures to be used.
✓ How assessment scores will be used.
✓ Who will have access to the results.
✓ Written informed consent must be obtained from the student’s parents, guardian or the student (if he or she has already reached ‘legal’ age).
– Human Relations
– Expertise of Test Users
– Avoiding Harassments
– Obsolete Tests and The Standard of Care
– Duty to Warn
– Consideration of Individual Differences
– Confidentiality
5. Appropriate Assessment Tool Selection
– Criteria for test selection
✓ It must be relevant to the problem
✓ Adaptable to the time available
✓ Appropriate for the patient/client
✓ Valid and reliable
✓ Familiar to the examiner
– Need for battery testing
✓ No single test proves to yield a diagnosis in all cases, or to be in all cases correct in the diagnosis it indicates.
✓ Psychological maladjustment whether mild or severe may encroach any or several of the functions tapped by the tests, leaving other
functions absolutely or relatively unimpaired.
– What test users should do?
✓ First define the purpose for testing and the population to be tested. Then, select a test for that purpose and that population based on a
thorough review of the available information and materials.
✓ Investigate potentially useful sources of information, in addition to test scores, to corroborate the information provided by tests.
✓ Read the materials provided by test developers and avoid using tests for which unclear or incomplete information is provided.
✓ Become familiar with how and when the test was developed and tried out.
✓ Read independent evaluations of a test and of possible alternative measures. Look for evidence required in supporting the claims of test
developers.
✓ Examine specimen sets, disclosed tests or samples of questions, directions, answer sheets, manuals, and score reports before selecting a
test.
✓ Ascertain whether the test content and norm group(s) or comparison group(s) is appropriate for the intended test takers.
✓ Select and use only those tests for which the skills needed to administer the test and interpret scores correctly are available.
6. Test Administration, Scoring and Interpretation
– Basic principles
✓ To ensure fair testing, the tester must become thoroughly familiar with the test. Even a simple test usually presents one or more stumbling
blocks which can be anticipated if the tester studies the manual in advance or even takes time to take the test himself before administering.
✓ The tester must maintain an impartial and scientific attitude. The tester must be keenly interested with the persons they test, and desire to
see them do well. It is the duty of the tester to obtain from each subject the best record he can produce.
✓ Establishing and maintaining rapport is necessary if the subject is to do well. That is, the subject must feel that he wants to cooperate with
the tester. Poor rapport is evident by the presence of inattention during directions, giving up before time is up, restlessness or finding fault
with the test.
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
✓ In case of individual testing, where each question is given orally, unintended help can be given by facial expression or words of
encouragement. Thereon, taking test is always concerned to know how well he is doing and watches the examiner for indications of his
success. The examiner must maintain a completely unrevealing expression, while at the same time silently assuring the subject of his
interest in what he says or do.
✓ In individual testing, the tester observes the subject’s performance with care. He notes the time to complete each task and any errors, he
watches for any unusual method of approaching the task. Observation and note taking must be done in a subtle and unobtrusive manner so
as not to indirectly or directly affect the subject’s performance of the task
– General Procedures/Guidelines
✓ Conditions of testing
• Physical Condition. The physical condition where the test is given may affect the test scores. If the ventilation and lighting are poor,
the subject will be handicapped.
• Condition of the Person. Sate of the person affects the results, if the test is given when he is fatigued, when his mind is concerned
with other problems, or when he is emotionally disturbed, results will not be a fair sample of his behavior.
• Test Condition. The testing condition can often be improved by spacing the tests to avoid cumulative fatigue. Test questionnaires,
answer sheets and other testing materials needed must always be in good condition so as not to hinder good performance.
• Condition of the Day. Time of the day may influence scores, but is rarely important. Alert subjects are more likely to give their best
than subjects who are tired and dispirited. Equally good results can be produced at any hour, however, if the subjects want to do
well.
✓ Control of the group
• Group tests are given only to those reasonably and cooperative subjects who expects to do as the tester requests. Group testing
then, is a venue for a problem in command.
• Directions should be given simply, clearly and singly. The subjects must have a chance to ask questions whenever they are
necessary but the examiner attempts to anticipate all reasonable questions by full directions.
• Effective control may be combined with good rapport if the examiner is friendly, avoid an antagonistic, overbearing or fault attitude.
• The goal of the tester is to obtain useful information about people; that is to elicit good information from the results of the test. There
is no value adhering rigidly to a testing schedule if the schedule will not give true information. Common sense is the only safe guide
in exceptional situations.
✓ Directions of the subject
• The most important responsibility of the test administrator is giving directions.
• It is imperative that the tester gives the directions exactly as provided in the manual. If the tester understands the importance of this
responsibility, it is simple to follow the printed directions, reading them word for word, adding nothing and changing nothing.
✓ Judgments left to the examiner
• The competent examiner must possess a high degree of judgment, intelligence, sensitivity to the reactions of others, and
professionalism, as well as knowledge with regards to scientific methods and experience in the use of psychometric techniques.
• No degree of mechanical perfection of the test themselves can ever take the place of good judgment and psychological insight of
the examiner.
✓ Guessing
• It is against the rules for the tester to give supplementary advices; he must retreat to such formula as “Use your judgment.” (But the
tester is not to give his group an advantage by telling them this trade secret.)
• The person taking the test is usually wise to guess freely. (But the tester is not to give his group an advantage by telling them this
trade secret.)
• From the point of view of the tester, the tendency to guess is an unstandardized aspect of the testing situation which interferes with
accurate measurement.
• The systematic advantage of the guesser is eliminated if the test manual directs everyone to guess, but guessing introduces large
chances of errors. Statistical comparison of “do not guess” instruction and “do guess” instruction show that with the latter, the test
has slightly lesser predictive value.
• The most widely accepted practice now is to educate students that wild guessing is to their disadvantage, but to encourage them to
respond when they can make an informed judgment as to the most reasonable answer even if they are uncertain.
• The motivation most helpful to valid testing is a desire on the part of the subject that the score be valid. Ideally the subject becomes
a partner in testing himself. The subject must place himself on a scale, and unless he cares about the result he cannot be
measured accurately.
• The desirability of preparing the subject for the test by appropriate advance information is increasingly recognized. This information
increases the person’s confidence, and reduces standard test anxiety that they might otherwise have.
– Scoring
✓ Hand scoring
✓ Machine scoring
7. Responsible Report Writing and Communication of Test Results
– What is a psychological report?
✓ an abstract of a sample of behavior of a patient or a client derived from results of psychological tests.
✓ A very brief sample of one’s behavior
– Criteria for a good psychological report
✓ Individualized – written specifically for the client
✓ Directly and adequately answers a referral question
✓ Clear – written in a language that can be easily understood
✓ Meaningful – perceived by the reader as clear and is understood by the reader
✓ Synthesized – details are formed into broader concepts about the specific person
✓ Delivered on time
– Principles of value in writing individualized psychological report
✓ Avoid mentioning general characteristics, which could describe almost anyone, unless the particular importance in the given case is made
clear.
✓ Describe the particular attributes of the individual fully, using as distinctive terms as possible.
✓ Simple listing of characteristics is not helpful; tell how they are related and organized in the personality.
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
✓ Information should be organized developmentally with respect to the time line of the individual life.
✓ Many of the problems of poor reports, such as vague generalizations, overqualification, clinging to the immediate data, stating the obvious
and describing stereotypes are understandable but undesirable reactions to uncertainty.
✓ Validate statements with actual behavioral responses.
✓ Avoid, if possible, the use of qualities such as “It appears”, “tends to”, etc. for these convey the psychologist’s uncertainties or indecisions.
✓ Avoid using technical terms. Present them using layman’s language
– Levels of Psychological Interpretation
✓ Level I
• There is minimal amount of any sort of
• Data are primarily treated in a sampling or correlate way
interpretation
• There is no concern with underlying constructs
• There is a minimal concern with intervening
• Found in large-scale selection testing
processes
• For psychometric approaches
✓ Level II
• Descriptive generalizations - From the particular behaviors observed, we generalize to more inclusive, although still largely behavioral
and descriptive categories. Thus, they note, a clinician might observe instances of slow bodily movements and excessive delays in
answering questions and from this infer that the patient is “retarded motorically.” With the further discovery that the patient eats and
sleeps poorly, cries easily, reports a constant sense of futility and discouragement and shows characteristic test behaviors, the
generalization is now broadened as “depressed.”
• Hypothetical constructs - Assumption of an inner state which goes logically beyond description of visible behavior. Such constructs
imply causal conditions, related personality traits and behaviors and allow prediction of future events. It is the movement from
description to construction which is the sense of clinical interpretation
✓ Level III
• The effort is to develop a coherent and inclusive theory of the individual life or a “working image” of the patient. In terms of a general
theoretical orientation, the clinician attempts a full-scale exploration of the individual’s personality, psychosocial situation, and
developmental history
– Sources of Error in Psychological Interpretation
✓ Information Overload
• Too much material, making the clinician overwhelmed
• Studies have been shown that clinical judges typically use less information than is available to them
• The need is to gather optimal, rather than maximal, amount of information of a sort digestible by the particular clinician
• Obviously, familiarity with the tests involved, type of patient, referral questions and the like figure in deciding how much of what kind of
material is collected and how extensible it can be interpreted
✓ Schematization
• All humans have a limited capacity to process information and to form concepts
• Consequently, the resulting picture is of the individual is schematized and simplified, perhaps catering to one or a few salient and
dramatic and often, pathological, characteristics
• The resulting interpretations are too organized and consistent and the person emerges as a two-dimensional creature
• The clinical interpreter has to be able to tolerate complexity and deal at one time with more data than he can comfortably handle
✓ Insufficient internal evidence for interpretation
• Ideally, interpretations should emerge as evidence converges from many sources, such as different responses and scores of the same
tests, responses of different tests, self-report, observation, etc.
• Particularly for interpretations at higher levels, supportive evidence is required
• Results from lack of tests, lack of responses
• Information between you and the client
✓ Insufficient external verification of interpretation
• Too often clinicians interpret assessment material and report on the patients without further checking on the accuracy of their
statements
• Information between you and the relevant others
• Verify statements made by patients
✓ Overinterpretation
• “Wild analysis”
• Temptation to over-interpret assessment material in pursuit of a dramatic or encompassing formulation
• Deep interpretations, seeking for unconscious motives and nuclear conflicts or those which attempt genetic reconstruction of the
personality are always to be made cautiously and only on the basis of convincing evidence
• Interpreting symbols in terms of fixed meanings is a cheap and usually inaccurate attempt at psychoanalytic interpretation
• At all times, the skillful clinician should be able to indicate the relationship between the interrupted hypothetical variable and its
referents to overt behavior
✓ Lack of Individualization
• It is perfectly possible to make correct statements which are entirely worthless because they could as well apply to anyone under most
conditions
• “Aunt Fanny syndrome”/”PT Barnum Effect”
• What makes the person unique (e.g., both patients are anxious – how does one patient manifest his anxiety)
✓ Lack of Integration
• Human personality is organized and integrated usually in hierarchical system
• It is of central importance to understand which facets of the personality are most central and which are peripheral, which needs to sub
serve others and how defensive, coping and ego functions are organized, if understanding of the personality is to be achieved
• Over-cautiousness, insufficient knowledge or a lack of a theoretical framework are sometimes revealed in contradictory interpretations
made side by side
• On the face of it, someone cannot be called both domineering and submissive
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
lOMoARcPSD|3551091
RGO 2018 Review Season
jhojo.012895@gmail.com
F.
✓ Overpathologizing
• Always highlights the negative not the positive aspect of behavior
• Emphasizes the weakness rather than the strengths of a person
• A Balance between the positive and negative must be the goal
• Sandwich method (positive-negative-positive) is a recommended approach
✓ Over-“psychologizing”
• Giving of interpretation when there is none (e.g., scratching of hands – anxious, itchy)
• Avoid generalized interpretations of overt behaviors
• Must probe into the meaning/motivations behind observed behaviors
– Essential Parts of a Psychological Report
✓ Industrial setting
• Identifying Information
• Skills and Abilities
• Test administered
• Personality Profile
• Test Results
• Summary/Recommendation
✓ Clinical setting
• Personal Information
• Test results and interpretation
• Referral question
• Summary formulation
• Test administered
• Diagnostic Impression
• Behavioral observation (Test and Interview)
• Recommendation
Rights of Test Takers
1. Be treated with courtesy, respect, and impartiality, regardless of your age, disability, ethnicity, gender, national origin, religion, sexual orientation or
other personal characteristics
2. Be tested with measures that meet professional standards and that are appropriate, given the manner in which the test results will be used
3. Receive information regarding their test results
4. Least stigmatizing label
5. Informed Consent
6. Privacy and Confidentiality
CHAPTER VIII: COMMON PSYCHOLOGICAL TESTS
A.
B.
C.
D.
Individually Administered Intelligence Tests
1. Stanford-Binet 5
2. Wechsler Scales
a. WPPSI
b. WISC c. WAIS
3. Comprehensive Test of Nonverbal Intelligence (CTONI)
4. Kaufman Assessment Battery for Children-Second Edition
5. Woodcock-Johnson III Complete Battery
6. Slosson Intelligence Scale
7. Universal Nonverbal Intelligence Test II
Group Administered Intelligence Tests
1. Raven’s Progressive Matrices
2. Standard Progressive Matrices
3. Advanced Progressive Matrices
4. Culture Fair Intelligence Test
5. Purdue Non-Language Test
6. SRA Verbal and Nonverbal Form
7. Thurstone Test of Mental Alertness
8. Revised Beta Examination
9. Wonderlic Cognitive Ability Tests
10. Otis-Lennon Mental Ability Test
11. Watson Glaser Critical Thinking Test
12. Panukat ng Katalinuhang Pilipino
Aptitude Tests
1. Differential Aptitude Tests (Fifth Edition)
2. Detroit Test of Learning Aptitude
3. Flanagan Industrial Tests
4. Armed Services Vocational Aptitude Battery
5. Employee Aptitude Survey
6. Standardized Aptitude Test for Teachers
7. Multidimensional Aptitude Battery II
8. OASIS-3 Aptitude Survey
9. Wiesen Test of Mechanical Aptitude
10. Philippine Aptitude Classification Test
Personality Tests
1. 16 Personality Factors
2. Myers-Briggs Type Indicator
3. Emotions Profile Index
E.
F.
4. Minnesota Multiphasic Personality Inventory - II
5. NEO Personality Inventory - III
6. Basic Personality Inventory
7. California Psychological Inventory
8. Personality Inventory for Children – II
9. Edward’s Personality Preference Schedule
10. BarOn Emotional Quotient Inventory
11. Taylor-Johnson Temperament Analysis
12. Panukat ng Ugali at Pagkatao
13. Panukat ng Ugaling Pilipino
Projective Tests
1. Word Association Method
2. Sentence Completion Test
a. Sack’s Sentence Completion Test
b. Rotter’s Incomplete Sentence Blank
c. Forer Structure Sentence Completion Test
3. Projective Drawings
a. Draw a person test (a person, person of the opposite sex,
and self)
b. Draw a person Intellectual Ability Test for Children &
Adults
c. House-Tree-Person
d. Kinetic Family Drawing
4. Apperception Tests
a. Children’s Apperception Test
b. Thematic Apperception Test
c. Philippine Thematic Apperception Test
5. Rorschach Inkblot Test
Neuropsychological Tests
1. Bender-Gestalt Motor Visual Test II
2. Wechsler Memory Scale
3. Trail Making Test
4. Rey-Osterrieth Complex Figure Test
5. Benton Test of Visual Memory
6. The Rivermead Behavioral Memory Test
7. Severe Cognitive Impairment Profile
Downloaded by Ynkats Hreshtak (idolorwalid@yahoo.com)
Download