+ AWL = 86 % of the Academic Corpus

advertisement

Hüsem Korkmaz

MA TEFL

Averil Coxhead

 was developed from a corpus of 5 million words with the needs of ESL/EFL learners in mind,

 contains the most widely useful 2,000 word families in

English. The

GSL has been criticised for its size (Engels, 1968), age

(Richards, 1974), and need for revision (Hwang, 1989).

Despite these criticisms, the GSL covers up to

90% of fiction texts up to 75% of nonfiction texts up to 76% of the

Academic Corpus

 Campion and

Elley (1971)

 Praninskas

(1972)

 Lynn (1973)

 Ghadessy (1979)

 Xue and Nation

(1984) University

Word List (UWL)

Corpora and identified words that occured across a range of texts

Student annotatitons above words in textbooks

Combination of the four lists mentioned

The UWL has been widely used by learners, teachers, course designers, and researchers.

However, as an amalgam of the four different studies, it lacked consistent selection principles and had many of the weaknesses of the prior work.

The corpora on which the studies were based were small and did not contain a wide and balanced range of topics.

setting vocabulary goals for language courses guiding learners in their independent study informing course and material designers in selecting texts and developing learning activities

 there is a need for a new academic word list based on data gathered from a large, well-designed corpus of academic English.

 The ideal word list would be divided into smaller, frequency-based sublists to aid in the sequencing of teaching and in materials development.

 A word list based on the occurrence of word families in a corpus of texts representing a variety of academic registers can provide information about how words are actually used

REPRESENTATION ORGANIZATION

SIZE WORD SELECTION

1.

Which lexical items occur frequently and uniformly across a wide range of academic material but are not among the first 2,000 words of English as given in the GSL (West, 1953)?

2.

Do the lexical items occur with different frequencies in arts, commerce, law, and science texts?

1.

2.

3.

4.

What percentage of the words in the Academic

Corpus does the AWL cover?

Do the lexical items identified occur frequently in an independent collection of academic texts?

How frequently do the words in the AWL occur in nonacademic texts?

How does the AWL compare with the UWL (Xue &

Nation, 1984)?

To evaluate the AWL, its coverage of these is discussed and compared with the UWL:

 (a) the Academic Corpus along with the GSL

(West, 1953)

 (b) a second collection of academic texts

 (c) a collection of fiction texts

Collecting text in electronic form

Removing its bibliography

Counting its words

Balancing the number of short, mid-lenght, and long texts

Each text was inserted into its subject area computer file

Each subject-area file was inserted into a discipline master file

= 414 academic texts, 400 authors, 3,513,330 tokens, 70,377 types in 11,666 pages

Selection criteria for words:

1.

Which lexical items occur frequently and uniformly across a wide range of academic material but are not among the first 2,000 words of English as given in the GSL (West, 1953)?

570 word families

2.

Do the lexical items occur with different frequencies in arts, commerce, law, and science texts?

14

12

10

8

6

4

2

0

12 %

Commerce

Percentages within disciplines

9,4 %

Law

9,3 %

Arts

9,1 %

Science

1.

What percentage of the words in the Academic

Corpus does the AWL cover?

The AWL accounts for 10.0 of the tokens in the Academic Corpus

GSL (2000 words) + AWL =

86 % of the Academic Corpus

2.

Do the lexical items identified occur frequently in an independent collection of academic texts?

A second corpus was made up of texts that had met the criteria for inclusion in the Academic Corpus but were not included either because they were collected too late or because the subject area they belonged to was already completer

3.

How frequently do the words in the AWL occur in nonacademic texts?

The Academic Wordlist – Project Gutenberg(50 texts)

• AWL accounts for 1,4 % of the tokens in PG collection

(lower than AWL’s 10 % coverage of Academic Corpus)

• Huge difference suggests AWL word families are mostly associated with academic writing

4.

How does the AWL compare with the UWL (Xue &

Nation, 1984)?

Though smaller than UWL, AWL gives a better return on learning, as students need to learn only

570 word families instead of 836 for the same coverage of academic texts.

UWL

401

435 word families

51 %

135

AWL

 UWL includes 570 word families that do not occur in the Academic Corpus. So, even if the students learn these words, they might rarely or never meet them in academic texts.

 AWL is smaller than UWL in size. However, it has a higher coverage of academic texts

 AWL covers a wider range of subject areas

Construct relevant teaching materials

For detailed information:

AWL: http://www.uefap.com/vocab/select/awl.htm

Project Gutenberg: http://www.gutenberg.net

Thanks for your participation and attention…

Hüsem Korkmaz

Download