to view slides.

advertisement
English Profile Corpus Update
Fiona Barker
Cambridge ESOL
© UCLES - This publication may not be reproduced without written permission from the copyright holder
CEPC and CACE
1. Cambridge English Profile
Corpus
2. Cambridge Academic Corpus of
English
© UCLES - This publication may not be reproduced without written permission from the copyright holder
2
Cambridge English Profile Corpus
The online data collection portal has been in use for 2.5
years, with a new portal and tasks ready for trialling.
All learners complete:
• a language-learning/background questionnaire
• writing tasks available to all learners, regardless of
their (teacher-assigned) proficiency level
Resulting in written responses (2/3 per learner) and
associated metadata.
© UCLES - This publication may not be reproduced without written permission from the copyright holder
3
Data collected to-date
As of January 2012 we have:
• 1.8 million words from 4078 learners (320k last
month)
• Most data from learners assigned to B levels
then C then A levels
• 41 L1s, seven with >100 learners who’ve
completed all tasks
• 11,400 responses, between 3 and 1217 per
task (due to new task sets being introduced)
© UCLES - This publication may not be reproduced without written permission from the copyright holder
4
Total words collected per CEFR level
1300000
Numberof
ofwords
words
Number
1200000
1100000
1100000
1000000
1000000
900000
900000
800000
800000
700000
700000
A1/A2
A1/A2
B1/B2
B1/B2
600000
600000
500000
500000
C1/C2
C1/C2
400000
400000
300000
300000
200000
200000
100000
100000
0
0
Completed
Completed
Questions
Questions
Abandoned
Abandoned
Questions
Questions
Total
Total
© UCLES - This publication may not be reproduced without written permission from the copyright holder
5
Task popularity
§ Task set 1: 8176 responses – level A students
had two pairs to choose 2 tasks from, B and C
level students had three pairs to choose 3 tasks
from
§ Task set 2: 3183 – same for all levels (see
following sample)
§ Task set 3: 60 – same for all levels (just
introduced)
© UCLES - This publication may not be reproduced without written permission from the copyright holder
6
Sample tasks from
set 2
Look at the pictures.
Write a letter to the
camera shop.
Complain about your
camera
(412 responses)
© UCLES - This publication may not be reproduced without written permission from the copyright holder
7
Choose 3 or more of
the pictures and
write a story.
(420 responses)
© UCLES - This publication may not be reproduced without written permission from the copyright holder
8
For this section, choose either
picture and write whatever you
feel like writing!
(427 + 634 responses)
© UCLES - This publication may not be reproduced without written permission from the copyright holder
9
New data collection portal
• New functional multi-level task set with task cycling (20 per
L1 per function).
• Allows longitudinal data to be collected more easily.
• Learners can upload a favourite piece of work and/or
participate in a forum with other learners.
• Corpus interface is under development.
• We aim to validate the level of data already collected and
collect further data to balance what we currently have.
© UCLES - This publication may not be reproduced without written permission from the copyright holder
10
2. Cambridge Academic Corpus
of English
• CACE was initiated Summer 2011 by Cambridge ESOL in
collaboration with CUP.
• This will help us to identify the nature of academic English,
including how native and non-native speakers deal with the
challenge.
• To enable the description of academic language in relation
to L1, academic level, domain, etc.
• So that we can specify the C levels (and beyond) for
academic contexts.
© UCLES - This publication may not be reproduced without written permission from the copyright holder
11
What will CACE contain?
• CACE will contain domain-specific writing collected from
sixth form and university students in various contexts.
• Non-native and native writing will be systematically
compared by domain & level.
• CACE will inform English Profile, test development, and
materials development.
© UCLES - This publication may not be reproduced without written permission from the copyright holder
12
CEPC and CACE
Next steps
Cambridge English Profile Corpus
•
Validation of level assignment; roll-out of new
portal; development of new tasks; access to
Sketch Engine for EP researchers; access to
online samples for the wider community.
Cambridge Academic Corpus of English
•
Recruit project manager; finalise design; begin
data collection; initial exploration of data.
© UCLES - This publication may not be reproduced without written permission from the copyright holder
13
Download