Annexa 3 CATALAN DATABASE FOR CONSUMER APPLICATIONS Author(s): Institute: Address: email: Date: Version: Asunción Moreno, David Conejero, Gonzalo Bustamante Universitat Politècnica de Catalunya Jordi Girona 1-3 08034 Barcelona asuncion@gps.tsc.upc.es December, 2nd, 2006 v1.4 CONTENTS 1 INTRODUCTION 3 1.1 Speech file formats 7 1.2 Directory structure 7 1.3 File nomenclature 8 1.4 Label files 9 1 Annexa 3. Catalan Database for Consumer Applications 2 DATABASE DESIGN AND COLLECTION 12 2.1 Recording platform 12 2.2 Speaker recruitment 15 2.3 Design of prompting and prompt-sheet 15 3 DATABASE CONTENTS DEFINITION 3.1 Free spontaneous speech (F01-F30) 15 17 3.2 Elicited speech 3.2.1 Birth date, current date, relative date (ED1-3) 3.2.2 Time of day (ET1-2) 3.2.3 City names (EC1-2) 3.2.4 Proper names (EP1-3) 3.2.5 Spelling of proper name (EL1) 3.2.6 Yes/No answers (EQ1-2) 3.2.7 Languages (EO1) 3.2.8 Telephone numbers (EN1-3) 18 18 18 19 19 19 20 20 20 3.3 Read speech 3.3.1 Phonetically rich words (W01-05) 3.3.2 Phonetically rich sentences S01-S30 3.3.3 Isolated digits (CI1-4) 3.3.4 Isolated digit sequences (CB1) 3.3.5 Digit strings (CC1-4) 3.3.6 Telephone number (CE1) 3.3.7 Natural numbers (CN1-3) 3.3.8 Money amounts (CM1) 3.3.9 Times (CT1-2) 3.3.10 Dates (CD1-3) 3.3.11 Spelled words (CL1-3) 3.3.12 Proper Name (CP1) 3.3.13 City/street names (CO1-2) 3.3.14 Yes/No (CQ1-2) 3.3.15 Email & Web addresses (CW1-2) 3.3.16 Special keyboard characters (CK1-2) 3.3.17 Application specific commands (101-995), (401-499) 20 20 22 23 23 23 23 24 24 24 25 27 28 28 28 28 28 29 4 TRANSCRIPTION 29 4.1 Foreign words 30 4.2 Unintelligible words 30 4.3 Word Fragments 31 4.4 Verbal Deletions 31 4.5 Non-Speech Acoustic Events 31 4.6 Truncated Waveforms 32 4.7 Combination of markers 33 4.8 Transcription procedure 33 2 Annexa 3. Catalan Database for Consumer Applications 5 LEXICON 34 6 SPEAKER DEMOGRAPHIC INFORMATION 37 6.1 Accent/Regions 37 6.2 Speaker ages 38 7 RECORDING CONDITIONS 39 7.1 Office 39 7.2 Entertainment 40 7.3 Public place 40 7.4 Car 41 8 DEVIATIONS FROM GENERAL SPEECON SPECIFICATIONS 41 9 SAMPLE SHEETS 41 9.1 Sample instruction sheet 41 9.2 Sample prompt sheet 41 10 REFERENCES AND BIBLIOGRAPHY 47 11 APPENDIX 1. 48 1 11.1 List of Proper Names (CP1) 48 11.2 City/street names (CO1-2) 11.2.1 City Names 11.2.2 Street Names 49 49 51 11.3 Email & Web addresses (CW1-2) 11.3.1 CW1: WWW address from a list of 150 URL-addresses 11.3.2 CW2: Email drawn from a set of 550 e-mail addresses 53 53 54 11.4 Application words. 11.4.1 List of synonyms and their codes. 11.4.2 List of words with multiple occurrences. 60 93 94 Introduction The Catalan Database for Consumer applications was sponsored by the Catalan Government. The database follows the specifications of the Speecon EU funded project for adult speakers. Collection was performed at the Department of Signal Theory and Communications of the Universitat Politècnica de Catalunya (UPC) (Spain). 3 Annexa 3. Catalan Database for Consumer Applications Transcriptions and formatting were done by Verbio Technologies. The owner of the database is Generalitat de Catalunya. The database is distributed in 19 DVD-ROM volumes, containing the speech signals, and one CD-ROM volume, which contains documentation only. The contents of each DVD and CD is as follows: CD name: ADULT1CAD00. Contents: Documentation DVD name ADULT1CA_01 Sessions SES000 SES005 SES010 SES015 SES020 SES025 SES001 SES006 SES011 SES016 SES021 SES026 SES002 SES007 SES012 SES017 SES022 SES027 SES003 SES008 SES013 SES018 SES023 SES028 SES004 SES009 SES014 SES019 SES024 SES029 SES030 SES035 SES040 SES045 SES050 SES055 SES031 SES036 SES041 SES046 SES051 SES056 SES032 SES037 SES042 SES047 SES052 SES057 SES033 SES038 SES043 SES048 SES053 SES058 SES034 SES039 SES044 SES049 SES054 SES059 SES060 SES065 SES070 SES075 SES080 SES085 SES061 SES066 SES071 SES076 SES081 SES086 SES062 SES067 SES072 SES077 SES082 SES087 SES063 SES068 SES073 SES078 SES083 SES088 SES064 SES069 SES074 SES079 SES084 SES089 SES090 SES095 SES100 SES105 SES110 SES115 SES091 SES096 SES101 SES106 SES111 SES116 SES092 SES097 SES102 SES107 SES112 SES117 SES093 SES098 SES103 SES108 SES113 SES118 SES094 SES099 SES104 SES109 SES114 SES119 SES120 SES125 SES130 SES135 SES140 SES145 SES121 SES126 SES131 SES136 SES141 SES146 SES122 SES127 SES132 SES137 SES142 SES147 SES123 SES128 SES133 SES138 SES143 SES148 SES124 SES129 SES134 SES139 SES144 SES149 SES150 SES155 SES160 SES165 SES170 SES175 SES151 SES156 SES161 SES166 SES171 SES176 SES152 SES157 SES162 SES167 SES172 SES177 SES153 SES158 SES163 SES168 SES173 SES178 SES154 SES159 SES164 SES169 SES174 SES179 SES180 SES181 SES182 SES183 SES184 ADULT1CA_02 ADULT1CA_03 ADULT1CA_04 ADULT1CA_05 ADULT1CA_06 ADULT1CA_07 4 Annexa 3. Catalan Database for Consumer Applications SES185 SES190 SES195 SES200 SES205 SES186 SES191 SES196 SES201 SES206 SES187 SES192 SES197 SES202 SES207 SES188 SES193 SES198 SES203 SES208 SES189 SES194 SES199 SES204 SES209 SES210 SES215 SES220 SES225 SES230 SES235 SES211 SES216 SES221 SES226 SES231 SES236 SES212 SES217 SES222 SES227 SES232 SES237 SES213 SES218 SES223 SES228 SES233 SES238 SES214 SES219 SES224 SES229 SES234 SES239 SES240 SES245 SES250 SES255 SES260 SES265 SES241 SES246 SES251 SES256 SES261 SES266 SES242 SES247 SES252 SES257 SES262 SES267 SES243 SES248 SES253 SES258 SES263 SES268 SES244 SES249 SES254 SES259 SES264 SES269 SES270 SES275 SES280 SES285 SES290 SES295 SES271 SES276 SES281 SES286 SES291 SES296 SES272 SES277 SES282 SES287 SES292 SES297 SES273 SES278 SES283 SES288 SES293 SES298 SES274 SES279 SES284 SES289 SES294 SES299 SES300 SES305 SES310 SES315 SES320 SES325 SES301 SES306 SES311 SES316 SES321 SES326 SES302 SES307 SES312 SES317 SES322 SES327 SES303 SES308 SES313 SES318 SES323 SES328 SES304 SES309 SES314 SES319 SES324 SES329 SES330 SES335 SES340 SES345 SES350 SES355 SES331 SES336 SES341 SES346 SES351 SES356 SES332 SES337 SES342 SES347 SES352 SES357 SES333 SES338 SES343 SES348 SES353 SES358 SES334 SES339 SES344 SES349 SES354 SES359 SES360 SES365 SES370 SES375 SES380 SES385 SES361 SES366 SES371 SES376 SES381 SES386 SES362 SES367 SES372 SES377 SES382 SES387 SES363 SES368 SES373 SES378 SES383 SES388 SES364 SES369 SES374 SES379 SES384 SES389 SES390 SES395 SES400 SES405 SES410 SES415 SES391 SES396 SES401 SES406 SES411 SES416 SES392 SES397 SES402 SES407 SES412 SES417 SES393 SES398 SES403 SES408 SES413 SES418 SES394 SES399 SES404 SES409 SES414 SES419 ADULT1CA_08 ADULT1CA_09 ADULT1CA_10 ADULT1CA_11 ADULT1CA_12 ADULT1CA_13 ADULT1CA_14 5 Annexa 3. Catalan Database for Consumer Applications ADULT1CA_15 SES420 SES425 SES430 SES435 SES440 SES445 SES421 SES426 SES431 SES436 SES441 SES446 SES422 SES427 SES432 SES437 SES442 SES447 SES423 SES428 SES433 SES438 SES443 SES448 SES424 SES429 SES434 SES439 SES444 SES449 SES450 SES455 SES460 SES465 SES470 SES475 SES451 SES456 SES461 SES466 SES471 SES476 SES452 SES457 SES462 SES467 SES472 SES477 SES453 SES458 SES463 SES468 SES473 SES478 SES454 SES459 SES464 SES469 SES474 SES479 SES480 SES485 SES490 SES495 SES500 SES505 SES481 SES486 SES491 SES496 SES501 SES506 SES482 SES487 SES492 SES497 SES502 SES507 SES483 SES488 SES493 SES498 SES503 SES508 SES484 SES489 SES494 SES499 SES504 SES509 SES510 SES515 SES520 SES525 SES530 SES535 SES511 SES516 SES521 SES526 SES531 SES536 SES512 SES517 SES522 SES527 SES532 SES537 SES513 SES518 SES523 SES528 SES533 SES538 SES514 SES519 SES524 SES529 SES534 SES539 SES540 SES545 SES541 SES546 SES542 SES547 SES543 SES548 SES544 SES549 ADULT1CA_16 ADULT1CA_17 ADULT1CA_18 ADULT1CA_19 Table 1- Distribution of the database into CD- ROM and DVDs The list of the distribution disks and directories are contained in the README.TXT file. Further details regarding the database contents, files and directories are provided in the documentation files in the DOC directory, the speaker, recording conditions, session and lexicon tables in the TABLE directory and contents files in the INDEX directory. File types are identified with the following extensions: *.DOC - Microsoft Word V6.0 document *.LST - DOS text index file with ISO Latin 1 symbols *.TBL - DOS text file with ISO Latin 1 symbols *.TXT - DOS text file *.SES - DOS text file *.CAO - SAM label file, text file with ISO Latin 1 symbols *.CA0 - speech signal file *.CA1 - speech signal file *.CA2 - speech signal file *.CA3 - speech signal file *.PS - Adobe Postcript file Each CD-ROM of the Adult database has the following directory structure: 6 Annexa 3. Catalan Database for Consumer Applications \: COPYRIGH.TXT - copyright notice DISK.ID - UNIX volume ID file README.TXT - describes files in the distribution media ADULT1CA/ - data directory ADULT1CA\DOC: DESIGN.DOC - Catalan database documentation file ISO88591.PS - ISO 8859_1 table SAMPALEX.PS - SAMPA table SUMMAR0.TXT - database contents summary file VALREP.DOC - validation report of the database PLATFORM.DOC - description of recording platform ADULT1CA\INDEX: CONTENT0.LST - file/utterance/speaker index table ADULT1CA\TABLE: SPEAKER.TBL - speaker table REC_CON.TBL - Recording condition table LEXICON.TBL - full lexicon table SESSION.TBL - session table ADULT1CA\: - contains the data block directories BLOCK00\ BLOCK01\ ... ADULT1CA\BLOCK00: SES0002 - session directories for each session ADULT1CA\BLOCK00\SES0002: SA002ED1.CAO - SAM label file SA002ED1.CA0 - speech signal file SA002ED1.CA1 - speech signal file SA002ED1.CA2 - speech signal file SA002ED1.CA3 - speech signal file This database was designed following the specifications given in the SpeeCon project. Most important points are copied or summarized in this document. Detailed information can be found in the Bibliography. 1.1 Speech file formats The signals are stored in a raw file format, i.e. without headers in the signal file. Each of the four speech channels is recorded at 16 kHz with 16 bit quantization with the least significant byte first (“lohi” or Intel format) as (signed) integers. A description of the sample rate, the quantization, and byte order used is held in the SAM label file that corresponds to each speech file. This label file also contains information about the noise level at recording time, and the signal quality value of the speech file. 1.2 Directory structure 7 Annexa 3. Catalan Database for Consumer Applications The directory structure uses a shallow directory nesting with contiguous numbers to identify the individual sub-directories and call directories. The following three-levels directory structure is defined: \<database>\<block>\<session> Where: <database> <block> <session> Defined as <dbName><#><language code> where <dbName> is ADULT , <#> is 1 for SpeeCon, <language code> is the ISO 639 2-letter language code, CA for Catalan Defined as BLOCK<NN> Where <NN> is a number from 00 to 99 Defined as SES<NN><M> Where <NN> is the block number and <M> a session number from 0 to 9 Table 2. SpeeCon directory structure Both signal files and label files belong to the same directory, although they are distributed on different disks. In addition to the previous structure the following directories are used to store some other files: / /<database>/DOC /<database>/HTML /<database>/TABLE /<database>/INDEX /<database>/PROMPT /<database>/SOURCE README file with overview of database, DISK.ID file, and copyright file Documentation HTML access to (selected) recordings. Optional. Not included Speaker, recording condition, environment conditions, and lexicon tables Index files Prompt sheet samples. Optional. Not included Source code. Optional. Not included Table 3. Non-speech related directory structure All sessions have complete recordings for all prompted items. Exceptions can be found in the summary text files. The root directory contains three mandatory files: - COPYRIGH.TXT : a copyright text in ASCII format, - DISK.ID : an 11-character string with the volume name (required for systems that cannot read the physical volume label), - README.TXT : an ASCII text file that lists all files of the database, except for signal and label files which can be indicated by their name template. 1.3 File nomenclature 8 Annexa 3. Catalan Database for Consumer Applications File names adhere to the common subset of the ISO 9660 standard, i.e. file names with 8 characters followed by a 3-character file extension: <dbID><NNM><CCC>.<LL><F> where: <dbID> <NNM> Database Identification Code (00-ZZ), SpeeCon Adult: SA Progressive recording session number (000 to 999), where NN is the block number and M is the session number Corpus code ISO 639 language code. CA for Catalan File type code: O: orthographic label file 0,1,2,3: speech signal files for channels 0 – 3 <CCC> <LL> <F> Table 4 - SpeeCon file name conventions An explanation of the corpus codes is presented in the tables in section 3. A list of separate documentation files, tables and listings follows below: Directory TABLE INDEX DOC File LEXICON.TBL REC_COND.TBL SESSION.TBL SPEAKER.TBL CONTENT0.LST CONTENT[1|2|3].LST DESIGN.DOC ISO8859<n>.PS PLATFORM.DOC SAMPALEX.PS SPELLALT.DOC SUMMAR[0|1|2|3].TXT TRANSCRP.DOC VALREP.{TXT|DOC} mandatory mandatory mandatory mandatory mandatory optional, not included mandatory mandatory optional,not included mandatory optional, not included at least SUMMAR0.TXT. Others not included optional, not included mandatory Table 5 - Summary of documentation files 1.4 Label files Associated with each speech file is an ASCII SAM label file. Associated SAM label files are text files <CR><LF> ended (according to the DOS format). All mnemonics used in the SpeeCon Catalan Database are listed below. For each one the explanation, the format and the domain accepted for the related items are given. SAM Label LHD ELF CMT DBN SES SCD SEX Description Format Format string Label header End of label file Comment Database name Session number Speaker code Speaker gender Fixed vocabulary item %s Free-form text SPEECON_ <LL> 3-digit number 3-digit number Fixed vocabulary item: [M|F] %s %s %03d %03d %s 9 Annexa 3. Catalan Database for Consumer Applications AGE ACC DIR SRC CCD REP RED RET BEG END SAM SNB SBF SSB QNT NCH LBD LBR MIP MIT SCC DBA SNQ LBO Speaker age Integer Speaker accent Fixed vocabulary item from list of dialects Speech file directory Fixed vocabulary item from file system \<dbName>\BLOCK<NN>\SES<NN><M>\ Speech file names A comma separated list of 8.3 file names Corpus code Recording place Recording date Recording time Labeled sequence begin position Labeled sequence end position Sampling frequency Number of (8-bit) bytes per sample Sample byte order Number of significant bits per sample Quantization Number of channels Label file body Prompt text %d %s %s 3 character code Free-form text DD/Mon/YYYY HH:MM:SS Integer %8c.%3c, %8c.%3c, %8c.%3c,%8c.%3c %3c %s %02d/%3c/%4d %02d:%02d:%02d %d Integer: number of sample points in recording %d Integer: 16000 Integer: 2, signed %d %1d,%s Integer: [0|lohi] Integer: [16] %s %d Fixed vocabulary item, e.g.: PCM Integer: 4 %s %d BEG,END,<gain>,<min>,<max>,<prompt text> with <gain>, <min>,<max> optional signal values; if they are not known, the values may be left empty, but the correct number of commas must remain. <prompt text> is the text that appears on the screen. Microphone Position An attribute-value pair list, CHN0 = %s, CHN1 = %s , CHN2 = %s, CHN3= %s that indicates the position of the microphones Atribute values are: CLOSE_HEADET|CLOSE_LAVALIER|MEDI UM|FAR|CLOSE_LOUDSPEAKER Microphone type An attribute-value pair list, CHN0 = %s, CHN1 = %s , CHN2 = %s, CHN3= %s that specifies the microphones. Atribute values are: SENNHEISER_ME104|SENNHEISER_ME64| AKG|PEIKER|MBF_HAUN|NOKIA Scenario code An attribute-value pair list, ENV = %s, PLC = %s , POS = %s, SIZ = %s, AUD = %s, DRV = %s that indicates the acoustic environment and the factors that define the actual recording scenario as explained in D212. Atribute values are shown below. A value for the ENV attribute is mandatory, but if the sub-scenario is unknown, the values may be left empty. However, the correct number of commas must remain. Noise level The noise level, as measured by the noise-level meter during recording of the silence word Signal/Noise Quality Attribute value pair list, CHN0 = %f, CHN1 = %f , CHN2= %f, CHN3 = %f The SNR values estimated by the recording platform Orthographic <transcription text> transcription 10 %d, %d, %d, %d, %d, %s %s,%s,%s,%s %s,%s,%s,%s %s,%s,%s,%s,%s,%s %f %f %d, %d, %d, %s Annexa 3. Catalan Database for Consumer Applications Table 6 - SpeeCon SAM labels The atribute valuess pairs for the SCC label are: Attribute ENV PLC POS SIZ AUD DRV values comments OFFICE | ENTERTAINMENT | CAR | PUBLIC_PLACE OFFICE_nn | ENTERTAIN_nn | PUBHALL_nn | PUBOPEN_nn | nn is a two digit number in CARUPPER_nn | CARMIDDLE_nn “%02d” format whose minimal and maximal CLOSE_WALL_nn | FAR_WALL_nn | NO_WALL_nn | CODRIVER values are given in D212 IF ENV = {OFFICE | ENTERTAINMENT} : Public places are supposed SQM_00_10 | SQM_10_20 | SQM_20-30 | SQM_30+ to have a size of at least IF ENV = PUBLIC_PLACE : 100 square meter SQM_100_200 | SQM_200+ ON | OFF ENGINE_ON | ENGINE_OFF | CITY_30_70 | COUNTRY_60_100 | HIGHWAY_90_130 Table 6a - atribute values pairs for the SCC label Below is an example of a label file contained in the Database. LHD: DBN: SES: CMT: SRC: DIR: CCD: BEG: END: REP: RED: RET: CMT: SAM: SNB: SBF: SSB: QNT: NCH: CMT: SCD: SEX: AGE: ACC: CMT: SNQ: MIP: MIT: SCC: DBA: CMT: LBD: LBR: LBO: ELF: SAM, 6.1 SPEECON_CA 000 *** Speech Label Information *** SA000925.CA0,SA000925.CA1,SA000925.CA2,SA000925.CA3 \ADULT1CA\BLOCK00\SES000 925 0 26399 OFFICE_01 11/Nov/2005 12:21:37 *** Speech Data Coding *** 16000 2, signed lohi 16 PCM 4 *** Speaker Information *** 000 M 25 CENTRAL *** Recording Conditions *** CHN0=35.146,CHN1=29.571,CHN2=19.286,CHN3=8.123 CHN0=CLOSE_HEADSET,CHN1=CLOSE_LAVALIER,CHN2=MEDIUM,CHN3=FAR CHN0=SENNHEISER_ME104,CHN1=NOKIA,CHN2=MBF_HAUN,CHN3=MBF_HAUN ENV=OFFICE,PLC=OFFICE_01,POS=FAR_WALL_01,SIZ=SQM_30+,AUD=,DRV= 50.0 *** Label File Body *** 0,26399,,,,comptador 0,13199,26399,comptador 11 Annexa 3. Catalan Database for Consumer Applications 2 2.1 Database design and collection Recording platform The recording platform is specified in [1] and mainly described in DOC\PLATFORM.DOC. In this document, only a brief summary is included. The recording platform is based on a laptop using its two Type II PCMCIA slots as interface to the audio equipment. As operating system Windows ’98 applies; the interfaces cards rely on it. The recording software used was developed by UPC. Up to four microphones are recorded simultaneously. Amplifiers, measurement devices and other specifications can be found in [1]. The integrated high-pass filter was not used. The microphones to be used for the recordings depends on the particular environment to record. In the SpeeCon project, 5 recording environments were defined: Office: An office, i.e. a room where people are working at desks, usually or possibly with a computer. No discussions or meetings should be held in the office during the recordings. Entertainment (household): Living room i.e. a room with some furniture, places where people may sit down. A table, a TV or some audio equipment may be present. Instead of a living room also a hotel room may be possible. Car: Vehicle for 4 or 5 passengers Public places: A very large room (hall) or open-air. A hall should have at least 3 walls and a ceiling; more or less busy people, but not too quiet. An open area has no walls and no closed ceiling. Of course, it can be marked off by the walls of the surrounding building. In such a case, at most 2 walls may be closer than 10 meters. This allows recordings at the corner of two buildings. In all cases trees, small shops, an open cafe area, traffic as well as a pedestrian way are possible. Each of these recording environments have its specific noise characteristics, number and kind of microphones to be recorded simultaneously and position of the recording platform and microphones. From the view of recording set-ups, the office and the entertainment (household) environment are treated the same. Therefore, four scenarios with different hardware setups apply: SCENARIO office, entertainment public places car CLOSE DISTANCE MEDIUM DISTANCE Sennheiser Nokia Lavalier Sennheiser ME 104 HDC-6D ME 64 Sennheiser Nokia Lavalier Sennheiser Mikrofonbau Haun ME 104 HDC-6D ME 64 MBNM-550 E-L Sennheiser Nokia Lavalier AKG Peiker ME 104 HDC-6D Q400 Mk3 T ME15/V520-1 FAR DISTANCE Mikrofonbau Haun MBNM-550 E-L - Table 7 Microphone positions within the recording scenarios 12 - Annexa 3. Catalan Database for Consumer Applications Figure 1 gives an overview of the combined mounting positions. For the ‘close distance’ mounting position a distance of ca. 2 cm from the mouth applies for the ME104 and ca. 10 cm applies for the HDC-6D microphone. Figure 1 Microphone mounting positions. Children are not incuded in this database Within all recording scenarios the ‘medium distance’ and ‘far distance’ mounted microphones are in face-to-face position to the speaker. Figure 2 shows the microphone mounting scheme. 13 Annexa 3. Catalan Database for Consumer Applications Figure 2 Microphone mounting scheme For the ‘medium distance’ and ‘far distance’ mounting positions mounting areas are defined. Instead of mounting the microphones exactly along the ‘speech direction’ deviations up to 0.5 meters may happen. The recorded person is sitting on a chair during all the recording session. The two ‘close distance’ microphones are mounted at the speaker itself and for the ‘medium’ and the ‘far distance’ an average mounting height of ca.1.2 meter is used, allowing a deviation of ca. 50 centimetres. Regarding the reverberation properties of a place the speaker’s position relative to reflective objects, i.e. walls is important. Position labels are intended for categorising such positions in a generic way. The idea is to focus on the distance of the speaker from the next wall. A virtual stripe along all surrounding walls is defined. If the speaker located inside this stripe, his position is called ‘CLOSE_WALL’ otherwise ‘FAR_WALL’. In the office and entertainment scenarios the virtual stripe is defined to be 1 meter. For the public place scenario 2 meters are defined. These situations do not apply for the car scenario. For each recording place and specific position, the room impulse response is measured. For each session, a noise level is measured. The recording procedure is completely supervised by an operator. The procedure for a session is as follows: 1. Disconnect the equipment from the power supply 2. Measure the noise level and introduce the value on the recording system 3. Introduce the data: sex, age, speaker code and dialect of the speaker 4. Adjust the amplifiers to the speaker voice level. The position is fixed during the complete session 5. Start the recordings. The speaker should answer or read what appears in the display. 6. The operator check the recordings, SNR measurements an speech waveforms. A keystroke serve to display and record a new prompt. Another keystroke serves to repeat and record a prompt 7. At the end of the recordings, the operator saves the recordings 14 Annexa 3. Catalan Database for Consumer Applications 2.2 Speaker recruitment Speakers were recruited from several universities and associations around Catalonia. A recording platform was moved to each dialectal area to make the recordings. In each dialectal area, a responsible was named to recruit speakers. All speakers received a present to encourage their participation. 2.3 Design of prompting and prompt-sheet A total of 550 different prompt sheets were designed. The items to record were spread among the prompt sheet to avoid tiredness effect. Only spontaneous sentences, five minutes of free speech recordings, were grouped and recorded at the beginning of the recording session (SpeeCon recommendation). Section 9 includes an example of prompt sheet. 3 Database contents definition The corpus and item identifiers of the various items are specified in the tables below. 15 Annexa 3. Catalan Database for Consumer Applications Corpus Item identifier identifier Calibration data _ (underscore) 01 – 06 N 01 Free spontaneous items F 01 – 30 Corpus contents noise recording(s): medium distance: 01=mid position, 02=left position, 03=right position far distance: 04=mid position, 05=left position, 06=right position The “silence word” recording 5 minutes (session time) of free an open number of spontaneous topics spontaneous, rich context items (story out of a set of 30 topics telling) Elicited spontaneous items E D1 – D3 3 elicited dates 17 elicited spontaneous items E T1 – T2 2 elicited times E P1– P3 3 elicited proper names E C1 – C2 2 elicited city name E L1 1 elicited letter sequence E Q1 – Q2 2 elicited answers to questions E N1 – N3 3 elicited telephone numbers E O1 1 elicited language Read speech S 01 – 30 30 phonetically rich sentences W 01 – 05 5 phonetically rich words Core words (read), 31 general words and phrases, 208 application specific words and phrases C I1 – I4 4 isolated digits C B1 1 isolated digit sequence C C1 – C4 4 connected digit sequences C E1 1 telephone number C N1 – N3 3 natural numbers C M1 1 money amount C T1 – T2 2 time phrases T0 : analogue, T1 : digital C D1 – D3 3 dates 31 general words and phrases, D0 : analogue, D1 : relative and general date, D2 : digital C L1 – L3 3 letter sequences C P1 1 proper name C O1 – O2 2 city or street names C Q1 – Q2 2 questions C K1 – K2 2 special keyboard characters C W1 1 Web address C W2 1 email address 1 2 3 4 5 6 7 8 9 Y 01 – 85 01 – 40 01 – 22 01 – 57 01 – 70 01 – 33 01 – 39 01 – 12 01 – 95 01 –54 Basic IVR commands Directory navigation Editing Output control Messaging & Internet browsing Organizer functions Routing Automotive Audio & Video core word synonyms Total of 216 words and phrases per session out of 473. Table 8 – Corpus codes and contents for the adult speech data base 16 Annexa 3. Catalan Database for Consumer Applications The proposed format permits selection of all files belonging to one of the corpora by using one command (e.g. in DOS “dir /s/b ?????C??”, in UNIX “find . -name ”?????C??” -print”). A detailed specification of the individual items is given in the following subsections. 3.1 Free spontaneous speech (F01-F30) A set of 30 topic classes was defined for each of the SpeeCon application areas. The items were circulated among speakers to achieve a uniform occurrence frequency of each topic at the end. The questions were prompted by the supervisor to help the speaker to clearly understand the situation or task, but the question is not recorded. Only the answer is recorded. Three contexts were distinguished for the free spontaneous prompts: A: Mobile phone and PDA (8 prompts) F01: Deixi un missatge a un contestador d'un conegut F02: Simuli una trucada a una agència de viatges F03: Simuli una trucada a un banc F04: Simuli una trucada per a reservar un parell d'entrades de teatre o cinema F05: Simuli una trucada a una llibreria on-line per a comprar/encarregar un llibre o un disc F06: Dicti una carta de negocis breu F07: Simuli una trucada a un hotel per a confirmar la reserva d'una habitació F08: Simuli una trucada per a accedir a algun servei de telefonia mòbil Live a message Call to travel agency Call to a bank Reservation in cinema or theater Ask for a book or disk Bussiness letter Book a hotel room Mobile phone services B: Automotive and information kiosk (13 prompts) F09: Descrigui la situació del trànsit F10: Doni informació sobre on es troba i com arribar-hi F11: Simuli que introdueix en un sistema de navegació l'adreça d'un restaurant F12: Demani a un sistema de navegació la millor ruta per a accedir a un determinat destí F13: Anomeni els instruments presents en el quadre de comandament de l'automòbil F14: Simuli l'activació per veu d'un dispositiu del seu automòbil F15: Demani al supervisor qualsevol tipus d'informació sobre el sistema de gravació F16: Simuli que demana menjar en un restaurant F17: Digui la ciutat que més li agrada i perquè 17 Traffic description Position localization navigation system coordinates route to destination instruments of a dashboard activate a car device car information of recording equipment menu in a restaurant favourite city and reasons Annexa 3. Catalan Database for Consumer Applications F18: Digui els llocs d'interès de la ciutat que més li interest places in a city agrada F19: Digui quin país li agradaria visitar i perquè favourite country reasons F20: Simuli una trucada per a reservar una habitació book a hotel room d'hotel F21: Descrigui com arribar a una estació de tren how to take a train and C: Audio/Video and toys (7 prompts) F22: Expliqui la seva pel·lícula preferida F23: Simuli que controla oralment el seu equip de música F24: Simuli que li demana a la televisió que es connecti el seu programa de televisió preferit F25: Descrigui el seu programa de televisió preferit F26: Anomeni els dispositius d'àudio i vídeo que tingui F27: Faci una consulta sobre un programa de televisió F28: Digui el seu esport i esportista preferit describe film activate a music player activate TV program describe TV program audio devices names question about a TV program favourite sportman/woman D: Miscellaneous (2 prompts) F29: Digui quina impressió treu de les gravacions opinion about recordings F30: Digui quin tipus de dispositiu li agradaria controlar devices to be voice activated per veu 3.2 Elicited speech 3.2.1 Birth date, current date, relative date (ED1-3) Each speaker says three dates. Birth dates [friend | relative] Digui la data d'aniversari d'un amic Digui la data d'aniversari d'un familiar Relative and/or current dates [last holidays | week day | favourite date] Digui quan va començar les seves últimes vacances Digui un dia de la setmana Digui una data especial per a vostè 3.2.2 Time of day (ET1-2) Times can be analogue (i.e. 20:30) or digital (i.e. half past eight). Different questions were designed to have both formats. Each speaker says two times. What time is it now? Digui l'hora actual When is the news on your favorite radio or TV station? Digui a quina hora comença el seu programa de televisió/ràdio preferit 18 Annexa 3. Catalan Database for Consumer Applications At what time did you eat your last [breakfast | lunch | dinner] ? Digui a quina hora esmorzarà avui Digui a quina hora ha dinat/dinarà avui Digui a quina hora ha sopat/soparà avui At what time [did you get up | go to bed] ? Digui a quina hora s'ha aixecat avui Digui a quina hora anirà a dormir avui 3.2.3 City names (EC1-2) Each speaker says two cities. To have variabililty, different questions are formulated to different speakers Which city/town do you like [less | more] Digui la ciutat que li agrada més Digui la ciutat que li agrada menys In which city/town would you [like | dislike] to live? Digui una ciutat o poble on li agradaria viure Digui la ciutat on no li agradaria viure In which city/town did you were born? Digui on va néixer 3.2.4 Proper names (EP1-3) Each speakers says three names. Questions are formulated to receive a given name or a name plus family name and a company name. Expected answer: name What is the name of one relative [male | female]? Digui el nom d'un familiar (home) Digui el nom d'un familiar (dona) Expected answer: name + family name What is the name of one friend [male | female]? Digui el nom d'un company de l'escola Digui el nom d'una companya de l'escola Expected name of a company Name any company Digui el nom d'una empresa 3.2.5 Spelling of proper name (EL1) One proper name is spelt. Spelling the name of a person [male | female] Lletregi el nom complet d'un amic Lletregi el nom complet d'una amiga Spelling a city Lletregi el nom de la ciutat on va néixer Lletregi el nom d'una ciutat o poble proper 19 Annexa 3. Catalan Database for Consumer Applications Spelling a street Lletregi el nom del carrer on viu 3.2.6 Yes/No answers (EQ1-2) Expected answer NO (no) 'Have you lived more than 5 years abroad?' Respongui sí o no: Ha viscut més de cinc anys a l'estranger? Expected answer YES (sí) 'Are you adult (legal age)?' Respongui sí o no: És major d'edat? 3.2.7 Languages (EO1) Each speaker says one language. Questions are formulated to have variability. A language that speak Digui el nom d'un idioma que parli A language that do not speak Digui el nom d'un idioma que li agradaria aprendre 3.2.8 Telephone numbers (EN1-3) Each speaker says three telephone numbers. Questions are designed to obtain fixed and cellular telephone numbers Fixed network: Digui un número de telèfon fix The last number, either fixed or cellular, you called Digui l'últim número de telèfon al que ha trucat Cellular number Digui un número de telèfon mòbil 3.3 Read speech 3.3.1 Phonetically rich words (W01-05) Each speaker says five 'phonetically rich words'. Words were obtained from an electronic dictionary. Words were selected to have a good coverage of phones and, specifically, phones in context silence. There are 2750 different words. 20 Annexa 3. Catalan Database for Consumer Applications Phoneme p b t d k g f s z S Z m n J l L rr r j w i e E a O o u ts tS dZ B D G v N @ x jj T Counts 516 337 446 280 561 270 331 692 290 232 233 623 557 235 619 244 461 608 233 316 974 420 665 721 281 294 1650 0 22 0 280 252 252 0 238 659 1 0 1 Table 9 - List of phonemes and counts in the phonetically rich words set at transcription level 21 Annexa 3. Catalan Database for Consumer Applications 3.3.2 Phonetically rich sentences S01-S30 Each speaker read 30 sentences from a set of sentences phonetically rich. Sentences were selected from text corpora taken from newspapers and books. Sentences were chosen to have between 5 and 12 words and individually checked to ensure that they are correct grammatically and in spelling, and that there is nothing potentially offensive or inappropriate in their content. Each speaker pronounces each phoneme at least once. A greedy algorithm designed in UPC (corpus_crt) was used to select the sentences. The number of different sentences is 3406 and each sentence is repeated between four and five times (Except one sentence that is repeated only 2 times). Phoneme p b t d k g f s z S Z m n J l L rr r j w i e E a O o u ts tS dZ B D G 22 Counts 14621 9552 24683 13562 19296 4241 5635 38700 3740 2054 4694 18466 29986 3321 21846 3805 10059 16386 2171 6368 32709 14190 10833 28460 9166 10747 27728 10 2747 20 5176 3983 3119 Annexa 3. Catalan Database for Consumer Applications v N @ x jj T 8 2728 61581 54 7 35 Table 10 - Phonetically rich sentences. Phoneme counts at transcription level 3.3.3 Isolated digits (CI1-4) The digits were presented to the speakers as words. Twelve different forms are used Digit 0 1 2 3 4 5 6 7 8 9 Form Zero Un|una Dos|dues Tres Quatre Cinc Sis Set Vuit Nou Table 11 Digits and Names of digits 3.3.4 Isolated digit sequences (CB1) One isolated digit sequence is prompted. The sequence contains the twelve forms defined in 3.3.3 Example: Cinc dos sis nou dues una set zero tres quatre vuit un 3.3.5 Digit strings (CC1-4) Four digits strings of five digits each are prompted. The digits are written in twelve different forms as was defined in 3.3.3. Each speaker pronounces each digit at least once. Example : Nou dues tres nou zero 3.3.6 Telephone number (CE1) This is a read 9-13 digit telephone number. The number of digits, spacing and presentation reflects the typical telephone numbers in the language for national numbers 23 Annexa 3. Catalan Database for Consumer Applications including area codes, international numbers, and cellular network numbers. For cellular numbers, 6xx xx xx xx is used for domestic calls and 00 34 6xx xx xx xx for international calls. The spacing can vary. Examples (The spacing can vary) Spain: 975 920 884 GSM: 678 362 209 International: 00 34 932 729 369 3.3.7 Natural numbers (CN1-3) Three natural number phrases were read from the prompt sheet. The numbers chosen aimed to maximize the natural number word equivalents of: 10-19, 20...90, hundred, thousand, etc: The prompts covered numbers 10 X 1,000,000, with enough examples of words to permit training of all lexical items. Examples dos-cents dotze mil tres-cents cinquanta-dos nou-cents noranta-un mil vuit-cents seixanta-dos coma set-cents dotze 3.3.8 Money amounts (CM1) Money amounts are expressed in Euro (Euro, Euros) and Cent (Céntimos) Example cent cinquanta-cinc euros i deu cèntims 3.3.9 Times (CT1-2) Two time phrases were read: T1: One phrase in analogue form to provide adequate lexical coverage of all necessary words for training. Words AM and PM are not used in Catalan. The words are read in the following phrases: és l'una del migdia són exactament les deu i mitja del matí són gairebé les dues en punt les onze menys cinc de la nit a les tres i cinc són les dotze i disset tres quarts de quatre de la tarda és mitjanit dos quarts de cinc de la matinada una hora aproximadament les sis i quart set hores set i tretze de la tarda són les quatre i vint-i-cinc minuts un quart i mig de vuit les onze menys un minut les nou del vespre 24 Annexa 3. Catalan Database for Consumer Applications Table 12. Sentences and formats of the read analogue time. aproximadament | gairebé exactament en punt matinada migdia matí tarda nit mitjanit i menys quart mitja minut minuts hora hores approximately, nearly exactly o'clock dawn noon, midday morning afternoon, evening evening, night midnight past to quarter half minute minutes hour hours Table 13. Words in time phrases. English translation T2: One phrase in digital form. The times were prompted as words, not as digits (e.g. five past ten). Examples: 09:18 is prompted as: Nou hores i divuit minuts 3.3.10 Dates (CD1-3) D1: analogue form. The analogue dates cover all weekday names and month names (uniformly distributed). Dilluns Dimarts Dimecres Dijous Divendres Dissabte Diumenge Monday Tuesday Wednesday Thursday Friday Saturday Sunday Table 14 - Week days and English translation Gener Febrer Març Abril Maig Juny Juliol Agost Setembre Octubre Novembre Desembre January February March April May June July August September October November December Table 15 -Month names and English translation 25 Annexa 3. Catalan Database for Consumer Applications Format: Dijous, tretze de gener de mil nou-cents vuitanta-sis D2: relative and general date expressions (more ways to talk about the future than the past are used in Catalan, and therefore in this selection). Avui Demà Demà passat Ahir Abans d’ahir El cap de setmana Aquest cap de setmana El proper cap de setmana El passat cap de setmana Aquesta setmana La setmana que ve La pròxima setmana La setmana passada Aquest mes El mes passat El mes que ve Aquest any L’any passat L’any que ve Setmana Santa Dijous Sant Divendres Sant Diumenge de rams Dilluns de Pasqua Carnestoltes Nadal Nit de Nadal Cap d’any Any Nou El dia de Reis Sant Joan Tots sants La Diada L’onze de setembre Sant Jordi El primer de maig El dia següent D’avui en vuit A mitjans de setmana la setmana vinent la propera setmana el mes vinent el proper mes el cap de setmana vinent Today Tomorrow The day after tomorrow Yesterday The day before tomorrow Weekend This Weekend Next Weekend Last Weekend This week Next week Next week Last week This month Last month Next month This year Last year Next year Easter Maundy Thursday Good Friday Easter Sunday Easter Monday Carnival Christmas Christmas eve (24th of December) 31st of December 1st of January 6th of January 24th of June 1st of November 11th of September 11th of September 23rd of April 1st of May Tomorrow Next week In the middle of the week Next week Next week Next month Next Month Next weekend Table 16 - Formats and vocabulary of the Relative Dates D3: digital form. Format: 26 Annexa 3. Catalan Database for Consumer Applications Month day [del] month [de | del] year disset del cinc de dos mil tretze catorze del dotze del dinou 3.3.11 Spelled words (CL1-3) Spelling is not usual in Catalan. Syllabic pronunciation or spelling based on names is used instead. However the corpus includes spontaneous and read spelling but minimizing the detail of the spelling (capital letters, accents, apostrophes, mid dots, hyphen, ...) 3 spelled words were recorded per speaker. Names (proper names, cities and companies) and phonetically rich words were chosen to spell. The third prompt for literal spelling is an artificial (i.e. non-existing) word to balance the number of realizations per letter. Letter A B C Ç D E F G H I J K L LL M N Ñ O P Q R S T U V W X Y Z Name a be | be alta ce ce trencada de e efa ge hac i | i llatina jota ca ela ella | ela doble | doble ela ema ena enya o pe cu erra essa te u be baixa be doble | doble be baixa | doble be ics i grega zeta Repetitions 978 321 323 265 341 747 285 348 342 591 292 278 592 340 355 497 290 516 324 279 568 491 452 516 324 280 313 276 266 Table 17 Spelled letters, their Catalan names and number of prompts at prompt level The first name in the list is the most commonly used. The alternative names are locally used and won't be considered to achieve a given number of realizations. 27 Annexa 3. Catalan Database for Consumer Applications In fact “LL” and “Ñ” are not letters from the Catalan alphabet: “LL” should be spelled as two different “L”, however its use is extended and it was included on the list. Some of the most commons names come from Spain, and some of them include letter “Ñ”, that's why it was included on this list. 3.3.12 Proper Name (CP1) The proper name was drawn from a set of 150 first or last names or a combination of them. The list is included in 11.1 3.3.13 City/street names (CO1-2) 1 item drawn from a set of 275 most frequent cities. The list is in 11.2.1 1 item drawn from a set of 275 most frequent street names. The list is in 11.2.2 3.3.14 Yes/No (CQ1-2) Two items (one Yes, one No) were recorded for each speaker. Yes: Sí No: No 3.3.15 Email & Web addresses (CW1-2) CW1: WWW address from a list of 150 URL-addresses. The list is in 11.3.1 CW2: Email drawn from a set of 550 e-mail addresses. The list is in 11.3.2 3.3.16 Special keyboard characters (CK1-2) Names of two special keyboard characters were recorded for each speaker. The names of the special keyboard characters were prompted to the speakers as orthographic words. Accent obert Accent tancat Apòstrof Arrova Asterisc Barra Barra inversa barra vertical Coixinet Coma Cometes Dièresi Dos punts Espai Guió i Interrogant open accent('`') closed accent (''') apostrophe(''') ‘at’ sign (‘@’) star, asterisk (‘*’) Slash (‘/’) backslash (‘\’) pipe (‘|’) hash (‘#’) comma (‘,’) quotes ('”') mid dot ('·') Colon (‘:’) Blank | space (‘ ‘) hyphen (‘-‘) ampersand (‘&’) question mark (‘¿,?’) Table 18- List of Special Characters 28 Annexa 3. Catalan Database for Consumer Applications 3.3.17 Application specific commands (101-995), (401-499) The application commands (words and phrases) are divided into 9 different groups. The groups are shown in the tables in section 3. They are indicated by the first digit of the corpus code. The appendix includes the list of application words, their functionality, the English word and code. In some cases, the same functionality can be defined by two different words. The list of synonyms and their assigned code is also also given in the appendix. Finally, the appendix also shows the list of words with two different codes assigned. Please, note that in the database, the speech and label filenames are linked with the first word of this list. 4 Transcription The goal is to define a coarse transcription that can be performed quickly, but covers adequately the acoustic events most important for the training and testing of automatic speech recognisers. The transcription is orthographic and includes a few details that represent audible acoustic events (speech and non-speech) present in the corresponding waveform files. The symbol set aims at the localisation of the main acoustic events according to a coarse categorisation rather than a full description of all possible sounds that may appear during a recording. Extra marks contained in the transcription aid in interpreting the text form of the utterance; markers for non-speech acoustic events and distortions have been chosen such that they can be automatically removed or modified to yield the base transcription. The overall aim is to keep as much speech in the corpus as possible and to avoid the need for deleting recordings from the corpus due to some extra noises, disfluencies, etc. Normal lexical items are represented by their spellings in the normal way. Transcriptions are case SENSITIVE. Following SpeeCon reccomendations, upper case are used in spelled letters and in some and in application words with abbreviations or acronyms, such as DAT or MP3. Number sequences (times, dates, money amounts, etc.) will be spelled out to reflect what was said ("flight six one three"; "seven thirty" Letter sequences occur in spelled words, ZIP-codes, acronyms and abbreviations. Letters are transcribed in upper case, separated by a space. As some letters in Catalan have several names, some simple rules were established to deal with them. The table below shows all posible cases. Letter B B V W W W W Letter pronounced be be alta be baixa be doble doble be doble B baixa be doble 29 Transcription B B alta B baixa B doble doble B doble B baixa B doble Annexa 3. Catalan Database for Consumer Applications LL LL LL ella doble ela ela doble LL doble L L doble If a speaker pronounces letters, acronyms or abbreviations as a word, for example "British Rail" for BR, then these should be spelled out as words. No punctuation will be provided in the transcription other than those symbols used for special transcription purposes. However the label files should retain all punctuation provided to the speaker in the prompting text, including mistakes if these occurred. Mispronounced words that are nevertheless intelligible will be marked with one star 1 attached to the left of the word, which is mispronounced e.g. *transportation instead of the mispronounced transportetation Words preceded by a star include mispronunciations such as words with extra or omitted syllables, but a star should not be used to indicate pronunciations of words that represent normal dialectal (e.g., "warshed" for "washed" or "cah" for "car") or stylistic variation (e.g., "bout" for "about" or "wanna" for "want a" or for "want to"). If the speaker would not consider the pronunciation as an error, the star notation should not be used. Obviously, there may be some clear and some unclear cases; transcribers should use their best judgement. In stretches of speech that are mispronounced, each mispronounced word is marked individually. A rather frequent case is that a speaker first mispronounces a word and says it correctly. This should be transcribed as: *transportation transportation 4.1 Foreign words Both the collection of command words and the elicitation of spontaneous speech may result in a certain amount of foreign words in some databases. These words should be orthographically transcribed in their original spelling, and should be given an entry in the pronunciation lexicon that uses only phone symbols appropriate for this database language. The encoding of foreign words in the transcription follows the normal transcription conventions for a language. 4.2 Unintelligible words 1 Note: This differs from the WSJ convention, which used two stars, and also permitted the mispronunciation to be represented by modifying the orthography. 30 Annexa 3. Catalan Database for Consumer Applications Words or stretches of speech that are completely unintelligible are transcribed by a sequence of two asterisks: "**"2. The “**” marker is separated from neighbouring (intelligible) words with spaces. 4.3 Word Fragments Word fragments, i.e. instances in which the speaker did not complete a word, will be considered as mispronunciation. It is accordingly marked with a star attached to the left of the intended word. 4.4 Verbal Deletions Verbal deletion means words spoken by the user but which, in the opinion of the transcriber, are superseded by subsequent speech explicitly (e.g., "show flights i mean fares") or implicitly (e.g., "show me the fares flights to Boston"). Verbal deletions are not indicated as such in the SpeeCon corpora. Transcribers should simply indicate all the words they hear. Verbal deletions occur any time there is a repetition or restart. In repetitions, one or more words are repeated, and there may or may not be extra material inserted into the repetition, for example: show me the flights the flights to boston show me the flights the nonstop flights to boston In restarts, words are not repeated, but the speaker changes direction, as in: show me the how many flights go to boston 4.5 Non-Speech Acoustic Events Four categories of non-speech acoustic events must be transcribed. Events will only be transcribed if they are clearly distinguishable. Very low-level, i.e. non-intrusive events will be ignored. The event will be transcribed at the place of occurrence3, using the defined symbols in square brackets. For noise events that occur over a span of one or more words, the transcription should indicate the beginning of the noise, just before the first word it affects4. The first two categories of acoustic events originate from the speaker, and the other categories originate from another source. Sounds originating from the speaker usually do not overlap with the target speech, sounds originating from other sources can of course occur simultaneously with the speech. The four categories are: 2 [fil]: Filled pause. These sounds can well be modelled in a filled pause model in speech recognisers. Examples of filled pauses: uh, um, er, ah, mm. The corresponding WSJ marking was: [unintelligible] 3 It is often difficult to localise these events; transcribing the utterance first, and listening for these events in a second pass is the correct procedure. 4 There is no notation for spans/duration of noise events as in the WSJ conventions. 31 Annexa 3. Catalan Database for Consumer Applications [spk]: Speaker noise. All kinds of sounds and noises made by the speaker that are not part of the prompted text, e.g. lip smack, cough, grunt, throat clear, tongue click, loud breath, laugh, loud sigh. This marker should also be used in the case that the speaker blows into the microphone, before or after a word; it is not used if the blowing overlaps with the word (cf. clipping). Only loud speaker noises should be transcribed. [sta]: Stationary noise. This category contains background noise that is not intermittent and has a more or less stable amplitude spectrum over some time. Examples, voice babble (cocktail-party noise), background noise, sirens, wind, rain, cobble stones. [int]: Intermittent noise. This category contains noises of an intermittent nature. These noises typically occur only once (like a door slam), or have pauses between them (like phone ringing), or change their colour over time (like music). Examples: music, background speech, horn sounds, paper rustle, cross talk, ticks by the direction indicator. [sta] and [int] should only be used if the sounds are not inherent to the environment as such. E.g. in the car a stationary background noise and street noises can be expected as given with the environment of the recording. These noises should not be transcribed. Only obvious and salient deviations from the given background should be marked. With music or other speakers in the background it is possible to have both [sta] and [int]: [sta] to indicate the overall noise level and [int] to indicate sudden (louder) events in this stationary stream. The distinction between stationary and intermittent noise is considered a relevant one. Utterances containing stationary noise can be used to train robust models with noise background, especially if the marker is before the first word of the sentence. In contrast, utterances containing intermittent noise will generally be discarded for training. If a [sta] or [int] sound begins in a word, then the symbol should be put before the first word affected. Never should the symbol appear in the word, thus breaking the word up into pieces. The exact location of the noise should be established by the user via listening to the utterance a second (or third, …) time. The transcription only serves as an indication of the presence of some intermittent noise. Only the four defined categories should appear in the final transcriptions, without any further sub-categorisation. (If other symbols are used, for instance because it is easier for the transcribers, a post-hoc conversion should ensure that only the four defined categories show up in the final transcriptions). The symbols should be separated from surrounding words by spaces. 4.6 Truncated Waveforms If a speech signal file is truncated due to a recording error, the following notation is to be used: Beginning of utterance truncation: End of utterance truncation: ~transcription transcription~ 32 Annexa 3. Catalan Database for Consumer Applications Beginning and end of utterance truncation: ~transcription~ There is a difference between an utterance that is truncated and is now incomplete, but which has not damaged the initial or final words, and an utterance when word(s) have been damaged. The ~ indicates truncation of the word. 4.7 Combination of markers If more distortions apply to a word, this should not be indicated by a combination of markers (*,~). Only the marker for the most severe distortion should be used, since this one will determine if a word/utterance is useful for any purpose or not. A characteristic feature of the SpeeCon recordings is that each utterance is recorded over more than one microphone. Due to the costs associated with transcription it is not feasible to create a hand labeled transcription for each recording channel. The information that is most important for the training of a recognizer (spoken words, mispronunciations, word fragments, and a few non-speech acoustic events) is assumed to be most distinct in the close-talk channel. Thus, the transcription of the close-talk microphone channel is mandatory for SpeeCon recordings. A transcription of the other channels is not provided in SpeeCon. The use of (semi-) automatic methods that could start from the transcription of the close-talk microphone channel is left to the end-user of a data base who might want to customize the coarse transcription to his needs. 4.8 Transcription procedure The transcription was performed at Verbio Technologies. using the transcription tool NaniBD for the read items and Transcriber for the free-spontaneous sentences. The following recommendations are made for the transcription procedure. 1. Use headphones for transcribers. Each transcriber has the same type of headphone. Transcriptions are made in a quiet environment. 2. Transcriptions were made in two passes. In the first pass, the transcriber check if the speaker says correctly the prompted text or introduces the texts in case of spontaneous or elicited prompts. In a second pass, the transcriber mark noises, truncations and special events. 3. All the transcribers use the same set of speakers to be trained. Moreover, this set will belong to the sentences corrected by the validation center in the prevalidation step. 4. The transcription tool uses a set of buttons for the non-speech acoustic events. Full typing by the transcribers introduces unnecessary errors. 5. The transcription task is done on a basis of half-day employment. The standard character set used for transcription is ISO-8859-1 33 Annexa 3. Catalan Database for Consumer Applications Transcriptions are checked periodically. Two persons do the same transcription task and results are compared. 5 Lexicon The lexicon is included in the file: ADULT1CA\TABLE\ LEXICON.TBL The lexicon file is an alphabetically ordered list of distinct lexical items (essentially words in our case) which occur in the corpus with the corresponding pronunciation information. Each distinct word has a separate entry. As the lexicon is derived from the corpus it uses the same alphabetic encoding for special and accented characters as used in the transcriptions (ISO-8859). ISO 8859 Latin 1 Characters code are listed in file ADULT1CA\DOC\ISO88591.PS. We have included a frequency count for each entry in the lexicon e.g. to help indicate rare words whose transcriptions are perhaps less important or reliable. The pronunciation lexicon was produced after the transcription phase; it contain, alphabetically sorted, all words found in the "LBO:" transcription (one occurrence for each word), their number of occurrences and the list of their phonemic representations. The words appear in the lexicon exactly as they appear in the transcription. The lexicon is case insensitive. All the component words have been identified and alphabetically sorted; all fragments, mispronunciations and non speech events have been removed, and only one occurrence of each word have been selected. A software tool developed at UPC (SEGRE: Catalan Automatic Graphemes to Allophones Transcriber) has been used to translate the transcribed words to phonemic strings by using the SAMPA phonemic notation. No stress marks are provided. It is important to mention here that the company/agency names, URL and email addresses very often don't correspond to Catalan words. These names are presented to the speaker as they are written by the company, web page or email (capital letters without accents, foreign words, acronyms ...). For those names that usually are spoken in a non standard transcription, we have included the common transcription., i.e. Philips: Standard transcription pilips, Common transcription filis The full lexicon was transcribed automatically in this intermediate deliverable. Proper names, foreing words, company names, URL addresses and email addresses will be checked manually in the final deliverable. The SAMPA phonetic symbols used are: Symbol p b t d k g Description voiceless bilabial stop voiced bilabial stop voiceless dental stop voiced dental stop voiceless velar stop voiced velar stop Example pala bala tela donar cala gala 34 Transcription "pal@ "bal@ "tEl@ du"na "kal@ "gal@ Annexa 3. Catalan Database for Consumer Applications f s z S Z m n J l L rr r j w i e E a O o u ts tS dZ B D G v N @ voiceless labiodental fricative voiceless alveolar fricative voiced alveolar fricative voiceless palatal fricative voiced palatal fricative voiced bilabial nasal voiced alveolar nasal voiced palatal nasal voiced alveolar lateral voiced palatal lateral voiced alveolar trill voiced alveolar flap voiced palatal approximant voiced labiovelar approximant close front unrounded close-mid front unrounded open-mid front unrounded open central unrounded open-mid back rounded close-mid back rounded close back rounded voiceless alveolar affricate voiceless palatal affricate voiced palatal affricate voiced bilabial fricative voiced dental fricative voiced velar fricative voiced labiodental fricative voiced velar nasal central (schwa) unrounded fals sala desde caixa màgic mena nena any líquid llamp carro cara caire veuen ric cec sec sac soc soc suc tots cotxe metge acaba cada amagar afgà fang amor "fals "sal@ "dezD@ "kaS@ "maZik “mEn@ “nEn@ “aJ "likit "Lam "karru "kar@ “kajr@ "bEw@n "rik "sek "sEk "sak "sOk "sok "suk “tots "kotS@ "medZ@ @"kaB@ “kaDa @m@”Ga @v”Ga "faN @"mor Table 19 - SAMPA Catalan Allophone set Symbols used in this table show allophonic variations. The following list show the corresponding phonemes: D and d can be merged in d. B and b can be merged in b. G and g can be merged in g. N and n can be merged in n. v and f can be merged in f. Recorded words pronounced in Spanish need three additional symbols: Symbol x jj T Description voiceless velar fricative voiced palatal fricative voiceless interdental fricative Example jamón mayo zapato Transcription xa”mon “majjo “Tapato Table 20 – Additional SAMPA symbols taken from Spanish set The LEXICON.TBL file has the following format. 35 Annexa 3. Catalan Database for Consumer Applications <grapheme form> <TAB> <frequency> <TAB> <phonemic transcription> <TAB> [<alternative phonemic transcription>] One phonemic transcription is provided. As for the other tables the first row contains a heading row with the field names as part of the table and each line in the lexicon is terminated by <CR> <LF> (ASCII 13 10). Phonemic characters are separated by a space (ASCII 32). The complete set of SAMPA Symbols used in the Catalan Database is included in the file: ADULT1CA\DOC\SAMPALEX.PS Ph. Rich Ph. Rich Allophone Sentences Words Combined Total p 14621 516 15137 87841 b 9552 337 9889 59840 t 24683 446 25129 202051 d 13562 280 13842 100733 k 19296 561 19857 140510 g 4241 270 4511 14243 f 5635 331 5966 32483 s 38700 692 39392 277585 z 3740 290 4030 28878 S 2054 232 2286 11139 Z 4694 233 4927 24562 m 18466 623 19089 118836 n 29986 557 30543 197609 J 3321 235 3556 8894 l 21846 619 22465 136046 L 3805 244 4049 18612 rr 10059 461 10520 76580 r 16386 608 16994 141503 j 2171 233 2404 14818 w 6368 316 6684 42916 i 32709 974 33683 240738 e 14190 420 14610 102359 E 10833 665 11498 72538 a 28460 721 29181 169696 O 9166 281 9447 56577 o 10747 294 11041 69527 u 27728 1650 29378 177172 ts 10 0 10 25 tS 2747 22 2769 5064 dZ 20 0 20 162 B 5176 280 5456 38211 D 3983 252 4235 40685 G 3119 252 3371 21256 v 8 0 8 17 N 2728 238 2966 14137 @ 61581 659 62240 535778 x 54 1 55 437 36 Annexa 3. Catalan Database for Consumer Applications jj T 7 35 0 1 7 36 52 1088 Table 21 - Analysis of frequency of occurrence at transcription level of the phonemes (allophones) represented in the phonetically rich sentences, phonetically rich words, the combined set, and in the full database. A list of foreing words follows. Words come from English (mainly from web and URL addresses) and Spanish. alresoft barrapunto cincodias ebay elotrolado google iberia infojobs kissfm libertaddigital microsoft paginasamarillas softonic terra vilaweb 6 altavista boe cocacola elcorreodigital elpais gruposantander iddeo invertia lanetro los40 movistar patagon sport tiscali vodafone amena cadena100 creativeweb elcorteingles elperiodico heraldo iespana irc-hispano latecla lycos msn periodistadigital telecinco uji wanadoo bandaancha cadenaser default elmundo eterk hispavista imatica ircap latinmail m80radio ondacero portalmix telefonica universia ya banesto cajamadrid ebankinter elmundodeportivo everyday hotmail ine jopers lavozdegalicia meristation ozu repsol-ypf telepolis verbalia yahoo Speaker demographic information The speaker demographic information is in the file SPEAKER.TBL 6.1 Accent/Regions Catalan is the language spoken in Catalonia, Valencia, Balearic Islands and Andorra. It is also spoken in other places, such as Rosselló and Vallespir (south of France), the border between Catalonia and Aragó, and the city of Alghero in Sardinia, Italy. In Catalonia, Catalan is divided into two major dialect blocks: Eastern Catalan and Western Catalan, which can be divided into several subdialects [3]. We will consider four dialects, named CENTRAL, NORD-OCCIDENTAL, TORTOSI and GIRONI, as shown in the map below. 37 Annexa 3. Catalan Database for Consumer Applications Name of accent/region CENTRAL NORD-OCCIDENTAL GIRONI TORTOSI #Male speakers 113 49 47 52 #Female speakers 132 55 55 47 Table 21 - Distribution of adult speakers over regions 6.2 Speaker ages Age groups 15-30 31-45 46-60 over 60 TOTAL Number of Number of male female speakers speakers 145 84 23 9 261 166 85 31 7 289 Percentage of total 56,5% 30,7% 9,8% 2,9% 100,0% Table 22 - Distribution of adult speakers over age groups and sexes 38 Annexa 3. Catalan Database for Consumer Applications 7 Recording conditions Five recording environments were defined: Office: An office, i.e. a room where people are working at desks, usually or possibly with a computer. No discussions or meetings should be held in the office during the recordings. Entertainment (household): Living room i.e. a room with some furniture, places where people may sit down. A table, a TV or some audio equipment may be present. Instead of a living room also a hotel room may be possible. Car: Vehicle for 4 or 5 passengers Public places: A very large room (hall) or open-air. A hall should have at least 3 walls and a ceiling; more or less busy people, but not too quiet. An open area has no walls and no closed ceiling. Of course, it can be marked off by the walls of the surrounding building. In such a case, at most 2 walls may be closer than 10 meters. This allows recordings at the corner of two buildings. In all cases trees, small shops, an open cafe area, traffic as well as a pedestrian way are possible. The following table presents the distribution of the speakers over the main environments. Environment #Males #Females Total Office 86 114 200 Entertainment 26 49 75 115 85 200 34 41 75 Public Car Table 23 - Distribution of speakers over main environments 7.1 Office The Office environment was divided into several sub-environments. The relevant parameters for the subdivision are the office room itself (Place) and the distance of the speaker to the wall. The speaker was considered close to the nearest wall if the distance was 1m or less. Place Close Wall Far Wall OFFICE_01 42 28 OFFICE_02 2 4 OFFICE_03 11 11 OFFICE_04 19 23 OFFICE_05 10 14 OFFICE_06 2 0 39 Annexa 3. Catalan Database for Consumer Applications OFFICE_07 3 2 OFFICE_08 4 4 OFFICE_09 1 1 OFFICE_10 4 1 OFFICE_11 2 1 OFFICE_12 3 2 OFFICE_13 1 2 OFFICE_14 1 2 105 95 TOTAL Table 24 - Distribution of adult speakers over office sub-environments 7.2 Entertainment The Entertainment environment was divided into several sub-environments. The relevant parameters for the subdivision are the use of audio equipment, and the distance of the speaker to the wall. The speaker was considered close to the nearest wall if the distance was 1m or less. AUDIO ON OFF TOTAL CLOSE WALL FAR WALL 16 26 42 TOTAL 15 18 33 31 44 75 Table 25 - Distribution of adult speakers over entertainment sub-environments 7.3 Public place The Public environment was divided into several sub-environments. The relevant parameters for the subdivision are the room itself (Place), the place category (Hall or Open), and the distance of the speaker to the wall. The speaker was considered close to the nearest wall if the distance was 2 m or less; the speaker was considered as speaking without wall reverberations (No Wall) if the distance was larger than 10 m. PLACE HALL OPEN TOTAL CLOSE WALL FAR WALL 44 29 73 NO WALL 46 64 110 TOTAL 17 17 Table 28 - Distribution of adult speakers over public places 40 90 110 200 Annexa 3. Catalan Database for Consumer Applications 7.4 Car Two different types of cars were distinguished (middle and upper class) and five different driving conditions. The distribution of speakers over the sub-environments is given in the table below. DRIVING CONDITIONS ENGINE OFF ENGINE ON CITY COUNTRY HIGHWAY TOTAL MIDDLE CLASS CAR 10 9 9 6 6 40 UPPER CLASS CAR 11 8 9 4 3 35 TOTAL 21 17 18 10 9 75 Table 26 - Distribution of adult speakers over car sub-environments 8 Deviations from general SpeeCon specifications No deviations from the specifications. 9 9.1 Sample sheets Sample instruction sheet The instructions are given orally to the speaker. A small training to be familiar with spontaneous sentences and their recording time is the most relevant part of the training 9.2 Sample prompt sheet The list of items is displayed, it do not exist physically. The formats of the items have been showed in this document. An example of a promt sheet follows: _01: --- noise rec_1, microphone medium, loudspeaker mid --_02: --- noise rec_2, microphone medium, loudspeaker left --_03: --- noise rec_3, microphone medium, loudspeaker right --_04: --- noise rec_4, microphone far, loudspeaker mid --_05: --- noise rec_5, microphone far, loudspeaker left --_06: --- noise rec_6, microphone far, loudspeaker right --T01: --- Test recordings --N01: --- silence_word --F16: Simuli que demana menjar en un restaurant F17: Digui la ciutat que més li agrada i perquè F18: Digui els llocs d'interès de la ciutat que més li agrada F19: Digui quin país li agradaria visitar i perquè F20: Simuli una trucada per a reservar una habitació d'hotel 41 Annexa 3. Catalan Database for Consumer Applications F21: Descrigui com arribar a una estació de tren F22: Expliqui la seva pel·lícula preferida F23: Simuli que controla oralment el seu equip de música F24: Simuli que li demana a la televisió que es connecti el seu programa de televisió preferit F25: Descrigui el seu programa de televisió preferit 110: control de veu 923: endavant 908: cable S01: Vaig trigar gairebé un mes a arribar-hi. 911: RDS Y50: reproducció 175: estat ED1: Digui la data d'aniversari d'un amic 420: resolució 533: marcació ràpida 414: ocultar W01: folls 408: mitjà 988: natura 929: càrrega de bateria CI1: sis 562: acceptar trucada 456: imprimir 319: acceptar S02: jo també vull un canvi a l'escola 454: canviar de cara 177: força de senyal 902: balanç ED2: Digui un dia de la setmana 954: viatges 404: mínim 536: conferència W02: mango 611: horari 722: aeroport 162: diari CI2: una 569: sortida 935: DAB 992: anuncis S03: No vaig voler perdre més el temps. 178: autoavaluació 701: ubicació 615: calculadora ED3: Digui una data especial per a vostè 184: sincronitzar Y34: navegar 723: frontera W03: oposeu 806: seient 980: esports d'aigua 42 Annexa 3. Catalan Database for Consumer Applications 222: anterior CI3: cinc Y08: reintentar 930: capacitat 718: cua S04: No és ben bé el mateix complir mig any 540: atenció al client 941: teatre 904: subtítols ET1: Digui a quina hora ha dinat/dinarà avui 446: auriculars 231: suggerir 145: assistent W04: divideix 152: català 913: títol 403: màxim CI4: un 624: adreça 317: repetir 550: secretari S05: Respecte a acollir un nou membre, francament, ho veig difícil. 427: color 136: reiniciar 605: mes ET2: Digui a quina hora s'ha aixecat avui 220: següent 157: temporitzador 965: comèdia W05: boirumós 527: trucada 811: combustible 977: bàsquet CB1: set dos tres un vuit dues cinc nou sis quatre zero una 706: guiar a 564: trucades perdudes 909: AUX S06: un passeig en cotxe de cavalls 315: classificar 302: entrada 546: oficina EP1: Digui el nom d'un familiar (home) 123: DVD 442: aguts 104: activar CC1: nou set vuit una set 139: continuar 931: imatge congelada 809: PC S07: Però ho veig difícil, perquè l'ambient no està molt clar 148: canviar 919: disquet 43 Annexa 3. Catalan Database for Consumer Applications 932: AV EP2: Digui el nom d'una companya de l'escola 732: estació de tren 117: videocàmera 906: antena CC2: dues dues cinc dos cinc 172: macro 726: cruïlla 122: DAT S08: un nou museu històric, etnogràfic i antropològic. 240: ometre 556: ciutat Y02: login EP3: Digui el nom d'un company de l'escola 223: enrere 958: thriller 120: CD CC3: sis sis quatre set una 942: cultura 521: targeta de visita 202: directori S09: van guanyar amb un resultat contundent 316: desfer 438: melodia Y23: memòria de veu EC1: Digui la ciutat que li agrada més 524: incloure a favorits 229: blanc 519: DTMF CC4: cinc un zero tres dues 812: manteniment 547: busca 163: setmanal S010: Som molt insolidaris, no volem veure res lleig ni brut. 735: pictograma 526: hiperenllaç 933: so EC2: Digui on va néixer 411: canal Y12: ignorar 978: atletisme CN1: cent setanta-un mil dos-cents vuitanta-vuit 444: equalitzador Y53: medi ambient 715: àrea de descans S11: Quan surto al camp només vull guanyar i donar-ho tot. 711: nova ruta 211: final 434: silenci EL1: Lletregi el nom del carrer on viu 568: arribada 927: reducció de so 44 Annexa 3. Catalan Database for Consumer Applications 416: apropar CN2: set-cents vint-i-cinc mil cent u 132: vídeo Y44: salut 602: avui S12: Fet i fet, mentre no guanyi el pitjor... 455: expulsar 429: més contrast Y01: inactivar EQ1: Respongui sí o no: És major d'edat? 612: contactes Y54: fitness 233: com CN3: cinc-cents noranta-sis mil cent divuit coma quatre-cents noranta-tres 312: crear 805: ventilació 176: bateria S13: Taipei és una ciutat molt animada 445: altaveu 421: actualitzar 147: comprovar EQ2: Respongui sí o no: Ha viscut més de cinc anys a l'estranger? 159: hora 111: bloquejar 156: gravar CM1: cinquanta-quatre euros i onze cèntims 991: cuina 629: província 541: operadora S14: la fauna inclou els tapirs 158: rellotge 710: ruta alternativa 209: amunt EN1: Digui un número de telèfon mòbil Y15: normal 452: ràpid 948: jazz CE1: 623 489 314 138: pausa 618: jocs 604: setmana S15: la nostra hostessa revisa l' equipatge de mà 314: marcar 102: apagar 422: congelar EN2: Digui l'últim número de telèfon al que ha trucat 425: més brillant 979: esports de motor 712: distància CT1: tres quarts de quatre de la tarda 972: concurs 228: negre 45 Annexa 3. Catalan Database for Consumer Applications 129: mòbil S16: Volem que obri els seus mercats. 511: vacances 219: dreta 704: guia acústica EN3: Digui un número de telèfon fix 609: tasques 146: verificar 552: internacional CT2: quinze hores i trenta-vuit minuts 544: casa 961: bèl·lica 168: automàtic S17: vol que Mas retardi el nom del substitut de Puig 215: més 510: memòria 716: estació de servei EO1: Digui el nom d'un idioma que parli 450: play 309: enganxar 443: potenciar greus CD1: dissabte, disset de febrer de mil nou-cents seixanta-vuit 161: alarma 134: telèfon 227: verd S18: les temporades funcionen de la manera següent 720: punt de sortida 567: bústia de veu 628: codi postal CD2: Dijous sant 728: transbordador 107: registre 737: oest S19: l' activitat és prendre banys calents 151: demostració 616: editor de textos 529: trucar CD3: setze del dotze de mil nou-cents noranta-u 949: country 938: actualitat 160: data S20: Hi ha almenys nou morts. 426: més fosc 990: aficions 912: artista CL1: L I N Y O L A 632: número de fax 232: examinar 939: esports S21: Ningú ha encertat l'actriu i és molt fàcil Y28: secretària 506: enviar 46 Annexa 3. Catalan Database for Consumer Applications 983: dibuixos animats CL2: B E N Z O P I R E N S Y27: privat 943: ciència 501: missatge S22: les activitats d' estiu superen en nombre les d' hivern 973: programa de varietats 167: programa 517: fax CL3: P Ñ D L Y X F 804: recirculació 520: enviar per infraroig 432: més alt S23: la cortina de l' habitació 308: retallar 714: aparcament 225: vermell CP1: Francesc Ibáñez Fortuny 221: avançar 424: lluminositat 119: casset S24: s'han fet els amos d'Internet amb un sorteig 614: accesoris 989: medicina 237: pàgina CO1: Avinguda del Marquès de l'Argentera 995: directe 724: lloguer de cotxes 108: bloqueig S25: la història de la construcció naval i la navegació 185: refer Y24: encapçalament 305: modificar CO2: Lleida S26: un capítol de la història CQ1: NO S27: al llarg de la llengua des de Smiltyné CQ2: SI S28: Hi veig massa gent allí dalt. CK1: signe d'exclamació S29: sense accents , oi? CK2: punt i coma S30: el minibar es pot llogar CW1: www.periodistadigital.com CW2: jmv4@altavista.com 10 References and bibliography [1] Diehl, Frank (Temic), Fischer, Volker (IBM), Kiessling, Andreas (Ericsson) Marasek, Krzysztof (Sony) “Specification of Databases – Specification of recording 47 Annexa 3. Catalan Database for Consumer Applications scenarios” Deliverables D211, 212, 213, 214, 215 of the SpeeCon Project . http://www.speecon.com [2] Henk van den Heuvel (SPEX), Shaunie Shammass (NSC), Ami Moyal (NSC), Oren Gedge (NSC) “Definition of Validation Criteria” Deliverable D41 of the SpeeCon Project . http://www.speecon.com [3] Joan Veny, Els parlars catalans, Edit. Moll, Mallorca, 1993. 11 Appendix 1. 11.1 List of Proper Names (CP1) Adrià Carmona Busquets Agustí Andreu Alba Borràs Ros Albert Àlex Alexandre Lozano Alfons Martínez Virgili Alícia Sala Domínguez Andrea Segura Prat Andreu Àngela Romero Alsina Àngel Flores Àngels Ruiz López Anna Jiménez Anna María Antoni Antònia Gómez Ariadna Puig Arnau Rovira Ibars Aurora Ortiz Carla Carles Carme Carolina Esteve Catalina Sureda Chrístian Márquez Roure Clara Moreno Lara Clàudia Suárez Concepció Consol Tarrés Cristina Daniel Teixidor Oró David Dolors Eduard Arias Valls Elena Riera Elisabet Emília Pelegrí Carrasco Emili Luque Encarnació Velasco Enric Duran Eric Camps Casadevall Esther Santos Eva Ferrer Vidal Fèlix Molina Ferran Masip Figueras Francesc Ibáñez Fortuny Francesc Xavier Casellas Delgado Francisca Badia Ramírez Gabriel Godia Gemma Gerard Sabaté Rodríguez Glòria Farré Morell Guillem Mir Ignasi Inmaculada Masó Mas Irene Gallardo Planas Isabel Ivan Caballero Jaume Vila Jesús Dalmau Joana Comas Pascual Joan Carles Joan Manel Joan Vázquez Joaquim Joel Medina Casals Jordi Navarro Font Josefa Josefina Jové Josep Agustí Josep Lluís Josep Maria Solsona Nieto Judith Palau Marín Júlia Herrera Guerrero Juli Blanco Ventura Laia Frigola Laura Boada Alonso Lídia Mora Torrelles Llucia Lluís Lluisa Lourdes Carrillo Cruz Magdalena Álvarez Manel Heredia Marc Cano Marcel Margarita Maria Maria Àngels Pagès Gil 48 Annexa 3. Catalan Database for Consumer Applications Maria Antònia María del Carme Serra Soler Maria del Mar Maria Dolors Maria Isabel Piñol Pérez Maria Jesús Maria Josefa Foix Coll Maria Lluisa Sánchez Feliu Maria Rosa García Maria Teresa Marina Ballesté Marta Miró Martí Merce Meritxell Batlle Miquel Miquel Àngel Guiu Mireia Míriam Mohamed Cortès Mònica Méndez Torres Montserrat Castelló Carreras Natàlia Núria Pujol Olga Florensa Vargas Oriol Reyes Castro Òscar Coromines Patrícia Nuñez Pau Hernández Pastor Paula Pere Pilar Costa Quintana Pol Rafael Ramona Fuentes Serrano Ramon Rigau Raquel Raül Grau Ricard Aguilar Robert Roger Rosa Dolcet Rosa Maria Ortega Roser Sanz Iglesias Salvador Mateu Sandra Sara Díaz Sebastiá Peña Sergi Gutiérrez Oliveras Sílvia Ribes Sònia Rosell Susanna Teresa Moyà Soriano Tomàs Vanessa Roig Verònica Bosch Víctor Victòria Xavier Pla Casas Yolanda Tomàs Hidalgo 11.2 City/street names (CO1-2) 11.2.1 City Names Abrera Agramunt Aitona Alcanar Alcarràs Alcover Alella Alfarràs Alguaire Almacelles Almenar Alpicat Altafulla Amer Amposta Anglès Arbeca Arbúcies Arenys de Mar Arenys de Munt Argentona Artés Artesa de Segre Badalona Badia del Vallès Balaguer Balenyà Balsareny Banyoles Barberà del Vallès Barcelona Begues Begur Bell-lloc d'Urgell Bell-lloc d'Urgell Bellpuig Bellvís Berga Bescanó Bigues i Riells Blanes Breda Cabrera de Mar Cabrils Calaf Calafell Caldes de Malavella Caldes de Montbui Calella Calonge Camarles Cambrils Campdevànol Camprodon Canet de Mar Canovelles Capellades Cardedeu Cardona Cassà de la Selva Castellar del Vallès Castellbell i el Vilar Castellbisbal Castelldefels Castelló d'Empúries Celrà Centelles Cerdanyola del Vallès Cervelló Cervera Constantí Corbera de Llobregat Cornellà de Llobregat Cubelles Cunit Deltebre Dosrius el Masnou el Morell el Papiol el Perelló 49 Annexa 3. Catalan Database for Consumer Applications el Pont de Suert el Pont de Vilomara Rocafort el Prat de Llobregat els Pallaresos el Vendrell Esparreguera Esplugues de Llobregat Falset Figueres Flix Franqueses del Vallès les Gandesa Gavà Gelida Girona Gironella Granollers Guissona Hostalric Igualada Juneda la Bisbal de l'Empordà la Garriga la Jonquera l'Aldea la Llagosta l'Ametlla del Vallès l'Ametlla de Mar la Palma de Cervelló la Pobla de Segur l'Arboç la Roca del Vallès la Selva del Camp la Sénia la Seu d'Urgell la Vall d'en Bas les Borges Blanques l'Escala les Masies de Voltregà l'Espluga de Francolí l'Hospitalet de Llobregat Linyola Llagostera Llançà Lleida Lliçà d'Amunt Lliçà de Vall Llinars del Vallès Lloret de Mar Maçanet de la Selva Malgrat de Mar Manlleu Manresa Martorell Martorelles Masquefa Matadepera Mataró Moià Molins de Rei Mollerussa Mollet del Vallès Monistrol de Montserrat i Montblanc Montcada i Reixac Montgat Montmeló Montornès del Vallès Mont-roig del Camp Móra d'Ebre Móra la Nova Navarcles Navàs Òdena Olesa de Montserrat Olot Palafolls Palafrugell Palamós Palau de Plegamans Pallejà Parets del Vallès Piera Pineda de Mar Platja d'Aro Polinyà Ponts Porqueres Prats de Lluçanès Premià de Dalt Premià de Mar Puigcerdà Puig-reig Reus Ribes de Freser Ripoll Ripollet Riudoms Roda de Barà Roda de Ter Roquetes Roses Rubí Sabadell Sallent Salou Salt Santa Bàrbara Santa Coloma de Cervelló Santa Coloma de Farners Santa Coloma de Gramenet Santa Coloma de Queralt Santa Cristina d'Aro Sant Adrià de Besòs Santa Eulàlia de Ronçana Santa Margarida de Montbui Santa Margarida i els Monjos Santa Maria de Palautordera Sant Andreu de la Barca Sant Andreu de Llavaneres Sant Antoni de Vilamajor Santa Perpètua de Mogoda Sant Boi de Llobregat Sant Carles de la Ràpita Sant Celoni 50 Sant Climent de Llobregat Sant Cugat del Vallès Sant Esteve Sesrovires Sant Feliu de Codines Sant Feliu de Guíxols Sant Feliu de Llobregat Sant Fost de Campsentelles Sant Fruitós de Bages Sant Gregori Sant Hilari Sacalm Sant Hipòlit de Voltregà Sant Jaume d'Enveja Sant Joan de les Abadesses Sant Joan Despí Sant Joan de Vilatorrada Sant Joan les Fonts Sant Just Desvern Sant Martí Sarroca Santpedor Sant Pere de Ribes Sant Pere de Riudebitlles Sant Pere de Torelló Sant Pol de Mar Sant Quirze del Vallès Sant Sadurní d'Anoia Sant Vicenç de Castellet Sant Vicenç dels Horts Sant Vicenç de Montalt Sarrià de Ter Sentmenat Seva Sils Sitges Solsona Sort Subirats Súria Taradell Tarragona Tàrrega Teià Terrassa Tiana Tona Tordera Torelló Torredembarra Torregrossa Torrelles de Llobregat Torroella de Montgrí Tortosa Tossa de Mar Tremp Ulldecona Vallirana Valls Vandellòs i l'Hospitalet de l'Infan Vic Vidreres Vielha e Mijaran Viladecans Viladecavalls Vilafant Annexa 3. Catalan Database for Consumer Applications Vilafranca del Penedès Vilanova del Camí Vilanova i la Geltrú Vila-seca Vilassar de Dalt Vilassar de Mar Vilobí d'Onyar 11.2.2 Street Names Avinguda de Joan XXIII Avinguda de Josep Tarradellas Avinguda de J V Foix Avinguda de la Catedral Avinguda de la Mare de Déu de Montserrat Avinguda de la República Argentina Avinguda del Carrilet Avinguda de les Drassanes Avinguda de l'Hospital Militar Avinguda del Litoral Avinguda del Marquès de l'Argentera Avinguda del Paral·lel Avinguda del Portal de l'Àngel Avinguda dels Ports d'Europa Avinguda del Tibidabo Avinguda de Madrid Avinguda de Pau Casals Avinguda de Pedralbes Avinguda de Rius i Taulet Avinguda de Roma Avinguda de Sarrià Avinguda de Vallvidrera Avinguda de Vilanova Avinguda Diagonal Avinguda d'Icària Avinguda Meridiana Baixada de Santa Eulàlia Camí del Faig Camí del Pla Llorens Camí del Polvorí Camí dels Àngels Carrer Albesa Carrer Can Mallol Carrer Comtal Carrer d'Abdó Terradas Carrer d'Adolf Florensa Carrer d'Antoni Costa Carrer d'Aragó Carrer d'Aribau Carrer d'Armengol Carrer d'Arquímedes Carrer de Badosa Carrer de Bailèn Carrer de Balmes Carrer de Batista i Roca Carrer de Binèfar Carrer de Bonveí Carrer de Budapest Carrer de Buïgas Carrer de Calàbria Carrer de Calders Carrer de Can Bruixa Carrer de Casals Carrer de Casanova Carrer de Casp Carrer de Castelló Carrer de Colom Carrer de Croia Carrer de Cucurulla Carrer de Ferran Agulló Carrer de Fra Juníper Serra Carrer de Garriga i Roca Carrer de Girona Carrer de Grau Carrer de Gravina Carrer de Gualbes Carrer de Guerau de Liost Carrer de Jaume Cabrera Carrer de Joan Cortada Carrer de la Creu Coberta Carrer de la Democràcia Carrer de la Diputació Carrer de la Infanta Isabel Carrer de la Lleialtat Carrer de l'Alzina Carrer de la Mercè Carrer de la Metal·lúrgia Carrer de la Química Carrer de l'Arc de Sant Onofre Carrer de l'Arc de Sant Ramon del Call Carrer de la Residència Carrer de l'Argenteria Carrer de l'Arquitecte Moragas Carrer de la Sagrada Família Carrer de las Navas de Tolosa Carrer de la Torre dels Pardals Carrer del Bruc Carrer del Canó Carrer del Capità Arenas Carrer del Castell d'Argençola Carrer del Civader Carrer del Comte d'Urgell Carrer del Conflent Carrer del Consell de Cent Carrer del Doctor Roux Carrer de les Açores Carrer de les Fages Carrer de les Semoleres Carrer de l'Institut Frenopàtic Carrer de l'Isard Carrer del Marquès de Santillana Carrer del Mestre Dalmau Carrer del Pacífic Carrer del Pedraforca Carrer del Port de la Selva Carrer del Príncep Jordi Carrer del Rector Ubach Carrer del Rosselló Carrer dels Gegants Carrer dels Vivers Carrer del Terç de la Mare de Déu de Montserrat 51 Annexa 3. Catalan Database for Consumer Applications Carrer de l'Ullastre Carrer del Vinyar Carrer d'Emèrita Augusta Carrer de Montagut Carrer de Montanyans Carrer de Muntaner Carrer d'Enric Granados Carrer d'en Roig Carrer d'Entença Carrer de Numància Carrer de Obradors Carrer de Palau Solità Carrer de Pamplona Carrer de Pau Claris Carrer de Portlligat Carrer de Provença Carrer de Puig i Cadafalch Carrer de Rafael Capdevila Carrer de Ribes Carrer de Rocafort Carrer de Rogent Carrer de Roger de Llúria Carrer de Santa Mònica Carrer de Sant Cugat del Vallès Carrer de Sant Elm Carrer de Sant Hipòlit Carrer de Sant Pau Carrer de Sants Carrer de Sas Carrer de Solà Carrer de Tarragona Carrer de Trafalgar Carrer de València Carrer de Viladomat Carrer de Vilassar Carrer de Villarroel Carrer d'Òtranto Carrer Gran de Gràcia Carrer Gran de Sant Andreu Carrer Nou de la Rambla Carrer Nou de Zurbano Carretera del Prat Carretera de Vista Rica Carretera l'Observatori Fabra Gran Via de Carles III Gran Via de les Corts Catalanes Jardins de Ca n'Altamira Jardins de Can Brasó Jardins de Fabià Puigserver Jardins del Bosquet dels Encants Jardins del Camp de la Bota Jardins de Maria Luz Morales la Rambla Moll de Barcelona Moll de Catalunya Moll de Marina Moll de Pescadors Moll per a Líquids Parc Creueta del Coll Parc de Bederrida Parc de Josep Maria Serra Martí Parc de la Ciutadella Parc de la Guineueta Parc de La Maquinista de Sant Andreu Parc de la Vall d'Hebron Parc del Carmel Parc de les Aigües Parc de les Heures Parc de l'Espanya Industrial Parc de l'Estació del Nord Parc del Laberint Parc del Turó de la Peira Parc de Montjuïc Parc de Pedralbes Parc Forestal del Tibidabo Parc Forestal del Turó de Maria Parc Güell Passatge de Borrell Passatge de Colomer Passatge de Còrsega Passatge de Feliu Casanova Passatge de Foret Passatge de Joaquim Rita Passatge de Jordi Ferran Passatge del Mig del Turó Passatge del Rector Oliveras Passatge de Lucà Passatge de Maluquer Passatge de Mas de Roda Passatge de Mercedes Passatge de Xile Passeig de Colom Passeig de Fabra i Puig Passeig de Gràcia Passeig de la Bonanova Passeig de la Font de la Mulassa Passeig de la Mare de Déu del Coll Passeig de la Vall d'Hebron Passeig de la Zona Franca Passeig del Born Passeig de Lluís Companys Passeig de Manuel Girona Passeig de Maragall Passeig de Santa Coloma Passeig de Sant Joan Passeig de Torras i Bages Passeig de Valldaura Passeig de Verdum Passeig Mare de Déu de la Mercè Passeig Marítim de la Barceloneta Passeig Olímpic Plaça de Calvó Plaça de Catalunya Plaça de Fra Eloi de Bianya Plaça de Francesc Macià Plaça de Joan Casanelles Plaça de la Sagrada Família Plaça del Comte de Sert Plaça del Diamant Plaça de les Glòries Catalanes Plaça de Lesseps Plaça de l'Estació Plaça dels Campions Plaça del Sortidor Plaça dels Països Catalans Plaça de Mossèn Ferran Palau 52 Annexa 3. Catalan Database for Consumer Applications Plaça de Nemesi Ponsati Plaça de Sants Plaça d'Espanya Plaça de Tirant Lo Blanc Plaça d'Ildefons Cerdà Plaça d'Urquinaona Pla de la Boqueria Pla de Palau Port Olímpic Rambla de Catalunya Rambla de Fabra i Puig Rambla de Guipúscoa Rambla del Carmel Rambla de l'Onze de Setembre Rambla del Poblenou Rambla del Raval Rambla de Mar Rambla de Prim Riera Blanca Ronda de Dalt Ronda de la Universitat Ronda del General Mitre Ronda del Guinardó Ronda del Litoral Ronda del Mig Ronda de Sant Antoni Ronda de Sant Pau Ronda de Sant Pere Rotonda de Bellesguard Torrent de Tapioles Torrent Pantà Travessera de Dalt Travessera de Gràcia Travessera de les Corts Via Augusta Via Favència Via Júlia Via Laietana 11.3 Email & Web addresses (CW1-2) 11.3.1 CW1: WWW address from a list of 150 URL-addresses 3xl.net altavista.com avui.com barrapunto.com bbva.es boe.es cat365.net ebankinter.com ebay.com edu365.com elcorreodigital.com elmundodeportivo.es elmundo.es elotrolado.net e-noticies.com fcbarcelona.com franjadeponent.net gencat.net hotmail.com hp.com http://graciaweb.com/portal/inici.htm http://personal4.iddeo.es/joansolerm http://www.aeic.es/cat/index.htm http://www.agtat.es http://www.alresoft.com http://www.amunt.net http://www.as.com http://www.cerdanyaonline.com http://www.ciberdiari.com http://www.comarcalia.com http://www.costabrava.org http://www.diaridebarcelona.com/ http://www.diaridegirona.es http://www.dlleure.com/index_c.php http://www.elbalco.com http://www.elguaitador.com http://www.elpunt.com http://www.elsesports.com http://www.entitats.info http://www.esportcatala.com http://www.estelnet.com http://www.eterk.com http://www.festes.org http://www.futcat.org http://www.gironacomercial.info http://www.gironit.com http://www.grec.net http://www.imatica.org http://www.infolleida.com http://www.jopers.cjb.net http://www.manresa.net http://www.noticies.net http://www.pangea.org/index.ca.html http://www.prioratdigital.com http://www.puntbarra.com http://www.quinspreus.com http://www.sindicat.net http://www.smscatala.com http://www.softcatala.org http://www.solidaries.org http://www.tarragona.creativeweb.es/intern et http://www.teatral.net/asp/index.asp http://www.vallesnet.org http://www.xarxanet.org kissfm.es lamalla.net lanetro.com latecla.com lavanguardia.es lleidatur.es lycos.es 53 Annexa 3. Catalan Database for Consumer Applications m80radio.com marca.com movistar.com ondacero.es ozu.es patagon.es plus.es portalmix.com regio7.com repsol-ypf.com rockcatala.com sport.es telecinco.es telefonica.es telepolis.com terra.es tvcatalunya.com uned.es uoc.edu verbalia.com viatgeaddictes.com vilaweb.com vodafone.es wanadoo.es www.abc.es www.aeat.es www.amena.com www.bandaancha.st www.banesto.es www.barcelona-on-line.es/cat/index.asp www.barcelonaturisme.com/home2.htm www.bcn.es www.cadena100.es www.cadenaser.com www.cajamadrid.es www.catalunyaonline.com www.cat-linux.com www.cincodias.com www.cocacola.es www.educaweb.com/cat/default.asp www.elcorteingles.es www.elpais.es www.elperiodico.com www.everyday.com www.flaixfm.net www.fnac.es www.google.es www.gruposantander.es www.heraldo.es www.hispavista.com www.iberia.com www.iespana.es www.ine.es www.infojobs.net www.invertia.com www.ircap.com www.irc-hispano.org www.lacaixa.es www.latinmail.com www.lavozdegalicia.es www.libertaddigital.com www.llibertat.com www.los40.com www.meristation.com www.microsoft.com www.msn.es www.paginasamarillas.es www.periodistadigital.com www.racocatala.com www.softonic.com www.tiscali.es www.uab.edu www.ub.edu www.uji.es www.universia.es www.upc.edu www.upf.edu ya.com yahoo.es 11.3.2 CW2: Email drawn from a set of 550 e-mail addresses a1188383@hp.com a2070119@bcn.es a214362@graciaweb.com a2144829@gabarro.org a2674537@infojobs.es a2985911@teleline.es a3948738@earthlink.net a4107838@yahoo.com a4385708@t-systems.es a498918@email.com a5592705@jopers.cjb.net a6251336@telefonica.net a6659588@cerdanyaonline.com a6682220@siemens.com a6740328@yahoo.com a7605700@pacbell.net a7804314@online.es a9060423@lycos.com a9215394@excite.com a9433530@msn.es a9617718@comcast.net a9989393@ya.es aan2@cat365.net abm23@softcatala.org acr16@ciudad.com.ar adl12@gmail.com adria.casas.gil@wanadoo.com adria_ortega_gutierrez@yahoo.com afd@lycos.es afr@lacaixa.es agr@urv.net agusti106@dcs.shef.ac.uk 54 Annexa 3. Catalan Database for Consumer Applications agusti_godia_rodriguez@gmail.com agusti_pla_molina@msn.com agusti_tomas_moya@lycos.com ahb@hush.ai ahr@yahoo.es aic@menta.es ais@let.ru.nl ajf15@sindicat.net alba97@yahoo.com alba_ferrer_grau@ya.es alba_figueras_flores@wildmail.com alba.hernandez.ramos@ya.com alba_medina_miro@hispavista.com alba.ramos.vidal@e-noticies.com albert136@gmail.com albert65@astalavista.net alex.alonso.amador@email.com alexandre17@sant-andreu.com alexandre20@163.net alex.pons.gonzalez@gmail.com alfons15@costabrava.org alicia106@mail.com alicia_grau_serra@esportcatala.com alrinoi6@xrice.com amc18@yahoo.es ame11@menta.es ams@gencat.es anc@garonuna.com andrea98@nsc.co.il angel9@costabrava.org angela66@garonuna.com angela_garcia_rubio@telefonica.net angela_roig_calvo@ozu.es angels_molina_flores@aaiworldmarket.com angels_oliveras_torres@online.com anna26@wanadoo.com anna_maria.mora.aguila@astor.urv.es anna_santos_nieto@gmail.com anolfi21@mindspring.com anr@aol.com antoni_caballero_sureda@juneisp.biz antoni_gimenez_gracia@ozu.es arg7@lycos.es arnau.gil.gomez@gte.net arnau.roca.badia@charter.net ars25@gmx.net ash@vilaweb.com aurora85@lycos.com aurora_carmona_gutierrez@sant-andreu.com aurora_romero_cortes@uab.edu avv@geocities.com b8910546@cst.dk baersej14@bcn.es balgeczim11@ya.com beguhu27@leaveform.com bueimoz10@msn.es bulcainob26@til.upf.edu carles114@gte.net carme_duran_pujol@uoc.edu carolina_calvo_castillo@bellatlantic.net catalina.castro.vazquez@yahoo.com catalina.jove.molina@msn.es cca11@gte.net ccd@regions.com cch@yahoo.es cfc@rugbycatala.org cfg8@didamail.com cfo@ditifet.com cgn7@menta.es christian196@pangea.org christian_iglesias_garcia@copernic.com cipiurip0@yahoo.es cit16@t-online.de clara_munoz_puig@prodigy.net claudia178@personal4.iddeo.es claudia_hernandez_serra@geocities.com cmv@email.com concepcio125@hotmail.com concepcio.ramos.domingo@astalavista.net concepcio.roig.fernandez@gencat.es consol.duran.gutierrez@siemens.com consol.solsona.sanz@uab.edu coznahot5@netscape.net cpo4@cerdanyaonline.com crg@menta.es cristina183@grn.es csp@infojobs.es cunaza21@ikp.uni-bonn.de cuxojme26@uab.edu david.martin.comas@menta.es david.vazquez.garrido@netscape.net dds2@ebay.com derdexlu22@fib.upc.edu deryidmi29@lycos.com devvueh14@turismerural.com dgt22@sant-andreu.com dmd8@inf.ed.ac.uk doisiyru16@sant-andreu.com dolors_quintana_martinez@online.es drh17@uab.edu e1192330@leaveform.com e1591795@mat.upc.es e5411699@gitdoc.net e5495476@telefonica.com e6423322@zoog01.com e7910481@estelnet.com e8587591@verizon.net e8769183@gesi.biz e8803432@msn.es e9039973@optonline.com e9913586@netscape.net ebgenfe0@dukas.upc.es edg@ya.com eduard163@wanadoo.es eduard_costa_ortega@lacaixa.es efi21@tiscali.net efr5@telefonica.com elena123@aol.com elena.grau.puig@gmail.com elisabet17@mailpanda.com elisabet.rubio.guerrero@oportunitats.com elm15@elda.org elm@gps.tsc.upc.edu elsoxjoz19@yahoo.es emili39@charter.net emilia_masip_guerrero@retena.com 55 Annexa 3. Catalan Database for Consumer Applications emilia.ramos.fernandez@netscape.net emili_rovira_navarro@hotmail.com encarnacio.alonso.gil@ed.ac.uk encarnacio.bonet.bosch@msn.es encarnacio_carmona_cortes@festes.org encarnacio.fernandez.lara@tiscali.com encarnacio_gimenez_cano@ameritech.net encarnacio.martinez.vila@rugbycatala.org epf7@angelfire.com erfotem28@catalunyarock.com eric.perez.domingo@barcelonaturisme.com esa26@rugbycatala.org esm18@bbvanet.com esther_castro_romero@grec.net eva_hernandez_martin@t-online.de evc@tvcatalunya.com fagizmer17@mchp.siemens.de fcr@oportunitats.com felix2@bcn.es felix.ortega.martinez@mat.upf.edu ferran57@engr.smu.edu fororxa21@winterthur.es francesc155@wanadoo.es francesc_mora_torrelles@t-online.de francesc_xavier_dalmau_alvarez@email.com francesc_xavier_font_dalmau@telefonica.net francesc_xavier.heredia.bosch@nsc.co.il francisca159@pacbell.net g1072360@student.kuleuven.ac.be g5867134@bcn.es g999736@webtv.net ganxoipa16@astalavista.com gemma12@gmail.com gerard_alvarez_pons@alltheweb.com gerard_ramos_sanchez@vilaweb.com gfo1@pacbell.net ghm21@online.com gloria29@cajamadrid.es gloria_cano_recasens@nosaltres.com gloria_ramos_teixido@excite.com gmh20@asistecmo.com grr@lleidatur.es guillem.alonso.hernandez@graciaweb.com guillem.dominguez.ramos@vilaweb.com guillem.espinosa.gomez@yahoo.es hovemeb27@flaixfm.net hurgibot8@gitdoc.net huthenla18@mail.com i1022974@uab.edu i1453048@ebay.com i2021920@online.es i250754@mail.com i2559149@lacaixa.es i3347117@infojobs.es i5192558@smscatala.com i653209@gesi.biz i6861873@yahoo.co.uk i8038010@menta.net igc6@netscape.net ignasi2@xrice.com ihf21@videotron.ca iluxtem28@verizon.com inmaculada9@geocities.com inmaculada.mora.serra@yahoo.es inmaculada_ramos_medina@wanadoo.es int24@siemens.com ipzemo8@engr.smu.edu iro26@teleline.es isabel_duran_casas@attbi.com isabel_marti_morales@cajamadrid.es ivan86@vallesnet.org ivan_fortuny_pages@bcn.es ivan_pujol_fuentes@mailpanda.com ivan.roca.aguilar@inf.ed.ac.uk ivan.romero.ruiz@ya.com ivb18@optonline.com ivm@terra.es iymosva21@yahoo.es jap@costabravagirona.com jaume_alvarez_perez@i6.informatik.rwth-aachen.de jco20@estelnet.com jcv13@elpunt.com jdj@tiscali.es jdr13@fib.upc.edu jesus87@hush.ai jesusdelgado127@infolleida.com jesus_jimenez_serra@telefonica.net jevevus10@caixacatalunya.es jeyiejvon21@menta.net jgs2@educaweb.com jmd@uab.edu jmp25@teleline.es jmv4@altavista.com joana_lopez_font@quinspreus.com joana_vidal_gil@teleline.es joan_carles_lozano_delgado@yahoo.es joan_manel_morell_font@aol.com joaquim.godia.castro@urv.net joel133@elguaitador.com joemga13@gmail.com jordi.vazquez.torres@lacaixa.es josefa115@quinspreus.com josefa.martin.camps@vallesnet.org josefa_serrano_balleste@aaiworldmarket.com josefina188@comarcalia.com josefina20@prioratdigital.com josep_aguilar_rodriguez@web.de josep_lluis127@list.ru josep_lluis.pifarre.puig@ikp.uni-bonn.de josep_maria151@bcn.es josep_maria.blanco.sanchez@grec.net josep_maria.casademont.rovira@dlleure.com josep_maria_rodriguez_hernandez@videotron.ca josep_maria.serra.batlle@ed.ac.uk josep.molina.casas@catalunyagolf.com josep.rovira.mas@uab.edu jpr27@leaveform.com jrb@clemzy.com jsf@ctt.upc.edu jtr26@gmail.com judith161@uoc.edu juli32@optonline.com julia191@ya.com julia_gomez_marti@upc.edu jvc@softcatala.org jvr21@didamail.com 56 Annexa 3. Catalan Database for Consumer Applications laaireg11@citibank.com lac6@alltheweb.com laia113@gmail.com laia116@online.es laura160@ebay.com lbs@email.com lhl17@aol.com lidia175@optonline.com lidia36@pacbell.net lihsupom18@ikp.uni-bonn.de llucia.sole.lara@wanadoo.com lluis120@gmail.com lluisa_herrera_ortiz@terra.es lmp@bk.ru lourdes.serra.pujol@hotmail.passport.com lsv@agbar.es maamza22@caixacatalunya.es maf@caixacatalunya.es magdalena101@moneo.upc.es magdalena125@wanadoo.es magdalena_sola_pujol@menta.net magdalena_valls_florensa@ditifet.com manel_iglesias_pujol@tvcatalunya.com marc44@catalunyagolf.com marc_blanco_grau@gabarro.org marcel.jimenez.cortes@optonline.net marcel.vidal.diaz@agtat.es marc_gomez_casals@t-online.de marc.segura.roig@bigpond.com margarita2@gitdoc.net maria56@prioratdigital.com maria_del_carme35@yahoo.com maria_del_carme_gonzalez_ortega@earthlink.n et maria_del_carme.torres.molina@yahoo.es maria_del_mar_lopez_puig@popd.ix.netcom.co m maria_dolors36@entitats.info maria_dolors.cano.pages@yahoo.com maria_isabel136@cs.com maria_isabel_caballero_sanchez@tinet.org maria_isabel.dominguez.roca@yahoo.com maria_isabel_ortiz_molina@copernic.com maria_jesus46@gesi.biz maria_jesus78@yahoo.es maria_jesus_carrasco_gracia@globetrotter.net maria_jesus.ramirez.vicente@fib.upc.edu maria_lluisa_aguilar_roca@dcemail.com maria_lluisa_ortiz_boada@terra.es maria_pons_castro@yahoo.com maria_rosa_gimenez_parra@gmail.com maria_rosa_roca_costa@costadaurada.org maria_teresa_ramirez_medina@myway.com maria_teresa_suarez_puig@icqmail.com marina.rodriguez.carreras@ya.com marta195@winterthur.es marta_batlle_ferrer@gironit.com marti_vila_coll@ponyexpress.net mba3@msn.es mcc@manresa.net mdc25@wildmail.com mdg29@menta.net mdr26@barcelonaturisme.com merce68@die.upm.es meritxell.dominguez.vazquez@geocities.com meritxell_roca_pascual@netscape.net mfc@tvcatalunya.com mff@menta.net mfm@gmail.com mge@ebay.com mgg@telefonica.es mgr15@yahoo.es mgs1@gmail.com mgs22@asistecmo.com miquel_angel164@ed.ac.uk mireia193@tvcatalunya.com miriam187@lacaixa.es miv@ozu.es mixusziy23@t-online.de mmd28@menta.es mmj11@pacbell.net mms12@yahoo.com mocebiuj29@hp.com mohamed97@gmail.com mohamed_herrera_romero@virtualgalon.com monica117@costabrava.org monica189@opd.ub.edu monica_hernandez_lara@ciudad.com.ar monica_serra_herrera@list.ru montserrat.rubio.grau@tinetdigital.org mpm12@elpunt.com mpr2@mchp.siemens.de mpr@ebay.com mrc@osona.com mrm@videotron.ca mrs15@email.com mrt5@wanadoo.es msb@lycos.es msd22@elbalco.com msm19@telefonica.es msm26@quinspreus.com msm@excite.com msm@pie.xtec.es msp@cajamadrid.es mtp27@msn.com mubbaiu16@flashmail.com mupbactoz26@astor.urv.es nenavis7@3xl.net nidipor9@sindicat.net norjumai18@zoog01.com nsq@concentric.net nubefrox26@telefonica.net nuria.campos.herrera@gmx.net nurizin19@racocatala.com o1230325@menta.net o1387048@yahoo.com o2018208@menta.es o3253677@lycos.es o385337@comcast.net o3885756@costadaurada.org o3993639@turismerural.com o4029593@upc.edu o4276047@xrice.com o4639051@gencat.es o4654232@gitdoc.net o5151601@rugbycatala.org 57 Annexa 3. Catalan Database for Consumer Applications o609000@list.ru o6591726@ebay.com o7338883@hotmail.com o7690716@mail.com o7759535@llibertat.com o9203538@telefonica.net o9731572@gencat.es occ14@telefonica.es ocv2@lsi.upc.edu ohj4@splash.upc.es oliijti13@softcatala.org omf7@zurich.com opgupoi12@netscape.net opp@pacbell.net oscar139@ciudad.com.ar oscar_cruz_gracia@tvcatalunya.com osgaget4@jopers.cjb.net ovdimo14@telefonica.com ozdirdi7@pangea.org pau.duran.martin@hotmail.passport.com paula36@garonuna.com paula.castillo.guerrero@regions.com pdl14@lycos.es peetut13@costabravagirona.com pexemat11@bigfoot.com pilar43@msn.com pmc17@netscape.net pmm24@trm.ub.edu pol115@mat.upf.edu pol_vicente_perez@elpunt.com pos@ctt.upc.edu ppc3@terra.es ppp2@gmail.com psm16@netscape.net ptr@esportcatala.com puclomdo0@ub.edu puibiuf18@regio7.com r1294692@lycos.com r2979988@elguaitador.com r4720394@astalavista.box.sk r5143529@wanadoo.com r645747@msn.es r9226637@teatral.net rafael.palau.ortega@ya.com ramon190@garonuna.com ramon92@telefonica.net ramona_rubio_fernandez@zipmail.com.br ramon_rubio_romero@siemens.com raquel_marin_solsona@ponyexpress.net raquel_serrano_torrelles@i6.informatik.rwthaachen.de rca@oportunitats.com rdr@mindspring.com relefpa22@barcelona-on-line.es rfp4@sindicat.net rfp@uab.edu rft8@citibank.com ricard.arias.gomez@manresa.net ricard.duran.molina@t-online.de ricard.herrera.flores@catalunyarock.com ricard.nieto.hernandez@gmail.com rizejhis14@ponyexpress.net rmn11@cdti.es rmr11@infojobs.es rojabuj17@som-hi.com rorxetvuv19@netscape.net rosa168@elbalco.com rosa_maria.comas.ibanez@gitdoc.net rosa_maria.soler.heredia@verizon.com roser_cano_vazquez@citibank.com roser_dominguez_roca@geocities.com rpc2@infojobs.es rpg11@online.com rpm@sbcglobal.net s335170@esportcatala.com salvador_casals_ramos@alltheweb.com salvador.costa.tomas@terra.es salvador_mas_diaz@ya.com sandra6@ciclismegirona.com sandra_torrelles_sanchez@gamebox.net sara115@retena.com sebastia120@barcelona-on-line.es sfc@aol.com shr21@atletisme.com sih@gmail.com smd15@lycos.com smh12@cajamadrid.es smp@die.upm.es smr@menta.net sonia107@upf.edu sonia76@prodigy.net susanna_pujol_rodriguez@tiscali.es teresa74@netscape.net teresa.fernandez.pena@telefonica.com teresa.ramirez.delgado@student.kuleuven.ac.be tichodlu14@tiscali.es ticvundul24@winterthur.es tiimune15@grn.es tomas_ortega_soler@tarragona.creativeweb.es u1442199@yahoo.com u3043818@telefonica.net u3858371@163.net u830108@menta.net ubzehfo26@garonuna.com udriut26@cableway.net uygebah7@copernic.com uzleloh7@astor.urv.es vcr@pqr.ub.edu vdg@costabravagirona.com vecaleb26@gmail.com veinujbox8@gesi.biz veronica_suarez_pascual@hispavista.com victoria60@mail.com victoria.cano.castillo@vilaweb.com victoria.florensa.gracia@festes.org victoria.rovira.sola@pangea.org victoria_ruiz_martin@msn.com victor.torres.gomez@elguaitador.com viromvev26@alresoft.com vmb7@videotron.ca vmp20@gmail.com vonleef10@gps.tsc.upc.edu voxdusuf19@verizon.net vuhfeir11@yahoo.es xahuphe7@bcn.es xodijrux2@videotron.ca 58 Annexa 3. Catalan Database for Consumer Applications xoemiex12@menta.es xojerye13@softcatala.org xotehu12@urv.net xrg23@gmail.com xudeci10@mchp.siemens.de xuriesa21@pacbell.net yecosuy29@hotmail.com yeiemjat9@hotmail.com yesidput24@backwards.com yihciliu13@ebay.com yiiroj20@ebay.com yijugso6@bk.ru yofijto19@catalunyagolf.com yolanda_pons_comas@cat-linux.com yolanda.sanchez.pascual@tinet.org zamucoh24@gironit.com zaxnulgev24@webtv.net 59 11.4 Application words. ID Semantic description English word 1.1 1.2 1.3 1.4 on off standby wake_up encendre apagar standby | inactivar activar cancel password cancel·lar contrasenya register | login registre | login keylock logout voice_control bloqueig sortir control_de_veu lock unlock bloquejar desbloquejar 1.13 1.14 1.15 activate a device deactivate a device put device into standby mode awaken device from standby mode cancel current operation indicate that you want to change password gain access to a protected device/mode activate keylock leave a protected mode activate speech recognition mode lock current settings, device etc. unlock current settings, device, etc. finalize an input (as <CR>) quit a running application to adjust settings enter exit | quit set introduir sortir configurar 1.16 1.17 1.18 1.19 device name: radio device name: camcorder device name: camera device name: cassette radio camcorder camera cassette ràdio videocàmera càmera casset 1.20 1.21 1.22 device name: CD player device name: CD writer device name: player/recorder CD CD-R DAT CD CD-R DAT 1.5 1.6 1.7 1.8 1.9 1.10 1.11 1.12 Comments Activating and deactivating the system Translation Devices DAT 1.23 1.24 1.25 1.26 1.27 1.28 1.29 1.30 1.31 1.32 1.33 1.34 device name: DVD player/writer device name: HiFi system device name: MiniDisk player/recorder device name: MP3 player device name: microphone device name: television device name: mobile telephone, cellular phone device name: PDA device name: set-top box device name: video device name: computer device name: phone DVD DVD HiFi MD HiFi MD MP3 microphone TV mobile | cellular MP3 micròfon TV mòbil PDA set-top_box video computer phone PDA descodificador vídeo ordinador telèfon start service, device, function start again end service, function temporarily interrupt the current function resume the current function open an object (file/item/window) close an object (file/item /window) confirm an operation activate help mode ask for information activate a SW assistant or contact a human assistant verify an operation check if any event has happened, e.g. check mail-box switch on or off or between, e.g. functions/menu items, etc. start | activate restart abort | end | stop pause iniciar | activar reiniciar anul·lar | finalitzar | parar pausa continue open continuar obrir close tancar confirm help information assistant confirmar ajuda informació assistent verify check verificar comprovar switch # switch_on # switch_off canviar # connectar desconnectar Operating devices 1.35 1.36 1.37 1.38 1.39 1.40 1.41 1.42 1.43 1.44 1.45 1.46 1.47 1.48 Voice control settings # 1.49 greeting the system (e.g. saying Hello enables the system to define English as dialogue language) list command & control vocabulary start a demo name of the current language (e.g. for German recordings select word Deutsch) tell the recognizer to learn a new word enter spelling mode enter digit mode to record a command or message hello hola vocabulary vocabulari demo <language> demostració <idioma> learn | train aprendre | ensenyar spell digit record gramàtica dígit gravar timer temporitzador clock rellotge time hora date data alarm alarma daily diari weekly setmanal monthly mensual 1.65 1.66 address timer function; activate/program timer function address clock function; activate clock address time function; announce/display/set time address date function; announce/display/set date address alarm function; activate alarm function activate device or function daily at a preset time activate device or function weekly at a preset time activate device or function monthly at a preset time go to an earlier time go to a later time earlier later abans després 1.67 program advanced features or program 1.50 1.51 1.52 1.53 1.54 1.55 1.56 Timer control 1.57 1.58 1.59 1.60 1.61 1.62 1.63 1.64 Programming programa 1.68 1.69 1.70 1.71 1.72 1.73 1.74 options, or enter sub-menu to do this select automatic mode select manual mode reset to the system to default save current item, configuration, etc. initiate recording/activation of a macro enter settings menue; select settings functions noun, activate device/system control automatic manual reset save | store automàtic manual reiniciar memoritzar / guardar macro macro settings opcions control control System status 1.75 1.76 1.77 1.78 show the status of the system show battery status display signal strength activate checking procedure status battery signal_strength self-test 1.79 1.80 1.81 1.82 1.83 1.84 connect to network infrared interface select bluetooth operation address (internet) host go to network function synchronize data with entries network infrared bluetooth host network synchronize 1.85 try again to activate/redo redo | retry the last (unsuccessful) action (command, query, ...) again estat bateria força_de_senyal autoavaluació Connectivity PC xarxa infraroig bluetooth hoste xarxa sincronitzar refer | reintentar Directory navigation Calling up menus 2.1 2.2 go to a menu enter directory (e.g. of files) or list directory entries menu directory menú directori 2.3 2.4 2.5 show options from a menu, etc. go to a sub-menu, i.e. give more details to show an inventory of items options detail opcions detalls list llista Menu browsing 2.6 2.7 2.8 2.9 2.10 2.11 2.12 2.13 2.14 2.15 2.16 2.17 2.18 2.19 2.20 2.21 2.22 2.23 2.24 2.25 2.26 2.27 2.28 2.29 2.30 2.31 go to the beginning of a dialogue or to top-level menu noun or adverb, go to the beginning of list noun or adverb, go to the beginning of list noun or adverb, go to the beginning of list noun or adverb, go to the beginning of list go to end of list go to end of list go to end of list go up one item go up one item go down one item go down one item go left from present position go right from present position go forward one item in the list go forward one item in the list go back one item in the list go back one item in the list color: blue color: red color: yellow color: green color: black color: white search for an item ask the system to suggest home principal start principi top inici front amunt first primer/a bottom end last up plus down minus left right next forward previous back blue red yellow green black white search | lookup | find suggest final fi últim amunt més baixar menys esquerra dreta següent avançar anterior enrere blau vermell groc verd negre blanc buscar | cercar suggerir 2.35 2.36 2.37 something (e.g. hotel or a route) browse through a list conduct a fuzzy search based on written or spoken input (e.g. Krzysztof sounds like Kschischtof, reads like Christof) go to item, identified by name, number etc. go to row go to column go to page (+ number) 2.38 2.39 2.40 select menu/item select all items skip a marked item select | choose all skip | ignore 3.1 3.2 create a new item object of operation: entry; e.g. entry in a list or directory input an item dictate text change an entry correct an entry copy (to clipboard) cut from chosen source to chosen destination paste an entry insert an entry or item add an entry or an item create an entry or an item delete or remove an entry/item highlight points of interest sort elements by name, date, … undo last operation repeat last function replace an item new entry nou entrada enter dictate modify | change correct copy cut introduir dictar modificar | canviar corregir copiar retallar paste insert add create delete | remove |erase mark sort undo repeat replace enganxar insertar afegir crear esborrar | eliminar marcar classificar desfer repetir substituir 2.32 2.33 2.34 browse like examinar com go_to anar_a row column page fila columna pàgina Selecting items seleccionar | escollir tots ometre | ignorar Editing 3.3 3.4 3.5 3.6 3.7 3.8 3.9 3.10 3.11 3.12 3.13 3.14 3.15 3.16 3.17 3.18 3.19 3.20 3.21 3.22 accept current status/settings or confirm correctness edit a text, business card, entry, etc. edit/delete/go to number xxx politely ask for something okay | OK acceptar edit editar number please número si_us_plau Output control General 4.1 4.2 4.3 4.4 4.5 4.6 4.7 4.8 4.9 4.10 4.11 personal picture and/or sound set settings to normal/neutral reach the top value of a certain range reach the smallest value of a certain range set level (+ number, range etc.) set level high set level low set level medium increase or get more decrease or get less select channel/program (+ number or name) profile neutral | normal | default maximum perfil neutre | normal | per_defecte màxim minimum mínim level high low medium more less channel | program nivell alt baix mitjà més menys canal | programa VISUAL OUTPUT Display format 4.12 4.13 4.14 display results, file etc. enlarge window to full-scale hide window on a display show | display maximize hide | minimize 4.15 4.16 4.17 4.18 clear display zoom in zoom out determine format or scale of displayed object noun, e.g. scale of the map e.g. of an image in pixels refresh current display clear zoom_in zoom_out format | picture_size mostrar maximitzar ocultar | amagar | minimitzar esborrar apropar allunyar format | tamany_d’image scale resolution update | refresh escala resolució actualitzar 4.19 4.20 4.21 4.22 hold picture freeze congelar Brightness 4.23 4.24 4.25 4.26 initiate control of function enter brightness menu set picture more bright set picture less bright ‘light’ light llum brightness brighter darker lluminositat més_brillant més_fosc Color balance 4.27 enter and manipulate balance menu 4.28 4.29 4.30 enter contrast menu set contrast to be sharper set contrast to be softer color color color Contrast contrast sharper softer contrast més_contrast menys_contrast AUDIO OUTPUT Volume 4.31 4.32 4.33 4.34 4.35 4.36 enter volume menu increase volume decrease volume turn sound off turn sound on again let volume fade volume louder quieter mute | silent unmute fade volum més alt més baix silenci anul·lar_silenci esmorteir_so 4.37 4.38 4.39 choose sounds select melody/ring-tone function vibrator signal sounds melody | ring_tone vibrating_alert 4.40 4.41 tone bass to greus treble aguts 4.43 4.44 enter tone menu set bass level (+ number, range, etc.) set treble level (+ number, range, etc.) bass boost access equalizer bass-boost equalizer potenciar_greus equalitzador 4.45 output to loudspeakers (+on/off) loudspeaker Sounds and signal control sons melodia / to_de_trucada vibració Tone control 4.42 Output mode altaveu 4.46 4.47 4.48 4.49 output to headphones (+on/off) data source data target switch to hands-free (muting head-set) headphones line-in line-out hands-free 4.50 playback play | playback 4.51 4.52 4.53 4.54 4.55 play at slower speed play at faster speed play in shuffle mode change side of tape eject medium slow fast shuffle | random_play reverse eject 4.56 create hardcopy of ticket, map, time table, etc… print 4.57 Read (aloud) a file/item read auriculars entrada_de_so sortida_de_so mans_lliures Play play | reproduir reproducció lent ràpid selecció_aleatòria canviar_de_cara | girar expulsar Print imprimir Read llegir Messaging browsing and internet General message functions 5.1 5.2 5.3 5.4 5.5 5.6 5.7 5.8 5.9 enter message mode make a connection answer the call/message answer to everyone in a list forward message or call (+ name, number etc.) send message receive a message or broadcast or phone call declare message as urgent call someone back message connect answer | reply answer_all | reply_all forward | transfer | divert missatge connectar respondre respondre_a_tots reenviar | transferir | desviar send receive enviar rebre urgent callback urgent tornar-a-trucar enter voice memo program invoke vacation mode; vacation notification memo | voice_memo vacation Voice Memo 5.10 5.11 set memòria | memòria de veu vacances | Email 5.12 5.13 5.14 5.15 5.16 send/receive email enter mailbox get an item, e.g. get an e-mail from a server go to Inbox ask for information on mail header email mailbox get | receive correu_electrònic bústia rebre inbox header entrada capçalera | encapçalament enter fax menu program fax Fax 5.17 fax SMS 5.18 enter SMS menu program SMS SMS DTMF 5.19 activate transfer by DTMF DTMF DTMF Infrared/Bluetooth messaging 5.20 5.21 transfer e.g. the electronic business card via infrared interface set / edit the electronic business card beam | transfer enviar_per_infraroig transferir business_card targeta_de_visita enter/leave internet program activate/deactivate internet browser set bookmark follow hyperlink address hyperlink internet browser internet navegador bookmark follow hyperlink | link incloure_a_favorits seguir hiperenllaç | enllaç Internet browsing 5.22 5.23 5.24 5.25 5.26 TELEPHONE Dialing options 5.27 5.28 5.29 5.30 5.31 5.32 5.33 a telephone call (noun) place a call (verb) place a call (verb) call last number again place a call by saying a name place a call digit by digit initiate call by its short number call dial call redial name_dialing digit_dialing speed_dialing trucada marcar trucar remarcar trucada_per_nom trucada_per_número marcació ràpida | 5.34 5.35 5.36 5.37 5.38 choose an item from a finite list set a call-back reminder make a conference phone call use prepaid operation list recent calls, incoming and outgoing choose reminder conference prepaid call-list seleccionar recordatori conferència prepagament llista_de_trucades 5.39 5.40 5.41 emergency call(s) call customer care transfer call to human operator emergency customer_care | support operator 5.42 activate extension mode; e.g. “train new extension: ‘boat’” which can then be used for setting up a call by saying “call Anton boat” connect to extension; i.e. make a call to the ‘car phone’ of the selected name connect to extension; i.e. call the person “at home” connect to extension; i.e. call a person’s mobile number connect to extension; i.e. call a person’s office number make a call to the ‘pager’ for the selected name connect to extension; call the person’s assistant make a call to an unspecified other number of the selected name connect to extension; call the person’s secretary connect to extension; call a person “at work” use an international phone extension extensió car cotxe home | private casa | privat mobile mòbil office oficina pager busca assistant assistent other altres secretary secretari/ària work feina international internacional Shortcuts emergència atenció_al_client operadora Extensions & User profiles 5.43 5.44 5.45 5.46 5.47 5.48 5.49 5.50 5.51 5.52 5.53 5.54 5.55 5.56 5.57 5.58 number extension activate/change user profile setting (e.g. from normal to meeting) activate user profile activate user profile activate user profile activate user profile activate user profile profile perfil home outdoors city hands-free meeting casa exterior ciutat mans_lliures reunió Call handling 5.59 recording a greeting for answering to the incoming call hang-up the phone put the call on hold accept call refuse call greeting salutació hang-up hold accept_call reject_call | refuse_call | busy missed_calls received_calls duration 5.67 list missed calls list received calls getting the time length of the call go to voice mail penjar en_espera acceptar_trucada refusar_trucada rebutjar_trucada / ocupat trucades_perdudes trucades_rebudes durada 5.68 5.69 5.70 ask for arrival time ask for departure time make a reservation arrival departure book 5.60 5.61 5.62 5.63 5.64 5.65 5.66 voice_mail answering_machine | bústia_de_veu / contestador Making reservations arribada sortida reservar Organiser functions Organiser 6.1 6.2 6.3 6.4 invoke calendar application display today's tasks, appointments etc. display tomorrow's tasks, appointments etc. display calendar of week + number calendar today calendari avui tomorrow demà week setmana / 6.5 month mes 6.6 6.7 6.8 6.9 6.10 6.11 6.12 6.13 display calendar of month + month or number display calendar of week + year call agenda function set appointment date enter tasks menu display all birthdays display date(s) of delivery go to contacts remind of task, appointment etc. year agenda appointment | meeting tasks birthday schedule contacts reminder any agenda cita tasques aniversari horari contactes recordatori 6.14 6.15 6.16 6.17 6.18 activate accessories menu activate calculator activate word processor activate notes editor mode name: games accessories | utilities calculator text-editor notes games 6.19 6.20 6.21 go to phonebook operations go to address book operations activate professional address or number list activate private address or number list add name to address book add address (company, street, P.O. box, city, zip code, state, country) to address book add address categories address category address category address category address category address category add phone number(s) to address book add fax number(s) to address phone-book address-book business agenda_telefònica adreces professional private privat name address nom adreça company street city zip-code province/state country phone_number empresa carrer ciutat codi_postal província/estat país número_de_telèfon fax_number número_de_fax Accessories accesoris calculadora editor_de_textos notes jocs Address book 6.22 6.23 6.24 6.25 6.26 6.27 6.28 6.29 6.30 6.31 6.32 6.33 book add email address(es) to address book email_address adreça_de_correu_electròni c /adreça c/e Routing Positioning 7.1 7.2 7.3 enter/state the actual position show actual position's map enter GPS (global positioning system) menu locate | position map GPS ubicació / posició mapa GPS 7.4 7.5 7.6 acoustic guidance on/off activate navigation program directly guide to pre-stored destination suggest route suggest fastest route suggest shortest route suggest alternative route calculate a new route show distance to destination suggest/book/guide to a hotel/accommodation suggest/guide to a car lot suggest and/or provide info about rest areas propose a gas/service station suggest route with points of interest calculate a route avoiding traffic jams suggest points of interest acoustic_guidance guide | navigate guide_to guia_acústica guiar | navegar guiar_a route fastest_route shortest_route alternative_route new_route distance hotel ruta ruta_més_ràpida ruta_més_curta ruta_alternativa nova_ruta distància hotel parking rest_area aparcament àrea_de_descans service_station sightseeing estació_de_servei miradors traffic_jam cua Journey planning 7.7 7.8 7.9 7.10 7.11 7.12 7.13 7.14 7.15 7.16 7.17 7.18 7.19 punts_d’interès points_of_interest Enter landmarks 7.20 7.21 7.22 7.23 enter start point enter destination enter airport enter border-point start_point destination airport border punt_de_sortida destinació aeroport frontera 7.24 7.25 7.26 7.27 7.28 7.29 7.30 7.31 7.32 7.33 7.34 enter car rental station enter city centre enter name of a crossing street enter place of exhibition enter ferry crossing point enter car service (garage) enter gas / Petrol station enter hospital enter railway station enter restaurant Get general traffic information car_rental centre crossing | junction_point fair | trade_show | exhibition ferry garage gas_station hospital railway_station restaurant traffic_information lloguer_de_cotxes centre cruïlla | intersecció fira / exposició comercial transbordador taller_mecànic benzinera hospital estació_de_tren restaurant informació_del_trànsit 7.35 7.36 7.37 7.38 toggle to pictogram mode move east from current position move west from current position move north from current position move south from current position pictogram east west north pictograma est oest nord south sud Maps 7.39 Automotive General 8.1 recall driver settings recall_settings memoritzar_configuració Cabin Control 8.2 air-conditioning aire_condicionat heater re-circulation ventilation calefacció recirculació ventilació 8.6 8.7 8.8 enter or leave air-conditioning menu activate or deactivate heater air re-circulation choose air flow/fan/ventilation/blower enter seat menu set cabin temperature open or close window seat temperature window seient temperatura finestra 8.9 8.10 8.11 enter auto PC menu activate or deactivate defroster give status of fuel PC defroster fuel | gas 8.3 8.4 8.5 Vehicle control PC descongelació combustible | benzina 8.12 give status of maintenance maintenance manteniment Audio & video Sound control 9.1 9.2 select surround level set balance level surround balance 9.3 9.4 activate and deactivate videotext switch subtitles on and off videotext subtitles 9.5 select next memory position slot | position so_envolvent balanç Videotext videotext subtítols Programming channels posició | pista Connectivity 9.6 9.7 9.8 9.9 input from terrestrial antenna input from satellite dish input from cable network select auxiliary input number antenna satellite cable AUX antena satèl·lit cable AUX Electronic Program Guide + Radio Data system 9.10 9.11 9.12 enter EPG menu enter RDS menu show EPG/RDS information on artist show EPG information on program title show EPG information on director of the program show EPG information on actor name EPG RDS artist EPG RDS artista title títol director director actor actor/actriu 9.16 9.17 9.18 9.19 9.20 9.21 store with normal capacity store with extended capacity media for information storage media for information storage media for information storage media for information storage normal longplay mini-disk diskette | floppy_disc memory disc 9.22 rewind medium in the device rewind 9.13 9.14 9.15 Record normal larga duración minidisc disquet memòria disc Winding and search commands enrere 9.23 9.24 9.25 9.26 wind medium in the device object: index mark object: counter activate changer fast_forward index counter changer 9.27 Select noise reduction system noise_reduction endavant índex comptador carregador Noise reduction system reducció_de_so Portable devices options 9.28 9.29 set portable mode set battery charge on carrying_mode battery_charge mode_de_transport càrrega_de_bateria 9.30 show remaining capacity of named medium (tape, MD, CD, etc.) capacity 9.31 record still picture instead of movie audio-video track insert sound only insert video only still_picture imatge_congelada AV sound video AV so vídeo select DAB (digital audio broadcasting) ask for stored traffic information messages (TiM) D.A.B. D.A.B. traffic_messages missatges_de_trànsit select news channel select actual matters select sport channel select education channel select drama program select culture channel select science channel select pop music select rock music select easy music select classical music news current_affairs sport education drama culture science pop rock easy_listening classical Indications capacitat Video device/ camera 9.32 9.33 9.34 Radio 9.35 9.36 DAB categories 9.37 9.38 9.39 9.40 9.41 9.42 9.43 9.44 9.45 9.46 9.47 notícies actualitat esports educació teatre cultura ciència pop rock relaxant clàssica 9.48 9.49 9.50 9.51 9.52 9.53 9.54 9.55 select jazz music select country music select national music select oldies select folk music choose religion program select travel channel select documentary channel jazz country national oldies folk religion travel documentary 9.56 9.57 9.58 9.59 9.60 9.61 9.62 9.63 9.64 9.65 9.66 9.67 9.68 9.69 9.70 9.71 9.72 9.73 9.74 9.75 9.76 9.77 9.78 9.79 9.80 9.81 9.82 select movie channel select detective movie select thriller movie select adventure program select western movie select war movie select science fiction channel select fantasy movie select horror film select comedy channel select soap opera select melodrama movie select romance movie select serious movie select historical program select discussion program select game show select variety show select talk show select football match select tennis match select basketball match select athletics show select motor sport select water sport select winter sport select martial sport movie detective thriller adventure western war science_fiction fantasy horror comedy soap melodrama romance serious historical discussion game_show | quiz | contest variety_show talk_show football | soccer tennis basketball athletics motor_sport water_sport winter_sports martial_sports jazz country nacional cançons_del_passat folclore religió viatges documentals DVB categories pel·lícula de_detectius thriller aventura de_l’oest | de_vaquers bèl·lica ciència_ficció fantasia por comèdia fulletó melodrama amor seriosa històrica debat concurs programa_de_varietats tertúlia futbol tenis basquet atletisme esports_de_motor esports_d’aigua esports_d’hivern arts_marcials 9.83 9.84 9.85 9.86 9.87 select cartoons channel select music select culture program select fashion show select economy program cartoons music arts | culture fashion economics | financial_services 9.88 select program on nature nature | animals | environment 9.89 9.90 9.91 9.92 9.93 9.94 select fitness show select hobby channel select cooking program select advertisement show select gardening show select original language of the movie select live event medicine | health | fitness hobby cooking advertisement | shopping gardening original_language dibuixos_animats música art | cultura moda economia | serveis_financers natura | animals | medi ambient medicina | salut | fitness aficions cuina anuncis / compres jardineria versió_original live directe 9.95 ID Semantic description English word Code Word 1.1 1.2 1.3 1.4 activate a device deactivate a device put device into standby mode awaken device from standby mode on off standby wake_up 101 102 103 104 encendre apagar standby activar 1.5 1.6 1.7 1.8 1.9 1.10 1.11 1.12 1.13 1.14 1.15 cancel current operation indicate that you want to change password gain access to a protected device/mode activate keylock leave a protected mode activate speech recognition mode lock current settings, device etc. unlock current settings, device, etc. finalize an input (as <CR>) quit a running application to adjust settings cancel password register | login keylock logout voice_control lock unlock enter exit | quit set 105 106 107 108 109 110 111 112 113 114 115 cancel·lar contrasenya registre bloqueig sortir control_de_veu bloquejar desbloquejar introduir sortir configurar Code Synonym 1 Y01 inactivar Y02 login Code Synonym 2 1.16 1.17 1.18 1.19 device name: radio device name: camcorder device name: camera device name: cassette radio camcorder camera cassette 116 117 118 119 ràdio videocàmera càmera casset 1.20 1.21 1.22 1.23 1.24 1.25 1.26 1.27 1.28 1.29 device name: CD player device name: CD writer device name: DAT player/recorder device name: DVD player/writer device name: HiFi system device name: MiniDisk player/recorder device name: MP3 player device name: microphone device name: television device name: mobile telephone, cellular phone device name: PDA device name: set-top box device name: video device name: computer device name: phone start service, device, function start again end service, function CD CD-R DAT DVD HiFi MD MP3 microphone TV mobile | cellular PDA set-top_box video computer phone start | activate restart abort | end | stop pause continue open close confirm help information assistant 120 121 122 123 124 125 126 127 128 129 CD CD-R DAT DVD HiFi MD MP3 micròfon TV mòbil 130 131 132 133 134 135 136 137 PDA descodificador vídeo ordinador telèfon iniciar reiniciar anul·lar 138 139 140 141 142 143 144 145 pausa continuar obrir tancar confirmar ajuda informació assistent verify check 146 147 verificar comprovar 1.30 1.31 1.32 1.33 1.34 1.35 1.36 1.37 1.38 1.39 1.40 1.41 1.42 1.43 1.44 1.45 1.46 1.47 temporarily interrupt the current function resume the current function open an object (file/item/window) close an object (file/item /window) confirm an operation activate help mode ask for information activate a SW assistant or contact a human assistant verify an operation check if any event has happened, e.g. check mail-box Y03 activar Y04 finalitzar Y46 parar 1.48 switch on or off or functions/menu items, etc. 1.49 greeting the system (e.g. saying Hello enables the system to define English as dialogue language) list command & control vocabulary start a demo name of the current language (e.g. for German recordings select word Deutsch) tell the recognizer to learn a new word enter spelling mode enter digit mode to record a command or message address timer function; activate/program timer function address clock function; activate clock address time function; announce/display/set time address date function; announce/display/set date address alarm function; activate alarm function activate device or function daily at a preset time activate device or function weekly at a preset time activate device or function monthly at a preset time go to an earlier time go to a later time 1.50 1.51 1.52 1.53 1.54 1.55 1.56 1.57 1.58 1.59 1.60 1.61 1.62 1.63 1.64 1.65 1.66 1.67 1.68 1.69 1.70 1.71 between, e.g. program advanced features or options, or enter sub-menu to do this select automatic mode select manual mode reset to the system to default save current item, configuration, etc. switch switch_on switch_off hello # # 148 canviar 149 hola vocabulary demo <language> 150 151 152 vocabulari demostració <idioma> learn | train spell digit record timer 153 154 155 156 157 aprendre gramàtica dígit gravar temporitzador clock time 158 159 rellotge hora date 160 data alarm 161 alarma daily 162 diari weekly 163 setmanal monthly 164 mensual earlier later 165 166 abans després program 167 programa automatic manual reset save | store 168 169 170 171 automàtic manual reiniciar memoritzar Y05 connectar Y06 ensenyar Y07 guardar Y47 desconnectar 1.72 1.73 1.74 1.75 1.76 1.77 initiate recording/activation of a macro enter settings menue; select settings functions noun, activate device/system control show the status of the system show battery status display signal strength 1.78 1.79 1.80 1.81 1.82 1.83 1.84 1.85 activate checking procedure connect to network infrared interface select bluetooth operation address (internet) host go to network function synchronize data with PC entries try again to activate/redo the last (unsuccessful) action (command, query, ...) again 2.1 2.2 go to a menu enter directory (e.g. of files) or list directory entries show options from a menu, etc. go to a sub-menu, i.e. give more details to show an inventory of items go to the beginning of a dialogue or to toplevel menu noun or adverb, go to the beginning of list noun or adverb, go to the beginning of list noun or adverb, go to the beginning of list noun or adverb, go to the beginning of list go to end of list go to end of list go to end of list go up one item go up one item go down one item go down one item 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10 2.11 2.12 2.13 2.14 2.15 2.16 2.17 macro settings 172 173 macro opcions control status battery signal_strengt h self-test network infrared bluetooth host network synchronize redo | retry 174 175 176 177 control estat bateria força_de_senyal 178 179 180 181 182 183 184 185 autoavaluació xarxa infraroig bluetooth hoste xarxa sincronitzar refer menu directory 201 202 menú directori options detail list home 203 204 205 206 opcions detalls llista principal start top front first bottom end last up plus down minus 207 208 209 210 211 212 213 214 215 216 217 principi inici amunt primer final fi últim amunt més baixar menys Y08 reintentar Y09 primera 2.18 2.19 2.20 2.21 2.22 2.23 2.24 2.25 2.26 2.27 2.28 2.29 2.30 go left from present position go right from present position go forward one item in the list go forward one item in the list go back one item in the list go back one item in the list color: blue color: red color: yellow color: green color: black color: white search for an item 2.31 3.3 3.4 3.5 ask the system to suggest something (e.g. hotel or a route) browse through a list conduct a fuzzy search based on written or spoken input (e.g. Krzysztof sounds like Kschischtof, reads like Christof) go to item, identified by name, number etc. go to row go to column go to page (+ number) select menu/item select all items skip a marked item create a new item object of operation: entry; e.g. entry in a list or directory input an item dictate text change an entry 3.6 3.7 3.8 3.9 correct an entry copy (to clipboard) cut from chosen source to chosen destination paste an entry 2.32 2.33 2.34 2.35 2.36 2.37 2.38 2.39 2.40 3.1 3.2 left right next forward previous back blue red yellow green black white search | lookup | find suggest 218 219 220 221 222 223 224 225 226 227 228 229 230 esquerra dreta següent avançar anterior enrere blau vermell groc verd negre blanc buscar 231 suggerir browse like 232 233 examinar com go_to row column page select | choose all skip | ignore new entry 234 235 236 237 238 239 240 301 302 anar_a fila columna pàgina seleccionar tots ometre nou entrada enter dictate modify change correct copy cut paste 303 304 305 introduir dictar modificar 306 307 308 309 corregir copiar retallar enganxar | Y10 cercar Y11 escollir Y12 ignorar Y13 canviar 3.10 3.11 3.12 3.13 insert an entry or item add an entry or an item create an entry or an item delete or remove an entry/item 3.14 3.15 3.16 3.17 3.18 3.19 3.20 highlight points of interest sort elements by name, date, … undo last operation repeat last function replace an item accept current status/settings or confirm correctness edit a text, business card, entry, etc. 3.21 3.22 4.1 4.2 edit/delete/go to number xxx politely ask for something personal picture and/or sound set settings to normal/neutral 4.3 4.4 4.5 4.6 4.7 4.8 4.9 4.10 4.11 reach the top value of a certain range reach the smallest value of a certain range set level (+ number, range etc.) set level high set level low set level medium increase or get more decrease or get less select channel/program (+ number or name) 4.12 4.13 4.14 display results, file etc. enlarge window to full-scale hide window on a display 4.15 4.16 4.17 clear display zoom in zoom out insert add create delete | remove |erase mark sort undo repeat replace okay | OK 310 311 312 313 insertar afegir crear esborrar 314 315 316 317 318 319 marcar classificar desfer repetir substituir acceptar edit 320 editar number please profile neutral | normal | default maximum minimum level high low medium more less channel | program show | display maximize hide | minimize clear zoom_in zoom_out 321 322 401 402 Y14 eliminar número si_us_plau perfil neutre Y15 normal 403 404 405 406 407 408 409 410 411 màxim mínim nivell alt baix mitjà més menys canal Y16 programa 412 413 414 mostrar maximitzar ocultar Y17 amagar 415 416 417 esborrar apropar allunyar Y48 per defecte Y49 minimitzar 4.18 determine format or scale of displayed object 4.19 4.20 4.21 noun, e.g. scale of the map e.g. of an image in pixels refresh current display 4.22 4.23 4.24 4.25 4.26 4.27 4.28 4.29 4.30 4.31 4.32 4.33 4.34 4.35 4.36 4.37 4.38 hold picture initiate control of ‘light’ function enter brightness menu set picture more bright set picture less bright enter and manipulate color balance menu enter contrast menu set contrast to be sharper set contrast to be softer enter volume menu increase volume decrease volume turn sound off turn sound on again let volume fade choose sounds select melody/ring-tone function 4.39 vibrator signal 4.40 4.41 4.42 4.43 4.44 4.45 4.46 4.47 4.48 4.49 4.50 enter tone menu set bass level (+ number, range, etc.) set treble level (+ number, range, etc.) bass boost access equalizer output to loudspeakers (+on/off) output to headphones (+on/off) data source data target switch to hands-free (muting head-set) playback format | picture_size scale resolution update | refresh freeze light brightness brighter darker color contrast sharper softer volume louder quieter mute | silent unmute fade sounds melody | ring_tone vibrating_aler t tone bass treble bass-boost equalizer loudspeaker headphones line-in line-out hands-free play | playback 418 format 419 420 421 escala resolució actualitzar 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 congelar llum lluminositat més_brillant més_fosc color contrast més_contrast menys_contrast volum més alt més baix silenci anul·lar_silenci esmorteir_so sons melodia 439 vibració 440 441 442 443 444 445 446 447 448 449 450 to greus aguts potenciar_greus equalitzador altaveu auriculars entrada_de_so sortida_de_so mans_lliures play Y18 tamany d’imatge Y19 to de trucada Y20 reproduir Y50 reproducció 4.51 4.52 4.53 play at slower speed play at faster speed play in shuffle mode 4.54 4.55 4.56 change side of tape eject medium create hardcopy of ticket, map, time table, etc… Read (aloud) a file/item enter message mode make a connection answer the call/message answer to everyone in a list 4.57 5.1 5.2 5.3 5.4 5.5 forward message or call (+ name, number etc.) 5.6 5.7 5.8 5.9 5.10 send message receive a message or broadcast or phone call declare message as urgent call someone back enter voice memo program 5.11 invoke vacation mode; set vacation notification send/receive email enter mailbox get an item, e.g. get an e-mail from a server go to Inbox ask for information on mail header enter fax menu program enter SMS menu program activate transfer by DTMF transfer e.g. the electronic business card via infrared interface set / edit the electronic business card enter/leave internet program activate/deactivate internet browser 5.12 5.13 5.14 5.15 5.16 5.17 5.18 5.19 5.20 5.21 5.22 5.23 slow fast shuffle | random_play reverse eject print 451 452 453 lent ràpid selecció_aleatòria 454 455 456 canviar_de_cara expulsar imprimir read message connect answer | reply answer_all | reply_all forward | transfer | divert send receive urgent callback memo | voice_memo vacation 457 501 502 503 504 llegir missatge connectar respondre respondre_a_tots 505 email mailbox get | receive inbox header fax SMS DTMF beam | transfer business_card internet browser Y21 girar reenviar Y22 transferir 506 507 508 509 510 enviar rebre urgent tornar-a-trucar memòria Y23 memòria de veu 511 vacances 512 513 514 515 516 517 518 519 520 correu_electrònic bústia rebre entrada capçalera fax SMS DTMF enviar_per_infraroig Y24 encapçalament Y25 transferir 521 522 523 targeta_de_visita internet navegador Y51 desviar 5.24 5.25 5.26 set bookmark follow hyperlink address hyperlink 5.27 5.28 5.29 5.30 5.31 5.32 5.33 5.34 5.35 5.36 5.37 5.38 5.39 5.40 a telephone call (noun) place a call (verb) place a call (verb) call last number again place a call by saying a name place a call digit by digit initiate call by its short number choose an item from a finite list set a call-back reminder make a conference phone call use prepaid operation list recent calls, incoming and outgoing emergency call(s) call customer care 5.41 5.42 transfer call to human operator activate extension mode; e.g. “train new extension: ‘boat’” which can then be used for setting up a call by saying “call Anton boat” connect to extension; i.e. make a call to the ‘car phone’ of the selected name connect to extension; i.e. call the person “at home” connect to extension; i.e. call a person’s mobile number connect to extension; i.e. call a person’s office number make a call to the ‘pager’ for the selected name connect to extension; call the person’s assistant make a call to an unspecified other number of the selected name 5.43 5.44 5.45 5.46 5.47 5.48 5.49 bookmark follow hyperlink | link call dial call redial name_dialing digit_dialing speed_dialing choose reminder conference prepaid call-list emergency customer_care | support operator extension 524 525 526 incloure_a_favorits seguir hiperenllaç 527 528 529 530 531 532 533 534 535 536 537 538 539 540 trucada marcar trucar remarcar trucada_per_nom trucada_per_número marcació ràpida seleccionar recordatori conferència prepagament llista_de_trucades emergència atenció_al_client 541 542 operadora extensió car 543 cotxe home | private 544 casa mobile 545 mòbil office 546 oficina pager 547 busca assistant 548 assistent other 549 altres Y26 enllaç Y27 privat 5.60 5.61 5.62 5.63 connect to extension; call the person’s secretary connect to extension; call a person “at work” use an international phone number extension activate/change user profile setting (e.g. from normal to meeting) activate user profile activate user profile activate user profile activate user profile activate user profile recording a greeting for answering to the incoming call hang-up the phone put the call on hold accept call refuse call 5.64 5.65 5.66 5.67 list missed calls list received calls getting the time length of the call go to voice mail 5.68 5.69 5.70 6.1 6.2 6.3 6.4 6.5 ask for arrival time ask for departure time make a reservation invoke calendar application display today's tasks, appointments etc. display tomorrow's tasks, appointments etc. display calendar of week + number display calendar of month + month or number display calendar of week + year call agenda function set appointment date 5.50 5.51 5.52 5.53 5.54 5.55 5.56 5.57 5.58 5.59 6.6 6.7 6.8 secretary 550 secretari Y28 secretària work international profile 551 552 553 feina internacional perfil home outdoors city hands-free meeting greeting 554 555 556 557 558 559 casa exterior ciutat mans_lliures reunió salutació hang-up hold accept_call reject_call | refuse_call | busy missed_calls received_calls duration voice_mail | answering_ma chine arrival departure book calendar today tomorrow week month 560 561 562 563 penjar en_espera acceptar_trucada refusar_trucada Y29 rebutjar_trucada 564 565 566 567 trucades_perdudes trucades_rebudes durada bústia_de_veu Y30 contestador 568 569 570 601 602 603 604 605 arribada sortida reservar calendari avui demà setmana mes year agenda appointment | meeting 606 607 608 any agenda cita Y52 ocupat 6.9 6.10 6.11 6.12 6.13 6.14 enter tasks menu display all birthdays display date(s) of delivery go to contacts remind of task, appointment etc. activate accessories menu 6.15 6.16 6.17 6.18 6.19 6.20 6.21 6.22 6.23 6.24 6.25 6.26 6.27 6.28 6.29 6.30 6.31 6.32 6.33 activate calculator activate word processor activate notes editor mode name: games go to phonebook operations go to address book operations activate professional address or number list activate private address or number list add name to address book add address (company, street, P.O. box, city, zip code, state, country) to address book add address categories address category address category address category address category address category add phone number(s) to address book add fax number(s) to address book add email address(es) to address book 7.1 enter/state the actual position 7.2 7.3 7.4 show actual position's map enter GPS (global positioning system) menu acoustic guidance on/off 7.5 activate navigation program 7.6 7.7 directly guide to pre-stored destination suggest route tasks birthday schedule contacts reminder accessories | utilities calculator text-editor notes games phone-book address-book business private name address 609 610 611 612 613 614 tasques aniversari horari contactes recordatori accesoris 615 616 617 618 619 620 621 622 623 624 calculadora editor_de_textos notes jocs agenda_telefònica adreces professional privat nom adreça company street city zip-code province/state country phone_number fax_number email_address 625 626 627 628 629 630 631 632 633 locate | position map GPS acoustic_guid ance guide | navigate guide_to route 701 empresa carrer ciutat codi_postal província país número_de_telèfon número_de_fax adreça_de_correu_electr ònic ubicació 702 703 704 mapa GPS guia_acústica 705 guiar 706 707 guiar_a ruta Y31 estat Y32 adreça_c/e Y33 posició Y34 navegar 7.8 7.9 7.10 suggest fastest route suggest shortest route suggest alternative route 7.11 7.12 7.13 7.14 7.15 7.16 calculate a new route show distance to destination suggest/book/guide to a hotel/accommodation suggest/guide to a car lot suggest and/or provide info about rest areas propose a gas/service station 7.17 7.18 7.19 suggest route with points of interest calculate a route avoiding traffic jams suggest points of interest 7.20 7.21 7.22 7.23 7.24 7.25 7.26 enter start point enter destination enter airport enter border-point enter car rental station enter city centre enter name of a crossing street 7.27 enter place of exhibition 7.28 7.29 7.30 7.31 7.32 enter ferry crossing point enter car service (garage) enter gas / Petrol station enter hospital enter railway station 7.33 7.34 enter restaurant Get general traffic information 7.35 7.36 toggle to pictogram mode move east from current position fastest_route shortest_route alternative_ro ute new_route distance hotel 708 709 710 ruta_més_ràpida ruta_més_curta ruta_alternativa 711 712 713 nova_ruta distància hotel parking rest_area service_statio n sightseeing traffic_jam points_of_inte rest start_point destination airport border car_rental centre crossing | junction_point fair | trade_show | exhibition ferry garage gas_station hospital railway_statio n restaurant traffic_inform ation pictogram east 714 715 716 aparcament àrea_de_descans estació_de_servei 717 718 719 miradors cua punts_d’interès 720 721 722 723 724 725 726 punt_de_sortida destinació aeroport frontera lloguer_de_cotxes centre cruïlla Y35 intersecció 727 fira Y36 exposició comercial 728 729 730 731 732 transbordador taller_mecànic benzinera hospital estació_de_tren 733 734 restaurant informació_del_trànsit 735 736 pictograma est 7.37 7.38 7.39 8.1 8.2 move west from current position move north from current position move south from current position recall driver settings enter or leave air-conditioning menu 8.3 8.4 8.5 8.6 8.7 8.8 8.9 8.10 8.11 8.12 9.1 9.2 9.3 9.4 9.5 9.6 9.7 9.8 9.9 9.10 9.11 9.12 9.13 9.14 9.15 9.16 9.17 9.18 9.19 activate or deactivate heater air re-circulation choose air flow/fan/ventilation/blower enter seat menu set cabin temperature open or close window enter auto PC menu activate or deactivate defroster give status of fuel give status of maintenance select surround level set balance level activate and deactivate videotext switch subtitles on and off select next memory position input from terrestrial antenna input from satellite dish input from cable network select auxiliary input number enter EPG menu enter RDS menu show EPG/RDS information on artist show EPG information on program title show EPG information on director of the program show EPG information on actor name store with normal capacity store with extended capacity media for information storage media for information storage 9.20 media for information storage west north south recall_settings airconditioning heater re-circulation ventilation seat temperature window PC defroster fuel | gas maintenance surround balance videotext subtitles slot | position antenna satellite cable AUX EPG RDS artist title director 737 738 739 801 802 oest nord sud memoritzar_configuració aire_condicionat 803 804 805 806 807 808 809 810 811 812 901 902 903 904 905 906 907 908 909 910 911 912 913 914 calefacció recirculació ventilació seient temperatura finestra PC descongelació combustible manteniment so_envolvent balanç videotext subtítols posició antena satèl·lit cable AUX EPG RDS artista títol director actor normal longplay mini-disk diskette floppy_disc memory 915 916 917 918 919 actor normal larga duración minidisc disquet 920 memòria | Y37 benzina Y38 pista Y39 actriu 9.21 9.22 9.23 9.24 9.25 9.26 9.27 media for information storage rewind medium in the device wind medium in the device object: index mark object: counter activate changer Select noise reduction system 9.28 set portable mode 9.29 set battery charge on 9.30 9.37 9.38 show remaining capacity of named medium (tape, MD, CD, etc.) record still picture instead of movie audio-video track insert sound only insert video only select DAB (digital audio broadcasting) ask for stored traffic information messages (TiM) select news channel select actual matters 9.39 9.40 9.41 9.42 9.43 9.44 9.45 9.46 9.47 9.48 9.49 9.50 9.51 9.52 select sport channel select education channel select drama program select culture channel select science channel select pop music select rock music select easy music select classical music select jazz music select country music select national music select oldies select folk music 9.31 9.32 9.33 9.34 9.35 9.36 disc rewind fast_forward index counter changer noise_reductio n carrying_mod e battery_charg e capacity 921 922 923 924 925 926 927 disc enrere endavant índex comptador carregador reducció_de_so 928 mode_de_transport 929 càrrega_de_bateria 930 capacitat still_picture AV sound video D.A.B. traffic_messag es news current_affair s sport education drama culture science pop rock easy_listening classical jazz country national oldies folk 931 932 933 934 935 936 imatge_congelada AV so vídeo D.A.B. missatges_de_trànsit 937 938 notícies actualitat 939 940 941 942 943 944 945 946 947 948 949 950 951 952 esports educació teatre cultura ciència pop rock relaxant clàssica jazz country nacional cançons_del_passat folclore 9.53 9.54 9.55 9.56 9.57 9.58 9.59 9.60 9.61 9.62 9.63 9.64 9.65 9.66 9.67 9.68 9.69 9.70 9.71 9.72 choose religion program select travel channel select documentary channel select movie channel select detective movie select thriller movie select adventure program select western movie select war movie select science fiction channel select fantasy movie select horror film select comedy channel select soap opera select melodrama movie select romance movie select serious movie select historical program select discussion program select game show 9.73 9.74 9.75 select variety show select talk show select football match 9.76 9.77 9.78 9.79 9.80 9.81 9.82 9.83 9.84 9.85 9.86 select tennis match select basketball match select athletics show select motor sport select water sport select winter sport select martial sport select cartoons channel select music select culture program select fashion show religion travel documentary movie detective thriller adventure western war science_fiction fantasy horror comedy soap melodrama romance serious historical discussion game_show | quiz | contest variety_show talk_show football | soccer tennis basketball athletics motor_sport water_sport winter_sports martial_sports cartoons music arts | culture fashion 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 religió viatges documentals pel·lícula de_detectius thriller aventura de_l’oest bèl·lica ciència_ficció fantasia por comèdia fulletó melodrama amor seriosa històrica debat concurs 973 974 975 programa_de_varietats tertúlia futbol 976 977 978 979 980 981 982 983 984 985 986 tenis basquet atletisme esports_de_motor esports_d’aigua esports_d’hivern arts_marcials dibuixos_animats música art moda Y40 de_vaquers Y41 cultura 9.87 select economy program 9.88 select program on nature 9.89 select fitness show 9.90 9.91 9.92 select hobby channel select cooking program select advertisement show 9.93 9.94 select gardening show select original language of the movie 9.95 select live event economics | financial_servi ces nature | animals | environment medicine | health | fitness hobby cooking advertisement | shopping gardening original_langu age live 987 economia Y42 serveis_financers 988 natura Y43 animals Y53 medi ambient 989 medicina Y44 salut Y54 fitness 990 991 992 aficions cuina anuncis Y45 compres 993 994 jardineria versió_original 995 directe 11.4.1 List of synonyms and their codes. Code 103 107 135 137 148 153 171 185 210 230 238 240 305 313 402 Command standby registre iniciar anul·lar canviar aprendre memoritzar refer primer buscar seleccionar ometre modificar esborrar neutre Code Y01 Y02 Y03 Y04 Y05 Y06 Y07 Y08 Y09 Y10 Y11 Y12 Y13 Y14 Y15 Synonym 1 inactivar login activar finalitzar connectar ensenyar guardar reintentar primera cercar escollir ignorar canviar eliminar normal Code Synonym 2 Y46 Y47 parar desconnectar Y48 per_defecte 411 414 418 438 450 454 505 510 516 520 526 544 550 563 567 629 633 701 705 726 727 811 905 915 960 985 987 988 989 992 canal ocultar format melodia play canviar_de_cara reenviar memòria capçalera enviar_per_infraroig hiperenllaç casa secretari refusar_trucada bústia_de_veu província adreça_de_correu_electrònic ubicació guiar cruïlla fira combustible posició actor de_l’oest art economia natura medicina anuncis Y16 Y17 Y18 Y19 Y20 Y21 Y22 Y23 Y24 Y25 Y26 Y27 Y28 Y29 Y30 Y31 Y32 Y33 Y34 Y35 Y36 Y37 Y38 Y39 Y40 Y41 Y42 Y43 Y44 Y45 11.4.2 List of words with multiple occurrences. Word with multiple occurrence activar Reference 104, Y03 programa amagar tamany_d’image to_de_trucada reproduir girar transferir memòria_de_veu encapçalament transferir enllaç privat secretària rebutjar_trucada contestador estat adreça_c/e posició navegar intersecció exposició_comercial benzina pista actriu de_vaquers cultura serveis_financers animals salut compres Y49 minimitzar Y50 reproducció Y51 desviar Y52 ocupat Y53 Y54 medi_ambient fitness amunt assistent canviar casa ciutat connectar cultura enrere entrada esborrar estat introduir mòbil mans_lliures marcar més memòria menys normal opcions perfil posició privat programa rebre recordatori reiniciar seleccionar sortir transferir trucar vídeo xarxa 209, 214 145, 548 148, Y13 544, 554 556, 627 Y05, 502 942, Y41 223, 922 302, 515 313, 415 175, Y31 113, 303 129, 545 449, 557 314, 528 215, 409 510, 920 217, 410 Y15, 916 173, 203 401, 553 Y33, 905 Y27, 622 167, Y16 507, 514 535, 613 136, 170 238, 534 109, 114 Y22, Y25 509, 529 132, 934 179, 183