Chinese Lexical Categorization: Theory and Practice * Chia-Ling Hsieh National Chiao Tung University clhsieh@mail.nctu.edu.tw Abstract This paper attempts to settle the disputes of Chinese lexical categorization by advancing a prototype-based approach from a multi-criterion perspective. Given the lack of morphological and syntactic distinctions, Chinese parts of speech cannot easily be defined by making reference to formal characteristics. This has led to a long debate over what and how many criteria will suffice for the establishment of different categories. Some linguists argue for a mono-criterion treatment, which assigns words to particular classes on either semantic or syntactic grounds (eg. Zhu 1982; Ma 1987; Chen 1990; Hu 1996). However, a more widely accepted treatment is to manifest linguistic categorization in terms of multiple grammatical levels (eg. Tang 1990; Chang 1992; CKIP 1993; Wen 1993). Though the criteria and the importance of each differ among accounts, it is widely acknowledged that the criteria are to be tested out through binary features [+] and [–]. This results in categories with homogeneous members and clear-cut boundaries. Straightforward as it is, this traditional approach is problematic in that a large number of lexical items can neither be unambiguously assigned to a single category nor be subsumed under a separate one. Chinese auxiliaries (eg. ke-neng “may”, ying-gai “should”, ke-yi “can”) are a case in point. As illustrated by their semantic import and syntactic distribution, modal auxiliary verbs are found to be inconsistently represented. The same holds true as they are re-classified into verbs and adverbs along traditional lines. To solve such dilemma, this paper provides a broader conceptualization of multiple criteria than that of previous studies. In particular, we argue for the role of semantic, syntactic and pragmatic factors in determining the categorization of Chinese lexical items. On the assumption that linguistic classes constitute fuzzy sets (Zadeh 1987; Shi 1996), we propose the prototype-based category theory. In this theory, members of each category are characterized as ranging across a spectrum from prototype to non-prototype. Differences among categories are thus relative and continuous, with transition features forming fuzzy adjacencies. In order to substantiate our hypothesis, we show that only multiple criteria can capture the intermediate behaviors of Chinese modal verbs and modal adverbs. The overlapping nature of these two sets lends strong empirical support to the prototype-based theory. An indication of this includes the inconsistent legitimacy of modal expressions being negated by bu “not” (eg. bu zen-wei “do not think” vs. *bu-xiang “do not think”) or standing alone as answers to yes-no questions (eg. que-ding “be sure” vs. *cai “guess”). Our analysis also sheds light on the identification and division of lexical classes by offering a universal basis for the theory and practice of linguistic categorization. * This paper was presented at The First Conference on Teaching of Languages, Linguistics, and Literature. May 1, 2004. Kaohsiung, Taiwan: English Department, National Kaohsiung Normal University. (NSC 92-2411-H-009-017) Reference Chang, Liensheng (張連生). 1992. Cixing queding de yuanze he fangfa. (Principles and approaches to the establishment of parts of speech). Dongbei Shida Xuebao (Journal of Dongbei Normal University) 138: 73-78. Chen, Gonghuan. (陳功煥) 1990. Yong yuyi gongneng jiangou hanyu yufa tixi de changshi (Attempt of constructing Chinese grammatical systems with semantic functions). Gannan Shifan Xueyuan Xuebao (Journal of Gannan Normal College) 5:53-60. CKIP (詞庫小組). 1993. Zongwen Cilei Fenxi (Analysis on Chinese Syntactic Categories). 3rd ed. Taipei: CKIP, Academic Sinica. Hu, Mingyang (胡明揚). 1996. Xiandai hanyu cilei wenti kaozha (Investigation on lexical categories in modern Chinese). In Mingyang Hu (ed.), Cilei Wenti Kaozha (Investigation on Lexical Categories). pp.1-21. Beijing: Beijing Yuyan Xueyuan Publishing. Ma, Xiao (馬嘯). 1987. Hanyuci fenlei de yiyi biaozhenlun pingshu: jianlun fenlei de mudi (Comments on the semantic criteria theory of Chinese lexical categorization: on the aim of categorization). Yinchuan Shizhuang Xuebao (Journal of Yinchuan Normal College) 3: 36-47. Shi, Youwei (史有為). 1996. Cilei wenti de zhengjie ji qi duice: hanyu cilei rouxing chuli shitan (Cruxes and strategies to the issues of lexical categorization: investigation on the flexible approach of Chinese lexical categories). In Mingyang Hu (ed.), Cilei Wenti Kaozha (Investigation on Lexical Categories). pp.56-92. Beijing: Beijing Yuyan Xueyuan Publishing. Tang, Tingchi (湯廷池). 1990. Hanyu de cilei: huafen de yiju yu gongyong (Chinese lexical categories: the basis and function of division). In Junsheng Chang and Fongwen Su (eds.), Proceedings of the Third Conference of Computational Linguistics. pp.159-183. Hsinchu: National Tsing Hua University. Wen, Lien (文煉). 1993. Yu fenlei youguan de jige wenti (Some issues on categorization). Hanyu Xuexi (Chinese Learning) 75:1-3. Zadeh, Lotfi Asker. 1987. Fuzzy sets. In R. R Yager et al. (eds.), Fuzzy Sets and Applications: Selected Papers by L. A. Zadeh. pp.29-44. New York: John Wiley & Sons. Zhu, Dexi (朱德熙). 1982. Yufa Jiangyi (Lectures on Grammatical Theory). Beijing: Shangwu Publising.