DC FieldValueLanguage
dc.contributorDepartment of Chinese and Bilingual Studiesen_US
dc.contributorDepartment of Englishen_US
dc.creatorHou, Ren_US
dc.creatorHuang, CRen_US
dc.creatorAhrens, Ken_US
dc.creatorLee, YMSen_US
dc.date.accessioned2021-07-06T02:41:57Z-
dc.date.available2021-07-06T02:41:57Z-
dc.identifier.issn2055-7671en_US
dc.identifier.urihttp://hdl.handle.net/10397/90417-
dc.language.isoenen_US
dc.publisherOxford University Pressen_US
dc.titleLinguistic characteristics of Chinese register based on the Menzerath—Altmann law and text clusteringen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage54en_US
dc.identifier.epage66en_US
dc.identifier.volume35en_US
dc.identifier.issue1en_US
dc.identifier.doi10.1093/llc/fqz005en_US
dcterms.abstractThis article explores the linguistic features of different registers in Chinese through text clustering driven by the Menzerath–Altmann (MA) law. We propose to calculate the average word length distribution according to clause length. The MA law predicts that texts from different registers will show differences in terms of average word length distribution in texts. As predicted by the MA law, analysis result demonstrates that average word length decreases with the increase of clause length in each register and that their relationship can be fitted by the formula y = axbe−cx⁠. We hypothesize that it is the situation type, i.e. whether the text is dialectic or monologue, that is the linguistic characteristic behind the dichotomy of word length distribution. To confirm these register-distinguishing linguistic features, texts were represented by the average word length distribution and the fitted parameters using the vector space model and clustered according to their register categories. Good clustering results show that average word length distribution in certain length clauses and their fitted parameters can be used as the distinctive characteristics of these three registers.en_US
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationDigital scholarship in the humanities, Apr. 2020, v. 35, no. 1, p. 54-66en_US
dcterms.isPartOfDigital scholarship in the humanitiesen_US
dcterms.issued2020-04-
dc.identifier.eissn2055-768Xen_US
dc.description.validate202107 bcvcen_US
dc.description.oaNot applicableen_US
dc.identifier.FolderNumbera0947-n02-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextNational Social Science Fund in China (Grant Award Number: 16BYY110), Hong Kong GRF (Grant Number 156097-15H) and The Hong Kong Polytechnic University (Grant Number 4-ZZFE)en_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2023.04.30en_US
Appears in Collections:Journal/Magazine Article
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.