Skip to main content

Advertisement

Table 3 Information on the texts and the reference corpora

From: DEGREE OF INTENSITY in English-Chinese translation: a corpus-based approach

  Data used No. of word tokens Year
Wild Swans-ST All chapters 220,399 1991
Wild Swans-TT All chapters 199,493a 1992
Mao’s Last Dancer-ST Chapters 6–22 79,503 2003
Mao’s Last Dancer-TT Chapters 6–22 77,254a 2009
FLOB All categories 1,000,000 1991
Category G (Belles-lettres, biographies, essays) 178,764
LCMC All categories 1,000,000 1991 ± 2
Category G (biographies, essays) 129,746
  1. a As there are no spaces marking word boundaries in naturally-occurring Chinese texts, the word counts of the two TTs are estimated based on McEnery and Xiao’s (2003–2008) guideline of a ratio of 1:1.6 of words to characters. In comparison, LCMC is artificially segmented for word boundaries.