[Annotating clause boundary labels to the written composition corpus of Japanese elementary and junior high school students].

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Additional Information
    • Transliterated Title:
      「児童・生徒作文コーパス」に対する節境界ラベル付与.
    • Source:
      Publisher: F1000 Research Ltd Country of Publication: England NLM ID: 101594320 Publication Model: eCollection Cited Medium: Internet ISSN: 2046-1402 (Electronic) Linking ISSN: 20461402 NLM ISO Abbreviation: F1000Res Subsets: MEDLINE
    • Publication Information:
      Original Publication: London : F1000 Research Ltd
    • Subject Terms:
    • Abstract:
      To evaluate the development of children's writing ability, it is necessary not only to examine quantitative indices such as the dependency distance, but also to inquiry the types of structures they use. We conducted clause boundary labeling using Support Vector Machine (SVM) on a corpus of Japanese students' compositions to investigate the change in the tendency of clause use with the progression of school age. The analysis of clause label frequency per sentence exhibited an increase in attributive clauses, nominal clauses, quotation clauses, and continuous clauses, and a decrease in parallel clauses, conditional clauses, reason clauses, time clauses, indirect interrogative clauses, and main clauses. The analysis of dependency distance demonstrated that most of the clauses that increased had short dependency distances, while most of the clauses that decreased had long dependency distances, and that the frequency of clauses with small dependency distances increased relatively with increasing school age. In addition, there was a shift in clause selection among functionally similar clauses, such as from "-te" to continuous forms, from "-tara" to "-ba", and from "-kedo" and "-keredo" to "-ga". These results suggest a change in the children's lexical and grammatical choices, from coordinate to subordinate structures, and from spoken to written vocabulary.
      Competing Interests: No competing interests were disclosed.
      (Copyright: © 2021 Imada M et al.)
    • Publication Date:
      Date Created: 20210920 Date Completed: 20211025 Latest Revision: 20231107
    • Publication Date:
      20240829
    • Accession Number:
      PMC8424459
    • Accession Number:
      10.12688/f1000research.40669.1
    • Accession Number:
      34540204