Text Mining Applied to Distance Higher Education: A Systematic Literature Review

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Author(s): Patrícia Takaki (ORCID Patrícia Takaki (ORCID 0000-0003-4777-9683); Moisés Lima Dutra
  • Language:
    English
  • Source:
    Education and Information Technologies. 2024 29(9):10851-10878.
  • Publication Date:
    2024
  • Document Type:
    Journal Articles
    Information Analyses
  • Additional Information
    • Availability:
      Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: [email protected]; Web site: https://link.springer.com/
    • Peer Reviewed:
      Y
    • Source:
      28
    • Education Level:
      Higher Education
      Postsecondary Education
    • Subject Terms:
    • Accession Number:
      10.1007/s10639-023-12235-0
    • ISSN:
      1360-2357
      1573-7608
    • Abstract:
      Much of the data produced and consumed by students, teachers, and educational managers is in textual format. Text Mining (TM) and Natural Language Processing (NLP) have been applied in the educational context in different ways. Ideally, such applications combine computational, linguistic, pedagogical, and psychological aspects. This article aims to gather and analyze scientific publications that have applied TM and NLP techniques in textual corpora from distance-higher education through a Systematic Literature Review. Eight scientific databases were searched (ACM DL, Scopus, Web of Science, IEEE Xplore, ArXiv, SpringerLink, ScienceDirect, and ERIC), and publications from 2017 to 2021 were selected. 718 unique publications were screened to identify primary research capable of characterizing this scientific area. 52 resulting publications were fully analyzed, and some consolidated results include: 38% of works had the professors as end users, followed by students (27%) and managers (25%); the English language was present in 50% of publications, followed by the Portuguese language (13,5%) and others languages; the text mining tasks most used were text classification (27%), sentiment analysis (17%), information extraction (15%), chatbot (15%) and topic modeling (13%); LDA (Latent Dirichlet Analysis) was the technique most used (19%); the Python language was the programming language most prevalent (42%), and 54% of works do not mention any educational construct or theory. Thus, this article presents an unprecedented overview of the field of Educational Text Mining (ETM) in distance higher education and analyses the main results obtained, aiming for future research in the area.
    • Abstract:
      As Provided
    • Notes:
      https://docs.google.com/spreadsheets/d/160jzML7fSoI4JQs_GXaHNW9Srd7PJg2xsylsYDhm4jE/edit?pli=1&gid=0#gid=0
    • Publication Date:
      2024
    • Accession Number:
      EJ1430274