Improving Text Readability through Segmentation into Rheses - l'unam - université nantes angers le mans Access content directly
Conference Papers Year : 2024

Improving Text Readability through Segmentation into Rheses

Antoine Jamelot
  • Function : Author
  • PersonId : 1380054
Sophie Hamon
  • Function : Author
  • PersonId : 1380055

Abstract

Enhancing text readability is crucial for readers with challenges like dyslexia. This paper delves into the segmentation of sentences into rheses, i.e. rhythmic and semantic units. Their aim is to clarify sentence structures for improved comprehension, through a harmonious balance between syntactic accuracy, the natural rhythm of reading aloud, and the delineation of meaningful units. This study relates and compares our various attempts to improve a pre-existing rhesis segmentation tool, which is based on the selection of candidate segmentations. We also release TeRheSe (Texts with Rhesis Segmentation), a bilingual dataset, segmented into rheses, comprising 12 books from classic literature in French and English. We evaluated our approaches on this dataset, showing the efficiency of a novel approach based on token classification, reaching a F1-score of 90.0% in English (previously 85.3%) and 91.3% in French (previously 88.0%). We also study the potential of leveraging prosodic elements, though its definitive impact remains inconclusive.
Fichier principal
Vignette du fichier
Article_Rhezor.pdf (432.53 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-04566523 , version 1 (02-05-2024)

Licence

Attribution

Identifiers

  • HAL Id : hal-04566523 , version 1

Cite

Antoine Jamelot, Solen Quiniou, Sophie Hamon. Improving Text Readability through Segmentation into Rheses. LREC-COLING 2024, May 2024, Turin, Italy. ⟨hal-04566523⟩
0 View
0 Download

Share

Gmail Facebook X LinkedIn More