The Benchmark of Paragraph and Sentence Extraction Summaries using Outlier Document Filtering based Multi -Document Summarizer


Turan M., Sonmez C., Ganiz M. C.

INFORMATION TECHNOLOGY AND CONTROL, cilt.43, sa.4, ss.433-439, 2014 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 43 Sayı: 4
  • Basım Tarihi: 2014
  • Doi Numarası: 10.5755/j01.itc.43.4.7010
  • Dergi Adı: INFORMATION TECHNOLOGY AND CONTROL
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.433-439
  • Marmara Üniversitesi Adresli: Hayır

Özet

We studied outlier document filtering (ODF) for extractive sentence summarization. Our results are superior compared to the average of the participant systems' using DUC 2006. Furthermore, we add extractive paragraph summarization to the same system. It is surprising that the results are nearly the same for ROUGE metrics. Although extractive paragraph summarization has a better performance for precision, extractive sentence summarization has a slightly better performance on the recall and F-Score which is the harmonic mean of recall and precision. The ODF is successful for both extractive sentence and paragraph summarization. The similarity metric (match percent) suggested in the article prevents the domination of longer sentences/paragraphs on shorter sentences/paragraphs in selection. As a result, the ODF provides the flexibility of paragraph extraction instead of sentence extraction for simplicity and readability and less work load.