INTER AND INTRA CLUSTER ON SELF-ADAPTIVE DIFFERENTIAL EVOLUTION FOR MULTI-DOCUMENT SUMMARIZATION

  • Alifia Puspaningrum Institut Teknologi Sepuluh Nopember
  • Adhi Nurilham Institut Teknologi Sepuluh Nopember
  • Eva Firdayanti Bisono Institut Teknologi Sepuluh Nopember
  • Khoirul Umam Institut Teknologi Sepuluh Nopember
  • Agus Zainal Arifin Institut Teknologi Sepuluh Nopember
Keywords: differential evolution, inter-cluster analysis, intra-cluster analysis, multi-document, summarization

Abstract

Multi – document as one of summarization type has become more challenging issue than single-document because its larger space and its different content of each document. Hence, some of optimization algorithms consider some criteria in producing the best summary, such as relevancy, content coverage, and diversity. Those weighted criteria based on the assumption that the multi-documents are already located in the same cluster. However, in a certain condition, multi-documents consist of many categories and need to be considered too. In this paper, we propose an inter and intra cluster which consist of four weighted criteria functions (coherence, coverage, diversity, and inter-cluster analysis) to be optimized by using SaDE (Self Adaptive Differential Evolution) to get the best summary result. Therefore, the proposed method will deal not only with the value of compactness quality of the cluster within but also the separation of each cluster. Experimental results on Text Analysis Conference (TAC) 2008 datasets yields better summaries results with average ROUGE-1 on precision, recall, and f - measure 0.77, 0.07, and 0.12 compared to another method that only consider the analysis of intra-cluster.
Published
2018-06-29