A Heuristic Approach for Telugu Text Summarization with Improved Sentence Ranking

Main Article Content

Kishore Kumar Mamidala et.al

Abstract

Extracting/abstracting the condensed form of original text document by retaining its information and complete meaning is known as text summarization. The creation of manual summaries from large text documents is difficult and time-consuming for humans. Text summarization has become an important and challenging area in natural language processing. This paper presents a heuristic appraoch to extract a summary of e-news articles of the Telugu language. The method proposes new lexical parameter-based information extraction (IE) rules for scoring the sentences. Event score and Named Entity Score is a novel part in sentence scoring to identify the essential information in the text. Depending on the frequency of occurrence of event/named entites in the sentence and document, sentences are selected for summary. Data is collected from online news sources (i.e., Eenadu, Sakshi,Andhra Jyothi, Namaste Telangana) to experiment. The proposed method is compared with other techniques developed for Telugu text summarization. Evaluation metrics like precision, recall, and F1 score is used to measure the proposed method's performance. An extensive statistical and qualitative evaluation of the system's summaries has been conducted using Recall-Oriented Understudy for Gisting Evaluation (ROUGE), a standard summary evaluation tool. The results showed improved performance compared to other methods.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Article Details

How to Cite
et.al, K. K. M. (2021). A Heuristic Approach for Telugu Text Summarization with Improved Sentence Ranking. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(3), 4238–4243. Retrieved from https://www.turcomat.org/index.php/turkbilmat/article/view/1714
Section
Research Articles