TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FRAKE: Fusional Real-time Automatic Keyword Extraction

FRAKE: Fusional Real-time Automatic Keyword Extraction

Aidin Zehtab-Salmasi, Mohammad-Reza Feizi-Derakhshi, Mohamad-Ali Balafar

2021-04-10Part-Of-Speech TaggingKeyword Extraction
PaperPDFCode(official)

Abstract

Keyword extraction is the process of identifying the words or phrases that express the main concepts of text to the best of one's ability. Electronic infrastructure creates a considerable amount of text every day and at all times. This massive volume of documents makes it practically impossible for human resources to study and manage them. Nevertheless, the need for these documents to be accessed efficiently and effectively is evident in numerous purposes. A blog, news article, or technical note is considered a relatively long text since the reader aims to learn the subject based on keywords or topics. Our approach consists of a combination of two models: graph centrality features and textural features. The proposed method has been used to extract the best keyword among the candidate keywords with an optimal combination of graph centralities, such as degree, betweenness, eigenvector, closeness centrality and etc, and textural, such as Casing, Term position, Term frequency normalization, Term different sentence, Part Of Speech tagging. There have also been attempts to distinguish keywords from candidate phrases and consider them on separate keywords. For evaluating the proposed method, seven datasets were used: Semeval2010, SemEval2017, Inspec, fao30, Thesis100, pak2018, and Wikinews, with results reported as Precision, Recall, and F- measure. Our proposed method performed much better in terms of evaluation metrics in all reviewed datasets compared with available methods in literature. An approximate 16.9% increase was witnessed in F-score metric and this was much more for the Inspec in English datasets and WikiNews in forgone languages.

Results

TaskDatasetMetricValueModel
Keyword ExtractionSemEval-2017 Task-10F1 score54FRAKE
Keyword ExtractionSemEval-2017 Task-10Precision@1053.6FRAKE
Keyword ExtractionSemEval-2017 Task-10Recall@1054.4FRAKE
Keyword ExtractionInspecF1 score58.9FRAKE
Keyword ExtractionInspecPrecision@1057.2FRAKE
Keyword ExtractionInspecRecall @ 1060.7FRAKE
Keyword ExtractionSemEval 2010 Task 8F1 score37.5FRAKE
Keyword ExtractionSemEval 2010 Task 8Precision@1041.5FRAKE
Keyword ExtractionSemEval 2010 Task 8Recall@1034.3FRAKE

Related Papers

Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching2025-06-17Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs2025-06-11Urania: Differentially Private Insights into AI Use2025-06-05FiLLM -- A Filipino-optimized Large Language Model based on Southeast Asia Large Language Model (SEALLM)2025-05-25On Multilingual Encoder Language Model Compression for Low-Resource Languages2025-05-22A Comparative Analysis of Static Word Embeddings for Hungarian2025-05-12From Precision to Perception: User-Centred Evaluation of Keyword Extraction Algorithms for Internet-Scale Contextual Advertising2025-04-30Keyword Extraction, and Aspect Classification in Sinhala, English, and Code-Mixed Content2025-04-14