yake

Community

Single-document unsupervised keyword extraction

Copied fromcf-post-staging / yake

Versions

Installation

To install this package, run one of the following:

Conda

$conda install conda-forge::yake

Usage Tracking

Version

5 / 8 versions selected

Downloads (Last 6 months): 0

Description

Unsupervised Approach for Automatic Keyword Extraction using Text Features. YAKE! is a light-weight unsupervised automatic keyword extraction method which rests on text statistical features extracted from single documents to select the most important keywords of a text. Our system does not need to be trained on a particular set of documents, neither it depends on dictionaries, external-corpus, size of the text, language or domain. To demonstrate the merits and the significance of our proposal, we compare it against ten state-of-the-art unsupervised approaches (TF.IDF, KP-Miner, RAKE, TextRank, SingleRank, ExpandRank, TopicRank, TopicalPageRank, PositionRank and MultipartiteRank), and one supervised method (KEA). Experimental results carried out on top of twenty datasets (see Benchmark section below) show that our methods significantly outperform state-of-the-art methods under a number of collections of different sizes, languages or domains. In addition to the python package here described, we also make available a demo, an API and a mobile app.

About

Summary

Single-document unsupervised keyword extraction

Last Updated

Feb 10, 2026 at 11:07

License

GPL-3.0-only

Supported Platforms

noarch

Home

https://github.com/LIAAD/yake

GitHub Repository

https://github.com/LIAAD/yake

Documentation

https://github.com/LIAAD/yake