TFIdf

NAME

Text::TFIdf

DESCRIPTION

Text::TFIdf assists in calulating TF-IDF vectors and scores for a collection of documents.

Examples

use Text::TFIdf;
    use Lingua::EN::Stopwords::Short;
my $doc-store = TFIdf.new(:trim(True), :stop-list(%stop-words));
$doc-store.add('perl is cool');
    $doc-store.add('i like node');
    $doc-store.add('java is okay');
    $doc-store.add('perl and node are interesting meh about java');
sub results($id, $score) {
      say $id ~ " got " ~ $score;
    }
$doc-store.tfids('node perl java', &results);
say $doc-store.tfids('node perl java');

This example will print out a numbers that represent how similar the given document is to documents in the corpus.

The Camelia image is copyright 2009 by Larry Wall. "Raku" is trademark of the Yet Another Society. All rights reserved.