pytorch/text

Ignoring UNK words

Open

#355 opened on Jul 22, 2018

View on GitHub
 (7 comments) (0 reactions) (0 assignees)Python (822 forks)batch import
enhancementhelp wanted

Repository metrics

Stars
 (3,396 stars)
PR merge metrics
 (No merged PRs in 30d)

Description

Cannot find the way to ignore UNK words when numericalising, i.e. instead by substituting them by a 0, it just ignore that word.

Is that implemented?

This is useful in classification problems, when you just want to remove 'UNK' words.

Contributor guide