pytorch/text

Language modelling dataset only one sample

Open

#381 opened on Sep 12, 2018

View on GitHub
 (2 comments) (0 reactions) (0 assignees)Python (822 forks)batch import
datasetshelp wanted

Repository metrics

Stars
 (3,396 stars)
PR merge metrics
 (No merged PRs in 30d)

Description

from torchtext import data
from torchtext import datasets

TEXT = data.Field(lower=True, batch_first=True)

train, valid, test = datasets.WikiText2.splits(TEXT)

print('len(train)', len(train))

This returns a length of one. It should print the length of the whole dataset. I have tried both with version 0.2.3 and 0.3 and none of them worked.

Contributor guide