Reputation: 21
Sequence ids are optional in CNTK text format. I am wondering how sequence ids are used during training. When a minibatch is created from a CNTKTextFormat with sequence ids, is a line in CNTKTextFormat considered as one sample or all lines with the same sequence id together is considered as one sample?
Upvotes: 2
Views: 91
Reputation: 2859
If IDs are given, then all lines with the same sequence ID are forming together one training instance (in CNTK lingo: they are forming a sequence consisting of samples).
If IDs are missing, then each line is a new training instance (consisting of a sequence with a single sample).
Upvotes: 1