commit | e4adb6901cf99cc3295905fb64e8c906b3893f6a | [log] [tgz] |
---|---|---|
author | Marc Kupietz <kupietz@ids-mannheim.de> | Sun Sep 26 11:57:01 2021 +0200 |
committer | Marc Kupietz <kupietz@ids-mannheim.de> | Sun Sep 26 14:52:48 2021 +0200 |
tree | 24f36757d5adc1d11dbc44181f229541630c94d8 | |
parent | 5d566530ff5c8672ab92bf370b7c095a09b49df7 [diff] [blame] |
Make sure that start and end tags for empty texts are counted For each text, no matter if empty or not, there will be one start and end tag count in the unigrams. Change-Id: I9fe769ea3d8a7de7b078499f33a611a7ba4bac4d
diff --git a/src/test/resources/simple_1gram_padded.freq b/src/test/resources/simple_1gram_padded.freq index 54522cb..b2f1b31 100644 --- a/src/test/resources/simple_1gram_padded.freq +++ b/src/test/resources/simple_1gram_padded.freq
@@ -1,6 +1,6 @@ +«END» 7 +«START» 7 . 3 -«END» 3 -«START» 3 alex 3 ich 3 bin 2