commit | e4adb6901cf99cc3295905fb64e8c906b3893f6a | [log] [tgz] |
---|---|---|
author | Marc Kupietz <kupietz@ids-mannheim.de> | Sun Sep 26 11:57:01 2021 +0200 |
committer | Marc Kupietz <kupietz@ids-mannheim.de> | Sun Sep 26 14:52:48 2021 +0200 |
tree | 24f36757d5adc1d11dbc44181f229541630c94d8 | |
parent | 5d566530ff5c8672ab92bf370b7c095a09b49df7 [diff] [blame] |
Make sure that start and end tags for empty texts are counted For each text, no matter if empty or not, there will be one start and end tag count in the unigrams. Change-Id: I9fe769ea3d8a7de7b078499f33a611a7ba4bac4d
diff --git a/src/test/resources/simple_1lpgram_padded.freq b/src/test/resources/simple_1lpgram_padded.freq index ff8c4f7..117e0e3 100644 --- a/src/test/resources/simple_1lpgram_padded.freq +++ b/src/test/resources/simple_1lpgram_padded.freq
@@ -1,7 +1,7 @@ +«END» «END» «STARTEND» 7 +«START» «START» «STARTEND» 7 . . $. 3 alex alex NE 3 ich ich PPER 3 -«END» «END» «STARTEND» 3 -«START» «START» «STARTEND» 3 bin sein VAFIN 2 heiße heißen VAFIN 1