| commit | 2f7f6f3cc19f79bfc1709daae4b9d92c3acf5743 | [log] [tgz] |
|---|---|---|
| author | Akron <nils@diewald-online.de> | Wed Feb 11 15:12:48 2026 +0100 |
| committer | Akron <nils@diewald-online.de> | Wed Feb 11 15:12:48 2026 +0100 |
| tree | 6c3796a5b11faa2b524df2d16740b80de1206b03 | |
| parent | 3dd560ea8a65ce28f48defe665d64eb3d19ec2e9 [diff] [blame] |
Support German gender-sensitive DET, ADJ, PRON ending (from KorAP-Tokenizer) Change-Id: I8f20ecb913c0fe514b5936ab43287ca616695f16
diff --git a/testdata/de/split.txt b/testdata/de/split.txt new file mode 100644 index 0000000..14a0e37 --- /dev/null +++ b/testdata/de/split.txt
@@ -0,0 +1,11 @@ +der/die +er/sie +und/oder +Modell/Versuch +Quelle:rbb +Foto:emm +Dies(ist)falsch +das/ist/falsch +mir:geht +Vor/Nachteile +Innenminister/Innenministerinnen