KorAP: Regular Expressions
Regular expressions are patterns describing a set of strings.
In the KorAP backend a wide range of operators is supported, but only the following are guaranteed to be stable throughout the system:
Operators
.
- Any- Any symbol
()
- Group- Create a group of operands
|
- Alternation- Create alternative operands
[]
- Character Class- Group alternative characters
\
- Escape symbol- Mark the following character to be interpreted as verbatim, when the character is special (i.e. an operator or quantifier)
".eine" Frau
"Fr..de"
"Fr(ie|eu)de" []{,3} Eierkuchen
"Fre[um]de"
"b.w\."
Quantifiers
Operands in regular expressions can be quantified, meaning they are allowed to occur consecutively a specified number of times. The following quantifieres are supported:
?
- Match 0 or 1 times
*
- Match 0 or more times
+
- Match 1 or more times
{n}
- Match
n
times {n,}
- Match at least
n
times {n,m}
- Match at least
n
times but no more thanm
times
"Schif+ahrt"
"kl?eine" Kinder
"Schlos{2,3}traße"
"Rha(bar){2}"