- f66dc14 Fix end of text behaviour in case of sentence positions by Akron · 1 year, 3 months ago v0.2.2
- 8e80393 Minor performance improvements by Akron · 1 year, 7 months ago v0.2.1
- 72a6422 Introduce english clitics by Akron · 1 year, 7 months ago
- cae3911 Fix buffer bug in token writer by Akron · 1 year, 7 months ago
- df27581 Make tokenizer robust and never failing by Akron · 2 years, 8 months ago
- 00cecd1 Initialize identity for sigma < 256 by Akron · 3 years ago
- 4880fb6 Improve rune2symbol conversion by Akron · 3 years ago
- 9c3bf7f Change fmt to log for easier writing to STDOUT by Akron · 3 years, 1 month ago
- 04335c6 Update tests by Akron · 3 years, 1 month ago
- 96fdc9b Fix TokenWriter regarding sentence boundaries and remove simple TokenWriter by Akron · 3 years, 1 month ago
- 8cc2dd9 Fix buffer rewind at end of transmission by Akron · 3 years, 1 month ago
- 4f6b28c Support token offsets in token writer by Akron · 3 years, 1 month ago
- 32416ce Support offsets in token writer by Akron · 3 years, 1 month ago
- 98fbfef Improve offset handling in buffers by Akron · 3 years, 1 month ago
- a854faa Introduce EOT (end-of-transmission) marker by Akron · 3 years, 1 month ago
- e396a93 Introduce token_writer object by Akron · 3 years, 1 month ago
- 941f215 Support both matrix and da in the command by Akron · 3 years, 2 months ago
- 16c312e Serialize and deserialize matrix representation by Akron · 3 years, 2 months ago
- 1c34ce6 Introduce alternative matrix representation by Akron · 3 years, 2 months ago
- 0d0daa2 Split Foma parser from datok by Akron · 3 years, 2 months ago
- 7f1097f Rename datokenizer to datok by Akron · 3 years, 2 months ago[Renamed (99%) from datokenizer.go]
- 29e306f Combine Niu et al. (2013) and Morita et al. (2001) by Akron · 3 years, 3 months ago
- 679b486 Add skip-method proposed by Morita et al. (2001) by Akron · 3 years, 3 months ago
- 7b1faa6 Add xCheck() improvement proposed by Niu (2013) by Akron · 3 years, 3 months ago
- 34dbe97 Ignore MCS transitions instead of failing by Akron · 3 years, 3 months ago
- 0630be5 Fix parsing of end states by Akron · 3 years, 3 months ago
- 92704eb Ignore tokenend accepting transitions by Akron · 3 years, 3 months ago
- 4fa28b3 Introduce TransCount method by Akron · 3 years, 3 months ago
- 31f3c06 Ignore MCS in sigma if not used in the transducer by Akron · 3 years, 3 months ago
- de18e90 Minor optimization on edges by Akron · 3 years, 3 months ago
- 6f1c16c Added benchmark for double array creation by Akron · 3 years, 3 months ago
- ea46e8a Add ASCII fast lookup to sigma by Akron · 3 years, 3 months ago
- f1a1650 Turn uint32 array in bc array by Akron · 3 years, 3 months ago
- e61380b Added some minor comments by Akron · 3 years, 3 months ago
- 527c10c Replace zerolog with log by Akron · 3 years, 3 months ago
- bb4aac5 Optimize loading of datok files by Akron · 3 years, 3 months ago
- 8e1d69b Introduced command line tool by Akron · 3 years, 4 months ago
- 01912fc Remove unnecessary allocation for buffer recasting by Akron · 3 years, 4 months ago
- 4db3ecf Change exit operations to returning nil by Akron · 3 years, 4 months ago
- ec835ad Remove Match() method by Akron · 3 years, 4 months ago
- 6e70dc8 Fix sentence splitting tests by Akron · 3 years, 4 months ago
- 1594cb8 Fix sentence splitting by Akron · 3 years, 4 months ago
- c5d8d43 Fix check on final states by Akron · 3 years, 4 months ago
- b7e1f13 Simplify transducer (single test broken) by Akron · 3 years, 4 months ago
- df0a3ef Correctly handle final data by Akron · 3 years, 4 months ago
- 439f4ec Cleanup by Akron · 3 years, 4 months ago
- 03c92fe Support for tokenend MCS symbol by Akron · 3 years, 4 months ago
- b4bbb47 Added sentence splitter capabilities by Akron · 3 years, 4 months ago
- 3610f10 Introduce buffer with single epsilon backtrack by Akron · 3 years, 4 months ago
- 3a063ef Fix loading routine by Akron · 3 years, 4 months ago
- 524c543 Fix sigma to start with 1 by Akron · 3 years, 4 months ago
- 3f8571a Support reader/writer in transduce and add load by Akron · 3 years, 4 months ago
- 84d68e6 Support tokenend handling in transducing by Akron · 3 years, 4 months ago
- 2a4b929 Switch to 2 leading bits (30 bit addresses) by Akron · 3 years, 4 months ago
- 068874c Introduce nontoken handling in preliminary transducer by Akron · 3 years, 4 months ago
- 83e75a2 Introduce nontoken information by Akron · 3 years, 4 months ago
- 03a3c61 Rename loadLevel to loadFactor by Akron · 3 years, 4 months ago
- 3fdfec6 Turn states into uint32 pairs by Akron · 3 years, 4 months ago
- 64ffd9a Restructure and rename methods by Akron · 3 years, 4 months ago
- c17f1ca Turn special sigma values into properties by Akron · 3 years, 4 months ago
- 6247a5d Add serialization method by Akron · 3 years, 4 months ago
- 773b1ef Cache loadlevel by Akron · 3 years, 4 months ago
- d66a926 Add load factor by Akron · 3 years, 4 months ago
- f2120ca Split Tokenizer and DaTokenizer by Akron · 3 years, 4 months ago
- c9d84a6 Sort alphabet prior to xCheck by Akron · 3 years, 4 months ago
- 740f3d7 Cleanup code by Akron · 3 years, 4 months ago
- 49d27ee Fix epsilon handling in match operation by Akron · 3 years, 4 months ago
- 465a099 Add support for epsilon symbols by Akron · 3 years, 4 months ago
- 730a79c Support unknown and identity symbols by Akron · 3 years, 4 months ago
- 75ebe7f Fix foma format parser by Akron · 3 years, 4 months ago
- 8ef408b Initial commit by Akron · 3 years, 4 months ago