Home >  Term: Unicode-based white space segmentation
Unicode-based white space segmentation

A method of tokenization that uses Unicode character properties to distinguish between token and separator characters.

0 0

Creator

  • PWH617
  •  (Gold) 1905 points
  • 100% positive feedback
© 2024 CSOFT International, Ltd.