python3-normality

Micro-library to normalize text strings in Python
  https://github.com/pudo/normality
  0
  no reviews



Normality is a Python micro-package that contains a small set of text normalization functions for easier re-use. These functions accept a snippet of Unicode or UTF-8 encoded text and remove various classes of characters, such as diacritics, punctuation etc. This is useful as a preparation to further text analysis.