What on Earth is Unicode Normalization?
Both \u00C7 and \u0043\u0327 seem to be producing the same character. In reality, they’re completely different — and any NLP model you build will see them both as being completely different too.
Source: towardsdatascience.com