Hello all,
I'm trying to use the Fuzzy Match step with french words (and for specifically, french first names).
Is there a way to use this step with special characters. My tests show that Therese (my input test) is closer to Terese than it is to Thérèse (the proper writing). Using an automatic replacement would result in this first name being written unproperly instead of just missing the accents. I would not be cleaning my data but just making it worse ;)
I have the same problem with other accents, cedillas, dieresis or Œ (among others)
Thanks in advance for your answers
Mathieu
I'm trying to use the Fuzzy Match step with french words (and for specifically, french first names).
Is there a way to use this step with special characters. My tests show that Therese (my input test) is closer to Terese than it is to Thérèse (the proper writing). Using an automatic replacement would result in this first name being written unproperly instead of just missing the accents. I would not be cleaning my data but just making it worse ;)
I have the same problem with other accents, cedillas, dieresis or Œ (among others)
Thanks in advance for your answers
Mathieu