Synonym Normalizer
Input
@two_way,pants,trousers
@two_way,trousers,kalhoty
@two_way,počítač,pocitac
@two_way,pocitac,pc,PC
Output
@two_way,pants,trousers,kalhoty
@two_way,počítač,pocitac,pc
Features
- join the lines that contain the same words
- remove duplicate words (removes words without diacritics)
- remove duplicate lines
- does not change the word order
- can show modified lines (not removed lines)
- can change synonyme type to @two_way
- replace _ to space
Checklist after synonym normalization
- are synonym types correct?
- are there some missing synonyms? e.g. "@two_way,černý,černá" - "černé" is missing
TODO
- mark as two_way synonym, if there are two one_way synonyms
File upload
RD, 2024