Cleaning-up Persian Texts!
fix_three_dots
normalize_dates
to re-order date parts with slash as delimiterfix_misc_spacing
to remove space before braces containing numbersremove_diacritics
to remove all diacritic characters #4fix_diacritics
, props @languagetool-orgfix_misc_non_persian_chars
fix_spacing_for_punctuations
fix_numeral_symbols
fix_diacritics
fix_hamzeh
cleanup_spacing
fix_numeral_symbols
to replace percent signs and decimal separators, props @ebraminio/persiantoolsfix_persian_glyphs
to replace glyph chars, props @ebraminio/persiantoolsfix_suffix_misc
to fix hamza with double yeh, props @ebraminio/persiantoolsmarkdown_normalize_braces
markdown_normalize_lists
preserve_frontmatter
to preserve frontmatter data?!
into !?
aggresive
optioncleanup_begin_and_end
fix_spacing_for_punctuations
extracted from fix_spacing_for_braces_and_quotes
preserve_braces
, preserve_brackets
, skip_markdown_ordered_lists_numbers_conversion
normalize_ellipsis
to replace more than one ellipsis with onecleanup_zwnj
cleanup_begin_and_end
cleanup_line_breaks
to remove more than two contiguous line breakspreserve_entities
to preserve html non decoded entitiespreserve_comments
to preserve html commentspreserve_nbsps
to preserve no-break spacesfix_punctuations
bi*
*am
, *at
, *ash
, *ei
, *eid
, *eem
, *and
*hayee
, *hayam
, *hayat
, *hayash
, *hayetan
, *hayeman
, *hayeshan
*tar
, *tari
, *tarin
props @zoghal
decode_htmlentities