v2.2.0 · Tags · Jens Getreu / stringsext

v2.2.0

4c577689 · add continuous delivery pipeline · Mar 19, 2020

New feature: "same-Unicode-block"

Requires all characters in a finding to originate from the same Unicode
block. This option helps to reduce false positives, especially when
scanning for UTF-16. When set, "`stringsext`" prints only Unicode block
homogenous strings. For example: "`-u All -n 10 -r`" finds a sequence of at
least 10 Cyrillic characters in a row or finds at least 10 Greek characters
in a row, whereas it ignores strings with randomly Cyrillic-Greek mixed
characters.  Technically this option guarantees, that all multibyte
characters of a finding - encoded as UTF-8 - start with the same leading
byte.