This help section describes the Charset blacklist test and the related settings available under the
page in the navigation.The email language is often declared by a so-called "character set" or "charset", which may be used detect if the email was written in a specific language/script. This feature of ORF can blacklist unwanted emails written in languages you do not normally use.
You can enable or disable the use of the Charset Blacklist on the
page in the navigation.Click button New to add a new charset to the list. To modify an existing charset, click Modify or hit Enter. Character sets can be deleted using the Delete button or the Delete key.
Click the column header of any column by which you wish to sort the charset list. To reverse sorting, click the column header again.
Right-click on the charset list and select "Import List..." or "Export List..." Alternatively, you can do this from the menu, select
or .Right-click on the list item or items and select "Search in logs..." to find log records that match the defined expression. Logs need to be loaded in the Log Viewer beforehand.
ORF searches the email subject and any body parts for character set declarations. If any of the collected charsets match with any of the charsets on the Charset blacklist, the email will be blacklisted.
Charset-based blacklisting has two major limitations.
The default list was compiled from Vamsoft spam samples and includes character sets we see most often in our foreign language spam. By default, no character set is enabled. This is because the list has to be customized for your location and email profile.
Character sets are declared in the email header, email part headers and the subject. In Microsoft® Outlook®, you can view this information under the (see Internet Headers). In the email header and part headers, the charset declaration typically looks like this:
The underlined part is the character set name. In subjects, the charset is encoded as:
Again, the underlined part is the character set name.
Before blacklisting specific charsets, it is recommended to check what languages/scripts the given charset covers. Note that it is not recommended under any circumstances to blacklist the "utf-8" charset, even when seen in foreign language spam. See the Limitations section for more information.