Replace Unwanted Characters Transform
The "Replace Unwanted Characters Transform" is a flexible component that is easy as typing in the characters or words you want to remove from your source data and have those characters replaced. You have two options to replace unwanted characters.
• Input Column - Name of the input column
• Choose Action - There are two actions that can be used.
■Replace Alpha/Alphanumeric/Numeric characters and replace with user defined value - This option gives you 6 predefined options to replace characters. This option is useful if you need a simple way to remove unwanted characters without knowing the specific characters you want to remove, but know what you want to keep.
• Parameters - Action
o ReplaceAlpha - Remove/replace letters in the alphabet from the input string.
o ReplaceNonAlpha - Remove/replace anything other than letters in the alphabet from the input string.
o RpelaceNumeric - Remove/replace numbers from the input string.
o ReplaceNonNumeric - Remove/replace anything other than number from the input string.
o ReplaceAlphaNumeric - Replace/remove any characters [A-Z], [a-z], and [0-9] from the input string.
o ReplaceNonAlphaNumeric - Replace/remove any characters other than [A-Z], [a-z], and [0-9] from the input string.
• Parameters - Replace Match Character With - The character or word to replace the characters defined in the action.
■ Replace Specified characters or words with user defined value - This options gives you the ability to replace any characters or words from the input source with a user defined value. This is useful when you need to be more selective with the characters that will be replaced.
o Characters to be searched - Define what characters should be replaced. Certain characters will need to be entered as escape sequences:
■ Tab - \t
■ CarriageReturn - \r
■ LineFeed - \n
■ VerticalTab - \v
■ SingleQuote - \'
■ DoubleQuotes - \"
■ NullCharacter - \0
o Words to be searched- Define words or Unicode characters to be replaced. All words must be separated by a vertical bar ( | ). If you have some Unicode characters you want to replace then you must specify \u (For example to replace a Greek letter you would specify \u03B2)
o Replace with - Define the characters or words to replace the unwanted characters or words defined in the "Characters to be searched" and "Words to be searched" parameters. Usually this value is left blank.
o Ignore Case - Choose whether the words or characters being searched should use case matching. The default is false.
o Maximum Replacements - Specify how many occurrences you want to replace. Setting this value to -1 will replace all occurrences in the input string.
■ Replace invalid characters that cannot be part of an Xml Document - Replaces the "&", "<" and ">" as well as " or ' (double and single quote) within attributes.
■ Replace non-printable characters - Replaces all non-printable characters such as tab and carriage return.
o Output Actions
■ Replace the column data with fixed data
■ Output the fixed data to a new output column - The new output column name will be "inputcolumn_output" for example, if the source column is "FirstName" the output will be "FirstName_output".
Please see the Error Row Handling page for more information about this functionality.