We recommend switching to the latest versions of Edge, Firefox, Chrome or Safari. Using Internet Explorer will result in a loss of website functionality.

Searching with unicode regular expression

Comments

6 comments

  • Avatar
    Adrian Williams

    You can just copy the actual unicode characters into your UTF-8 encoded csv file and then import them using the CSV/Delimited node

     

     

    Attached files

    ExtendedChar_Replacements.txt

     

    0
    Comment actions Permalink
  • Avatar
    M M

    Thanks Adrian, but I'm not certain that all the required characters are displayable - but I still need to find and replace them. The ones above are for the purposes of the query only.

    0
    Comment actions Permalink
  • Avatar
    M M

    To be honest, the inconsistent behaviour between the Create Data and the CSV nodes itself is concerning. Can a bug report be raised for this?

    0
    Comment actions Permalink
  • Avatar
    Adrian Williams

    I will confer with the team to confirm the situation but I believe the nodes are operating correctly. The CSV/Delimited node expects the data in the file to be encoded using the relevant character set as defined by the FileCharacterSet property. The node does not support escaped  unicode characters.

    In the mean time does this provide you with a solution for escaped UTF-8 characters?:

     

    0
    Comment actions Permalink
  • Avatar
    M M

    Ah, very nice. Was unaware of the 'unicode_escape' option.

    0
    Comment actions Permalink
  • Avatar
    Adrian Williams

    The team confirmed the CSV/Delimited node is operating as expected. The node is designed to import data that is encoded per a supported codec (e.g. UTF-8, UTF-16LE, ISO-8859-1, etc.).

    0
    Comment actions Permalink

Please sign in to leave a comment.