We recommend switching to the latest versions of Edge, Firefox, Chrome or Safari. Using Internet Explorer will result in a loss of website functionality.

Fuzzy Xref Fuzzy Threshhold

Comments

2 comments

  • Avatar
    Komsky, Eugine

    Is there a better way to compare  two strings ? 

     

    0
    Comment actions Permalink
  • Avatar
    Stéphane O

    You need first to select the Fuzzy Algorithm

    help will give you all the information you need:

    http://localhost:8080/docs/dist/help/Content/e-node-help/Correlation/fuzzy-join.html

     

    Few months ago I tried to use the Levenshtein distance algorithm with both Lavastorm and data3sixty to compare millions of lines with hundred of thousand of records. Neither Lavastorm or Data3sixty were able to run such huge amount of commands on my computer.

    We wrote a Python script and launched it on a 16 core server.

    The way you are writing the Python code will have a huge influence on the processing time. But 13.000 x 800 processing should be  done within few minutes. Which algorithm have you selected?

    0
    Comment actions Permalink

Please sign in to leave a comment.