Delphix Products

Expand all | Collapse all

Lookupfiile Limits

Jump to Best Answer
  • 1.  Lookupfiile Limits

    Posted 09-12-2017 11:38:00 PM
    Is there a limit on the number of lines of lookup file used for Secure Lookup?
    #Masking


  • 2.  RE: Lookupfiile Limits

    Posted 09-13-2017 06:19:00 AM
    Hi Koichi, there shouldn't be limits on number of rows on SL files. However keep in mind that all data read from SL file stay in RAM while job is running, so you should increase the max size of memory in the job definition form. The default value is 1024MB and could be not enough. If you increase the max size of RAM I suggest you to increase the minimum size too for better performance of the garbage collector of the JVM. Best Regards Gianpiero Piccolo


  • 3.  RE: Lookupfiile Limits
    Best Answer

    Posted 09-13-2017 09:59:00 AM
    There are practical limits and there have been issues with multi-million files in the past.
    However, it's not recommended to create large lookup lists.

    First, you should know that no matter how many entries your lookup list has, even if they are more values than your original data has, it will always create duplicate masked values! This is intentional and built into the code.

    The protection of the data and the irreversibility stem from mapping multiple, even thousands of original data, to the same masked value. Otherwise the cardinality of an entry could still identify the original data. Take an address as an example. You may have only one customer from a small village. Now if that village's name would be replaced with a masked name but it's still the only entry in your table - then you can still identify the customer. By mapping thousands of different villages to the same masked name it's impossible to identify the original town's name.

    Perhaps counter intuitively the lesser values your Secure Lookup algorithm uses, the more you have secured your data.
    Keep in mind the goal is to protect the sensitive data - the goal is _not_ to create the best similar looking fake data possible.

    You need to find a balance between protection and a number of different values that enables to test the application. A lookup list with some hundreds or some thousands of values will usually serve best.



  • 4.  RE: Lookupfiile Limits

    Posted 09-21-2017 04:03:00 AM
    Sorry for the late reply. Thank you for your reply!