Supporting File Structures|Rule Files - Active And Inactive
During data analysis, Rule files help determine the data class of the data in each column of the source data.
Rule files specify the tests that the data must pass to achieve the data class associated with the rule file.
Rule files can be active or inactive.
Active rule files are included in the 'Generate Match Criteria' process, inactive files are not.
The 'Generate Match Criteria' process compares column values to the rules held in each active rule file and
attempts to determine the most likely data class for the column currently being analysed. Once a data class has
been assigned, the 'Rules To Match Functions' file is able to assign an applicable match function to the column.
For example, in a source data file, there might exist a column holding a persons surname.
There is a matching function set up specifically check person surname columns for possible duplications or matches.
However, in order for this function to be chosen by default, by the Generate Match Criteria process, the following must be true:-
- The 'RuleFileDefines_PersonSurnames.txt' rule file must be active and display on the screen shown below.
- The entry 'RuleFileDefines_PersonSurnames.txt ALIAS_SURNAME_MATCH()' must be included in the Rules To Match Functions screen.
- Any Rule Files prioritisation group containing the entry 'RuleFileDefines_PersonSurnames.txt' should ensure that it is prioritised accordingly.
If any of the above are not set, then it is possible that the best matching function, in this case 'ALIAS_SURNAME_MATCH()' might not be chosen.
However, this need not matter. Once Generate Match Criteria has run, you can always manually override the default choice of matching function, before initiating the matching process itself.