Hi,
I have Undup Matchig job in place to test 20 millions plus records.
It does not output any records to duplicate/ residual/ clerical or master link even after running for long time. However when I tried running a small subset of this file ( with 20000 records), it ran successfully in few minutes and delivered the desired records.
Can anybody tell me what could be the issue. Please note I have made my first match pass with strict blocking columns which is the combination of couple of fields.
Matching Job Does not Output any records
-
- Premium Member
- Posts: 43
- Joined: Wed Feb 08, 2012 8:12 pm
- Location: United States
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
If you look at the match statistics, is there much mention of overflow blocks? You may need to look at a blocking strategy that more finely discriminates potential duplicates, so that the block size is not exceeded.
If the records don't appear on any of those links, where DO they appear? These are the only possiblities. Are you perhaps not waiting for the job to finish?
If the records don't appear on any of those links, where DO they appear? These are the only possiblities. Are you perhaps not waiting for the job to finish?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Premium Member
- Posts: 43
- Joined: Wed Feb 08, 2012 8:12 pm
- Location: United States
Thanks Ray,
:D
You were right about letting the job finish first. So having a patience did take care of that issue.
Now I have one more question about undup matching.
What kind of option should I select for the following scenario.
I want to consider two records for the candidate of matching when both have field one present, if one of them have field one missing it should not be compared against each other.
:D
You were right about letting the job finish first. So having a patience did take care of that issue.
Now I have one more question about undup matching.
What kind of option should I select for the following scenario.
I want to consider two records for the candidate of matching when both have field one present, if one of them have field one missing it should not be compared against each other.
Three options you have:
1. Make the two fields blocking columns in the passes in question. Blocking columns must be populated an identical to be considered a possible match.
2. Use CRITICAL variable type on each column in question. This means that a column must be populated and identical in order to be considered a match.
3. Using a Transformer, split the data so that records with no data in either one of those columns never go into the match. Of course, this means they will NEVER match in ANY pass :D
1. Make the two fields blocking columns in the passes in question. Blocking columns must be populated an identical to be considered a possible match.
2. Use CRITICAL variable type on each column in question. This means that a column must be populated and identical in order to be considered a match.
3. Using a Transformer, split the data so that records with no data in either one of those columns never go into the match. Of course, this means they will NEVER match in ANY pass :D
Regards,
Robert
Robert
Three options you have:
1. Make the two fields blocking columns in the passes in question. Blocking columns must be populated an identical to be considered a possible match.
2. Use CRITICAL variable type on each column in question. This means that a column must be populated and identical in order to be considered a match.
3. Using a Transformer, split the data so that records with no data in either one of those columns never go into the match. Of course, this means they will NEVER match in ANY pass :D
1. Make the two fields blocking columns in the passes in question. Blocking columns must be populated an identical to be considered a possible match.
2. Use CRITICAL variable type on each column in question. This means that a column must be populated and identical in order to be considered a match.
3. Using a Transformer, split the data so that records with no data in either one of those columns never go into the match. Of course, this means they will NEVER match in ANY pass :D
Regards,
Robert
Robert
-
- Premium Member
- Posts: 43
- Joined: Wed Feb 08, 2012 8:12 pm
- Location: United States