Example Job implementing the different match models
Source data
The main source reads like:
ID | Name |
1 | Shong |
2 | Elisa |
3 | Sabrina |
The Lookup source reads as follows:
ID | |
1 | Shong1@talend.com |
1 | Shong2@talend.com |
2 | Elisa@talend.com |
3 | Sabrina@talend.com |
Now we plan to perform an inner join between the main source and the lookup source, and to produce the data structure as below based on the two sources.
ID | Name |
The result varies depending on the match model to be used.
Creating the Job
We use a tFixedFlowInput component to generate the main source.
data:image/s3,"s3://crabby-images/6d8c8/6d8c8826c65afef4ab0b245240472bb9433a86dc" alt=""
And use a second tFixedFlowInput component to generate the lookup source.
data:image/s3,"s3://crabby-images/8a777/8a777dbf83887ab9f81b8e9cc2277e6f4514e8c5" alt=""
Use tMap to perform the inner join, and output the result to a tLogRow component (with Table mode) that prints the result on the console.
data:image/s3,"s3://crabby-images/2060d/2060d41e7683280f3c44b9c231383dcc0c75c448" alt=""
data:image/s3,"s3://crabby-images/1e9f7/1e9f71b338fd343263b5a10fb9dc9a77bea08687" alt=""
Using the match models to generate different results
Unique match: this is the default option for the JOIN operation. It outputs the last matching record of the lookup source.
data:image/s3,"s3://crabby-images/1082d/1082d9211a571526e3ebe02e5a87c1209233b7a9" alt=""
The result of the JOIN by the Unique match model reads as follows:
Starting Job tMap_Match_modes at 17:46 25/09/2013. [statistics] connecting to socket on port 3367 [statistics] connected .--+-------+-------------------. | tLogRow_2 | |=-+-------+------------------=| |ID|Name |Email | |=-+-------+------------------=| |1 |Shong |Shong2@talend.com | |2 |Elisa |Elisa@talend.com | |3 |Sabrina|Sabrina1@talend.com| '--+-------+-------------------' [statistics] disconnected Job tMap_Match_modes ended at 17:46 25/09/2013. [exit code=0]
First match: it outputs the first matching record of the lookup source.
data:image/s3,"s3://crabby-images/01dd7/01dd757d2d6a79759a1b5c6528a60c7207dfc6b2" alt=""
The result of the JOIN by the First match model reads as follows:
Starting Job tMap_Match_modes at 17:51 25/09/2013. [statistics] connecting to socket on port 3942 [statistics] connected .--+-------+-------------------. | tLogRow_2 | |=-+-------+------------------=| |ID|Name |Email | |=-+-------+------------------=| |1 |Shong |Shong1@talend.com | |2 |Elisa |Elisa@talend.com | |3 |Sabrina|Sabrina1@talend.com| '--+-------+-------------------' [statistics] disconnected Job tMap_Match_modes ended at 17:51 25/09/2013. [exit code=0]
All match: it outputs all matching records of the lookup source.
data:image/s3,"s3://crabby-images/965e0/965e00404fb6aba3f366f60f1a1451cfaa0c2e2d" alt=""
The result of the JOIN by the All match model reads as follows:
Starting Job tMap_Match_modes at 17:58 25/09/2013. [statistics] connecting to socket on port 3381 [statistics] connected .--+-------+-------------------. | tLogRow_2 | |=-+-------+------------------=| |ID|Name |Email | |=-+-------+------------------=| |1 |Shong |Shong1@talend.com | |1 |Shong |Shong2@talend.com | |2 |Elisa |Elisa@talend.com | |3 |Sabrina|Sabrina1@talend.com| '--+-------+-------------------' [statistics] disconnected Job tMap_Match_modes ended at 17:58 25/09/2013. [exit code=0]