Match Up Overview 1 Introduction Match Up is
- Slides: 27
Match. Up Overview 1
Introduction Match. Up is used to compare database records and determine if duplicate records exist. Match. Up is offered as… A stand-alone Windows software solution A programmers API An Enterprise level ETL solution: SSIS , Pentaho components A Web Service An Excel Plug-In 2
Introduction: Match. Up got started partly because of this problem – too much mail! 3
NAME Mary Smith MD John Smith Jr John L Smith DDS Mr John Smith Jr Billy Smith John Smith Jr Maria Smith Dr. Mary Smith John Smith Jr ADDRESS 12 Main Street 12 Main St Twelve Main St 12 Main St CSZ Anytown, CA 99999 Anytown, CA 99999 12 Main St Anytown, CA 99999 4 Campus Way, Rm 6 Collegeville, CA 99999 12 Main St Anytown, CA 99999 PHONE 999 -2000 555 -444 -1234 EMAIL john@smithhouse. com john@work. com 555 -444 -1234 555 -200 -6666 737 -115 -8844 999 -2000 555 -444 -1234 billy 43@gaming. com billy 44@gaming. com billy 45@gaming. com maria@college. edu m. smith@lab. edu Today, the problem is more like this. How did all these get in our system, and how do we resolve them? 4
Introduction Match. Up can… Process files with different structures, field names, types or lengths and output the results to your choice of file types or database formats. FLEXIBLE. Process multiple matchcodes (matching criteria) in a single pass, each of which can be made of any part(s) of any field(s) you designate. UNLIMITED MATCHING Prioritize matched records for output, and roll up data into a single record. ADVANCED. 5
Output properties necessary to distinguish duplicate records…. FULLNAME Jeff Clareck Jeff Clarreck Mr. Charles N. Lee Sr. Mr Charles Lee Jr Phd. Charles Lee Chuck Lee Jo Tennison Sr Jo Tennison Jr Darlene Terry Dusty Terry Sam Smith Samantha Smith MD Samuel Smith ADDRESS 18 JORDAN MILL CT 5773 N Oyster Rd # 5 5773 N Oyster Rd Apt 5 5773 North Oyster Rd 5700 N Oyster Rd 1033 Bush Ave PO BOX 7 12 Main St ZIP PHONE 16100 54229 781545 -7300 54229 800 -212 -7300 54229 216. 949. 4000 54229 781545 -7300 07452 622 -545 -7000 07452 781545 -7000 92688 981545 -7343 92688 981 -545 -7666 92688 981 -545 -7344 CODE B AU IRA S CHK IRA S STOCK CHK PROP CHK S TOTAL 100 50 500 46 1000 500 244 77 880 100 40 66 100 Results Group Count MS 01 1 1 MS 01 2 1 MS 02, MS 06 3 3 MS 03, MS 06 3 3 MS 01 4 1 MS 02, MS 06 5 2 MS 03, MS 06 5 2 MS 01 6 1 MS 01 7 1 MS 01 8 1 MS 01 9 1 MS 01 10 1 6
Identify and output golden records and gather survivorship data…. 7
Or, retain all records and survivorship data…. 8
Distribution Options Choose the right solution for your needs… Matchcode Editor Programming required Match. Up Windows Desktop Match. Up Object x X x Multi Platform x real. Time - Deduping Flexible output (File type) Record Limit X Global processing? Output record Consolidation Output record priority source record update (distribute) Rapid Development updates support LWXL Match. Up Web Service no x Various file types unlimited Unlimited Various file types unlimited 50000 unlimited US, CAN, UK Global presently no survivorship golden record gathering priority scattering x x Rapid Integration direct file handling unlimited files analyzing file control toolset CASS Automatic reports generated Match. Up ETL x x X built in tool x x X x x Source, lookup Excel, csv, etc. x 18 reports x X x x X automatic x x 9
Concepts How does Match. Up determine if two records match? Lets take a look at a few examples of records that appear to be the same contact… US / Canada Query Match Smith Jr. , John Accounting Services, Inc. 12 Main St Anytown, CA 92688 Mr. J Smithe TAXman Consulting Suite 5 12 North Main Street Anytown, CA 92688 10
Concepts Germany Query Mr. J. Smithe Deutsche Bank Ltd. Berger Straße 130 60385 Frankfurt Am Main Germany Match Herr Jürgen Smithe Deutsche Bank Gmb. H Suite 5 Berger Str. 130 60385 Frankfurt Am Main DEU 11
Concepts United Kingdom Query Mr. Steven Gerrard Liverpool Football Club, LTD Anfield Road Liverpool L 4 0 TH Merseyside UK Match Steven George Gerrard, MBE Liverpool Football Club Anfield Rd Liverpool L 4 0 TH United Kingdom 12
Concepts Australia Query Match Louise Herron AM CEO Sydney Opera House Bennelong Pt Sydney NSW 2000 Australia Sydney Opera House Bennelong Point Sydney New South Wales 2000 AUS 13
Concepts Match. Up uses a records' matchkey to compare it against the keys of other records. What's a matchkey? A string of data, extracted from each record, used to compare records. The matchcode you select, determines which data is extracted from each record to build the matchkey. What's a matchcode? A matchcode is a set of rules which allow you to determine if two records should be considered duplicates. It contains the data types to be used, their size, order, their individual properties, and how they work in different combinations. Match. Up uses a predefined matchcode, or one you have created using the Matchcode Editor, to create a matchkey for each record. For a full discussion of matchcodes and matchkeys see… Understanding Matchodes Which is also available as: 2_Matchcode. Editor_Overview. pptx 14
Concepts Match. Up is distributed with the ‘Matchcode Editor’, a graphic tool which allows you to determine and create rules when comparing records. 15
You select a distributed matching strategy (matchcode), or create a custom matchcode to suit your needs. 16
View the configured properties of distributed matchcodes, or add new ones to your custom matchcode here and configure. 17
With the new Global Match. Up, you have the ability to create a global matchcode 18
Global Matchcodes have an additional set of parsed address matchcode datatypes. But we always add Country to a global matchcode. 19
Since many of the traditional Fuzzy algorithms do not comply with extended characters, we have a new algorithm for Global data 20
Using the matchcode For matching solutions with a GUI interface, you will simply select the matchcode to use. For developer solutions, you will specify and configure your matchcode in your calling application code See additional Match. Up documentation for specific order of operations. 21
Global Processing Why the difference between Global and Domestic matchcode logic? Global Deduping relies on our Global Address Object’s address splitter to parse the numerous possible global address patterns Postal codes are part of the full address line input for Global, not a separate mapped datatype. 22
Global Processing Encoding – functionality has been added so the user can specify the encoding of the database Country specific matching – Match. Up relies on the Global. Address Object for Address parsing (but does not verify the address. This requires Country to be one of the matchcode components. Knowledgebase – we are carefully adding datatype keywords to the underlying datafiles. Ex: gm. BH 23
Additional Resources • • Melissa Data Wiki Matchcode. Editor_Overview. pptx Inefficient. Matchcode_Analysis. pptx Editing. Matchcodes. pptx (coming) Match. Up_GUI. pptx Match. Up_Object. pptx Match. Up_SSIS. pptx 24
Resources Take me to this Online example 25
26
Thank You 27
- What is bioinformatics an introduction and overview
- Papercut job ticketing
- Introduction product overview
- Introduction product overview
- Introduction product overview
- Match the definition with the vocabulary word gamboled
- Match the following terms with their definitions:
- Conclusion paragraph format
- Www stands for world wide web
- Maximo work order priority
- Uml overview
- In uml is a connection among things
- Retail vertical
- Figure 12-1 provides an overview of the lymphatic vessels
- Pulmonary circulation diagram
- Texas public school finance overview
- Walmart operations
- Stylistic overview of philippine art
- How can we integrate oop with sd/sa
- Spring framework overview
- Nagios tactical overview
- Market overview managed file transfer solutions
- Nfv vs sdn
- Sbic program
- Sap mm organization structure
- Ariba registration process
- Safe overview
- Rfid technology overview