(SSIS, TSQL and MDS) - Record Linkage(Fuzzy Match)
We will implement several code samples based on a series of articles amd posts identifying similar records between two different sources or grouping of records from a single source, based on existing column string of values. We will define an approach, review actual implementations with various SQL tools(TSQL, VB,SSIS and MDS). Although we are discussing matching, we need to address several steps prior to getting to the actual use of matching algorithms.The steps are as follows: 1. Cleansing and standardization 2. Group records 3. Split records 4. Compare records and determine scores 5. Split into separate match categories 6. Analyze results of matches 7. Evaluate using match tools to determine if best algorithms have been combined.