Text Joins for Data Cleansing and Integration
SQL Scripts
The SQL scripts described in [2,3]
- Import data
- Create tokens
- Create auxiliary relations
- Sampling
- Run the join
- Measure precision and recall
(The SQL scripts were tested on Microsoft SQL Server 2000, Developer’s edition, Service Pack 2.)