#fuzzyjoin
This is a fork of the string similarity join algorithm implemented in Efficient Parallel Set-Similarity Joins Using MapReduce. Rares Vernica, Michael J. Carey, Chen Li SIGMOD 2010. This version can run in AWS Elastic MapReduce.
#fuzzyjoin
This is a fork of the string similarity join algorithm implemented in Efficient Parallel Set-Similarity Joins Using MapReduce. Rares Vernica, Michael J. Carey, Chen Li SIGMOD 2010. This version can run in AWS Elastic MapReduce.