Simple heuristic for measuring web page similarity (& data set) - View it on GitHub
Star
90
Rank
269922