Statistics for Visualizing and Understanding Code Duplication in Large Software Systems