Statistics for Advanced Machine Learning Techniques for Taxonomic Classification and Clustering of DNA Sequences