Statistics for Scalable Embeddings for Kernel Clustering on MapReduce