Statistics for SCALING PRE-TRAINING DATA & LANGUAGE MODELS FOR AFRICAN LANGUAGES