Based on HuggingFace script to train a transformers model from scratch. I run:
python3 training_mlm.py \\ --dataset_name wikipedia \\ --tokenizer_name roberta-bas