Managing Train/Develop Splits with the spaCy command line trainer
问题 I am training an NER model using the python -m spacy train command line tool. I use gold.docs_to_json to convert my annotated documents to the JSON-serializable format. The command line training tool uses both a training set and a development set. I'm not sure how much assistance the command line tools give me for managing train/dev splits. Is there a command line tool to create train/dev splits from a single set of data? Will the spaCy training command do cross-validation for me instead of