I am training a CNN using TensorFlow, python, and Keras. I want to evaluate the accuracy of my models using sklearn.metrics.f1_score and sklearn.metrics.cohen_kappa_score. Howev