I\'m analyzing a corpus of 8000 Instagram captions and categorizing them as low engagement and high engagement based on whether or not the like count is above or below the mean.