I have a dataset currently with around 15k entries and 100+ variables of categorical variety (gender, race, occupation), continuous variety (income, age, bank balance), and