I want to train a language model from ground up and use the most VRAM i have available, how can i calculate how much parameters can it have?