LLM from scratch using distributed computing