hit between 4.25 to 3.5 TPS (tokens per second) on the Q4 671b full model
ollama pull deepseek-r1:671b
But the Huggingface repo has 163 files of ~4.3GB each, so around 700GB: https://huggingface.co/deepseek-ai/DeepSeek-R1/tree/main