Not covering DeepSeek‘s DeepSeek-R1 model(s) with RL yet (e.g. check out https://ollama.com/library/deepseek-r1 to try it), but a pretty good visualisation how LLMs generally work:
Make a diff!
Not covering DeepSeek‘s DeepSeek-R1 model(s) with RL yet (e.g. check out https://ollama.com/library/deepseek-r1 to try it), but a pretty good visualisation how LLMs generally work: