Image | ![]() |
---|---|
Description | scientific paper by DeepSeek Research introducing reinforcement learning techniques in the reasoning capabilities of large language models |
Author/s |
author: Daya Guo Qihao Zhu Zhuoshu Li Runxin Xu Fuli Luo Liang Wenfeng Zhenda Xie Damai Dai Yixuan Tan Honghui Ding Liyue Zhang Shirong Ma Xiaodong Liu Ruoyu Zhang Yiyuan Liu Xiaokang Zhang |
Publication date | |
Language | |
Country of origin | |
Wikipedia link | |
Copyright status | |
Missing/wrong data? | Edit Wikidata item |
Paulina is supported by:
About Paulina