Advanced search

Authors whose works are in public domain in at least one jurisdiction

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Image Image of a generic work. The text above it indicates that there is no free image of the work available, and that if you own one, you can click on the placeholder link to upload it.
Description scientific paper by DeepSeek Research introducing reinforcement learning techniques in the reasoning capabilities of large language models
Author/s

author: Daya Guo  Qihao Zhu  Zhuoshu Li  Runxin Xu  Fuli Luo  Liang Wenfeng  Zhenda Xie  Damai Dai  Yixuan Tan  Honghui Ding  Liyue Zhang  Shirong Ma  Xiaodong Liu  Ruoyu Zhang  Yiyuan Liu  Xiaokang Zhang 

Publication date
Language
Country of origin
Wikipedia link
Copyright status
Missing/wrong data? Edit Wikidata item