Liu, B. (2025). From Policy Optimization Foundations to Language Model Post-Training on Structured Tasks. ProQuest Dissertations and Theses.
Kopierejuvvon čuohpusbeavdái
Kopieren čuohpusbeavdái ii lihkostuvvan
Chicago-čujuhus (17. p.)
Liu, Boyi. "From Policy Optimization Foundations to Language Model Post-Training on Structured Tasks."
ProQuest Dissertations and Theses 2025.
Kopierejuvvon čuohpusbeavdái
Kopieren čuohpusbeavdái ii lihkostuvvan
MLA-čujuhus (9. p.)
Liu, Boyi. "From Policy Optimization Foundations to Language Model Post-Training on Structured Tasks."
ProQuest Dissertations and Theses, 2025.
Kopierejuvvon čuohpusbeavdái
Kopieren čuohpusbeavdái ii lihkostuvvan
Muitte dárkkistit čujuhemiid riektatvuođa, ovdal go geavahat daid iežat deavsttas.