Skip to content

Commit 1030534

Browse files
committed
paper
1 parent 631e387 commit 1030534

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

index.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -165,7 +165,7 @@ <h1 class="title is-1 publication-title is-bold">
165165
<br>
166166
<img src="static/images/teaser.png" alt="algebraic reasoning" class="center" style="width:95%;height:auto;">
167167
<br>
168-
<figcaption style="text-align:left;">Figure1: SimKO improves pass@K performance on math tasks (AIME24/25, AMC, MATH500, Minerva, Olympiadbench) and logic tasks (Synlogic, BBH) compared to GRPO, as shown in the plots (left and middle). The Figure1 on the right shows the k-th highest candidate probabilities averaged over the dataset. The SimKO-trained model exhibits a less concentrated probability distribution compared to GRPO.</figcaption>
168+
<figcaption style="text-align:left;">Figure1: SimKO improves pass@K performance on math tasks (AIME24/25, AMC, MATH500, Minerva, Olympiadbench) and logic tasks (Synlogic, BBH) compared to GRPO, as shown in the plots (left and middle). The figure on the right shows the k-th highest candidate probabilities averaged over the dataset. The SimKO-trained model exhibits a less concentrated probability distribution compared to GRPO.</figcaption>
169169
<br>
170170
<h2 class="title is-3">Abstract</h2>
171171
<div class="content has-text-justified">
@@ -359,7 +359,7 @@ <h1 class="title is-1 mmmu">Results</h1>
359359
<div class="container is-max-desktop content">
360360
<h2 class="title is-3 has-text-centered">BibTeX</h2>
361361
<pre><code>
362-
@article{yu2025simko,
362+
@article{peng2025simko,
363363
title={SimKO: Simple Pass@K Policy Optimization},
364364
author={Peng, Ruotian and Ren, Yi and Yu, Zhouliang and Liu, Weiyang and Wen, Yandong},
365365
journal={arXiv preprint arXiv:2510.14807},

0 commit comments

Comments
 (0)