LLM is not guaranteed to be accurate for all queries
Understanding which queries they are reliable for is important
Selective Prediction : the deployment scenario for AI where humans are involved to maintain overall accuracy by reviewing AI-generated, low-confidence outputs
Selective Prediction is hard as LLM is trained to predict not the "correct" next token but only the "next" token
It doesn't generate a confidence score also obtaining confidence score from output sequence is not straightforward
Distinguishing the correctness from likelihood scores is a challenging
ASPIRE : learns self-evaluate from target-task data
set of reference outputs
evaluation metric
threshold
training dataset randomly sampled from a target task distribution
rejection operation
selective predictor
use AUACC (area under the accuracy-coverage curve) to measure selective prediction performance
use AUROC (area under the receiver operator characteristic curve) to measure the quality of the selection score estimation
Start with LoRA
Fine-tune LLM to learn self-evaluation
use to generate different answers for each example
supposing the decoding algorithm used to generate output sequences for is
where
choose output sequences such that is maximal
use metric to determine is correct
i.e. if , it is correct
use threshold different from for evaluation (choose sufficiently large so that the wrong outputs wouldn't be labeled as correct outputs)
after sampling high-likelihood outputs, tune only for learning self-evaluation ( and are frozen)
the training objective is
where is a set of 'correct' outputs containing the reference and correct outputs with highest likelihood from , same for (If doesn't have wrong output, add a default wrong output(e.g. empty string) to )
After training , obtain the prediction solving
Also, the self-eval score is defined as
Used Beam search decoding
Overall, the selection scoring function is
where is a hyperparameter
단순히 프롬프트로 신뢰도를 찍어내는 것이 아니라, 나름의 계산과 Learning 기반으로 신뢰도를 얻어낼 수 있는게 좋았음. 다만 테스트한 모델이 좀 오래되어서, 최근의 sLLM으로도 가능한지 의문
Hello, maybe you can try to reproduce this paper. I am very interested in this paper, but unfortunately there are some details that I don’t quite understand. By the way, your article is very well written and concise.