KAIST 경영대학

02030101

TMBA

#tm_1th_2 > li:nth-child(3) > ul > li.toy_0 > a

02030101

TMBA

#mprovide > div > div > div.box.box1 > ul > li:nth-child(1) > a

02030201

IMBA

#tm_1th_2 > li:nth-child(3) > ul > li.toy_1 > a

02030201

IMBA

#mprovide > div > div > div.box.box1 > ul > li:nth-child(2) > a

02030301

EMBA

#tm_1th_2 > li:nth-child(3) > ul > li.toy_2 > a

02030301

EMBA

#mprovide > div > div > div.box.box1 > ul > li:nth-child(4) > a

02030401

PMBA

#tm_1th_2 > li:nth-child(3) > ul > li.last.toy_3 > a

02030401

PMBA

#mprovide > div > div > div.box.box1 > ul > li:nth-child(3) > a

02040101

FMBA

#tm_1th_2 > li:nth-child(4) > ul > li.toy_0 > a

02040101

FMBA

#mprovide > div > div > div.box.box3 > ul > li:nth-child(1) > a

02040201

MFE

#tm_1th_2 > li:nth-child(4) > ul > li.toy_1 > a

02040201

MFE

#mprovide > div > div > div.box.box3 > ul > li:nth-child(3) > a

02040401

IMMBA

#tm_1th_2 > li:nth-child(4) > ul > li.toy_2 > a

02040401

IMMBA

#mprovide > div > div > div.box.box3 > ul > li:nth-child(2) > a

02040501

IMMS

#tm_1th_2 > li:nth-child(4) > ul > li.toy_3 > a

02040501

IMMS

#mprovide > div > div > div.box.box3 > ul > li:nth-child(4) > a

02040601

SEMBA

#tm_1th_2 > li:nth-child(4) > ul > li.toy_4 > a

02040601

SEMBA

#mprovide > div > div > div.box.box3 > ul > li:nth-child(6) > a

02040701

#tm_1th_2 > li:nth-child(4) > ul > li.last.toy_5 > a

02040701

#mprovide > div > div > div.box.box3 > ul > li:nth-child(7) > a

02040701

admission

#txt > div.sub0303.mt_20 > div.btn_wrap > a

02040701

#mprovide > div > div > div.box.box3 > ul > li:nth-child(7) > a

인쇄

Selection of the most probable best

OPERATIONS RESEARCH2025-11

Kim, Taeho | Kim, Kyoung-Kuk | Song, Eunhye

We consider an expected-value ranking and selection (R&S) problem where all k solutions' simulation outputs depend on a common parameter whose uncertainty can be modeled by a distribution. We define the most probable best (MPB) to be the solution that has the largest probability of being optimal with respect to the distribution and design an efficient sequential sampling algorithm to learn the MPB when the parameter has a finite support. We derive the large deviations rate of the probability of falsely selecting the MPB and formulate an optimal computing budget allocation problem to find the rate-maximizing static sampling ratios. The problem is then relaxed to obtain a set of optimality conditions that are interpretable and computationally efficient to verify. We devise a series of algorithms that replace the unknown means in the optimality conditions with their estimates and prove the algorithms' sampling ratios achieve the conditions as the simulation budget increases. Furthermore, we show that the empirical performances of the algorithms can be significantly improved by adopting the kernel ridge regression for mean estimation while achieving the same asymptotic convergence results. The algorithms are benchmarked against a state-of-the-art contextual R&S algorithm and demonstrated to have superior empirical performances.

Publisher: Institute for Operations Research and the Management Sciences

Issue Date: 2025-11

Article Type: Article

Citation: OPERATIONS RESEARCH, Vol.73, No.6, pp.3199 - 3218

ISSN: 0030-364X

DOI: 10.1287/opre.2022.0343

LIST

콘텐츠담당자 : 최희정 연락처 : 02-958-3604

관심자등록

KCB ISSUE

경영대학 YouTube

Beyond Knowledge, Building Excellence Together!

1/3