02030101 TMBA TMBA #tm_1th_2 > li:nth-child(3) > ul > li.toy_0 > a 02030101 TMBA TMBA #mprovide > div > div > div.box.box1 > ul > li:nth-child(1) > a 02030201 IMBA IMBA #tm_1th_2 > li:nth-child(3) > ul > li.toy_1 > a 02030201 IMBA IMBA #mprovide > div > div > div.box.box1 > ul > li:nth-child(2) > a 02030301 EMBA EMBA #tm_1th_2 > li:nth-child(3) > ul > li.toy_2 > a 02030301 EMBA EMBA #mprovide > div > div > div.box.box1 > ul > li:nth-child(4) > a 02030401 PMBA PMBA #tm_1th_2 > li:nth-child(3) > ul > li.last.toy_3 > a 02030401 PMBA PMBA #mprovide > div > div > div.box.box1 > ul > li:nth-child(3) > a 02040101 FMBA FMBA #tm_1th_2 > li:nth-child(4) > ul > li.toy_0 > a 02040101 FMBA FMBA #mprovide > div > div > div.box.box3 > ul > li:nth-child(1) > a 02040201 MFE MFE #tm_1th_2 > li:nth-child(4) > ul > li.toy_1 > a 02040201 MFE MFE #mprovide > div > div > div.box.box3 > ul > li:nth-child(3) > a 02040401 IMMBA IMMBA #tm_1th_2 > li:nth-child(4) > ul > li.toy_2 > a 02040401 IMMBA IMMBA #mprovide > div > div > div.box.box3 > ul > li:nth-child(2) > a 02040501 IMMS IMMS #tm_1th_2 > li:nth-child(4) > ul > li.toy_3 > a 02040501 IMMS IMMS #mprovide > div > div > div.box.box3 > ul > li:nth-child(4) > a 02040601 SEMBA SEMBA #tm_1th_2 > li:nth-child(4) > ul > li.toy_4 > a 02040601 SEMBA SEMBA #mprovide > div > div > div.box.box3 > ul > li:nth-child(6) > a 02040701 GP GP #tm_1th_2 > li:nth-child(4) > ul > li.last.toy_5 > a 02040701 GP GP #mprovide > div > div > div.box.box3 > ul > li:nth-child(7) > a 02040701 admission admission #txt > div.sub0303.mt_20 > div.btn_wrap > a 02040701 GP GP #mprovide > div > div > div.box.box3 > ul > li:nth-child(7) > a
본문 바로가기 사이트 메뉴 바로가기 주메뉴 바로가기

Selection of the most probable best

OPERATIONS RESEARCH2025-11

Kim, Taeho | Kim, Kyoung-Kuk | Song, Eunhye

We consider an expected-value ranking and selection (R&S) problem where all k solutions' simulation outputs depend on a common parameter whose uncertainty can be modeled by a distribution. We define the most probable best (MPB) to be the solution that has the largest probability of being optimal with respect to the distribution and design an efficient sequential sampling algorithm to learn the MPB when the parameter has a finite support. We derive the large deviations rate of the probability of falsely selecting the MPB and formulate an optimal computing budget allocation problem to find the rate-maximizing static sampling ratios. The problem is then relaxed to obtain a set of optimality conditions that are interpretable and computationally efficient to verify. We devise a series of algorithms that replace the unknown means in the optimality conditions with their estimates and prove the algorithms' sampling ratios achieve the conditions as the simulation budget increases. Furthermore, we show that the empirical performances of the algorithms can be significantly improved by adopting the kernel ridge regression for mean estimation while achieving the same asymptotic convergence results. The algorithms are benchmarked against a state-of-the-art contextual R&S algorithm and demonstrated to have superior empirical performances.

Publisher
Institute for Operations Research and the Management Sciences
Issue Date
2025-11
Article Type
Article
Citation
OPERATIONS RESEARCH, Vol.73, No.6, pp.3199 - 3218
ISSN
0030-364X
DOI
10.1287/opre.2022.0343
만족도조사

이 페이지에서 제공하는 정보에 대하여 만족하십니까?

콘텐츠담당자 : 최희정 연락처 : 02-958-3604

관심자등록

KCB ISSUE