1-1 of 1
Keywords: ε-greedy policy
Close
Follow your search
Access your saved searches in your account

Would you like to receive an alert when new items match your search?
Close Modal
Sort by
Journal Articles
Publisher: ASME
Article Type: Research Papers
J. Comput. Inf. Sci. Eng. December 2024, 24(12): 121006.
Paper No: JCISE-24-1239
Published Online: November 5, 2024
... the true objective function after it obtains new observations. In this work, we improve the exploitation of TS by incorporating the ε -greedy policy, a well-established selection strategy in reinforcement learning. We first delineate two extremes of TS, namely the generic TS and the sample-average TS...