02-04-2023, 09:08 AM
Extra seconds do not make a difference if they do not end up leading to a new search depth. Whenever the agent runs out of processing time, it selects the move that it thinks is best based on the deepest depth for which it was able to *completely* search the tree.
It's not entirely accurate to say that the move selected will always be the same for the same search depth though. Sometimes there can be multiple different moves that are all equally evaluated as "best moves", and the agent may randomly pick among them (not exactly just randomly, but there can be a random component in this tie-breaking). Aside from that little detail, yes, it will always pick the same move (or pick from the same set of multiple best moves) for the same search depth.
It's not entirely accurate to say that the move selected will always be the same for the same search depth though. Sometimes there can be multiple different moves that are all equally evaluated as "best moves", and the agent may randomly pick among them (not exactly just randomly, but there can be a random component in this tie-breaking). Aside from that little detail, yes, it will always pick the same move (or pick from the same set of multiple best moves) for the same search depth.