Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

AlphaZero implementation with TorchRL #1844

Unanswered
dtsaras asked this question in Q&A
Jan 29, 2024 · 1 comments · 3 replies
Discussion options

Hello, I am working on an implementation of AlphaZero with torchrl and I was hoping to get some hints on how to better utilize the tools provided. Currently, I have MCTS coded from scratch and I insert the experience later in a replay buffer. I wonder how I could utilize the data collector and rollout methods to do this more efficiently. Any suggestions/comments are appreciated.

You must be logged in to vote

Replies: 1 comment · 3 replies

Comment options

Amazing!
We'd be excited to help you implement that!

Some datapoints: We have an open PR to get a working version of MCTS with tensordict #629. As you can see, it isn't young and hasn't progressed much but we want to push for it in the next 6 months. I do think that since tensordict is a good primitive to represent tree structures, it makes so much sense to use it to represent decision trees.

RE your questions of data collector / rollout, I think I'd need a bit more context to be helpful.

If you want to integrate your code as an example this is also something we can talk about.

Happy to discuss this offline if you'd like. For that feel free to reach out on our slack or discord channel :)

You must be logged in to vote
3 replies
@dtsaras
Comment options

Thanks for your quick reply I took a quick look over the MCTS code and it looks pretty nice. I am not sure if I missed this bit, but I did not see an action mask to prevent expanding actions that would be illegal. Also, I can't find the discord server. Do you mind sharing the link?

@vmoens
Comment options

vmoens Jan 29, 2024
Collaborator

Here it is: https://discord.gg/cZs26Qq3Dd

@vmoens
Comment options

vmoens Jan 29, 2024
Collaborator

@mjlaali maybe you'd like to join this conversation :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
🙏
Q&A
Labels
None yet
2 participants
Morty Proxy This is a proxified and sanitized view of the page, visit original site.