You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am curious about where do we implement the hybrid training strategy in the paper "DM0 employs a hybrid training strategy: for embodied data, gradients from the action expert are not backpropagated to the VLM to preserve generalized representations" ? Do you freeze VLM to achieve gradient flow ?
Thanks for open-sourcing DM0 !
I am curious about where do we implement the hybrid training strategy in the paper "DM0 employs a hybrid training strategy: for embodied data, gradients from the action expert are not backpropagated to the VLM to preserve generalized representations" ? Do you freeze VLM to achieve gradient flow ?
Thanks