⚡ Customization of VERL for ai-infra #2

allenphilipj · Oct 8, 2025

Doc with summary of changes - Link

allenphilipj · Oct 13, 2025

verl/trainer/fsdp_sft_trainer.py

                data = TensorDict(data, batch_size=self.config.data.train_batch_size).to(self.device_name)
                metric = self.training_step(data)
-                train_time += metric["train/time(s)"]
+                train_time += metric["train/time_s"]


🐞Causing MLFlow metric logging error

allenphilipj · Oct 13, 2025

verl/trainer/fsdp_sft_trainer.py

    def _compute_loss_and_backward(self, batch, do_backward=True, n_micro_batches=1):
        """Compute loss with optional sequence parallelism and remove padding features"""
-        use_sp = self.use_remove_padding and self.config.ulysses_sequence_parallel_size > 1
+        use_sp = self.use_remove_padding and self.config.ulysses_sequence_parallel_size >= 1


🐞Previously didn't allow us to skip padding without sequence parallelism which meant memory usage was incredibly high. Super small fix for the same.

allenphilipj · Oct 13, 2025

verl/workers/rollout/vllm_rollout/vllm_rollout_spmd.py

-        sampling_metadata: SamplingMetadata,
    ) -> torch.Tensor:
-        logits = original_compute_logits(hidden_states, sampling_metadata)
+        logits = original_compute_logits(hidden_states)


🐞Failing after vllm 0.11.0 update since SamplingMetadata was deprecated and removed. Checked in vllm repo and made a small fix to keep this aligned.

allenphilipj · Oct 13, 2025

verl/workers/reward_manager/naive.py

            rollout_reward_scores = data_item.non_tensor_batch.get("reward_scores", {})
            extra_info["num_turns"] = num_turns
            extra_info["rollout_reward_scores"] = rollout_reward_scores
+            extra_info["prompt_str"] = prompt_str


Passing prompt string since we need it for some of our reward computation.

allenphilipj · Oct 13, 2025

verl/utils/tracking.py

            # If experiment does not exist, will create a new experiment
            experiment = mlflow.set_experiment(project_name)
            mlflow.start_run(experiment_id=experiment.experiment_id, run_name=experiment_name)
+            mlflow_tags = os.getenv("MLFLOW_TAGS", None)


Enhance to support MLFlow tags. Good for experiment logging in general.

allenphilipj · Oct 13, 2025

verl/trainer/fsdp_sft_trainer.py

        elif is_npu_available:
            torch.distributed.all_reduce(step_loss)
            step_loss /= self.device_mesh.size(0)
        return {


Added more metrics for tracking

allenphilipj · Oct 13, 2025

verl/custom/override.py

Overrides to the default verl behaviour specifically for our use cases

docker/uv_setup.sh

allenphilipj · Oct 13, 2025

docker/setup_and_run.sh

+fi
+
+echo "== Setup the active user =="
+if [ -n "${HOST_UID}" ] && [ -n "${HOST_GID}" ] && [ -n "${INTERCOM_USER}" ]; then


🔴 Useful for local but failing for EKS. Need to debug why.

Skipping running as active user and defaulting to root for now. Feels like a time sink at the moment.

allenphilipj · Oct 13, 2025

docker/verl0.6-cu128-torch2.8.0-fa2.7.4/Dockerfile.finai

🔴 This is brittle but could use a review/rewrite.

Restructured a fair bit but still not able to move in the form of new recipes. Should do for now though.

socket-security · Oct 13, 2025

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff	Package	Supply Chain Security
	torchdata@0.11.0 ⏵ 0.10.0	^-35
	setuptools@80.9.0 ⏵ 79.0.1	⁺¹
	transformers@4.57.1 ⏵ 4.57.0
	pywin32@311
	flashinfer-python@0.4.0
	mlflow-skinny@3.4.0
	sentencepiece@0.2.1
	pillow@12.0.0 ⏵ 11.3.0	⁺¹
	fsspec@2025.9.0
	nvidia-cublas-cu12@12.8.3.14
	nvidia-cuda-cupti-cu12@12.8.57
	nvidia-cuda-nvrtc-cu12@12.8.61
	nvidia-cuda-runtime-cu12@12.8.57
	nvidia-curand-cu12@10.3.9.55
	nvidia-nvjitlink-cu12@12.8.61
	nvidia-nvtx-cu12@12.8.55
	nvidia-cufile-cu12@1.13.0.11
	fonttools@4.60.1
	nvidia-cusparselt-cu12@0.6.3
	nvidia-nccl-cu12@2.26.2
	py-spy@0.4.1
	pytest@8.4.2
	joblib@1.5.2
	hf-transfer@0.1.9
	hf-xet@1.1.10
	distlib@0.4.0
	pycryptodomex@3.23.0
	identify@2.6.15
	websockets@15.0.1
	blobfile@3.1.0
	pygments@2.19.2
	networkx@3.5
	mlx-metal@0.29.2
See 92 more rows in the dashboard

View full report

socket-security · Oct 13, 2025

Warning

Review the following alerts detected in dependencies.

According to your organization's Security Policy, it is recommended to resolve "Warn" alerts. Learn more about Socket for GitHub.

Action	Severity	Alert (click "▶" to expand/collapse)
Warn		`alembic@1.17.0` has a License Policy Violation. License: CC-BY-4.0 (alembic-1.17.0/docs/_static/vendor/fontawesome/6.5.2/js/all.min.js.LICENSE.txt) License: OFL-1.1 (alembic-1.17.0/docs/_static/vendor/fontawesome/6.5.2/js/all.min.js.LICENSE.txt) License: Font-Awesome-Free-License-6.x (alembic-1.17.0/docs/_static/vendor/fontawesome/6.5.2/LICENSE.txt) From: `?` → `pypi/sagemaker-mlflow@0.1.1` → `pypi/alembic@1.17.0` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/alembic@1.17.0`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: FSFAP (tvm_ffi/3rdparty/libbacktrace/aclocal.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: FSFAP (tvm_ffi/3rdparty/libbacktrace/aclocal.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: FSFAP (tvm_ffi/3rdparty/libbacktrace/aclocal.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: FSFAP (tvm_ffi/3rdparty/libbacktrace/aclocal.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`apache-tvm-ffi@0.1.0b15` has a License Policy Violation. License: BSD-3-Clause-HP (tvm_ffi/3rdparty/libbacktrace/LICENSE) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.sub) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/warnings.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/ltmain.sh) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/lead-dot.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config/libtool.m4) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/config.guess) License: GPL-3.0-only (tvm_ffi/3rdparty/libbacktrace/move-if-change) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/compile) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/missing) License: GPL-2.0-only (tvm_ffi/3rdparty/libbacktrace/test-driver) From: `?` → `pypi/flashinfer-python@0.4.0` → `pypi/apache-tvm-ffi@0.1.0b15` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/apache-tvm-ffi@0.1.0b15`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`cupy-cuda12x@13.6.0` has a License Policy Violation. License: Sleepycat (cupy_cuda12x-13.6.0.dist-info/licenses/docs/source/license.rst) From: `?` → `pypi/cupy-cuda12x@13.6.0` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/cupy-cuda12x@13.6.0`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`cupy-cuda12x@13.6.0` has a License Policy Violation. License: Sleepycat (cupy_cuda12x-13.6.0.dist-info/licenses/docs/source/license.rst) From: `?` → `pypi/cupy-cuda12x@13.6.0` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/cupy-cuda12x@13.6.0`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`cupy-cuda12x@13.6.0` has a License Policy Violation. License: Sleepycat (cupy_cuda12x-13.6.0.dist-info/licenses/docs/source/license.rst) From: `?` → `pypi/cupy-cuda12x@13.6.0` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/cupy-cuda12x@13.6.0`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`cupy-cuda12x@13.6.0` has a License Policy Violation. License: Sleepycat (cupy_cuda12x-13.6.0.dist-info/licenses/docs/source/license.rst) From: `?` → `pypi/cupy-cuda12x@13.6.0` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/cupy-cuda12x@13.6.0`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`cupy-cuda12x@13.6.0` has a License Policy Violation. License: Sleepycat (cupy_cuda12x-13.6.0.dist-info/licenses/docs/source/license.rst) From: `?` → `pypi/cupy-cuda12x@13.6.0` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/cupy-cuda12x@13.6.0`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`cupy-cuda12x@13.6.0` has a License Policy Violation. License: Sleepycat (cupy_cuda12x-13.6.0.dist-info/licenses/docs/source/license.rst) From: `?` → `pypi/cupy-cuda12x@13.6.0` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `support@socket.dev`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/cupy-cuda12x@13.6.0`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
See 422 more rows in the dashboard

View full report

allenphilipj · Oct 13, 2025

docker/custom/Dockerfile.uv

Changed a bunch of things:

Using the same base image for rest of model_training/ eks jobs

Using uv for installation (lot of pkgs still require no build isolation)

Tried to make build faster and image smaller (reduced by ~5 GB and builds in 20mins from 30mins earlier)

Pin cores deps with uv.lock

Remove unnecessary steps

Removed hack to install ai-datasets (Image still has the API key though)

This reverts commit f82a6c6.

allenphilipj added 16 commits October 7, 2025 16:52

Initial commit

b4df516

Stable local test runs

1ba2c96

Minor fixes and debugging changes

2415a79

Updates

254de11

Almost feature complete, need final qwen3 moe test

dafb59a

Stable updates

21acdda

Stable commit

2400c8a

Updates

6b15d49

Update docs

8c220d9

Update README.md

429d9a5

Updates and clean up

f676f20

Updates for SFT job

79adc01

Minor improvements

3b81706

Add MLFlow tags

c70450c

Log step loss metric for train

1351b7c

Enhancements to SFT

16ce887

allenphilipj commented Oct 13, 2025

View reviewed changes

allenphilipj added 2 commits October 13, 2025 17:15

⚡ Improve base docker img creation

6230ee5

Add example for optimized fsdp config

445e4d8

allenphilipj commented Oct 13, 2025

View reviewed changes

allenphilipj added 8 commits October 14, 2025 11:57

Fix validations when dynamic bz disabled

e0e6e84

Stable megatron runs

df2d3c7

Don't recopy

e8f3753

Prevent partial reads

bc68f79

Revert this since workers don't have context

2cd0366

Copy to local if s3

f7d6209

Add debug stmts

f82a6c6

Revert "Add debug stmts"

6ac420e

This reverts commit f82a6c6.

allenphilipj added 6 commits October 15, 2025 16:04

Simplify

29a67c3

Debug commit revert later

a69a731

More memory logging stmts

d397b90

Show cpu mem usage

1d9a209

(debug) Memory clean up

1ce905c

(debug) Seems to decrease 50GB

c5c2db2

Search code, repositories, users, issues, pull requests...

⚡ Customization of VERL for ai-infra #2

Are you sure you want to change the base?

⚡ Customization of VERL for ai-infra #2

Uh oh!

Conversation

allenphilipj commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

socket-security bot commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

socket-security bot commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

allenphilipj Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

allenphilipj commented Oct 8, 2025 •

edited

Loading

socket-security bot commented Oct 13, 2025 •

edited

Loading

socket-security bot commented Oct 13, 2025 •

edited

Loading

allenphilipj Oct 13, 2025 •

edited

Loading