From 41640f4add5dc2417415c953e89c6e7c0dffa44c Mon Sep 17 00:00:00 2001 From: rentainhe <596106517@qq.com> Date: Fri, 2 Aug 2024 21:21:39 +0800 Subject: [PATCH] refine installation --- README.md | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/README.md b/README.md index 7ad7c42..0129599 100644 --- a/README.md +++ b/README.md @@ -16,6 +16,12 @@ Grounded SAM 2 does not introduce significant methodological changes compared to ## Installation +Install PyTorch environment first. We use `python=3.10`, as well as `torch >= 2.3.1`, `torchvision>=0.18.1` and `cuda-12.1` in our environment to run this demo. Please follow the instructions [here](https://pytorch.org/get-started/locally/) to install both PyTorch and TorchVision dependencies. Installing both PyTorch and TorchVision with CUDA support is strongly recommended. You can easily install the latest version of PyTorch as follows: + +```bash +pip3 install torch torchvision torchaudio +``` + Since we need the CUDA compilation environment to compile the `Deformable Attention` operator used in Grounding DINO, we need to check whether the CUDA environment variables have been set correctly (which you can refer to [Grounding DINO Installation](https://github.com/IDEA-Research/GroundingDINO?tab=readme-ov-file#hammer_and_wrench-install) for more details). You can set the environment variable manually as follows if you want to build a local GPU environment for Grounding DINO to run Grounded SAM 2: ```bash