Installing aphrodite-engine on Ubuntu 24.04 server with Cuda 12.8 and RTX 5080 #1389
Unanswered
marc-ramsey
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I've been trying for several days now to get what should (?) have been a simple installation to work. However, I need to use at least Cuda 12.8 (earliest with 5080 support), which complicates matters. Using the default NVidia driver installation instructions results in cuda r12.9, which doesn't appear to be supported by PyTorch yet Forcing use of cuda r12.8 for a headless installation apparently requires the following gyrations:
sudo apt purge dkms *nvidia*sudo apt install linux-headers-$(uname -r)sudo apt updatesudo apt install nvidia-headless-570-server-opensudo apt install nvidia-utils-570-serversudo apt install nvidia-cuda-devsudo apt install cuda-toolkit-12-8That all seems to work, nvidia-smi and clinfo detect the GPU correctly, stand alone installation of torch 2.7 works fine. So:
git clone https://github.com/PygmalionAI/aphrodite-engine.gitcd aphrodite-engine/python3 -m venv venvsource venv/bin/activatepip install -r requirements-cuda.txtpip install wheel ninja==1.10export PATH=/usr/local/cuda-12.8/bin:$PATHexport LD_LIBRARY_PATH=/usr/local/cuda-12.8/lib64:$LD_LIBRARY_PATHexport CUDA_HOME=/usr/local/cuda-12.8MAX_JOBS=4 pip install -e .Clean build, no problem, how about
aphrodite run Qwen/Qwen2.5-Coder-7B-Instruct:As far as I can figure out, the main issue here is that requirements-common.txt requires PyTorch 2.6 which is built with Cuda 12.4 libraries, no sm_120. The PyTorch.org nightly build is 2.7 built with Cuda 12.8 which does have sm_120 (and it works), but is not compatible with building the current aphrodite-engine source. How do I get around this? Thanks, Marc
Beta Was this translation helpful? Give feedback.
All reactions