- Install Nvidia Driver, CUDA Toolkit & Python Dependencies
Stable installation from requirements.txt
chmod +x install.sh stable
./install.shInstall latest version of libraries:
chmod +x install.sh latest
./install.shIf installed nvidia driver version is >=450.80.02 then install CUDA 11.6 Toolkit:
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.0-1_all.deb
sudo dpkg -i cuda-keyring_1.0-1_all.deb
sudo apt-get update
sudo apt install cuda-toolkit-11-6If installed nvidia driver version is < 450.80.02 - Remove previously installed nvidia driver
sudo apt clean
sudo apt update
sudo apt purge nvidia-* nvidia driver version 510 & CUDA 11.6 - download
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.0-1_all.deb
sudo dpkg -i cuda-keyring_1.0-1_all.deb
sudo apt-get update
sudo apt install nvidia-driver-510 cuda-toolkit-11-6python3 -m pip install transformers tensorboard datasets nvidia-ml-py3 python-dotenv requests huggingface_hub evaluate pyext acceleratepip install torch --force-reinstall --extra-index-url https://download.pytorch.org/whl/cu116sudo apt install libaio-dev ninja-build
pip install deepspeedIf error "deepspeed: command not found" is visible after installation - faced this issue on ubuntu 20:
nano ~/.bashrcSave following at end of file
export PATH="/home/user/.local/bin:$PATH"You will then need to profile, do this by either running the command:
source ~/.bashrc