Spaces:

amd
/

README

Running

seungrok81 commited on Nov 1, 2024

Commit

fed8070

verified ·

1 Parent(s): 0d53a29

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -48,7 +48,7 @@ Select 'Stable + Linux + Pip + Python + ROCm' to get the specific pip installati
 An example command line (note the versioning of the whl file):
-> `pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.4.2`
 ### TensorFlow
@@ -99,7 +99,7 @@ with Hugging Face models are available on the [Optimum page](https://huggingface
 # Serving a model with TGI
 Text Generation Inference (a.k.a “TGI”) provides an end-to-end solution to deploy large language models for inference at scale.
-TGI is already usable in production on AMD Instinct™ GPUs through the docker image `ghcr.io/huggingface/text-generation-inference:1.2-rocm`.
 Make sure to refer to the [documentation](https://huggingface.co/docs/text-generation-inference/supported_models#supported-hardware)
 concerning the support and any limitations.
@@ -110,6 +110,7 @@ across normal and distributed settings, with various supported optimizations and
 # Useful Links and Blogs
 - Detailed Llama-2 results show casing the [Optimum benchmark on AMD Instinct MI250](https://huggingface.co/blog/huggingface-and-optimum-amd)
 - Check out our blog titled [Run a Chatgpt-like Chatbot on a Single GPU with ROCm](https://huggingface.co/blog/chatbot-amd-gpu)
 - Complete ROCm [Documentation](https://rocm.docs.amd.com/en/latest/) for installation and usage

 An example command line (note the versioning of the whl file):
+> `pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2`
 ### TensorFlow
 # Serving a model with TGI
 Text Generation Inference (a.k.a “TGI”) provides an end-to-end solution to deploy large language models for inference at scale.
+TGI is already usable in production on AMD Instinct™ GPUs through the docker image `ghcr.io/huggingface/text-generation-inference:latest-rocm`.
 Make sure to refer to the [documentation](https://huggingface.co/docs/text-generation-inference/supported_models#supported-hardware)
 concerning the support and any limitations.
 # Useful Links and Blogs
+- Detailed Llama-3 results [Run TGI on AMD Instinct MI300X](https://huggingface.co/blog/huggingface-amd-mi300)
 - Detailed Llama-2 results show casing the [Optimum benchmark on AMD Instinct MI250](https://huggingface.co/blog/huggingface-and-optimum-amd)
 - Check out our blog titled [Run a Chatgpt-like Chatbot on a Single GPU with ROCm](https://huggingface.co/blog/chatbot-amd-gpu)
 - Complete ROCm [Documentation](https://rocm.docs.amd.com/en/latest/) for installation and usage