diff --git a/.gitignore b/.gitignore index 5805ae7..6a65503 100644 --- a/.gitignore +++ b/.gitignore @@ -1,3 +1,2 @@ .venv/ -.csv -.png \ No newline at end of file +*.csv \ No newline at end of file diff --git a/README.md b/README.md index fba07a9..02c5ac1 100644 --- a/README.md +++ b/README.md @@ -9,7 +9,7 @@ To generate the required CSV file, use [llama-benchy](https://github.com/eugr/ll Example command to generate the CSV: ```bash -uvx llama-benchy --base-url http://127.0.0.1:8000/v1 --model Qwen/Qwen3.6-27B --served-model-name unsloth/Qwen3.6-27B-GGUF --concurrency 1 2 4 8 16 32 --pp 128 --tg 128 --format csv +uvx llama-benchy --base-url http://127.0.0.1:8000/v1 --model Qwen/Qwen3-0.6B --served-model-name Qwen/Qwen3-0.6B --concurrency 1 2 4 8 16 32 --pp 128 --tg 128 --format csv ``` This will produce a CSV file with benchmark results that can be used as input for Benchy-Graph. @@ -25,4 +25,10 @@ Replace `` with the path to your llama-benchy CSV file and ` This example has been made with [mixa3607/ML-gfx906](https://github.com/mixa3607/ML-gfx906/tree/master/vllm) custom vLLM docker image + +![vLLM Qwen3-0.6B Demo](docs/img/demo.png) \ No newline at end of file diff --git a/docs/img/demo.png b/docs/img/demo.png new file mode 100644 index 0000000..38016cf Binary files /dev/null and b/docs/img/demo.png differ