Fails to run on my Spark with the Docker command from the documentation

#6
by VladNC - opened

First I had to reduce the memory to 0.90, which made it not fail immediately. But now it fails after loading the model, with:

(APIServer pid=532)   File "/usr/local/lib/python3.12/dist-packages/vllm/entrypoints/openai/parser/harmony_utils.py", line 63, in get_encoding
(APIServer pid=532)     _harmony_encoding = load_harmony_encoding(HarmonyEncodingName.HARMONY_GPT_OSS)
(APIServer pid=532)                         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=532)   File "/usr/local/lib/python3.12/dist-packages/openai_harmony/__init__.py", line 699, in load_harmony_encoding
(APIServer pid=532)     inner: _PyHarmonyEncoding = _load_harmony_encoding(name)
(APIServer pid=532)                                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=532) openai_harmony.HarmonyError: error downloading or loading vocab file: failed to download or load vocab file

Sign up or log in to comment