Demo gpu sharing for mps does not start inferencing after downloading pytorch_model.bin #56

ltson4121994 · 2024-07-30T02:47:19Z

Starting Prometheus server on port 8000...
Running benchmark...
Downloading (…)cessor_config.json";: 100%|██████████| 292/292 [00:00<00:00, 27.0kB/s]
Downloading (…)"config.json";: 100%|██████████| 4.13k/4.13k [00:00<00:00, 244kB/s]
Downloading (…)"pytorch_model.bin";: 100%|██████████| 123M/123M [11:58<00:00, `171kB/s]

The line Running inference... is not printed out so I assume there is some problem when the model is loaded to GPU. Here is the MPS server log:

==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:31:59.303 Other   138] Initializing server process
[2024-07-30 02:31:59.339 Server   138] Creating server context on device 0 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:31:59.401 Server   138] Creating server context on device 1 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:31:59.456 Server   138] Created named shared memory region /cuda.shm.3e8.8a.1

==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:31:59.456 Control    58] NEW SERVER 138: Ignoring connection from user

==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:31:59.456 Server   138] Active Threads Percentage set to 0.0
[2024-07-30 02:32:36.506 Server   138] Server Priority set to 0
[2024-07-30 02:32:36.506 Server   138] Server has started
[2024-07-30 02:32:36.506 Server   138] Destroy server context on device 0
[2024-07-30 02:32:36.545 Server   138] Destroy server context on device 1

==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.581 Control    58] Server 138 exited with status 0
[2024-07-30 02:32:36.581 Control    58] Starting new server 144 for user 1000

==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:32:36.601 Other   144] Startup
[2024-07-30 02:32:36.601 Other   144] Connecting to control daemon on socket: /tmp/nvidia-mps/control

==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.601 Control    58] Accepting connection...

==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:32:36.601 Other   144] Initializing server process
[2024-07-30 02:32:36.641 Server   144] Creating server context on device 0 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:32:36.704 Server   144] Creating server context on device 1 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:32:36.768 Server   144] Created named shared memory region /cuda.shm.3e8.90.1

==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.768 Control    58] NEW SERVER 144: Ready

==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:32:36.768 Server   144] Active Threads Percentage set to 100.0
[2024-07-30 02:32:36.768 Server   144] Server Priority set to 0
[2024-07-30 02:32:36.768 Server   144] Server has started
[2024-07-30 02:32:36.768 Server   144] Received new client request
[2024-07-30 02:32:36.799 Server   144] Worker created
[2024-07-30 02:32:36.799 Server   144] Creating worker thread
[2024-07-30 02:32:36.799 Server   144] Waiting for current clients to finish

==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.847 Control    58] Accepting connection...
[2024-07-30 02:32:36.848 Control    58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:37:55.850 Control    58] Accepting connection...
[2024-07-30 02:37:55.850 Control    58] User did not send valid credentials
[2024-07-30 02:37:55.850 Control    58] Accepting connection...
[2024-07-30 02:37:55.851 Control    58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:41:25.952 Control    58] Accepting connection...
[2024-07-30 02:41:25.952 Control    58] User did not send valid credentials
[2024-07-30 02:41:25.952 Control    58] Accepting connection...
[2024-07-30 02:41:25.952 Control    58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:42:55.872 Control    58] Accepting connection...
[2024-07-30 02:42:55.872 Control    58] User did not send valid credentials
[2024-07-30 02:42:55.872 Control    58] Accepting connection...
[2024-07-30 02:42:55.872 Control    58] NEW CLIENT 0 from user 0: Server is not ready, push client to pending list
[2024-07-30 02:49:23.964 Control    58] Accepting connection...
[2024-07-30 02:49:23.964 Control    58] User did not send valid credentials
[2024-07-30 02:49:23.964 Control    58] Accepting connection...
[2024-07-30 02:49:23.964 Control    58] NEW CLIENT 0 from user 0: Server is not ready, push client to pending list
[2024-07-30 02:50:09.170 Control    58] Accepting connection...
[2024-07-30 02:50:09.247 Control    58] User did not send valid credentials
[2024-07-30 02:50:09.247 Control    58] Accepting connection...
[2024-07-30 02:50:09.247 Control    58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:51:05.370 Control    58] Accepting connection...
[2024-07-30 02:51:05.370 Control    58] User did not send valid credentials
[2024-07-30 02:51:05.370 Control    58] Accepting connection...
[2024-07-30 02:51:05.370 Control    58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:52:51.748 Control    58] Accepting connection...
[2024-07-30 02:52:51.749 Control    58] User did not send valid credentials
[2024-07-30 02:52:51.749 Control    58] Accepting connection...
[2024-07-30 02:52:51.749 Control    58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:54:55.658 Control    58] Accepting connection...
[2024-07-30 02:54:55.658 Control    58] User did not send valid credentials
[2024-07-30 02:54:55.658 Control    58] Accepting connection...
[2024-07-30 02:54:55.658 Control    58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:57:06.983 Control    58] Accepting connection...
[2024-07-30 02:57:06.984 Control    58] User did not send valid credentials
[2024-07-30 02:57:06.984 Control    58] Accepting connection...
[2024-07-30 02:57:06.984 Control    58] NEW CLIENT 0 from user 0: Server is not ready, push client to pending list

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demo gpu sharing for mps does not start inferencing after downloading pytorch_model.bin #56

Demo gpu sharing for mps does not start inferencing after downloading pytorch_model.bin #56

ltson4121994 commented Jul 30, 2024 •

edited

Loading

Demo gpu sharing for mps does not start inferencing after downloading pytorch_model.bin #56

Demo gpu sharing for mps does not start inferencing after downloading pytorch_model.bin #56

Comments

ltson4121994 commented Jul 30, 2024 • edited Loading

ltson4121994 commented Jul 30, 2024 •

edited

Loading