You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Starting Prometheus server on port 8000...
Running benchmark...
Downloading (…)cessor_config.json";: 100%|██████████| 292/292 [00:00<00:00, 27.0kB/s]
Downloading (…)"config.json";: 100%|██████████| 4.13k/4.13k [00:00<00:00, 244kB/s]
Downloading (…)"pytorch_model.bin";: 100%|██████████| 123M/123M [11:58<00:00, `171kB/s]
The line Running inference... is not printed out so I assume there is some problem when the model is loaded to GPU. Here is the MPS server log:
==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:31:59.303 Other 138] Initializing server process
[2024-07-30 02:31:59.339 Server 138] Creating server context on device 0 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:31:59.401 Server 138] Creating server context on device 1 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:31:59.456 Server 138] Created named shared memory region /cuda.shm.3e8.8a.1
==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:31:59.456 Control 58] NEW SERVER 138: Ignoring connection from user
==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:31:59.456 Server 138] Active Threads Percentage set to 0.0
[2024-07-30 02:32:36.506 Server 138] Server Priority set to 0
[2024-07-30 02:32:36.506 Server 138] Server has started
[2024-07-30 02:32:36.506 Server 138] Destroy server context on device 0
[2024-07-30 02:32:36.545 Server 138] Destroy server context on device 1
==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.581 Control 58] Server 138 exited with status 0
[2024-07-30 02:32:36.581 Control 58] Starting new server 144 for user 1000
==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:32:36.601 Other 144] Startup
[2024-07-30 02:32:36.601 Other 144] Connecting to control daemon on socket: /tmp/nvidia-mps/control
==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.601 Control 58] Accepting connection...
==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:32:36.601 Other 144] Initializing server process
[2024-07-30 02:32:36.641 Server 144] Creating server context on device 0 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:32:36.704 Server 144] Creating server context on device 1 (NVIDIA GeForce RTX 2080 Ti)
[2024-07-30 02:32:36.768 Server 144] Created named shared memory region /cuda.shm.3e8.90.1
==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.768 Control 58] NEW SERVER 144: Ready
==> /tmp/nvidia-mps/server.log <==
[2024-07-30 02:32:36.768 Server 144] Active Threads Percentage set to 100.0
[2024-07-30 02:32:36.768 Server 144] Server Priority set to 0
[2024-07-30 02:32:36.768 Server 144] Server has started
[2024-07-30 02:32:36.768 Server 144] Received new client request
[2024-07-30 02:32:36.799 Server 144] Worker created
[2024-07-30 02:32:36.799 Server 144] Creating worker thread
[2024-07-30 02:32:36.799 Server 144] Waiting for current clients to finish
==> /tmp/nvidia-mps/control.log <==
[2024-07-30 02:32:36.847 Control 58] Accepting connection...
[2024-07-30 02:32:36.848 Control 58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:37:55.850 Control 58] Accepting connection...
[2024-07-30 02:37:55.850 Control 58] User did not send valid credentials
[2024-07-30 02:37:55.850 Control 58] Accepting connection...
[2024-07-30 02:37:55.851 Control 58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:41:25.952 Control 58] Accepting connection...
[2024-07-30 02:41:25.952 Control 58] User did not send valid credentials
[2024-07-30 02:41:25.952 Control 58] Accepting connection...
[2024-07-30 02:41:25.952 Control 58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:42:55.872 Control 58] Accepting connection...
[2024-07-30 02:42:55.872 Control 58] User did not send valid credentials
[2024-07-30 02:42:55.872 Control 58] Accepting connection...
[2024-07-30 02:42:55.872 Control 58] NEW CLIENT 0 from user 0: Server is not ready, push client to pending list
[2024-07-30 02:49:23.964 Control 58] Accepting connection...
[2024-07-30 02:49:23.964 Control 58] User did not send valid credentials
[2024-07-30 02:49:23.964 Control 58] Accepting connection...
[2024-07-30 02:49:23.964 Control 58] NEW CLIENT 0 from user 0: Server is not ready, push client to pending list
[2024-07-30 02:50:09.170 Control 58] Accepting connection...
[2024-07-30 02:50:09.247 Control 58] User did not send valid credentials
[2024-07-30 02:50:09.247 Control 58] Accepting connection...
[2024-07-30 02:50:09.247 Control 58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:51:05.370 Control 58] Accepting connection...
[2024-07-30 02:51:05.370 Control 58] User did not send valid credentials
[2024-07-30 02:51:05.370 Control 58] Accepting connection...
[2024-07-30 02:51:05.370 Control 58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:52:51.748 Control 58] Accepting connection...
[2024-07-30 02:52:51.749 Control 58] User did not send valid credentials
[2024-07-30 02:52:51.749 Control 58] Accepting connection...
[2024-07-30 02:52:51.749 Control 58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:54:55.658 Control 58] Accepting connection...
[2024-07-30 02:54:55.658 Control 58] User did not send valid credentials
[2024-07-30 02:54:55.658 Control 58] Accepting connection...
[2024-07-30 02:54:55.658 Control 58] NEW CLIENT 0 from user 1000: Server is not ready, push client to pending list
[2024-07-30 02:57:06.983 Control 58] Accepting connection...
[2024-07-30 02:57:06.984 Control 58] User did not send valid credentials
[2024-07-30 02:57:06.984 Control 58] Accepting connection...
[2024-07-30 02:57:06.984 Control 58] NEW CLIENT 0 from user 0: Server is not ready, push client to pending list
The text was updated successfully, but these errors were encountered:
Starting Prometheus server on port 8000...
Running benchmark...
Downloading (…)cessor_config.json";: 100%|██████████| 292/292 [00:00<00:00, 27.0kB/s]
Downloading (…)"config.json";: 100%|██████████| 4.13k/4.13k [00:00<00:00, 244kB/s]
Downloading (…)"pytorch_model.bin";: 100%|██████████| 123M/123M [11:58<00:00, `171kB/s]
The line Running inference... is not printed out so I assume there is some problem when the model is loaded to GPU. Here is the MPS server log:
The text was updated successfully, but these errors were encountered: