Galaxy GTX 460
Was working fine. Now I get this in the client logs:
[03:12:12] mdrun_gpu returned 52
[03:12:12] NANs detected on GPU
It then shuts down the client for 24 hours.
This card is only a month old. Anything I can do to fix it?
Seems to run fine otherwise....
Edit: ok, more info
[03:16:41] Starting GUI Server
[03:16:41] Setting checkpoint frequency: 500000
[03:16:41] Setting checkpoint frequency: 500000
[03:18:38] Completed 500000 out of 50000000 steps (1%).
[03:18:38] mdrun_gpu returned 52
[03:18:38] NANs detected on GPU
[03:18:38]
[03:18:38] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:18:40] CoreStatus = 7A (122)
[03:18:40] Sending work to server
[03:18:40] Project: 6806 (Run 2142, Clone 1, Gen 30)
[03:18:40] - Read packet limit of 540015616... Set to 524286976.
[03:18:40] - Error: Could not get length of results file work/wuresults_05.dat
[03:18:40] - Error: Could not read unit 05 file. Removing from queue.
[03:18:40] EUE limit exceeded. Pausing 24 hours.
I've also deleted the contents of the work folder and the logs and core from the client folder and tried a fresh start.
Was working fine. Now I get this in the client logs:
[03:12:12] mdrun_gpu returned 52
[03:12:12] NANs detected on GPU
It then shuts down the client for 24 hours.
This card is only a month old. Anything I can do to fix it?
Seems to run fine otherwise....
Edit: ok, more info
[03:16:41] Starting GUI Server
[03:16:41] Setting checkpoint frequency: 500000
[03:16:41] Setting checkpoint frequency: 500000
[03:18:38] Completed 500000 out of 50000000 steps (1%).
[03:18:38] mdrun_gpu returned 52
[03:18:38] NANs detected on GPU
[03:18:38]
[03:18:38] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:18:40] CoreStatus = 7A (122)
[03:18:40] Sending work to server
[03:18:40] Project: 6806 (Run 2142, Clone 1, Gen 30)
[03:18:40] - Read packet limit of 540015616... Set to 524286976.
[03:18:40] - Error: Could not get length of results file work/wuresults_05.dat
[03:18:40] - Error: Could not read unit 05 file. Removing from queue.
[03:18:40] EUE limit exceeded. Pausing 24 hours.
I've also deleted the contents of the work folder and the logs and core from the client folder and tried a fresh start.
Last edited: