Welcome to MilkyWay@home

Many failed GPU workunits in a row


Advanced search

Message boards : Number crunching : Many failed GPU workunits in a row
Message board moderation

To post messages, you must log in.

AuthorMessage
robertmiles

Send message
Joined: 30 Sep 09
Posts: 211
Credit: 35,322,576
RAC: 8,011
30 million credit badge12 year member badgeextraordinary contributions badge
Message 64475 - Posted: 16 Apr 2016, 19:42:03 UTC

Several GPU workunits in a row failed, at least two dozen from various BOINC projects.

MilkyWay@Home v1.02 (opencl_nvidia):
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971920
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971926
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971928

Milkyway@Home Separation (Modified Fit) v1.36 (opencl_nvidia_101):
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971742
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971744
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971749
milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971751
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1559971753

Part of a large cluster of GPU workunits from various BOINC projects that failed about the same time, so at least one of them did not leave BOINC and the driver in a state where another GPU workunit would start properly.
ID: 64475 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
robertmiles

Send message
Joined: 30 Sep 09
Posts: 211
Credit: 35,322,576
RAC: 8,011
30 million credit badge12 year member badgeextraordinary contributions badge
Message 64476 - Posted: 17 Apr 2016, 1:47:10 UTC

The OpenCL section of the Nvidia 364.72 driver, and earlier 364.* drivers, has a problem which can an entire computer to lock up, or cause a few dozen OpenCL tasks (often not all from the same BOINC project) to give a quick Compute Error. Problem not seen in the 362.00 driver.

Tasks from POEM@home seem the most likely to trigger this problem.

Threads on the problems:

https://www.primegrid.com/forum_thread.php?id=6769#94223

http://boinc.fzk.de/poem/forum_thread.php?id=1205#10896
ID: 64476 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
robertmiles

Send message
Joined: 30 Sep 09
Posts: 211
Credit: 35,322,576
RAC: 8,011
30 million credit badge12 year member badgeextraordinary contributions badge
Message 64530 - Posted: 3 May 2016, 1:43:18 UTC - in response to Message 64476.  

The 365.10 driver is now available, but it does not fix this problem.
ID: 64530 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Many failed GPU workunits in a row

©2022 Astroinformatics Group