Welcome to MilkyWay@home

Computation errors.

Message boards : Number crunching : Computation errors.
Message board moderation

To post messages, you must log in.

AuthorMessage
Phil

Send message
Joined: 3 Dec 12
Posts: 6
Credit: 372,276
RAC: 0
Message 60097 - Posted: 3 Oct 2013, 21:10:31 UTC

I'm getting computation errors on the n-body tasks again.
ID: 60097 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 4 Sep 12
Posts: 219
Credit: 456,474
RAC: 0
Message 60098 - Posted: 3 Oct 2013, 21:28:57 UTC - in response to Message 60097.  

I'm getting computation errors on the n-body tasks again.

Mine are running fine. Windows 7/64, BOINC v7.2.16

Looks like I got the new app, DLLs, isotropic LUA, and isotropic histogram all in one go this morning.
ID: 60098 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Phil

Send message
Joined: 3 Dec 12
Posts: 6
Credit: 372,276
RAC: 0
Message 60099 - Posted: 4 Oct 2013, 0:25:48 UTC - in response to Message 60097.  

I'm running Boinc version 7.0.64 (x64). Is this my problem. I can't find an upgrade on the home page. Is this my problem?
ID: 60099 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 4 Sep 12
Posts: 219
Credit: 456,474
RAC: 0
Message 60100 - Posted: 4 Oct 2013, 9:23:26 UTC - in response to Message 60099.  

I'm running Boinc version 7.0.64 (x64). Is this my problem. I can't find an upgrade on the home page. Is this my problem?

Unlikely. v7.0.64 may not be perfect for running MT applications, but it's OK.

Your tasks (which hadn't been reported when I last posted) are now all showing exit code -1073741515 (0xc0000135): "The application failed to initialise properly." This usually indicates missing or damaged DLLs - sometimes anti-virus programs block the download of new executable files, or simple network congestion when everybody is trying to download the same file at the same time can have the same effect.

You could try downloading the two N-Body DLLs manually, and replacing the possibly damaged copies in your Milkyway project directory:

http://milkyway.cs.rpi.edu/milkyway/download/libgomp_64-1_nbody_1.38.dll (49,152 bytes)
http://milkyway.cs.rpi.edu/milkyway/download/pthreadGC2_64_nbody_1.38.dll (49,152 bytes)
ID: 60100 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,940,055
RAC: 22,560
Message 60102 - Posted: 4 Oct 2013, 12:24:47 UTC - in response to Message 60099.  

I'm running Boinc version 7.0.64 (x64). Is this my problem. I can't find an upgrade on the home page. Is this my problem?


There ARE some newer Beta versions, but I am finishing most n-body units just fine with 64bit Boinc version 7.0.64.
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=578701735

You can always download all Boinc versions from here if you chose:
http://boinc.berkeley.edu/dl/?C=M;O=D
ID: 60102 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Morten Ross
Avatar

Send message
Joined: 18 Feb 10
Posts: 15
Credit: 554,442,004
RAC: 0
Message 60141 - Posted: 11 Oct 2013, 8:54:53 UTC - in response to Message 60102.  

One of my hosts started experiencing computation errors one day ago.

The first change is that all tasks stay 10 or more seconds at 100% complete.
M@H Separation completes successfully, but computation times have increased due to the added 10+ seconds at 100% complete.

Thus I'm only able to crunch M@H Separation.

I've detached and reattached to project - no change. Only system change was Windows Update security patches on Wednesday.

Error excerpt:
Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified resource manager is not started or was shut down due to an error.

Failed to update checkpoint file ('separation_checkpoint_tmp' to 'separation_checkpoint') (2): No such file or directory
Failed to write final checkpoint
Failed to calculate likelihood
Morten Ross
ID: 60141 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 12 Aug 09
Posts: 262
Credit: 92,631,041
RAC: 0
Message 60142 - Posted: 11 Oct 2013, 10:57:38 UTC

All my WU's failed after updating to driver 13.1. Now I updated to the latest driver, 13.9 on a HD5870 and BOINC 7.0.64
Now MilkyWay@Home v1.02 (opencl_amd_ati) all have computation error in seconds, but the Milkyway@Home Separation (Modified Fit) v1.28 (opencl_amd_ati) run fine.
Other projects do good as well.
Is this still the driver update since begin this year, adn is only 12.1 good for MilkyWay@home?
Greetings from,
TJ
ID: 60142 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,940,055
RAC: 22,560
Message 60144 - Posted: 12 Oct 2013, 10:56:18 UTC - in response to Message 60142.  

All my WU's failed after updating to driver 13.1. Now I updated to the latest driver, 13.9 on a HD5870 and BOINC 7.0.64
Now MilkyWay@Home v1.02 (opencl_amd_ati) all have computation error in seconds, but the Milkyway@Home Separation (Modified Fit) v1.28 (opencl_amd_ati) run fine.
Other projects do good as well.
Is this still the driver update since begin this year, adn is only 12.1 good for MilkyWay@home?


I am running the beta version 13.10 and it is mostly working, but DO get messages saying I am not using an "approved version" on some machines. When I asked for a list of "approved versions" they told me they support ALL released versions, but no beta ones. I am running the cpu units only here right now and am NOT downgrading!!
ID: 60144 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Caboose-1

Send message
Joined: 26 Nov 10
Posts: 2
Credit: 11,226,655
RAC: 0
Message 60151 - Posted: 14 Oct 2013, 17:45:59 UTC - in response to Message 60144.  

I'm running into the same issue on the 13.9 drivers :/ it's frustrating ..
ID: 60151 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Computation errors.

©2024 Astroinformatics Group