Welcome to MilkyWay@home

Computation Error on NBody Model

Message boards : Number crunching : Computation Error on NBody Model
Message board moderation

To post messages, you must log in.

AuthorMessage
Brent

Send message
Joined: 16 Mar 10
Posts: 12
Credit: 22,284,745
RAC: 0
Message 42186 - Posted: 16 Sep 2010, 15:00:27 UTC

I have a task (de_nbody_model1_1_2789_1284175894_3) that is displaying a Computation error after almost 24 hours (23:45:26) of computer time! What might have gone wrong and, more importantly, how can I prevent this from happening in the future? I hate to waste almost a days work for nothing!
ID: 42186 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 42188 - Posted: 16 Sep 2010, 15:42:22 UTC - in response to Message 42186.  

It looks like the maximum time exceeded.

We're still working on a way to get a better flops estimate so that BOINC doesn't kill them after some amount of time.

This workunit looks like its radius and mass is pretty close to the worst possible case.
The run times can vary greatly depending on the parameters. Changing multiple parameters at the same time leads to annoying to predict run times. For example, here are some of the graphs I've been making while trying to come up with a prediction using a small number of bodies. Single variable changes lead to large changes like:



Only changing the radii, the run times seem to vary by a factor of 200.




This one's my favorite, although not yet being fit:




[/img]
ID: 42188 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Magister

Send message
Joined: 22 Nov 07
Posts: 8
Credit: 2,873,855
RAC: 0
Message 42328 - Posted: 23 Sep 2010, 18:49:59 UTC - in response to Message 42188.  

Just got one also

http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=150393135

and I'm not the only one on this one :-(
ID: 42328 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
(retired account)
Avatar

Send message
Joined: 17 Oct 08
Posts: 36
Credit: 411,744
RAC: 0
Message 42332 - Posted: 23 Sep 2010, 20:46:55 UTC

This afternoon (12:23 UTC) I finished a workunit after more than 100 hours of run time. This unit named de_nbody_model1_1_35711_1284295325_2 has previously been finished with max. time exceeded error on the wingmen, so I decided to give it a bit more time. I guess it was valid, because it is already purged from the database:
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=150672863

I was not at home this afternoon, so I'm not 100% sure, but if it were invalid, it should still be in the database, right?

Regards
Alex
ID: 42332 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Magister

Send message
Joined: 22 Nov 07
Posts: 8
Credit: 2,873,855
RAC: 0
Message 42346 - Posted: 24 Sep 2010, 12:55:51 UTC - in response to Message 42328.  

got a second one...

http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=150666061

I will stop using Nbody for the moment.
ID: 42346 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
(retired account)
Avatar

Send message
Joined: 17 Oct 08
Posts: 36
Credit: 411,744
RAC: 0
Message 42352 - Posted: 24 Sep 2010, 19:14:21 UTC

Hi, I got a second one, which received some additional time to be finished. It did today after 185-something hours. Since validation is inconclusive, it is still in the database here, workunit id 150689696, workunit name de_nbody_model1_1_52544_1284309885, task id 196714928. I guess I won't get those 3,956.93 credits claimed? *g*
ID: 42352 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Computation Error on NBody Model

©2024 Astroinformatics Group