Welcome to MilkyWay@home

Still have probelms with n-body WUs

Message boards : Number crunching : Still have probelms with n-body WUs
Message board moderation

To post messages, you must log in.

AuthorMessage
Bill Walker

Send message
Joined: 19 Aug 09
Posts: 23
Credit: 631,303
RAC: 0
Message 49155 - Posted: 6 Jun 2011, 20:05:19 UTC

Glad the project is back, but my first new WU was another N-Body task that failed within a few minutes. The task is gone now, but here is the message log for its brief existance. Anybody know what is up with these?

06/06/2011 2:55:57 PM Milkyway@home Starting de_nbody_orphan_test_2model_4_144_1306331281_1
06/06/2011 2:55:58 PM Milkyway@home Starting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 2:59:03 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:02:06 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:05:08 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:08:11 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:11:13 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:14:16 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:17:19 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:20:21 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:23:24 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:26:26 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:29:29 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:32:32 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:35:34 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:38:37 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:41:40 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:44:42 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:47:45 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:50:48 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:53:50 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_144_1306331281_1 using milkyway_nbody version 40
06/06/2011 3:54:02 PM Milkyway@home Computation for task de_nbody_orphan_test_2model_4_144_1306331281_1 finished

ID: 49155 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 49177 - Posted: 7 Jun 2011, 23:53:45 UTC - in response to Message 49155.  

I don't see any error here?
ID: 49177 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bill Walker

Send message
Joined: 19 Aug 09
Posts: 23
Credit: 631,303
RAC: 0
Message 49182 - Posted: 8 Jun 2011, 11:54:57 UTC - in response to Message 49177.  
Last modified: 8 Jun 2011, 12:01:44 UTC

Every time the n-body task restarts in BOINC I get a Windows error message, "Task ended in error". I sometimes get 30 or 40 of these before the WU gives up with about 10 or 20 minutes of actual CPU time (compared to 300+ hours estimated run time when it downloads). BOINC task list shows "computation error" for the task at that time. All my n-body WUs have had this behaviour for several weeks now.

Just an observation: the BOINC message list shows the task restarting, but doesn't ever show it stopping. It appears to be Windows that kills it. Also note that no other tasks are show starting in between the MW restarts (this machine has 2 CPUs, and usually runs 3 or 4 different projects).

I've tried reloaded the C++ library, as suggested in another thread. Just got another n-body WU, waiting to see what happens with this one.
ID: 49182 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bill Walker

Send message
Joined: 19 Aug 09
Posts: 23
Credit: 631,303
RAC: 0
Message 49190 - Posted: 10 Jun 2011, 0:00:30 UTC - in response to Message 49182.  

OK, here is the windows message I receive:

"milkyway_nbody_0.40_windows_x86_64_mt... has stopped working

A problem has caused the program to stop working correctly. Windows will close the program and notify you if a solution is avialable."

I have 20 of these right now. 21 by the time I had typed that. Up to 23 just before I sent this.

Here are the associated BOINC messages:

09/06/2011 6:49:34 PM Milkyway@home Computation for task de_separation_10_3s_fix20_2_188437_1307510342_0 finished
09/06/2011 6:49:34 PM Milkyway@home Starting de_nbody_orphan_test_2model_4_31278_1307524107_0
09/06/2011 6:49:34 PM Milkyway@home Starting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 6:52:36 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 6:55:39 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 6:58:44 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:01:55 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:04:57 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:08:00 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:11:03 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:14:07 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:17:09 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:20:15 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:23:18 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:26:20 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:29:23 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:32:26 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:35:28 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:38:31 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:41:33 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:44:36 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:47:39 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40
09/06/2011 7:50:41 PM Milkyway@home Restarting task de_nbody_orphan_test_2model_4_31278_1307524107_0 using milkyway_nbody version 40


According to the BOINC tasks tab the WU has been running for 44 seconds. I expect it to die shortly.

WU died at 01:08 cpu time. BOINC task tab says "computation error" but messages tab just says

09/06/2011 7:58:14 PM Milkyway@home Computation for task de_nbody_orphan_test_2model_4_31278_1307524107_0 finished

ID: 49190 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bill Walker

Send message
Joined: 19 Aug 09
Posts: 23
Credit: 631,303
RAC: 0
Message 49191 - Posted: 10 Jun 2011, 0:02:17 UTC

Here is the database info for that WU:
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=45875226
ID: 49191 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Still have probelms with n-body WUs

©2024 Astroinformatics Group