Welcome to MilkyWay@home

Stuck again

Message boards : Number crunching : Stuck again
Message board moderation

To post messages, you must log in.

AuthorMessage
Steven Gaber

Send message
Joined: 20 May 21
Posts: 20
Credit: 2,266,771
RAC: 3,216
Message 73944 - Posted: 2 Jul 2022, 22:05:50 UTC

Once again, Milky Way is stuck on an N-VBody simulation for an hour and a half.
Progress remains at 73.24%, Elapsed time is 1:38: 17 and increasing while time remaining is 00:36:15 and increasing. If I left it alone, that could go on for days and days.

There have been several of these. When such a task is running, I notice that CoreTemp indicates a much lower temperature, like 114-120 degrees F instead of the normal range of 152-168 degrees. I guess the CPU is not active during those times?

When that happens, I just abort them.

Some n-body simulations don't do that, but run normally.

S, Gaber
ID: 73944 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 73948 - Posted: 3 Jul 2022, 11:21:04 UTC - in response to Message 73944.  
Last modified: 3 Jul 2022, 11:22:20 UTC

Once again, Milky Way is stuck on an N-VBody simulation for an hour and a half.
Progress remains at 73.24%, Elapsed time is 1:38: 17 and increasing while time remaining is 00:36:15 and increasing. If I left it alone, that could go on for days and days.

There have been several of these. When such a task is running, I notice that CoreTemp indicates a much lower temperature, like 114-120 degrees F instead of the normal range of 152-168 degrees. I guess the CPU is not active during those times?

When that happens, I just abort them.

Some n-body simulations don't do that, but run normally.

S, Gaber
I have seen the same on other projects but never here. If you use Boinctasks you can get a much better picture of everything happening, including how much CPU each task is really using.

I too just abort anything that isn't being productive. Chances are it'll work on the next machine it's handed to. If it doesn't, the admin can see it has failed on all x machines the server tries.
ID: 73948 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 715
Credit: 555,441,958
RAC: 38,704
Message 73951 - Posted: 3 Jul 2022, 17:10:06 UTC - in response to Message 73944.  

When you see this happening a simple reboot will solve the issue.
ID: 73951 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Stuck again

©2024 Astroinformatics Group