Welcome to MilkyWay@home

Stuck again


Advanced search

Message boards : Number crunching : Stuck again
Message board moderation

To post messages, you must log in.

AuthorMessage
Steven Gaber

Send message
Joined: 20 May 21
Posts: 16
Credit: 413,255
RAC: 25
100 thousand credit badge1 year member badge
Message 73944 - Posted: 2 Jul 2022, 22:05:50 UTC

Once again, Milky Way is stuck on an N-VBody simulation for an hour and a half.
Progress remains at 73.24%, Elapsed time is 1:38: 17 and increasing while time remaining is 00:36:15 and increasing. If I left it alone, that could go on for days and days.

There have been several of these. When such a task is running, I notice that CoreTemp indicates a much lower temperature, like 114-120 degrees F instead of the normal range of 152-168 degrees. I guess the CPU is not active during those times?

When that happens, I just abort them.

Some n-body simulations don't do that, but run normally.

S, Gaber
ID: 73944 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 750
Credit: 361,752,637
RAC: 388,495
300 million credit badge11 year member badge
Message 73948 - Posted: 3 Jul 2022, 11:21:04 UTC - in response to Message 73944.  
Last modified: 3 Jul 2022, 11:22:20 UTC

Once again, Milky Way is stuck on an N-VBody simulation for an hour and a half.
Progress remains at 73.24%, Elapsed time is 1:38: 17 and increasing while time remaining is 00:36:15 and increasing. If I left it alone, that could go on for days and days.

There have been several of these. When such a task is running, I notice that CoreTemp indicates a much lower temperature, like 114-120 degrees F instead of the normal range of 152-168 degrees. I guess the CPU is not active during those times?

When that happens, I just abort them.

Some n-body simulations don't do that, but run normally.

S, Gaber
I have seen the same on other projects but never here. If you use Boinctasks you can get a much better picture of everything happening, including how much CPU each task is really using.

I too just abort anything that isn't being productive. Chances are it'll work on the next machine it's handed to. If it doesn't, the admin can see it has failed on all x machines the server tries.
ID: 73948 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileKeith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 657
Credit: 508,257,763
RAC: 101,401
500 million credit badge12 year member badgeextraordinary contributions badge
Message 73951 - Posted: 3 Jul 2022, 17:10:06 UTC - in response to Message 73944.  

When you see this happening a simple reboot will solve the issue.
ID: 73951 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Stuck again

©2023 Astroinformatics Group