Welcome to MilkyWay@home

Posts by ritterm

21) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63514)
Posted 4 May 2015 by Profile ritterm
Post:
Could somebody pass David's message over to CERN/CMS-dev, please? I don't even have an invitation code to create a posting account.

Done.
22) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63502)
Posted 3 May 2015 by Profile ritterm
Post:
I'm glad you got it fixed, that is strange. Could they be trying to use the same file name?

Me, too! :-) I'm really not sure what's going on with the other project, but you can read more in this thread over at CERN/CMS-dev.
23) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63498)
Posted 2 May 2015 by Profile ritterm
Post:
THAT'S the file you want to look at, post it here if you can please.

Well, other than the initial remarks about the "Maximum disk usage exceeded" error and what appears to be the lack of results data at the end, the rest of the file looks virtually identical to the stderr output of a valid task.

However, I think my problem is solved. Following the suggestion of a forum post about a similar problem at another project, I checked my host's slots directories and found two "stray" VM image files left by one of the VM projects (probably CERN's CMS-dev), each of which was over 5GB. I deleted those files and slots and have been running trouble free for almost 12 hours. I'm not sure, though, that I understand why that was the problem.
24) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63487)
Posted 1 May 2015 by Profile ritterm
Post:
I suspended all tasks then resumed them one at a time and waited for one to crash. It didn't take too long, but all that's left in the directory is the stderr output file and it's only 4KB. If something "big" was generated before being deleted, I wasn't able to see anything.

The only message in the BOINC manager log related to the task is something similar to this:

Aborting task de_80_DR8_Rev_8_5_00004_1429700402_4384432_0: exceeded disk limit: 5115.01MB > 14.31MB

I just don't understand what's going on. Only about 5% of the tasks are failing for me and others don't seem to be having any problem with them.
25) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63482)
Posted 30 Apr 2015 by Profile ritterm
Post:
A bad driver by itself wouldn't cause a disk limit error - unless it's spewing out yards and yards of error messages. Look in the slot directory...

I'm afraid I'm not sure I know what to look for. These tasks are failing right away, after only 1-2 seconds of run time. I don't see anything changing in the slot directory when this happens (\ProgramData\BOINC\slots, right?).
26) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63478)
Posted 29 Apr 2015 by Profile ritterm
Post:
...I think it's to do with <rsc_disk_bound>...

That's what I was thinking, too, of course. However, I now think I might have a GPU hardware problem -- all the tasks I've checked that errored out for me have been completed by another host without a problem. If the tasks I ran had bad parameters, would the same task work for another host?

When I upgraded the video driver, I went to the AMD website, downloaded and ran their auto-detect tool, and let it pick and install a new driver. Is there anything else I need to install?
27) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63470)
Posted 27 Apr 2015 by Profile ritterm
Post:
Additional info, in case it matters...

I have local preferences set that are pretty wide-open, I think. The host has an 1TB HDD with only about 250GB used. Local disk limits are set to:

Use at most -- 150GB (most restrictive)
Leave at least -- 0.1 GB (least restrictive)
Use at most -- 50% of total (less restrictive)

The BOINC manager says that 26 GB is used for BOINC with 124GB available and that MS@H is using less than 240MB.
28) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63469)
Posted 27 Apr 2015 by Profile ritterm
Post:
These things keep coming. Updating to the latest driver doesn't seem to have helped and others running the same tasks don't seem to have any problems, either. I'm thinking I have a hardware problem... :-(
29) Message boards : Number crunching : 196 (0xc4) EXIT_DISK_LIMIT_EXCEEDED (Message 63461)
Posted 24 Apr 2015 by Profile ritterm
Post:
I'm getting a few "Maximum disk usage exceeded" errors. I don't think I've ever seen this here.

For my tasks, the client_state file shows:

<rsc_disk_bound>15000000.000000</rsc_disk_bound>

Example task result shows Peak disk usage 5,741.01 MB.
30) Message boards : News : New Nbody version 1.46 (Message 62977)
Posted 8 Jan 2015 by Profile ritterm
Post:
It seems there has been an issue with the newly added multithreading routines in this version...

Is it just the MTs? I've seen the same kind of errors in the non-MTs. WU 690529199, for example.
31) Message boards : Number crunching : MilkyWay@Home v1.02 (opencl_amd_ati) Compute Errors (Message 62846)
Posted 15 Dec 2014 by Profile ritterm
Post:
Hope they figure it out soon :)

They look to be making their way out of the system as I don't have nearly as many, but they're still out there. A recent result is WU 671517190.
32) Message boards : Number crunching : MilkyWay@Home v1.02 (opencl_amd_ati) Compute Errors (Message 62837)
Posted 15 Dec 2014 by Profile ritterm
Post:
Seeing a lot of compute errors on recent work. Wingmen are failing, too. Typical example:

WU 671510169

Stderr output contains these lines:

Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'

All of my failed work starts with "de_80_DR8_Rev_8_4". All other "de_8x" work seems okay so far.
33) Message boards : Number crunching : Feeder Down (Message 62385)
Posted 24 Sep 2014 by Profile ritterm
Post:
Sorry about that! I've been at a conference in Baltimore...I ran back to my hotel room on break to get things fixed. Sorry about the delay!

Well, certainly. You are allowed to have a life apart from keeping multiple BOINC projects running...or is that the other Travis?! :D Regardless, thanks for the effort getting us back in business here!

Cheers,

MarkR
34) Message boards : Number crunching : Feeder Down (Message 62376)
Posted 23 Sep 2014 by Profile ritterm
Post:
See the follow thread in the News area...

I guess it was too much to hope for a response in an on-topic thread... :D (Please excuse the snark...I'm not like that in real life.)
35) Message boards : Number crunching : Feeder Down (Message 62373)
Posted 23 Sep 2014 by Profile ritterm
Post:
*crickets*
36) Message boards : Number crunching : Feeder Down (Message 62368)
Posted 22 Sep 2014 by Profile ritterm
Post:
As of post time, the feeder is down. First seen by me at about 1951 UTC.
37) Message boards : News : Introduction (Message 61772)
Posted 23 May 2014 by Profile ritterm
Post:
Blurf wrote:
Welcome aboard Siddhartha!

+1 :-)
38) Message boards : Number crunching : SERVER DOWN (Message 61085)
Posted 11 Feb 2014 by Profile ritterm
Post:
Jeffery M. Thompson wrote:
This was not planned maintenance.
We had a problem with the database...

Thanks for the update, Jeff! :-)
39) Message boards : News : Some new work units running (Message 61062)
Posted 11 Feb 2014 by Profile ritterm
Post:
Jeffery M. Thompson wrote:
The error you are getting say it can not load the function. Your binary says it is an anonymous binary which means...

Okay, okay...It's not my intention to hijack this thread, but could you or another admin respond to the server issues brought up in the SERVER DOWN or Server Error: Feeder not running threads?
40) Message boards : News : incoming badges (Message 60888)
Posted 1 Feb 2014 by Profile ritterm
Post:
I've finally got some time to get badges up and going...

Well done, Travis, et al! Thanks for the good work... :-)


Previous 20 · Next 20

©2024 Astroinformatics Group