Welcome to MilkyWay@home

Must set rsc_memory_bound correctly

Message boards : Number crunching : Must set rsc_memory_bound correctly
Message board moderation

To post messages, you must log in.

AuthorMessage
Jacob Klein

Send message
Joined: 22 Jun 11
Posts: 32
Credit: 41,852,496
RAC: 0
Message 61458 - Posted: 1 Apr 2014, 1:19:52 UTC

MilkyWay Team:

You need to change your work unit parameters, to properly set <rsc_memory_bound> correctly. BOINC 7.3.14 alpha (and potentially future versions also) will read that value, and compare it to the Working Set size, and will auto-abort the work unit if it exceeds the bound.

As of right now, I am getting errors due to your incorrect settings.

For example:
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=703493619
Exit status 198 (0xc6) EXIT_MEM_LIMIT_EXCEEDED
<core_client_version>7.3.14</core_client_version>
<![CDATA[
<message>
working set size > workunit.rsc_memory_bound: 97.08MB > 47.68MB
</message>

Could you please promptly fix this?

Regards,
Jacob Klein
ID: 61458 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 22 Jun 11
Posts: 32
Credit: 41,852,496
RAC: 0
Message 61459 - Posted: 1 Apr 2014, 2:09:18 UTC
Last modified: 1 Apr 2014, 2:09:28 UTC

It looks like this change is being reverted for now, per David's email below.
So, there is no longer an immediate need to correct the value...
But please consider setting it correctly at some point, in case it gets used by the client in the future.


> Date: Mon, 31 Mar 2014 18:53:33 -0700
> From: d..a@ssl.berkeley.edu
> To: b..c_alpha@ssl.berkeley.edu
> Subject: Re: [boinc_alpha] 7.3.14 - Heads up - Memory bound enforcement
>
> On further thought, I'm going to change things back to the way they were, namely
>
> 1) workunit.rsc_memory_bound is used only by the server;
> it won't send a job if rsc_memory_bound > host's available RAM
> 2) the client aborts a job if working set size > host's available RAM
> 3) the client will run a set of jobs only if the sum of their WSSs
> fits in available RAM
> (i.e. if a job's WSS is close to all available RAM,
> it would run that job and nothing else)
>
> The reason for not aborting jobs when WSS > rsc_memory_bound is that
> it requires projects to come up with very accurate estimates of RAM usage,
> which I don't think is feasible in general.
> Also, it will lead to lots of aborted jobs, which is bad for volunteer morale.
>
> -- David
ID: 61459 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Must set rsc_memory_bound correctly

©2024 Astroinformatics Group