Nbody 1.68 release
log in

Advanced search

Message boards : News : Nbody 1.68 release

Author Message
Sidd
Project developer
Project tester
Project scientist
Send message
Joined: 19 May 14
Posts: 65
Credit: 356,131
RAC: 12

Message 67017 - Posted: 31 Jan 2018, 15:51:18 UTC

Hi All,

A new version, v1.68, of nbody has just been released. I have not yet released the mac multi-threaded version (OpenMP). I will release this at a later date.

In this release we have added a new way of constraining the width of the stream. Previously, we were using a measure of the velocity dispersion in each histogram bin. This led us to fit our parameters quite well. Unfortunately, we found that this may not be the best method in the long run. I have added a measure of the beta coordinate dispersion which, from initial findings, will be (hopefully) easier to fit our parameters with.

As always, please let me know if there are issues.

Thank you all for your continuing support,
Sidd

Profile Jake Weiss
Volunteer moderator
Project developer
Project tester
Project scientist
Send message
Joined: 25 Feb 13
Posts: 461
Credit: 12,275,554
RAC: 409

Message 67018 - Posted: 31 Jan 2018, 19:05:19 UTC

Congrats on the new version!

Tom*
Send message
Joined: 4 Oct 11
Posts: 37
Credit: 277,747,903
RAC: 254,185

Message 67019 - Posted: 31 Jan 2018, 21:38:17 UTC
Last modified: 31 Jan 2018, 21:38:55 UTC

Sidd,

All three of my systems running 1.68 get the following error, some more than others.

<core_client_version>7.6.22</core_client_version>
<![CDATA[
<message>
The data is invalid.
(0xd) - exit code 13 (0xd)
</message>
<stderr_txt>
<search_application> milkyway_nbody 1.68 Windows x86_64 double OpenMP, Crlibm </search_application>
Using OpenMP 4 max threads on a system with 8 processors
Error evaluating NBodyCtx: [string "-- /* Copyright (c) 2016 Siddhartha Shelton..."]:81: bad argument #1 to 'create' (Missing required named argument 'BetaSigma')
Failed to read input parameters file
14:52:09 (8888): called boinc_finish(13)

My I7 systems get very few but my I5 system gets a ton.

Yet are completed ok by other systems on a resend

Sidd
Project developer
Project tester
Project scientist
Send message
Joined: 19 May 14
Posts: 65
Credit: 356,131
RAC: 12

Message 67020 - Posted: 1 Feb 2018, 0:44:44 UTC - in response to Message 67019.

Thanks for letting me know!! I'm checking it out right now.

Sidd
Project developer
Project tester
Project scientist
Send message
Joined: 19 May 14
Posts: 65
Credit: 356,131
RAC: 12

Message 67022 - Posted: 1 Feb 2018, 1:10:13 UTC - in response to Message 67019.

I believe I found the workunit that was from.

Because I added an entirely new calculation, there were some new parameters needed for future flexibility. Therefore, if you were to use the old parameter files on the new binary it would give that error. It seems for some reason, that workunit did exactly that, the binary being used is the nbody v168 but the workunit is from the v166 runs, using the v166 parameter files. Before releasing, I took down the older runs, and so I was not expecting the work units to do this, and for that I apologize.

Fortunately, this error would occur right at the beginning, before anything began to run so it will not cause any wasted computational time. If you have any v166 runs in your queue, you can go ahead and cancel them so they do not give this error.

Tom*
Send message
Joined: 4 Oct 11
Posts: 37
Credit: 277,747,903
RAC: 254,185

Message 67023 - Posted: 1 Feb 2018, 4:01:47 UTC

Thanks Sidd

MossyRock
Send message
Joined: 27 Sep 17
Posts: 4
Credit: 634,323
RAC: 5,202

Message 67025 - Posted: 1 Feb 2018, 22:55:56 UTC

Sidd,

Most of my v168 runs are blowing up. Do I abort the v168 runs in queue?

Thanks.

Tom*
Send message
Joined: 4 Oct 11
Posts: 37
Credit: 277,747,903
RAC: 254,185

Message 67026 - Posted: 2 Feb 2018, 1:48:00 UTC
Last modified: 2 Feb 2018, 1:53:07 UTC

Mossy,

The problem is when the V168 application tries to process V166 data

You have same issue as I if you look inside the stderr

de_nbody_1_13_2018_v166_20k__optimizerparameters_diff_seedruns_3_1516211024_96183_4

this is the data version.

As Sidd says it only takes 2 or 3 seconds to fail, so if there are no
other ramifications (like not getting new tasks:-)) just let them run.

otherwise you have to highlite the task in the task list in BOINC
then choose properties to see the version v166 or v168

MossyRock
Send message
Joined: 27 Sep 17
Posts: 4
Credit: 634,323
RAC: 5,202

Message 67027 - Posted: 2 Feb 2018, 4:21:24 UTC - in response to Message 67026.

Tom,

Gotcha.

Thanks for letting me know how to find the mis-matches in the "ready to start" state. I just aborted a few.

Schwerrechner
Send message
Joined: 9 Feb 17
Posts: 1
Credit: 42,470
RAC: 809

Message 67028 - Posted: 2 Feb 2018, 12:23:56 UTC

Hey, I am new here. I am getting errors on nbody calculating the optimizerparameter with a Ryzen 1700. The Cpu is prime stable tho I get the erros only on nbody-optimizertasks.
Everything else works fine.

mmonnin
Send message
Joined: 2 Oct 16
Posts: 79
Credit: 64,146,845
RAC: 386,735

Message 67042 - Posted: 9 Feb 2018, 13:54:41 UTC

These are still being sent out. :(
https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1566643057

mmonnin
Send message
Joined: 2 Oct 16
Posts: 79
Credit: 64,146,845
RAC: 386,735

Message 67047 - Posted: 9 Feb 2018, 22:44:59 UTC

And another
https://milkyway.cs.rpi.edu/milkyway/result.php?resultid=2255257329

Yavanius
Avatar
Send message
Joined: 27 Jan 15
Posts: 6
Credit: 1,025,134
RAC: 14,987

Message 67050 - Posted: 10 Feb 2018, 5:26:00 UTC - in response to Message 67017.

I keep the intermittent N-body that runs and runs... the last one I aborted at 15 hours...

https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1573440465

Sometimes if I restart BOINC, they'll run properly, but I've seen them bite the dust too shortly after too...

Yavanius
Avatar
Send message
Joined: 27 Jan 15
Posts: 6
Credit: 1,025,134
RAC: 14,987

Message 67056 - Posted: 10 Feb 2018, 18:05:49 UTC - in response to Message 67017.

Went searching for an answer to this, but couldn't find an answer:

Why is the N-Body credit such a pittance with double digit credit even when run time is the same as the regular WU?

Mr McGill
Send message
Joined: 13 Nov 17
Posts: 2
Credit: 858,231
RAC: 11,212

Message 67058 - Posted: 10 Feb 2018, 22:26:24 UTC

had great hopes the new model Nbody would resolve faults: still getting Nbody trapped, sometimes suspend helps (so far i am around 5 successes to 20 failures), restart has so far not.

Second thought: with our Nbody fails being a pain in the proverbial, are they responsible for some of the failed reporting stuff? In particular their runtime can exceed their report times, which could also cause failures on single processor tasks paused to complete a multicore Nbody that never sees completion?

Profile ritterm
Avatar
Send message
Joined: 16 Jun 08
Posts: 92
Credit: 363,556,267
RAC: 21,623

Message 67060 - Posted: 11 Feb 2018, 2:36:29 UTC

How does this happen?

Stderr output
<core_client_version>7.8.6</core_client_version>
<![CDATA[
<message>
process exited with code 13 (0xd, -243)</message>
<stderr_txt>
<search_application> milkyway_nbody 1.66 Darwin x86_64 double OpenMP, Crlibm </search_application>
Using OpenMP 8 max threads on a system with 8 processors
Application version too old. Workunit requires version 1.68, but this is 1.66
Failed to read input parameters file
04:21:59 (82996): called boinc_finish(13)

</stderr_txt>
]]>
____________

Profile ritterm
Avatar
Send message
Joined: 16 Jun 08
Posts: 92
Credit: 363,556,267
RAC: 21,623

Message 67062 - Posted: 11 Feb 2018, 12:28:20 UTC - in response to Message 67022.

Sidd wrote:
It seems for some reason, that workunit did exactly that, the binary being used is the nbody v168 but the workunit is from the v166 runs, using the v166 parameter files. Before releasing, I took down the older runs...

Maybe I misunderstand, but v166 tasks are still being sent out.

Tom*
Send message
Joined: 4 Oct 11
Posts: 37
Credit: 277,747,903
RAC: 254,185

Message 67068 - Posted: 11 Feb 2018, 18:05:24 UTC
Last modified: 11 Feb 2018, 18:16:12 UTC

Think we need a new version of the application that can process both
v166 and v168 data file formats.

PLEASE

I have only been getting v166 lately is there a pointer the the v166 application?

Profile ritterm
Avatar
Send message
Joined: 16 Jun 08
Posts: 92
Credit: 363,556,267
RAC: 21,623

Message 67093 - Posted: 16 Feb 2018, 19:41:05 UTC

It looked good for a few days, but I've picked up some v166 tasks recently (see examples).


Post to thread

Message boards : News : Nbody 1.68 release


Main page · Your account · Message boards


Copyright © 2018 AstroInformatics Group