rpi_logo
New Separation Runs Started
New Separation Runs Started
log in

Advanced search

Message boards : News : New Separation Runs Started

Author Message
Jeffery M. Thompson
Volunteer moderator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 23 Sep 12
Posts: 149
Credit: 12,937,156
RAC: 4,599

Message 57700 - Posted: 28 Mar 2013, 22:14:30 UTC

New Separation Runs

ps_separation_82_3s_dr8_3
de_separation_82_3s_dr8_3


These have increased the number of steps from

ps_separation_82_3s_dr8_2
de_separation_82_3s_dr8_2

Which should fix the errors people were reporting.

Please report problems here.

Thank you,
Jeff Thompson

Herge
Send message
Joined: 11 Oct 09
Posts: 19
Credit: 192,462,972
RAC: 0

Message 57701 - Posted: 28 Mar 2013, 22:59:37 UTC

Still failing with GPU (units 330082567, 330083256, 330082966). Someone completed 330082567 though.

Smetchewit
Send message
Joined: 9 Feb 12
Posts: 3
Credit: 106,190,896
RAC: 42,132

Message 57702 - Posted: 28 Mar 2013, 23:02:58 UTC

The new Version: ps_separation_82-3s-dr8-3_1358941502_29605592
works also not,

http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=330033046 old one
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=330079218 new one

Windows 7 Professionel
Radeon Sapphire 7950 3gb 810 Cpu MHZ
Driver Catalyst 13.1

Normal Mw Wu ps_separation_23_3s_sSgr_1_1358941502_29575819 works fine

Smetchewit
Send message
Joined: 9 Feb 12
Posts: 3
Credit: 106,190,896
RAC: 42,132

Message 57703 - Posted: 28 Mar 2013, 23:10:40 UTC

The WU´s failling 0.00 seconds to maximum 0.05 seconds. cpu time
all 5 new wu`s are failling.

Smetchewit
Send message
Joined: 9 Feb 12
Posts: 3
Credit: 106,190,896
RAC: 42,132

Message 57704 - Posted: 28 Mar 2013, 23:16:54 UTC

http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=429564623
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=429563200
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=429562269
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=429561997
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=429560857
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=429559136
http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=429558309

All Wu´s are failling both of the new ones ps_separation and de_separation

Profile Arion
Avatar
Send message
Joined: 10 Aug 08
Posts: 218
Credit: 41,846,854
RAC: 0

Message 57705 - Posted: 28 Mar 2013, 23:23:05 UTC
Last modified: 28 Mar 2013, 23:24:02 UTC

de_separation_82_3s_dr8_2_1358941502_29558890_2 is reporting that it will take 87600:00:00 to complete on ATI graphics !!!!!!

System is not sending completed or calling for new ones at the present time. Unknown why other than it probably thinks I have plenty to do with that huge wu. *shrug*
____________

GaryG
Avatar
Send message
Joined: 29 Aug 12
Posts: 31
Credit: 40,781,945
RAC: 0

Message 57706 - Posted: 28 Mar 2013, 23:56:09 UTC

I got the same time estimate, 87600:00:00, on an AMD card. The task locked up the BOINC client and it would not restart. I ended up renaming the milky way directory and restarting. A new directory was created and all the nbody's ended up in a computation error, I had suspended the GPU tasks prior to this so they were recovered and all is running again.

____________

Jeffery M. Thompson
Volunteer moderator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 23 Sep 12
Posts: 149
Credit: 12,937,156
RAC: 4,599

Message 57707 - Posted: 29 Mar 2013, 0:13:10 UTC

I will pull them back down and check through all the parameters. I will also try some different variants on the test machine before reposting..

Thank you for the feedback.


Jeff Thompson

GaryG
Avatar
Send message
Joined: 29 Aug 12
Posts: 31
Credit: 40,781,945
RAC: 0

Message 57708 - Posted: 29 Mar 2013, 0:27:24 UTC

I thought the nbody's were CPU only, the ones with the 87600:00:00 time estimate come in marked for the GPU. My system appears to try to start them on the GPU and I am wondering if this is what caused the crash?

Jeffery M. Thompson
Volunteer moderator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 23 Sep 12
Posts: 149
Credit: 12,937,156
RAC: 4,599

Message 57709 - Posted: 29 Mar 2013, 0:54:40 UTC

Separation doesn't use the nbody application and has GPU support.


Jeff Thompson

Profile Arion
Avatar
Send message
Joined: 10 Aug 08
Posts: 218
Credit: 41,846,854
RAC: 0

Message 57710 - Posted: 29 Mar 2013, 2:11:42 UTC
Last modified: 29 Mar 2013, 2:58:28 UTC

I have 2 more on another system that is CPU processing only

de_separation_82_3s_dr8_1358941502_29567987_1
de_separation_82_3s_dr8_1358941502_29616483_0

Both showing the same time for completion 87600:00:00

Previous system running with ATI card has updated and now sending and receiving WUs. Don't know what happened to the one that had the extended time frame unless it borked and got tossed.

Edit - These WUs are now showing up as invalid with too many errors. Guess that sorts it out. 3 different systems processed them with same results.
____________

CraniuMod
Send message
Joined: 18 Jan 10
Posts: 1
Credit: 58,120,741
RAC: 1

Message 57712 - Posted: 29 Mar 2013, 5:32:14 UTC
Last modified: 29 Mar 2013, 5:36:07 UTC

I have several that reported 87600 hours to complete (curiously 3650 days) that I aborted as they forced others to high priority.

Computer 479613

de_separation_82-3s-dr8-3_1358941502_29610520
ps_separation_82-3s-dr8-3_1358941502_29611872
ps_separation_82-3s-dr8-3_1358941502_29611626
ps_separation_82-3s-dr8-3_1358941502_29614009
de_separation_82-3s-dr8-3_1358941502_29612832
ps_separation_82-3s-dr8-3_1358941502_29615173
ps_separation_82-3s-dr8-3_1358941502_29617289
de_separation_82-3s-dr8-3_1358941502_29613679

If you want to have a look see.

GaryG
Avatar
Send message
Joined: 29 Aug 12
Posts: 31
Credit: 40,781,945
RAC: 0

Message 57713 - Posted: 29 Mar 2013, 11:08:33 UTC

Well I am not sure what is happening but I woke up with the system down and unable to restart again. Again had to dump the Milkyway directory to recover.
The task was from the de_separation_82_3s_dr8_3 group.

GaryG
Avatar
Send message
Joined: 29 Aug 12
Posts: 31
Credit: 40,781,945
RAC: 0

Message 57714 - Posted: 29 Mar 2013, 18:00:22 UTC

Got two of these in row after a restart, which results in a client lockup. These always have the 87600:00:00 time estimate and always crash my client. This is making extremely difficult to run tasks on my system. Any chance these are going to be corrected?

ps_separation_82_3s_dr8_2_1358941502_29566368_1

<core_client_version>7.0.33</core_client_version>
<![CDATA[
<message>
- exit code -1073740940 (0xc0000374)
</message>
<stderr_txt>
BOINC: parse gpu_opencl_dev_index 0
<search_application> milkyway_separation 1.02 Windows x86 double OpenCL </search_application>
Unrecognized XML in project preferences: max_gfx_cpu_pct
Skipping: 5
Skipping: /max_gfx_cpu_pct
Unrecognized XML in project preferences: allow_non_preferred_apps
Skipping: 1
Skipping: /allow_non_preferred_apps
Unrecognized XML in project preferences: nbody_graphics_poll_period
Skipping: 30
Skipping: /nbody_graphics_poll_period
Unrecognized XML in project preferences: nbody_graphics_float_speed
Skipping: 5
Skipping: /nbody_graphics_float_speed
Unrecognized XML in project preferences: nbody_graphics_textured_point_size
Skipping: 250
Skipping: /nbody_graphics_textured_point_size
Unrecognized XML in project preferences: nbody_graphics_point_point_size
Skipping: 40
Skipping: /nbody_graphics_point_point_size
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Integral area { 1819239215, 1769234796, 1008738314 } will overflow progress calculation

</stderr_txt>
]]>

____________

JZD
Send message
Joined: 31 Dec 11
Posts: 4
Credit: 82,141,812
RAC: 8,801

Message 57715 - Posted: 29 Mar 2013, 18:23:47 UTC

I have several tasks with the same error.
de_separation_82-3s-dr8-3-test_1358941502_29860874_1
stderr output

&lt;core_client_version&gt;6.10.58&lt;/core_client_version&gt; &lt;![CDATA[ &lt;message&gt; process exited with code 1 (0x1, -255) &lt;/message&gt; &lt;stderr_txt&gt; &lt;search_application&gt; milkyway_separation 1.00 Linux x86_64 double &lt;/search_application&gt; Unrecognized XML in project preferences: max_gfx_cpu_pct Skipping: 20 Skipping: /max_gfx_cpu_pct Unrecognized XML in project preferences: apps_selected Skipping: app_id Skipping: app_id Skipping: /apps_selected Unrecognized XML in project preferences: nbody_graphics_poll_period Skipping: 30 Skipping: /nbody_graphics_poll_period Unrecognized XML in project preferences: nbody_graphics_float_speed Skipping: 5 Skipping: /nbody_graphics_float_speed Unrecognized XML in project preferences: nbody_graphics_textured_point_size Skipping: 250 Skipping: /nbody_graphics_textured_point_size Unrecognized XML in project preferences: nbody_graphics_point_point_size Skipping: 40 Skipping: /nbody_graphics_point_point_size Error loading Lua script 'astronomy_parameters.txt': [string &quot;number_parameters: 4...&quot;]:1: '&lt;name&gt;' expected near '4' Error reading astronomy parameters from file 'astronomy_parameters.txt' Trying old parameters file Integral area { 1819239215, 1769234796, 1008738314 } will overflow progress calculation Error reading parameters file Failed to read parameters file 18:28:56 (17086): called boinc_finish &lt;/stderr_txt&gt; ]]&gt;

Allen Paschke
Send message
Joined: 3 Jan 13
Posts: 1
Credit: 3,640,781
RAC: 2,018

Message 57718 - Posted: 29 Mar 2013, 19:08:04 UTC - in response to Message 57700.

Today, all the Milky Way runs have ended with a "Computational Error".

I received 1 download that would require 28,000+ hours to run. I aborted this run.

Herge
Send message
Joined: 11 Oct 09
Posts: 19
Credit: 192,462,972
RAC: 0

Message 57719 - Posted: 29 Mar 2013, 19:57:47 UTC

ps_separation_82 are still being released and they all crash within a second.

Jeffery M. Thompson
Volunteer moderator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 23 Sep 12
Posts: 149
Credit: 12,937,156
RAC: 4,599

Message 57720 - Posted: 29 Mar 2013, 20:29:36 UTC

Once we close the old runs there are still some runs in the systems that process through.


I am using a versioning number at the end of the runs and that tells you which release it was.

de_p_82_3s_dr8_2
ps_p_82_3s_dr8_2

de_p_82_3s_dr8_3
ps_p_82_3s_dr8_3

Were the previous runs I have re-confirmed that they are not active runs. So old units were still in the queue with the other separation runs.

I have

de_p_82_3s_dr8_4
ps_p_82_3s_dr8_4

starting and am posting on the changes on those in the next thread.



Jeff Thompson


Post to thread

Message boards : News : New Separation Runs Started


Main page · Your account · Message boards


Copyright © 2018 AstroInformatics Group