Welcome to MilkyWay@home

CPU Calculation errors on ps-separation tasks

Message boards : Number crunching : CPU Calculation errors on ps-separation tasks
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile John Black

Send message
Joined: 3 May 10
Posts: 74
Credit: 1,532,760
RAC: 0
Message 60753 - Posted: 16 Jan 2014, 23:03:34 UTC

Hi,
I have just had three calculation failures on ps-separation tasks. below is the sderr.

<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
Incorrect function.
(0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
<search_application> milkyway_separation 1.00 Windows x86 double </search_application>
Unrecognized XML in project preferences: max_gfx_cpu_pct
Skipping: 100
Skipping: /max_gfx_cpu_pct
Unrecognized XML in project preferences: nbody_graphics_poll_period
Skipping: 30
Skipping: /nbody_graphics_poll_period
Unrecognized XML in project preferences: nbody_graphics_float_speed
Skipping: 5
Skipping: /nbody_graphics_float_speed
Unrecognized XML in project preferences: nbody_graphics_textured_point_size
Skipping: 250
Skipping: /nbody_graphics_textured_point_size
Unrecognized XML in project preferences: nbody_graphics_point_point_size
Skipping: 40
Skipping: /nbody_graphics_point_point_size
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE3 path
Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified file system resource manager is not started or was shutdown due to an error.

Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified file system resource manager is not started or was shutdown due to an error.

Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified file system resource manager is not started or was shutdown due to an error.

Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified file system resource manager is not started or was shutdown due to an error.

Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified file system resource manager is not started or was shutdown due to an error.

Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified file system resource manager is not started or was shutdown due to an error.

Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (6801): Transaction support within the specified file system resource manager is not started or was shutdown due to an error.

Failed to update checkpoint file ('separation_checkpoint_tmp' to 'separation_checkpoint') (2): No such file or directory
Write checkpoint failed
22:11:28 (964): called boinc_finish

</stderr_txt>

I have been happily calculating these for some time without any failures so I am thinking that it may be a task failure.
Does anybody know what "incorrect function" means?

My thanks to all who provide suggestions. In the meantime I will continue with the tasks in the hope that some will be ok.

John [/b]
ID: 60753 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile John Black

Send message
Joined: 3 May 10
Posts: 74
Credit: 1,532,760
RAC: 0
Message 60754 - Posted: 16 Jan 2014, 23:11:30 UTC
Last modified: 16 Jan 2014, 23:11:55 UTC

Hi again,

I now have had 5 calculation failures on the trot so I am going to stop calculating in the hope that we can find an answer on this thread.

Thanks
John
ID: 60754 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile John Black

Send message
Joined: 3 May 10
Posts: 74
Credit: 1,532,760
RAC: 0
Message 60779 - Posted: 20 Jan 2014, 21:10:02 UTC

Hi again,

more faults on these ps_sepn_10_2s_sSgrFreeInertia tasks. here is the sderr

<core_client_version>7.2.33</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.00 Windows x86 double </search_application>
Unrecognized XML in project preferences: max_gfx_cpu_pct
Skipping: 100
Skipping: /max_gfx_cpu_pct
Unrecognized XML in project preferences: nbody_graphics_poll_period
Skipping: 30
Skipping: /nbody_graphics_poll_period
Unrecognized XML in project preferences: nbody_graphics_float_speed
Skipping: 5
Skipping: /nbody_graphics_float_speed
Unrecognized XML in project preferences: nbody_graphics_textured_point_size
Skipping: 250
Skipping: /nbody_graphics_textured_point_size
Unrecognized XML in project preferences: nbody_graphics_point_point_size
Skipping: 40
Skipping: /nbody_graphics_point_point_size
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE3 path
Integral 0 time = 12490.993018 s
Running likelihood with 107122 stars
Likelihood time = 2.492598 s
<background_integral> 0.000520275761890 </background_integral>
<stream_integral> 736.383873646955070 1685.907943371911900 </stream_integral>
<background_likelihood> -11.472450952046016 </background_likelihood>
<stream_only_likelihood> -3.130047180413889 -21.326269355458681 </stream_only_likelihood>
<search_likelihood> -3.130047177420612 </search_likelihood>
20:25:17 (7288): called boinc_finish

</stderr_txt>

Does anybody have any suggestions what "Unrecognized XML in project preferences: max_gfx_cpu_pct" might mean?

Thanks for looking
John
ID: 60779 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile John Black

Send message
Joined: 3 May 10
Posts: 74
Credit: 1,532,760
RAC: 0
Message 60808 - Posted: 25 Jan 2014, 12:04:24 UTC

Hi again,

it seems that my problems have gone as the latest tasks are working ok. I have no idea what caused the calculation errors but, as I have not changed my set up, it must have been a faulty batch of tasks??

Anyway they seem to be working ok now.

John
ID: 60808 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Penguin

Send message
Joined: 4 Mar 12
Posts: 45
Credit: 459,790,300
RAC: 3,243
Message 60811 - Posted: 25 Jan 2014, 20:26:54 UTC - in response to Message 60808.  

I'm getting errors on the GPU tasks for those work units.
ID: 60811 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile John Black

Send message
Joined: 3 May 10
Posts: 74
Credit: 1,532,760
RAC: 0
Message 60856 - Posted: 29 Jan 2014, 7:37:18 UTC - in response to Message 60811.  

Hi Penguin5540,

yeah nobody seems to have any ideas what is going wrong with these tasks.

That said I am now getting some sort of result that the system is accepting, though, when I look at the wingmates result they seem sort of different to me. I am no expert at what the dupe should look like in fact in some cases there is no dupe???

I am going to bash on and see what happens.

Thanks for at least commenting

Regards
John
ID: 60856 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Penguin

Send message
Joined: 4 Mar 12
Posts: 45
Credit: 459,790,300
RAC: 3,243
Message 60897 - Posted: 1 Feb 2014, 20:47:34 UTC - in response to Message 60856.  

Hopefully someone will fix the gpu and cpu task errors soon.
ID: 60897 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3319
Credit: 520,257,464
RAC: 20,597
Message 60905 - Posted: 2 Feb 2014, 12:45:08 UTC - in response to Message 60856.  

Hi Penguin5540,

yeah nobody seems to have any ideas what is going wrong with these tasks.

That said I am now getting some sort of result that the system is accepting, though, when I look at the wingmates result they seem sort of different to me. I am no expert at what the dupe should look like in fact in some cases there is no dupe???

I am going to bash on and see what happens.

Thanks for at least commenting

Regards John


Part of the "wingmates" thing is this change:
from
minimum quorum 2
to
minimum quorum 1

This means you no longer need a wingmate and the system now trusts your pc to produce valid results, or the project is sending out units that thinks few pc's can mess them up. Either way it's a good thing as you no longer have to wait on your wingmate to get credits.
ID: 60905 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : CPU Calculation errors on ps-separation tasks

©2024 Astroinformatics Group