Welcome to MilkyWay@home

Computation errors?

Message boards : Number crunching : Computation errors?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 46119 - Posted: 10 Feb 2011, 1:37:02 UTC - in response to Message 46111.  

Any long task crasched at my HD 4770 @ XP x86. All short tasks are running OK.
ID 120357. Any idea?
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Nesprávná funkce. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Running Milkyway@home ATI GPU application version 0.23 (Win32, CAL 1.4) by Gipsel
ignoring unknown input argument in app_info.xml: -np
ignoring unknown input argument in app_info.xml: 8
ignoring unknown input argument in app_info.xml: -p
ignoring unknown input argument in app_info.xml: 0.3519878880479614400000000
ignoring unknown input argument in app_info.xml: 26.1891288052719350000000000
ignoring unknown input argument in app_info.xml: -2.4507573159806090000000000
ignoring unknown input argument in app_info.xml: 42.2227791818824000000000000
ignoring unknown input argument in app_info.xml: 31.2596681887050780000000000
ignoring unknown input argument in app_info.xml: 2.2478147380792306000000000
ignoring unknown input argument in app_info.xml: 0.2000000000000000000000000
ignoring unknown input argument in app_info.xml: 2.0000000000000000000000000
instructed by BOINC client to use device 0
CPU: Intel(R) Core(TM)2 Duo CPU     E8400  @ 3.00GHz (2 cores/threads) 2.9997 GHz (323ms)

CAL Runtime: 1.4.900
Found 1 CAL device

Device 0: ATI Radeon HD4700/4800 (RV740/RV770) 512 MB local RAM (remote 64 MB cached + 512 MB uncached)
GPU core clock: 825 MHz, memory clock: 845 MHz
640 shader units organized in 8 SIMDs with 16 VLIW units (5-issue), wavefront size 64 threads
supporting double precision

Starting WU on GPU 0

main integral, 1500 iterations (3500x3000), 1 streams
predicted runtime per iteration is 705 ms (33.3333 ms are allowed), dividing each iteration in 22 parts
borders of the domains at 0 160 320 480 640 800 960 1120 1272 1432 1592 1752 1912 2072 2232 2392 2552 2704 2864 3024 3184 3344 3500
1, integration, Stream Allocation : Failed to create Buffer
Kernel Execution : Uninitialized or Allocation failed Input streams.
Stream Allocation : Failed to create Buffer


</stderr_txt>
]]>
That big one requires (3000 * 3500 * 8 * 2 / (1024 * 1024)) = 160MB. Maybe that isn't available? Alternatively ATI's drivers have had (may still have?) problems where they sometimes don't allow you to allocate more than some random size for a single buffer.

ID: 46119 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 15 Jul 08
Posts: 383
Credit: 729,293,740
RAC: 0
Message 46128 - Posted: 10 Feb 2011, 6:38:31 UTC - in response to Message 46119.  
Last modified: 10 Feb 2011, 6:41:13 UTC

Any long task crasched at my HD 4770 @ XP x86. All short tasks are running OK.

The long tasks have been running fine on all 8 of my HD 4770 cards (512k). Using the v10.12 ATI drivers with app support on all of them. All are Win7-64.
ID: 46128 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile kashi

Send message
Joined: 30 Dec 07
Posts: 311
Credit: 149,490,184
RAC: 0
Message 46130 - Posted: 10 Feb 2011, 9:26:33 UTC

"Stream Allocation : Failed to create Buffer" errors are usually caused by insufficient system memory in the Collatz ATI application. Perhaps this could be the cause here also. Hard to tell when computers are hidden.

Could also be similar to what happened about a year ago when the task lengths were increased and the memory in the GPU was not being initialised properly for CUDA cards running Win XP. It was fixed by a new 0.23 version application. If this was the case though I would have expected more people to be reporting these errors.
ID: 46130 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 46131 - Posted: 10 Feb 2011, 9:48:39 UTC - in response to Message 46097.  


09/02/2011 11:05:23 Milkyway@home Starting de_separation_82_1s_fix_1_60710_1297245785_0
09/02/2011 11:05:23 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:23 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:24 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:24 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:25 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:26 Milkyway@home Computation for task de_separation_82_1s_fix_1_60710_1297245785_0 finished
09/02/2011 11:05:37 Milkyway@home task de_separation_82_1s_fix_1_60710_1297245785_0 suspended by user

Is there any workaround for this?

Thanks
This MIGHT be because I think I built everything targeting Windows XP.


I wonder, how can i get the executable? I can open it with dependscheck and see what prevents it to be executed.
ID: 46131 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 46132 - Posted: 10 Feb 2011, 10:01:45 UTC - in response to Message 46131.  


09/02/2011 11:05:23 Milkyway@home Starting de_separation_82_1s_fix_1_60710_1297245785_0
09/02/2011 11:05:23 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:23 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:24 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:24 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:25 Milkyway@home [error] Process creation failed:
09/02/2011 11:05:26 Milkyway@home Computation for task de_separation_82_1s_fix_1_60710_1297245785_0 finished
09/02/2011 11:05:37 Milkyway@home task de_separation_82_1s_fix_1_60710_1297245785_0 suspended by user

Is there any workaround for this?

Thanks
This MIGHT be because I think I built everything targeting Windows XP.


I wonder, how can i get the executable? I can open it with dependscheck and see what prevents it to be executed.


In fact, I just took a look at the milkyway_0.50_windows_intelx86.exe in Dependency Walker and I see that two functions from kernel32.dll are not bound:

DecodePointer(...) and EncodePointer(...)

Why are you using these?
ID: 46132 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 46133 - Posted: 10 Feb 2011, 10:09:13 UTC - in response to Message 46131.  

This MIGHT be because I think I built everything targeting Windows XP.


If it is encoding pointer issue, it is even worse - they are supported starting from Windows XP SP2.
ID: 46133 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile banditwolf
Avatar

Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 524,164
RAC: 0
Message 46172 - Posted: 11 Feb 2011, 19:24:37 UTC

I had one wu that just plain stuck and nothing ran all day on my computer. Not sure why. All others have run fine.
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.
ID: 46172 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 46175 - Posted: 11 Feb 2011, 20:09:28 UTC - in response to Message 46132.  

In fact, I just took a look at the milkyway_0.50_windows_intelx86.exe in Dependency Walker and I see that two functions from kernel32.dll are not bound:

DecodePointer(...) and EncodePointer(...)

Why are you using these?
I'm not. Something in libraryland must be.
ID: 46175 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 46185 - Posted: 12 Feb 2011, 9:14:19 UTC - in response to Message 46175.  

No way to solve this problem? If it does not work in Win2000, obviously, it does not work in Win98.

So, it has to clearly stated that Milkyway tasks can be run only and only starting from Windows XP SP2.
ID: 46185 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 12 Aug 09
Posts: 262
Credit: 92,631,041
RAC: 0
Message 46186 - Posted: 12 Feb 2011, 13:01:25 UTC - in response to Message 46185.  
Last modified: 12 Feb 2011, 13:03:07 UTC

So, it has to clearly stated that Milkyway tasks can be run only and only starting from Windows XP SP2.


No.

In my experience they run with ATI cards on Vista and Win7 and with nVidia CUDA on Vista Ultimate x64. Both are GPU tasks.

I have also ran them via CPU on a Vista Ultimate x84 box, without errors.
Greetings from,
TJ
ID: 46186 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 46191 - Posted: 12 Feb 2011, 19:08:33 UTC - in response to Message 46186.  

So, it has to clearly stated that Milkyway tasks can be run only and only starting from Windows XP SP2.


No.

In my experience they run with ATI cards on Vista and Win7 and with nVidia CUDA on Vista Ultimate x64. Both are GPU tasks.

I have also ran them via CPU on a Vista Ultimate x84 box, without errors.


Well, I ment starting from Windows XP SP2 and higher (obviously, it means Vista and 7).

However, I still don't know what to do with my Win2000 laptop :) I use it for tests, so most of the time it is idle and boinc is a good fun time for it ...

I noticed that tasks started to fail recently. Like 3 months ago. Before it was working okay.
ID: 46191 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 46216 - Posted: 13 Feb 2011, 11:19:26 UTC - in response to Message 46191.  
Last modified: 13 Feb 2011, 11:19:55 UTC

I also noticed another glitch in BOINC when running it on my windows 7 machine: one of the task has progress in 127.808%. How is this possible?

See screenshot here:
ID: 46216 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 49279 - Posted: 14 Jun 2011, 5:53:04 UTC

Any news on Windows 2000 ? I still can't run the tasks because they fail with "calculation error".
ID: 49279 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Milchband

Send message
Joined: 19 Apr 09
Posts: 26
Credit: 37,330,714
RAC: 0
Message 49282 - Posted: 14 Jun 2011, 10:33:54 UTC

Any new 0.82tasks crashed.i quid them.
ID: 49282 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 49312 - Posted: 14 Jun 2011, 22:41:19 UTC - in response to Message 49279.  

Any news on Windows 2000 ? I still can't run the tasks because they fail with "calculation error".
I thought I built everything for Windows 2000, but apparently I was 1 underscore off in defining the windows version, and it was missing the version from the link flags. I put out the 0.84 (nonGPU) for 32 bit Windows. That might work, I don't have a Windows 2000 around to actually try it.
ID: 49312 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 49325 - Posted: 15 Jun 2011, 10:58:37 UTC - in response to Message 49312.  

Any news on Windows 2000 ? I still can't run the tasks because they fail with "calculation error".
I thought I built everything for Windows 2000, but apparently I was 1 underscore off in defining the windows version, and it was missing the version from the link flags. I put out the 0.84 (nonGPU) for 32 bit Windows. That might work, I don't have a Windows 2000 around to actually try it.


Thank you. I just tried the updated MilkyWay@Home 0.84 (ps_test_4_1064456_0) and it fails with the same error.


ID: 49325 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 49330 - Posted: 15 Jun 2011, 14:02:11 UTC - in response to Message 49325.  

If I see dependency of the binary, I see that it links with EncodePointer!kernel32.dll which is not exproted in 2k. Please see attached screenshot:



Actually, if you don't have 2k, you can just watch the import table of the executable and make sure it does not contain EncodePointer & DecodePointer. The simplest solution is to search for occurancy using any test editor. You can also use grep if you up to.
ID: 49330 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 49383 - Posted: 16 Jun 2011, 22:21:24 UTC - in response to Message 49330.  

If I see dependency of the binary, I see that it links with EncodePointer!kernel32.dll which is not exproted in 2k. Please see attached screenshot:



Actually, if you don't have 2k, you can just watch the import table of the executable and make sure it does not contain EncodePointer & DecodePointer. The simplest solution is to search for occurancy using any test editor. You can also use grep if you up to.
Try 0.86
ID: 49383 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 49389 - Posted: 17 Jun 2011, 7:31:18 UTC - in response to Message 49383.  

Same problem with 086:

ID: 49389 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Shcherbyna

Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 0
Message 49394 - Posted: 17 Jun 2011, 14:55:59 UTC - in response to Message 49389.  

Well, in fact, the problem is not the same.

In *086 I don't see static link to msvcrt.dll. It is dynamic. Therefore this application does not run due to absence of msvcrt.dll in the system.

In most of the cases machines are clean and don't have CRT library installed. Can you compile it with static CRT? I.e., /MT switch and not /MD. This will increase size of the binary, but it will make sure people would not have to dig and see what's wrong.

In this particular case it is blaming it cannot find the function lc_codepage_func(...) http://msdn.microsoft.com/en-us/library/ff730817.aspx - from VS 2010 CRT !

Thank you!
ID: 49394 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : Computation errors?

©2024 Astroinformatics Group