Message boards :
Number crunching :
Problems downloading Stars and Volume files
Message board moderation
Author | Message |
---|---|
Send message Joined: 30 Aug 07 Posts: 15 Credit: 6,390 RAC: 0 |
I may have spoken to soon on the other thread but, it still appears that the stars.txt and volume.txt files are hanging up on download. I know that during the night(~11:30pmCST) when I was on the website that the website locked up with too many concurrent connections. So maybe there is a problem with too many people trying to download the files at once. |
Send message Joined: 8 Oct 07 Posts: 6 Credit: 75,628 RAC: 0 |
I may have spoken to soon on the other thread but, it still appears that the stars.txt and volume.txt files are hanging up on download. I know that during the night(~11:30pmCST) when I was on the website that the website locked up with too many concurrent connections. So maybe there is a problem with too many people trying to download the files at once. Looks like I got the same problem now (didn't meet it with many other WUs my hosts downloaded earlier today Here's BOINC's log: 08/11/2007 17.13.14|Milkyway@home|[file_xfer] Started download of file stars.txt |
Send message Joined: 29 Aug 07 Posts: 2 Credit: 346,188 RAC: 0 |
Even if you get stars.txt and volume.txt to download, once you run out of tasks they are deleted from the project directory. So now when more tasks become available, stars.txt and volume.txt will have to be downloaded again. If stars.txt and volume.txt are the same for all tasks, shouldn't they be set as non-delete in the server configuration? I solved this by making those two files read-only, but if the project changes them then all tasks will fail for me. Here are the two messages in the boinc log showing that the project tried to delete these files when I ran out of work: Milkyway@home 11/8/2007 10:22:24 AM [error] Couldn't delete file projects/milkyway.cs.rpi.edu_milkyway/stars.txt Milkyway@home 11/8/2007 10:22:31 AM [error] Couldn't delete file projects/milkyway.cs.rpi.edu_milkyway/volume.txt |
Send message Joined: 8 Oct 07 Posts: 24 Credit: 111,325 RAC: 0 |
I would guess that there will be a limit as to how many times or for how long it will request the missing files before giving up. Any idea how long that might be? All excited about getting my first pair of wus, to find them stuck in download for the last 5 hours. Candidate for ATA membership. |
Send message Joined: 31 Aug 07 Posts: 21 Credit: 21,004,179 RAC: 0 |
There may have been a problem with the server as there were 4700 (and rising) WUs waiting for validation, but that seems to be resolved now. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
There may have been a problem with the server as there were 4700 (and rising) WUs waiting for validation, but that seems to be resolved now. We've been testing the first version of using a genetic search to automatically generate assimilate and generate work units - and it looks like there's been a problem where it was creating too many work units. For some reason the stars and volume.txt files got lost in the process of doing the search, so thats why you all had problems. Hopefully the next one we run (today or tomorrow) will work a bit more smoothly :) We definitely appreciate your work and hope you hang in there with us while we get all the bugs sorted out. |
Send message Joined: 31 Aug 07 Posts: 21 Credit: 21,004,179 RAC: 0 |
Thanx Travis |
Send message Joined: 30 Aug 07 Posts: 15 Credit: 6,390 RAC: 0 |
We definitely appreciate your work and hope you hang in there with us while we get all the bugs sorted out. Still having issues and now I'm starting to see other files not downloading as well: 11/8/2007 4:01:44 PM|Milkyway@home|[file_xfer] Started download of file volume.txt 11/8/2007 4:01:45 PM|Milkyway@home|[file_xfer] Started download of file stars.txt 11/8/2007 4:01:46 PM|Milkyway@home|[file_xfer] Temporarily failed download of stars.txt: file not found 11/8/2007 4:01:46 PM|Milkyway@home|[file_xfer] Temporarily failed download of volume.txt: file not found 11/8/2007 4:01:46 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194561965_872 11/8/2007 4:01:47 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194561964_863 11/8/2007 4:01:48 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194561964_863: file not found 11/8/2007 4:01:48 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194561964_867 11/8/2007 4:01:48 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194561965_872: file not found 11/8/2007 4:01:49 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194561964_862 11/8/2007 4:01:49 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194561964_867: file not found 11/8/2007 4:01:50 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194561964_861 11/8/2007 4:01:50 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194561964_862: file not found 11/8/2007 4:01:51 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194561964_860 11/8/2007 4:01:51 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194561964_861: file not found 11/8/2007 4:01:52 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194561963_858 11/8/2007 4:01:52 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194561964_860: file not found 11/8/2007 4:01:53 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194561963_858: file not found Are there other issues here besides the server being overwhelmed by requests? |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I think the server is doing ok... i'm not quite sure why files are going missing. I've started up a new search and hopefully that will fix the problem of too many work units being generated -- i think that had something to do with files going missing. Hopefully I'll figure out the problem soon. |
Send message Joined: 29 Aug 07 Posts: 115 Credit: 502,661,158 RAC: 4,584 |
I think the server is doing ok... i'm not quite sure why files are going missing. I've started up a new search and hopefully that will fix the problem of too many work units being generated -- i think that had something to do with files going missing. Hopefully I'll figure out the problem soon. About half of my machines picked up work, and about half of those had stuck files downloading. |
Send message Joined: 3 Oct 07 Posts: 21 Credit: 49,862 RAC: 0 |
I think the server is doing ok... i'm not quite sure why files are going missing. I've started up a new search and hopefully that will fix the problem of too many work units being generated -- i think that had something to do with files going missing. Hopefully I'll figure out the problem soon. I just posted this problem in Q&A - Windows The project directory didn't exist. I manually created it and the downloads started right up |
Send message Joined: 29 Aug 07 Posts: 115 Credit: 502,661,158 RAC: 4,584 |
I think the server is doing ok... i'm not quite sure why files are going missing. I've started up a new search and hopefully that will fix the problem of too many work units being generated -- i think that had something to do with files going missing. Hopefully I'll figure out the problem soon. The project directory does exist on mine, but still I have stuck downloads. |
Send message Joined: 30 Aug 07 Posts: 15 Credit: 6,390 RAC: 0 |
It's interesting how the files get stuck in the same position on the progress meter in the transfers tab: Parameters file is always at 32.36% Stars file is at 0.00% Volume file is at 100.00% Even though the volume.txt file is at 100% it is still in the transfer tab. I have to abort some downloads because they were not being recognized as being downloaded at the client but the server would say I had the WU for hours. I wonder if the scheduler is clearing out the files to fast after the transfer? I am just shooting spitballs here. Anybody else got an idea? |
Send message Joined: 18 Sep 07 Posts: 7 Credit: 33,809 RAC: 0 |
It's interesting how the files get stuck in the same position on the progress meter in the transfers tab: From my experiences with failing transmissions behind NTLM proxy server - Boinc receives some data packet sized around 300 kB, which tells (in HTML language, suitable for browsers), that e.g. the authentication was wrong or omitted or something else went wonky. Boinc just counts these bytes in into the amount to be transmitted - thus it was already enough for volume.txt, or already the first third of parameters_... or still less than 0.01% of stars.txt to be displayed. After the amount of bytes transferred is over (volume.txt) or the connection is cleanly closed (stars.txt, parameters_...), some higher code instance notices, that the checksum does not match at all and the false contents is thrown away. In the case of the large files, they are possibly tried to download the full amount... In this case it can be different, but maybe there is something common in the scenario? Peter |
Send message Joined: 19 Sep 07 Posts: 5 Credit: 5,401 RAC: 0 |
I’ve noticed this morning on this winXP machine that all files ending with a 4 figured number starting with a 3 have downloaded successfully, e.g. 09/11/2007 09:07:32|Milkyway@home|Finished download of parameters_generated_1194648761_3785 However all files ending with a 5 figure number beginning with a 4 have failed because the file has not been found, e.g. 09/11/2007 08:57:38|Milkyway@home|Giving up on download of parameters_generated_1194582769_44937: file not found Hope that helps, Dave. |
Send message Joined: 8 Oct 07 Posts: 289 Credit: 3,690,838 RAC: 0 |
Is the best course of action to just abort the downloads missing files? I assume these will never dl...right? |
Send message Joined: 3 Oct 07 Posts: 5 Credit: 329,770 RAC: 0 |
Stars & volume.txt downloaded and I also got about 300 w/u's to run, which I did. Now NEW W/U files to download are having the same problem as Stars and volume had before. Can't find file. I've aborted the downloads because they just keep trying but failing. ???????????? |
Send message Joined: 28 Aug 07 Posts: 2 Credit: 1,093,644 RAC: 0 |
I just posted in 'Platform Specific - Windows' ... should've read through these posts first. Getting same problem on one WinXPSP2 pc, only one d/l at a time. 11/9/2007 12:08:32 PM|Milkyway@home|[file_xfer] Started download of file parameters_generated_1194582644_44195 11/9/2007 12:08:34 PM|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194582644_44195: file not found 11/9/2007 12:08:34 PM|Milkyway@home|Backing off 2 hr 2 min 8 sec on download of file parameters_generated_1194582644_44195 File transferred 326/1000 bytes 32.60% Boinc Client: 5.8.15 I aborted the others, but will check stats if I get another. This file also is a five figure number beginning with '4', so that was a good suggestion to research from there if possible. |
Send message Joined: 29 Aug 07 Posts: 115 Credit: 502,661,158 RAC: 4,584 |
|
Send message Joined: 8 Oct 07 Posts: 24 Credit: 111,325 RAC: 0 |
I got the stars and volume files at the first attempt at this batch but can't get the parameters file for any of these 5 wus: 09/11/2007 16:46:28|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194569879_2880: file not found 09/11/2007 16:46:30|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194569879_2881: file not found 09/11/2007 16:46:33|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194569879_2882: file not found 09/11/2007 16:46:34|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194569880_2892: file not found 09/11/2007 16:46:35|Milkyway@home|[file_xfer] Temporarily failed download of parameters_generated_1194569880_2893: file not found All have 3 failed downloads and a No Reply against them already so I'm not very hopeful 8¬( |
©2024 Astroinformatics Group