Message boards :
Application Code Discussion :
problem with checkpoints 2
Message board moderation
Author | Message |
---|---|
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I'm pretty sure the application is checkpointing correctly but we're still getting the odd bad workunit. Not quite sure what's causing it, but I'm going to keep looking into the problem. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
This is happening in all the 0.14 compiled apps. Including the Gipsel app (which i think has a slightly different fix to the checkpointing problem). So I'm not quite sure if the issue is still checkpointing or if it's something else. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
Pretty sure I found the issue. At the end of calculate integrals there is: #ifdef GMLE_BOINC int retval = write_checkpoint(es); if (retval) { fprintf(stderr,"APP: astronomy checkpoint failed %d\n",retval); return retval; } #endif So in the rare case that this is the last checkpoint calculated (and there hasn't been a new one from the next integral calculation or likelihood calculation), the app will recalculate an integral. I'm going to do another update because I think this should put the last nail in the coffin to this problem. |
©2024 Astroinformatics Group