Validate error for being second

Questions and Answers : Issue Discussion : Validate error for being second
Message board moderation

To post messages, you must log in.

AuthorMessage
Werinbert

Send message
Joined: 30 Nov 20
Posts: 14
Credit: 7,958,883
RAC: 16
Message 1382 - Posted: 13 Oct 2021, 16:43:30 UTC

I noticed a couple of WUs that had a second task sent out presumably because the first task timed out. The first task however was returned and arrived before the second task was returned. Consequently the second task was marked invalid merely because a valid result was already returned.

example: https://www.mlcathome.org/mlcathome/workunit.php?wuid=5465870
ID: 1382 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
UBT - wbiz

Send message
Joined: 2 May 21
Posts: 9
Credit: 2,016,461
RAC: 2
Message 1384 - Posted: 14 Oct 2021, 4:09:58 UTC - in response to Message 1382.  

It is a bit frustrating but if after the second computer is allocated the job, the first time the first computer reports back to MLC is with a completed task, it would be silly to throw it away.

Its not helped by users stacking up loads of jobs, in this case he has 360 jobs stacked up on an average turnaround of 7 days with 8 cores and regularly has jobs timing out (roughly 40%).
ID: 1384 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 1 Jul 20
Posts: 8
Credit: 25,000,139
RAC: 0
Message 1427 - Posted: 28 Nov 2021, 8:12:10 UTC

@pianoman, do you not think this is an issue ?

Time and effort are being wasted on returning what look like perfectly good results only to have them marked as invalid because the original work unit gets returned shortly before a replacement work unit can get returned. By all means cancel duplicates if they haven't started but if it's running and then completes as a valid task it should be given credit please. I do not have a problem with the original task getting zero credit if it is returned after its deadline and already completed by another.

If the project doesn't think the time and effort is worth credit for these results then I'll go along with that and just abort all non-original tasks that get downloaded to ensure I'm not going to run into the problem.

Thanks
ID: 1427 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Questions and Answers : Issue Discussion : Validate error for being second

©2023 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)