MLDG (test) doesn't respect suspend signal

Questions and Answers : Issue Discussion : MLDG (test) doesn't respect suspend signal
Message board moderation

To post messages, you must log in.

AuthorMessage
Thundergrid

Send message
Joined: 8 Sep 20
Posts: 2
Credit: 286,053
RAC: 2
Message 1264 - Posted: 15 Jul 2021, 15:49:10 UTC
Last modified: 15 Jul 2021, 15:51:01 UTC

Hi,
i've noticed today that an MLC WU didn't respect the suspend signal (nor stop) and keeps processing even when BOINC is completely stoped.
I know is a test application, but this behaviour is a bug... certainly shouldn't hijack a core.

WU in question

BOINC properties informs it as suspended... but keeps crunching.
Machine Learning Dataset Generator (test) 9.91 
ParityMachine-1626323656-7920-0
Suspended 
wrapper_001_x86_64-pc-linux-gnu


I hope you find the bug, is there anything else can i provide?

Regards
ID: 1264 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Michael Goetz
Avatar

Send message
Joined: 1 Jul 20
Posts: 8
Credit: 534,979
RAC: 0
Message 1265 - Posted: 15 Jul 2021, 18:58:52 UTC - in response to Message 1264.  

I also have seen this behavior.
Want to find one of the largest known primes? Try PrimeGrid. Or help cure disease at WCG.

ID: 1265 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
pianoman [MLC@Home Admin]
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Jun 20
Posts: 462
Credit: 21,406,548
RAC: 0
Message 1266 - Posted: 15 Jul 2021, 22:00:26 UTC

Yes, I've reproduced it here. Definitely a bug, but didn't think to try it until you mentioned it here, so thank you.

It's definitely new behavior with the wrapper app. I'll look into it.
ID: 1266 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
pianoman [MLC@Home Admin]
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Jun 20
Posts: 462
Credit: 21,406,548
RAC: 0
Message 1267 - Posted: 15 Jul 2021, 22:06:21 UTC

OK, I think I see whats going on and I think I know how to fix it. Expect an update later tonight. Thanks again for reporting.
ID: 1267 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Thundergrid

Send message
Joined: 8 Sep 20
Posts: 2
Credit: 286,053
RAC: 2
Message 1268 - Posted: 15 Jul 2021, 22:55:45 UTC - in response to Message 1267.  

OK, I think I see whats going on and I think I know how to fix it. Expect an update later tonight. Thanks again for reporting.


Thanks for the hard work, i'm glad to hear you found it.
ID: 1268 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
pianoman [MLC@Home Admin]
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Jun 20
Posts: 462
Credit: 21,406,548
RAC: 0
Message 1269 - Posted: 16 Jul 2021, 7:12:39 UTC

Hmmm...what I though was the problem didn't actually fix it. That's disheartening. Will take another crack at it tomorrow.
ID: 1269 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
pianoman [MLC@Home Admin]
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Jun 20
Posts: 462
Credit: 21,406,548
RAC: 0
Message 1271 - Posted: 17 Jul 2021, 5:38:52 UTC

OK, *NOW* I think it's finally working. At least I was able to suspend a process with the latest test build on my machine. Releasing new test WUs now, so please test at your leisure.
ID: 1271 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Questions and Answers : Issue Discussion : MLDG (test) doesn't respect suspend signal

©2024 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)