Questions and Answers :
Issue Discussion :
Rogue batch ?
Message board moderation
| Author | Message |
|---|---|
|
Send message Joined: 24 Jul 20 Posts: 8 Credit: 14,406,606 RAC: 1 |
The last 6 WU's on my machine have all failed on multiple machines. https://www.mlcathome.org/mlcathome/workunit.php?wuid=5156209 https://www.mlcathome.org/mlcathome/workunit.php?wuid=5200315 https://www.mlcathome.org/mlcathome/workunit.php?wuid=5216143 https://www.mlcathome.org/mlcathome/workunit.php?wuid=5225507 https://www.mlcathome.org/mlcathome/workunit.php?wuid=5233215 https://www.mlcathome.org/mlcathome/workunit.php?wuid=5036531 This is not productive - can it be avoided ? |
|
Send message Joined: 31 Jan 21 Posts: 1 Credit: 103,509 RAC: 0 |
9 so far - rather annoying |
|
Send message Joined: 30 Jun 20 Posts: 462 Credit: 21,406,548 RAC: 0 |
Looking at this now.. working on it. Not sure what's going on, looks like a connection issue to the database somewhere. Tracking. |
|
Send message Joined: 30 Jun 20 Posts: 462 Credit: 21,406,548 RAC: 0 |
Aha. Out of memory error on mongodb. ran out of the 32G in the server. working on a fix now. The problem was the validator script connects to another database (a mongodb, the main boinc database is mysql, also large and also running on the same server), and it interpreted a connection-to-the-db failed as a validation failed. So, multiple things to fix. |
|
Send message Joined: 9 Jul 20 Posts: 142 Credit: 11,536,204 RAC: 3 |
Thanks for looking into it! |
©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)