|
21)
Questions and Answers :
Issue Discussion :
Validate errors
(Message 984)
Posted 29 Dec 2020 by alex Post: I have checked my account. Currently i have 86 invalid results; 4 of them are cpu wu's. Tasks are 3262846, 3259758, 3291712 and 3294610. I have also 31 wu's that report as error; about 50% CPU and 50% GPU wu's. Since my first report of failing wu's the rise of the numbers slowed down. Hope this can help a little bit to figure out the source. |
|
22)
Questions and Answers :
Issue Discussion :
"No tasks sent"
(Message 964)
Posted 24 Dec 2020 by alex Post: I have no problems getting wu's. All systems are win10 (20H2) 64Bit with at least 8GB Ram, all different NVidia cards. |
|
23)
Questions and Answers :
Issue Discussion :
Setup to run 2 wu's on one GPU
(Message 957)
Posted 23 Dec 2020 by alex Post: I checked the gpu usage with gpu-z. On my GTX1060 it is around 50% and memory used below 2 GB. So it should make sense to run 2 wu's on that card. Is there a setup available? And another question: for Einstein it makes sense to run processlasso to priorize the cpu usage for the gpu-wu's. Increases the gpu output pretty well since the gpu is not long in idle waiting for the cpu. The wu's are set up as to use 0.991 cpus. I assume this is a dummy value. Maybe the output can be increased here as well with processlasso. Unfortunately it's no longer free of charge, but maybe someone has a free version available and can check this. If it works i will pay the 38 Euros for it. |
|
24)
Questions and Answers :
Windows :
6 invalid wu's
(Message 952)
Posted 21 Dec 2020 by alex Post: Murphy said: If something can go wrong, it will. Even if there is only a little chance. Thank you for keeping us informed! |
|
25)
Questions and Answers :
Issue Discussion :
Great credit
(Message 951)
Posted 21 Dec 2020 by alex Post: No need to hurry, i can live with that kind of error! No priority one issue. I wish you merry christmas! |
|
26)
Questions and Answers :
Issue Discussion :
Great credit
(Message 947)
Posted 20 Dec 2020 by alex Post: Reading about doubled credit i took a look into my account and found this: 3358630 1620482 5172 20 Dec 2020, 16:08:42 UTC 20 Dec 2020, 19:47:19 UTC Fertig und Bestätigt 5,302.74 5,175.34 4,326,400.00 Machine Learning Dataset Generator (GPU) v9.75 (cuda10200) windows_x86_64 Can i have more of these wu's please? |
|
27)
Message boards :
Cafe :
Geeking out on emerging technologies ...
(Message 930)
Posted 9 Dec 2020 by alex Post: This is a episode from TED talks that found my interest. https://www.youtube.com/watch?v=aR5N2Jl8k14 Maybe you find this interesting too. |
|
28)
Questions and Answers :
Issue Discussion :
wu's fail with err. message out of memory
(Message 928)
Posted 9 Dec 2020 by alex Post: The PC is a live backup, running nothing than BOINC and no special setups to increase GPU-load. CPU-load is around 96%. All out of the box.But BOINC cpu wu's are also running. Also rosetta wu's which are very,very memory hungry. Sometimes they are suspended with the status 'waiting for memory'. Knowing this behaviour triggered me to ask. In the meantime i have 3 failed wu' on PC 5172 with the same message and one on PC 5173 running a NVIDIA GeForce GTX 750 Ti (2048MB) driver: 457.51 OpenCL: 1.2 https://www.dropbox.com/s/5wehq33maxx12d6/gpu-z1.PNG?dl=0 |
|
29)
Questions and Answers :
Windows :
6 invalid wu's
(Message 927)
Posted 9 Dec 2020 by alex Post: Thank you for that explanation. It's a young project and failurs are an option, otherwise progress would be too slow. I accept that. Happy that you are aware of these facts. I have seen projects where the admins were not aware (for at least half a year) that the formula they used was wrong. |
|
30)
Questions and Answers :
Issue Discussion :
wu's fail with err. message out of memory
(Message 919)
Posted 8 Dec 2020 by alex Post: This wu https://www.mlcathome.org/mlcathome/result.php?resultid=3201137 failed with - Unhandled Exception Record - Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x00007FFC6C79D759 PC: Koprozessor NVIDIA GeForce GTX 1060 6GB (4095MB) driver: 457.51 So does this mean GPU-memory or main memory? Main Memory can be increased, GPU mem not. |
|
31)
Questions and Answers :
Windows :
6 invalid wu's
(Message 918)
Posted 8 Dec 2020 by alex Post: The thing is, that these wu's validate on other PC's. So a reinit could propably help. The other idea is: if the first nan occurs, does it make sense to complete the wu wasting GPU-time? If a reinit is no option, the wu could be stopped with an errorcode other than 0 (as my failed wu's do). If this is a more common problem it might be worth to look into the code. But i did not find similar posts. A windows-only problem? The number of failed wu's (Bestätigungsfehler) is now up to 9. At least this wu also failed on another PC, a windows server. https://www.mlcathome.org/mlcathome/show_host_detail.php?hostid=5082 This one validated on another win10 pc https://www.mlcathome.org/mlcathome/workunit.php?wuid=1470598 |
|
32)
Questions and Answers :
Windows :
6 invalid wu's
(Message 916)
Posted 7 Dec 2020 by alex Post: Hi, i have 6 invalid wu's on different PC's. The common thing is: all have all or some entries in stderr of loss or val_loss of nan. One example is https://www.mlcathome.org/mlcathome/result.php?resultid=3189474 Is the value 'nan' the reason for the invalid wu? If yes, why does it make sense to finish calculation? Would it be possible to simply restart the wu? |
©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)