|
1)
Questions and Answers :
Unix/Linux :
Workaround for "signal 4" problems with 9.50 linux client
(Message 437)
Posted 30 Aug 2020 by PoppaGeek Post: Mine was run on a dual opteron 2431 hex cores running ubuntu 16.04 apparently with success |
|
2)
Questions and Answers :
Unix/Linux :
Workaround for "signal 4" problems with 9.50 linux client
(Message 431)
Posted 30 Aug 2020 by PoppaGeek Post: Detected AMD Family 16 processor, switching OpenBLAS to generic Re-exec()-ing to set number of threads correctly... Machine Learning Dataset Generator v9.55 (Linux/x86_64) (libTorch: release/1.6) [2020-08-29 23:32:03 main:399] : INFO : Set logging level to 1 [2020-08-29 23:32:03 main:407] : INFO : Running in BOINC Standalone mode [2020-08-29 23:32:03 main:412] : INFO : Resolving all filenames [2020-08-29 23:32:03 main:420] : INFO : Resolved: dataset.hdf5 => dataset.hdf5 (exists = 1) [2020-08-29 23:32:03 main:420] : INFO : Resolved: model.cfg => model.cfg (exists = 0) [2020-08-29 23:32:03 main:420] : INFO : Resolved: model-final.pt => model-final.pt (exists = 0) [2020-08-29 23:32:03 main:420] : INFO : Resolved: model-input.pt => model-input.pt (exists = 0) [2020-08-29 23:32:03 main:420] : INFO : Resolved: snapshot.pt => snapshot.pt (exists = 0) [2020-08-29 23:32:03 main:434] : INFO : Dataset filename: dataset.hdf5 [2020-08-29 23:32:03 main:436] : INFO : Configuration: [2020-08-29 23:32:03 main:437] : INFO : Validation Loss Threshold: 0.0001 [2020-08-29 23:32:03 main:438] : INFO : Max Epochs: 2 [2020-08-29 23:32:03 main:439] : INFO : Batch Size: 128 [2020-08-29 23:32:03 main:440] : INFO : Patience: 10 [2020-08-29 23:32:03 main:441] : INFO : Hidden Width: 12 [2020-08-29 23:32:03 main:442] : INFO : # Recurrent Layers: 4 [2020-08-29 23:32:03 main:443] : INFO : # Backend Layers: 4 [2020-08-29 23:32:03 main:445] : INFO : Preparing Dataset [2020-08-29 23:32:03 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Xt from dataset.hdf5 into memory [2020-08-29 23:32:04 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Yt from dataset.hdf5 into memory [2020-08-29 23:32:04 load:103] : INFO : Successfully loaded dataset of 2048 examples into memory. [2020-08-29 23:32:04 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Xv from dataset.hdf5 into memory [2020-08-29 23:32:05 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Yv from dataset.hdf5 into memory [2020-08-29 23:32:05 load:103] : INFO : Successfully loaded dataset of 512 examples into memory. [2020-08-29 23:32:05 main:451] : INFO : Creating Model [2020-08-29 23:32:05 main:456] : INFO : Preparing config file [2020-08-29 23:32:05 main:468] : INFO : Creating new config file [2020-08-29 23:32:05 main:499] : INFO : Loading DataLoader into Memory [2020-08-29 23:32:05 main:502] : INFO : Starting Training [2020-08-29 23:34:11 main:514] : INFO : Epoch 1 | loss: 0.0435973 | val_loss: 0.031742 | Time: 126584 ms [2020-08-29 23:36:25 main:514] : INFO : Epoch 2 | loss: 0.0312199 | val_loss: 0.0305334 | Time: 133204 ms [2020-08-29 23:36:25 main:533] : INFO : Saving trained model to model-final.pt, val_loss 0.0305334 [2020-08-29 23:36:25 main:538] : INFO : Saving end state to config to file [2020-08-29 23:36:25 main:543] : INFO : Success, exiting.. |
|
3)
Questions and Answers :
Unix/Linux :
OS/Distribution support question?
(Message 421)
Posted 28 Aug 2020 by PoppaGeek Post: Hi! +1 |
|
4)
Questions and Answers :
Unix/Linux :
Sorry for the delay
(Message 374)
Posted 22 Aug 2020 by PoppaGeek Post: Thanks. I would assume I'll need to remove that once things are fixed? Cheers! |
|
5)
Questions and Answers :
Unix/Linux :
OS/Distribution support question?
(Message 372)
Posted 22 Aug 2020 by PoppaGeek Post: Oldest I'm running is 1 Ubuntu 16.04 x86-64 All ARM are 18.04 or 20.04 or Deb 10 |
|
6)
Questions and Answers :
Unix/Linux :
Linux/armhf and Linux/arm64 support status thread
(Message 371)
Posted 22 Aug 2020 by PoppaGeek Post: Thanks JagDoc! |
|
7)
Questions and Answers :
Unix/Linux :
Linux/armhf and Linux/arm64 support status thread
(Message 368)
Posted 22 Aug 2020 by PoppaGeek Post: No problem here with the rant. Was a UNIX admin in a production environment. I feel your pain. ;-) Good luck, hope things go better for ya soon! |
|
8)
Questions and Answers :
Unix/Linux :
Linux/armhf and Linux/arm64 support status thread
(Message 365)
Posted 22 Aug 2020 by PoppaGeek Post: Odroid c4 same as Jetson. OK with 64bit, 32bit errors out. Ubuntu 20.04 aarch64 https://www.mlcathome.org/mlcathome/results.php?hostid=1985 |
|
9)
Questions and Answers :
Unix/Linux :
Linux/armhf and Linux/arm64 support status thread
(Message 363)
Posted 22 Aug 2020 by PoppaGeek Post: My Jetson Nano is just the opposite. Does aarch64-unknown-linux-gnu fine errors out on arm-unknown-linux-gnueabihf https://www.mlcathome.org/mlcathome/results.php?hostid=1984&offset=0&show_names=0&state=0&appid= Cheers! |
|
10)
Questions and Answers :
Unix/Linux :
Sorry for the delay
(Message 350)
Posted 21 Aug 2020 by PoppaGeek Post: OK thanks for letting me know. Again thanks for keeping us updated! :-) |
|
11)
Questions and Answers :
Unix/Linux :
Sorry for the delay
(Message 346)
Posted 21 Aug 2020 by PoppaGeek Post: I have 2 pretty much identical dual Opteron systems running Ubuntu. Both did fine with 9.20 but all errors with 9.50 https://www.mlcathome.org/mlcathome/results.php?hostid=2032&offset=0&show_names=0&state=6&appid= https://www.mlcathome.org/mlcathome/results.php?hostid=293&offset=20&show_names=0&state=6&appid= |
|
12)
Questions and Answers :
Unix/Linux :
Sorry for the delay
(Message 340)
Posted 21 Aug 2020 by PoppaGeek Post: Thanks for all the updates. |
|
13)
Questions and Answers :
Unix/Linux :
Linux/armhf and Linux/arm64 support status thread
(Message 321)
Posted 13 Aug 2020 by PoppaGeek Post: Ran 2 on Odroid c4 Ubuntu 20.04 had required libs. Multi-thread, ran just those 2 work units at a time. Memory usage ran 550mb to 610. Run time 6 hours 49 min 47 sec CPU time 10 hours 46 min 46 sec https://www.mlcathome.org/mlcathome/result.php?resultid=947290 https://www.mlcathome.org/mlcathome/result.php?resultid=947346 Cheers! |
©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)