| Name | ParityModified-1647047553-29267-3-0_4 |
| Workunit | 11618266 |
| Created | 18 Apr 2022, 15:59:11 UTC |
| Sent | 18 Apr 2022, 16:00:02 UTC |
| Report deadline | 26 Apr 2022, 16:00:02 UTC |
| Received | 22 Apr 2022, 8:51:05 UTC |
| Server state | Over |
| Outcome | Computation error |
| Client state | Compute error |
| Exit status | 193 (0x000000C1) EXIT_SIGNAL |
| Computer ID | 12007 |
| Run time | 5 min 19 sec |
| CPU time | 5 min 10 sec |
| Validate state | Invalid |
| Credit | 0.00 |
| Device peak FLOPS | 1,884.93 GFLOPS |
| Application version | Machine Learning Dataset Generator (GPU) v9.80 (cuda10200) x86_64-pc-linux-gnu |
| Peak working set size | 1.82 GB |
| Peak swap size | 13.38 GB |
| Peak disk usage | 2.99 GB |
<core_client_version>7.16.6</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63)</message> <stderr_txt> DEBUG: Args: ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200 -c --maxepoch 2048 nthreads: 1 gpudev: 0 Re-exec()-ing to set environment correctly Machine Learning Dataset Generator v9.80 (Linux/x86_64) (libTorch: release/1.7 GPU: NVIDIA GeForce GTX 1050) [2022-04-18 18:00:20 main:442] : INFO : Set logging level to 1 [2022-04-18 18:00:20 main:448] : INFO : Running in BOINC Client mode [2022-04-18 18:00:20 main:451] : INFO : Resolving all filenames [2022-04-18 18:00:20 main:459] : INFO : Resolved: dataset.hdf5 => dataset.hdf5 (exists = 1) [2022-04-18 18:00:20 main:459] : INFO : Resolved: model.cfg => model.cfg (exists = 0) [2022-04-18 18:00:20 main:459] : INFO : Resolved: model-final.pt => model-final.pt (exists = 0) [2022-04-18 18:00:20 main:459] : INFO : Resolved: model-input.pt => model-input.pt (exists = 1) [2022-04-18 18:00:20 main:459] : INFO : Resolved: snapshot.pt => snapshot.pt (exists = 0) [2022-04-18 18:00:20 main:479] : INFO : Dataset filename: dataset.hdf5 [2022-04-18 18:00:20 main:481] : INFO : Configuration: [2022-04-18 18:00:20 main:482] : INFO : Model type: GRU [2022-04-18 18:00:20 main:483] : INFO : Validation Loss Threshold: 0.0001 [2022-04-18 18:00:20 main:484] : INFO : Max Epochs: 2048 [2022-04-18 18:00:20 main:485] : INFO : Batch Size: 128 [2022-04-18 18:00:20 main:486] : INFO : Learning Rate: 0.01 [2022-04-18 18:00:20 main:487] : INFO : Patience: 10 [2022-04-18 18:00:20 main:488] : INFO : Hidden Width: 12 [2022-04-18 18:00:20 main:489] : INFO : # Recurrent Layers: 4 [2022-04-18 18:00:20 main:490] : INFO : # Backend Layers: 4 [2022-04-18 18:00:20 main:491] : INFO : # Threads: 1 [2022-04-18 18:00:20 main:493] : INFO : Preparing Dataset [2022-04-18 18:00:20 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Xt from dataset.hdf5 into memory [2022-04-18 18:00:21 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Yt from dataset.hdf5 into memory [2022-04-18 18:00:22 load:106] : INFO : Successfully loaded dataset of 2048 examples into memory. [2022-04-18 18:00:22 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Xv from dataset.hdf5 into memory [2022-04-18 18:00:22 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Yv from dataset.hdf5 into memory [2022-04-18 18:00:22 load:106] : INFO : Successfully loaded dataset of 512 examples into memory. [2022-04-18 18:00:22 main:501] : INFO : Creating Model [2022-04-18 18:00:22 main:514] : INFO : Preparing config file [2022-04-18 18:00:22 main:526] : INFO : Creating new config file [2022-04-18 18:00:22 main:545] : INFO : This is a continuation WU, loading previous network [2022-04-18 18:00:23 main:566] : INFO : Loading DataLoader into Memory [2022-04-18 18:00:23 main:569] : INFO : Starting Training [2022-04-18 18:00:25 main:581] : INFO : Epoch 1 | loss: nan | val_loss: nan | Time: 2765.4 ms [2022-04-18 18:00:28 main:581] : INFO : Epoch 2 | loss: nan | val_loss: nan | Time: 2367.71 ms [2022-04-18 18:00:30 main:581] : INFO : Epoch 3 | loss: nan | val_loss: nan | Time: 2401.96 ms [2022-04-18 18:00:33 main:581] : INFO : Epoch 4 | loss: nan | val_loss: nan | Time: 2395.05 ms [2022-04-18 18:00:35 main:581] : INFO : Epoch 5 | loss: nan | val_loss: nan | Time: 2339.56 ms [2022-04-18 18:00:37 main:581] : INFO : Epoch 6 | loss: nan | val_loss: nan | Time: 2385.87 ms [2022-04-18 18:00:40 main:581] : INFO : Epoch 7 | loss: nan | val_loss: nan | Time: 2361.47 ms [2022-04-18 18:00:42 main:581] : INFO : Epoch 8 | loss: nan | val_loss: nan | Time: 2382.15 ms [2022-04-18 18:00:44 main:581] : INFO : Epoch 9 | loss: nan | val_loss: nan | Time: 2332.88 ms [2022-04-18 18:00:47 main:581] : INFO : Epoch 10 | loss: nan | val_loss: nan | Time: 2358.42 ms [2022-04-18 18:00:49 main:581] : INFO : Epoch 11 | loss: nan | val_loss: nan | Time: 2346.99 ms [2022-04-18 18:00:51 main:581] : INFO : Epoch 12 | loss: nan | val_loss: nan | Time: 2346.47 ms [2022-04-18 18:00:54 main:581] : INFO : Epoch 13 | loss: nan | val_loss: nan | Time: 2399.26 ms [2022-04-18 18:00:56 main:581] : INFO : Epoch 14 | loss: nan | val_loss: nan | Time: 2382.96 ms [2022-04-18 18:00:59 main:581] : INFO : Epoch 15 | loss: nan | val_loss: nan | Time: 2377.6 ms [2022-04-18 18:01:01 main:581] : INFO : Epoch 16 | loss: nan | val_loss: nan | Time: 2375.93 ms [2022-04-18 18:01:03 main:581] : INFO : Epoch 17 | loss: nan | val_loss: nan | Time: 2399.96 ms [2022-04-18 18:01:06 main:581] : INFO : Epoch 18 | loss: nan | val_loss: nan | Time: 2372.58 ms [2022-04-18 18:01:08 main:581] : INFO : Epoch 19 | loss: nan | val_loss: nan | Time: 2370.24 ms [2022-04-18 18:01:10 main:581] : INFO : Epoch 20 | loss: nan | val_loss: nan | Time: 2332.22 ms [2022-04-18 18:01:13 main:581] : INFO : Epoch 21 | loss: nan | val_loss: nan | Time: 2392.09 ms [2022-04-18 18:01:15 main:581] : INFO : Epoch 22 | loss: nan | val_loss: nan | Time: 2357.78 ms [2022-04-18 18:01:18 main:581] : INFO : Epoch 23 | loss: nan | val_loss: nan | Time: 2384.86 ms [2022-04-18 18:01:20 main:581] : INFO : Epoch 24 | loss: nan | val_loss: nan | Time: 2372.58 ms [2022-04-18 18:01:22 main:581] : INFO : Epoch 25 | loss: nan | val_loss: nan | Time: 2368.87 ms [2022-04-18 18:01:25 main:581] : INFO : Epoch 26 | loss: nan | val_loss: nan | Time: 2385.34 ms [2022-04-18 18:01:27 main:581] : INFO : Epoch 27 | loss: nan | val_loss: nan | Time: 2358.44 ms [2022-04-18 18:01:29 main:581] : INFO : Epoch 28 | loss: nan | val_loss: nan | Time: 2406.87 ms [2022-04-18 18:01:32 main:581] : INFO : Epoch 29 | loss: nan | val_loss: nan | Time: 2370.65 ms [2022-04-18 18:01:34 main:581] : INFO : Epoch 30 | loss: nan | val_loss: nan | Time: 2389.12 ms [2022-04-18 18:01:37 main:581] : INFO : Epoch 31 | loss: nan | val_loss: nan | Time: 2391.95 ms [2022-04-18 18:01:39 main:581] : INFO : Epoch 32 | loss: nan | val_loss: nan | Time: 2362.33 ms [2022-04-18 18:01:41 main:581] : INFO : Epoch 33 | loss: nan | val_loss: nan | Time: 2353.4 ms [2022-04-18 18:01:44 main:581] : INFO : Epoch 34 | loss: nan | val_loss: nan | Time: 2375.33 ms [2022-04-18 18:01:46 main:581] : INFO : Epoch 35 | loss: nan | val_loss: nan | Time: 2407.31 ms [2022-04-18 18:01:49 main:581] : INFO : Epoch 36 | loss: nan | val_loss: nan | Time: 2413.27 ms [2022-04-18 18:01:51 main:581] : INFO : Epoch 37 | loss: nan | val_loss: nan | Time: 2357.96 ms [2022-04-18 18:01:53 main:581] : INFO : Epoch 38 | loss: nan | val_loss: nan | Time: 2394.21 ms [2022-04-18 18:01:56 main:581] : INFO : Epoch 39 | loss: nan | val_loss: nan | Time: 2384.82 ms [2022-04-18 18:01:58 main:581] : INFO : Epoch 40 | loss: nan | val_loss: nan | Time: 2413.34 ms [2022-04-18 18:02:00 main:581] : INFO : Epoch 41 | loss: nan | val_loss: nan | Time: 2417.26 ms [2022-04-18 18:02:03 main:581] : INFO : Epoch 42 | loss: nan | val_loss: nan | Time: 2404.96 ms [2022-04-18 18:02:05 main:581] : INFO : Epoch 43 | loss: nan | val_loss: nan | Time: 2411.72 ms [2022-04-18 18:02:08 main:581] : INFO : Epoch 44 | loss: nan | val_loss: nan | Time: 2382.03 ms [2022-04-18 18:02:10 main:581] : INFO : Epoch 45 | loss: nan | val_loss: nan | Time: 2362.38 ms [2022-04-18 18:02:12 main:581] : INFO : Epoch 46 | loss: nan | val_loss: nan | Time: 2403.67 ms [2022-04-18 18:02:15 main:581] : INFO : Epoch 47 | loss: nan | val_loss: nan | Time: 2383.01 ms [2022-04-18 18:02:17 main:581] : INFO : Epoch 48 | loss: nan | val_loss: nan | Time: 2375.1 ms [2022-04-18 18:02:20 main:581] : INFO : Epoch 49 | loss: nan | val_loss: nan | Time: 2372.17 ms [2022-04-18 18:02:22 main:581] : INFO : Epoch 50 | loss: nan | val_loss: nan | Time: 2374.84 ms [2022-04-18 18:02:24 main:581] : INFO : Epoch 51 | loss: nan | val_loss: nan | Time: 2393.77 ms [2022-04-18 18:02:27 main:581] : INFO : Epoch 52 | loss: nan | val_loss: nan | Time: 2387.84 ms [2022-04-18 18:02:29 main:581] : INFO : Epoch 53 | loss: nan | val_loss: nan | Time: 2375.64 ms [2022-04-18 18:02:31 main:581] : INFO : Epoch 54 | loss: nan | val_loss: nan | Time: 2376.57 ms [2022-04-18 18:02:34 main:581] : INFO : Epoch 55 | loss: nan | val_loss: nan | Time: 2365.11 ms [2022-04-18 18:02:36 main:581] : INFO : Epoch 56 | loss: nan | val_loss: nan | Time: 2350.02 ms [2022-04-18 18:02:39 main:581] : INFO : Epoch 57 | loss: nan | val_loss: nan | Time: 2347.16 ms [2022-04-18 18:02:41 main:581] : INFO : Epoch 58 | loss: nan | val_loss: nan | Time: 2354.37 ms [2022-04-18 18:02:43 main:581] : INFO : Epoch 59 | loss: nan | val_loss: nan | Time: 2380.82 ms [2022-04-18 18:02:46 main:581] : INFO : Epoch 60 | loss: nan | val_loss: nan | Time: 2367.37 ms [2022-04-18 18:02:48 main:581] : INFO : Epoch 61 | loss: nan | val_loss: nan | Time: 2405.25 ms [2022-04-18 18:02:50 main:581] : INFO : Epoch 62 | loss: nan | val_loss: nan | Time: 2380.95 ms [2022-04-18 18:02:53 main:581] : INFO : Epoch 63 | loss: nan | val_loss: nan | Time: 2436.31 ms [2022-04-18 18:02:55 main:581] : INFO : Epoch 64 | loss: nan | val_loss: nan | Time: 2397.96 ms [2022-04-18 18:02:58 main:581] : INFO : Epoch 65 | loss: nan | val_loss: nan | Time: 2413.21 ms [2022-04-18 18:03:00 main:581] : INFO : Epoch 66 | loss: nan | val_loss: nan | Time: 2336.09 ms [2022-04-18 18:03:02 main:581] : INFO : Epoch 67 | loss: nan | val_loss: nan | Time: 2334.88 ms [2022-04-18 18:03:05 main:581] : INFO : Epoch 68 | loss: nan | val_loss: nan | Time: 2354.33 ms [2022-04-18 18:03:07 main:581] : INFO : Epoch 69 | loss: nan | val_loss: nan | Time: 2371.29 ms [2022-04-18 18:03:09 main:581] : INFO : Epoch 70 | loss: nan | val_loss: nan | Time: 2272.48 ms [2022-04-18 18:03:12 main:581] : INFO : Epoch 71 | loss: nan | val_loss: nan | Time: 2356.87 ms [2022-04-18 18:03:14 main:581] : INFO : Epoch 72 | loss: nan | val_loss: nan | Time: 2378.58 ms [2022-04-18 18:03:16 main:581] : INFO : Epoch 73 | loss: nan | val_loss: nan | Time: 2342.58 ms [2022-04-18 18:03:19 main:581] : INFO : Epoch 74 | loss: nan | val_loss: nan | Time: 2370.6 ms [2022-04-18 18:03:21 main:581] : INFO : Epoch 75 | loss: nan | val_loss: nan | Time: 2356.19 ms [2022-04-18 18:03:24 main:581] : INFO : Epoch 76 | loss: nan | val_loss: nan | Time: 2323.61 ms [2022-04-18 18:03:26 main:581] : INFO : Epoch 77 | loss: nan | val_loss: nan | Time: 2368.13 ms [2022-04-18 18:03:28 main:581] : INFO : Epoch 78 | loss: nan | val_loss: nan | Time: 2296.22 ms [2022-04-18 18:03:31 main:581] : INFO : Epoch 79 | loss: nan | val_loss: nan | Time: 2382.52 ms [2022-04-18 18:03:33 main:581] : INFO : Epoch 80 | loss: nan | val_loss: nan | Time: 2374.02 ms [2022-04-18 18:03:35 main:581] : INFO : Epoch 81 | loss: nan | val_loss: nan | Time: 2378.73 ms [2022-04-18 18:03:38 main:581] : INFO : Epoch 82 | loss: nan | val_loss: nan | Time: 2369.5 ms [2022-04-18 18:03:40 main:581] : INFO : Epoch 83 | loss: nan | val_loss: nan | Time: 2342.41 ms [2022-04-18 18:03:42 main:581] : INFO : Epoch 84 | loss: nan | val_loss: nan | Time: 2376.27 ms [2022-04-18 18:03:45 main:581] : INFO : Epoch 85 | loss: nan | val_loss: nan | Time: 2345.47 ms [2022-04-18 18:03:47 main:581] : INFO : Epoch 86 | loss: nan | val_loss: nan | Time: 2349.05 ms [2022-04-18 18:03:49 main:581] : INFO : Epoch 87 | loss: nan | val_loss: nan | Time: 2342.03 ms [2022-04-18 18:03:52 main:581] : INFO : Epoch 88 | loss: nan | val_loss: nan | Time: 2348.38 ms [2022-04-18 18:03:54 main:581] : INFO : Epoch 89 | loss: nan | val_loss: nan | Time: 2336.43 ms [2022-04-18 18:03:56 main:581] : INFO : Epoch 90 | loss: nan | val_loss: nan | Time: 2377.6 ms [2022-04-18 18:03:59 main:581] : INFO : Epoch 91 | loss: nan | val_loss: nan | Time: 2344.24 ms [2022-04-18 18:04:01 main:581] : INFO : Epoch 92 | loss: nan | val_loss: nan | Time: 2355.91 ms [2022-04-18 18:04:04 main:581] : INFO : Epoch 93 | loss: nan | val_loss: nan | Time: 2359.67 ms [2022-04-18 18:04:06 main:581] : INFO : Epoch 94 | loss: nan | val_loss: nan | Time: 2339.11 ms [2022-04-18 18:04:08 main:581] : INFO : Epoch 95 | loss: nan | val_loss: nan | Time: 2358.28 ms [2022-04-18 18:04:11 main:581] : INFO : Epoch 96 | loss: nan | val_loss: nan | Time: 2347.53 ms [2022-04-18 18:04:13 main:581] : INFO : Epoch 97 | loss: nan | val_loss: nan | Time: 2384.33 ms [2022-04-18 18:04:15 main:581] : INFO : Epoch 98 | loss: nan | val_loss: nan | Time: 2379.99 ms [2022-04-18 18:04:18 main:581] : INFO : Epoch 99 | loss: nan | val_loss: nan | Time: 2365.94 ms [2022-04-18 18:04:20 main:581] : INFO : Epoch 100 | loss: nan | val_loss: nan | Time: 2404.73 ms [2022-04-18 18:04:22 main:581] : INFO : Epoch 101 | loss: nan | val_loss: nan | Time: 2341.52 ms [2022-04-18 18:04:25 main:581] : INFO : Epoch 102 | loss: nan | val_loss: nan | Time: 2327.76 ms [2022-04-18 18:04:27 main:581] : INFO : Epoch 103 | loss: nan | val_loss: nan | Time: 2390.64 ms [2022-04-18 18:04:30 main:581] : INFO : Epoch 104 | loss: nan | val_loss: nan | Time: 2336.16 ms [2022-04-18 18:04:32 main:581] : INFO : Epoch 105 | loss: nan | val_loss: nan | Time: 2365.2 ms [2022-04-18 18:04:34 main:581] : INFO : Epoch 106 | loss: nan | val_loss: nan | Time: 2352.61 ms [2022-04-18 18:04:37 main:581] : INFO : Epoch 107 | loss: nan | val_loss: nan | Time: 2394.02 ms [2022-04-18 18:04:39 main:581] : INFO : Epoch 108 | loss: nan | val_loss: nan | Time: 2349.79 ms [2022-04-18 18:04:41 main:581] : INFO : Epoch 109 | loss: nan | val_loss: nan | Time: 2341.26 ms [2022-04-18 18:04:44 main:581] : INFO : Epoch 110 | loss: nan | val_loss: nan | Time: 2350.88 ms [2022-04-18 18:04:46 main:581] : INFO : Epoch 111 | loss: nan | val_loss: nan | Time: 2365.06 ms [2022-04-18 18:04:48 main:581] : INFO : Epoch 112 | loss: nan | val_loss: nan | Time: 2370.87 ms [2022-04-18 18:04:51 main:581] : INFO : Epoch 113 | loss: nan | val_loss: nan | Time: 2384.06 ms [2022-04-18 18:04:53 main:581] : INFO : Epoch 114 | loss: nan | val_loss: nan | Time: 2337.23 ms [2022-04-18 18:04:55 main:581] : INFO : Epoch 115 | loss: nan | val_loss: nan | Time: 2343.93 ms [2022-04-18 18:04:58 main:581] : INFO : Epoch 116 | loss: nan | val_loss: nan | Time: 2343.82 ms [2022-04-18 18:05:00 main:581] : INFO : Epoch 117 | loss: nan | val_loss: nan | Time: 2344.69 ms [2022-04-18 18:05:03 main:581] : INFO : Epoch 118 | loss: nan | val_loss: nan | Time: 2378.34 ms [2022-04-18 18:05:05 main:581] : INFO : Epoch 119 | loss: nan | val_loss: nan | Time: 2354.39 ms [2022-04-18 18:05:07 main:581] : INFO : Epoch 120 | loss: nan | val_loss: nan | Time: 2298.72 ms [2022-04-18 18:05:10 main:581] : INFO : Epoch 121 | loss: nan | val_loss: nan | Time: 2330.74 ms [2022-04-18 18:05:12 main:581] : INFO : Epoch 122 | loss: nan | val_loss: nan | Time: 2367.16 ms [2022-04-18 18:05:14 main:581] : INFO : Epoch 123 | loss: nan | val_loss: nan | Time: 2349.89 ms [2022-04-18 18:05:17 main:581] : INFO : Epoch 124 | loss: nan | val_loss: nan | Time: 2321.82 ms [2022-04-18 18:05:19 main:581] : INFO : Epoch 125 | loss: nan | val_loss: nan | Time: 2347.24 ms [2022-04-18 18:05:21 main:581] : INFO : Epoch 126 | loss: nan | val_loss: nan | Time: 2301.53 ms [2022-04-18 18:05:24 main:581] : INFO : Epoch 127 | loss: nan | val_loss: nan | Time: 2278.44 ms [2022-04-18 18:05:26 main:581] : INFO : Epoch 128 | loss: nan | val_loss: nan | Time: 2357.78 ms [2022-04-18 18:05:28 main:581] : INFO : Epoch 129 | loss: nan | val_loss: nan | Time: 2338.38 ms [2022-04-18 18:05:31 main:581] : INFO : Epoch 130 | loss: nan | val_loss: nan | Time: 2313.48 ms [2022-04-18 18:05:33 main:581] : INFO : Epoch 131 | loss: nan | val_loss: nan | Time: 2298.07 ms [2022-04-18 18:05:35 main:581] : INFO : Epoch 132 | loss: nan | val_loss: nan | Time: 2323.82 ms [2022-04-18 18:05:37 main:581] : INFO : Epoch 133 | loss: nan | val_loss: nan | Time: 2288.79 ms [2022-04-18 18:05:40 main:581] : INFO : Epoch 134 | loss: nan | val_loss: nan | Time: 2319.59 ms [2022-04-18 18:05:42 main:581] : INFO : Epoch 135 | loss: nan | val_loss: nan | Time: 2342.08 ms [2022-04-18 18:05:44 main:581] : INFO : Epoch 136 | loss: nan | val_loss: nan | Time: 2358.76 ms [2022-04-18 18:05:47 main:581] : INFO : Epoch 137 | loss: nan | val_loss: nan | Time: 2323.43 ms [2022-04-18 18:05:49 main:581] : INFO : Epoch 138 | loss: nan | val_loss: nan | Time: 2338.04 ms [2022-04-18 18:05:51 main:581] : INFO : Epoch 139 | loss: nan | val_loss: nan | Time: 2343.99 ms [2022-04-18 18:05:54 main:581] : INFO : Epoch 140 | loss: nan | val_loss: nan | Time: 2330.02 ms [2022-04-18 18:05:56 main:581] : INFO : Epoch 141 | loss: nan | val_loss: nan | Time: 2320.48 ms [2022-04-18 18:05:58 main:581] : INFO : Epoch 142 | loss: nan | val_loss: nan | Time: 2323.81 ms [2022-04-18 18:06:01 main:581] : INFO : Epoch 143 | loss: nan | val_loss: nan | Time: 2311.14 ms [2022-04-18 18:06:03 main:581] : INFO : Epoch 144 | loss: nan | val_loss: nan | Time: 2339.62 ms [2022-04-18 18:06:05 main:581] : INFO : Epoch 145 | loss: nan | val_loss: nan | Time: 2283.58 ms [2022-04-18 18:06:08 main:581] : INFO : Epoch 146 | loss: nan | val_loss: nan | Time: 2302.62 ms [2022-04-18 18:06:10 main:581] : INFO : Epoch 147 | loss: nan | val_loss: nan | Time: 2316.52 ms [2022-04-18 18:06:12 main:581] : INFO : Epoch 148 | loss: nan | val_loss: nan | Time: 2369.68 ms [2022-04-18 18:06:15 main:581] : INFO : Epoch 149 | loss: nan | val_loss: nan | Time: 2351.77 ms [2022-04-18 18:06:17 main:581] : INFO : Epoch 150 | loss: nan | val_loss: nan | Time: 2378.66 ms [2022-04-18 18:06:19 main:581] : INFO : Epoch 151 | loss: nan | val_loss: nan | Time: 2350.08 ms [2022-04-18 18:06:22 main:581] : INFO : Epoch 152 | loss: nan | val_loss: nan | Time: 2341.46 ms [2022-04-18 18:06:24 main:581] : INFO : Epoch 153 | loss: nan | val_loss: nan | Time: 2329.52 ms [2022-04-18 18:06:26 main:581] : INFO : Epoch 154 | loss: nan | val_loss: nan | Time: 2338.71 ms [2022-04-18 18:06:29 main:581] : INFO : Epoch 155 | loss: nan | val_loss: nan | Time: 2385.47 ms [2022-04-18 18:06:31 main:581] : INFO : Epoch 156 | loss: nan | val_loss: nan | Time: 2389.34 ms [2022-04-18 18:06:34 main:581] : INFO : Epoch 157 | loss: nan | val_loss: nan | Time: 2343.96 ms [2022-04-18 18:06:36 main:581] : INFO : Epoch 158 | loss: nan | val_loss: nan | Time: 2329.92 ms [2022-04-18 18:06:38 main:581] : INFO : Epoch 159 | loss: nan | val_loss: nan | Time: 2342.21 ms [2022-04-18 18:06:41 main:581] : INFO : Epoch 160 | loss: nan | val_loss: nan | Time: 2343.13 ms [2022-04-18 18:06:43 main:581] : INFO : Epoch 161 | loss: nan | val_loss: nan | Time: 2341.92 ms [2022-04-18 18:06:45 main:581] : INFO : Epoch 162 | loss: nan | val_loss: nan | Time: 2344.91 ms [2022-04-18 18:06:48 main:581] : INFO : Epoch 163 | loss: nan | val_loss: nan | Time: 2341.8 ms [2022-04-18 18:06:50 main:581] : INFO : Epoch 164 | loss: nan | val_loss: nan | Time: 2370.22 ms [2022-04-18 18:06:52 main:581] : INFO : Epoch 165 | loss: nan | val_loss: nan | Time: 2345.01 ms [2022-04-18 18:06:55 main:581] : INFO : Epoch 166 | loss: nan | val_loss: nan | Time: 2334.73 ms [2022-04-18 18:06:57 main:581] : INFO : Epoch 167 | loss: nan | val_loss: nan | Time: 2338.7 ms [2022-04-18 18:06:59 main:581] : INFO : Epoch 168 | loss: nan | val_loss: nan | Time: 2353.8 ms [2022-04-18 18:07:02 main:581] : INFO : Epoch 169 | loss: nan | val_loss: nan | Time: 2346.57 ms [2022-04-18 18:07:04 main:581] : INFO : Epoch 170 | loss: nan | val_loss: nan | Time: 2346.1 ms [2022-04-18 18:07:06 main:581] : INFO : Epoch 171 | loss: nan | val_loss: nan | Time: 2333.18 ms [2022-04-18 18:07:09 main:581] : INFO : Epoch 172 | loss: nan | val_loss: nan | Time: 2349.16 ms [2022-04-18 18:07:11 main:581] : INFO : Epoch 173 | loss: nan | val_loss: nan | Time: 2325.62 ms [2022-04-18 18:07:13 main:581] : INFO : Epoch 174 | loss: nan | val_loss: nan | Time: 2317.49 ms [2022-04-18 18:07:16 main:581] : INFO : Epoch 175 | loss: nan | val_loss: nan | Time: 2374.12 ms [2022-04-18 18:07:18 main:581] : INFO : Epoch 176 | loss: nan | val_loss: nan | Time: 2546.71 ms [2022-04-18 18:07:21 main:581] : INFO : Epoch 177 | loss: nan | val_loss: nan | Time: 2348.48 ms [2022-04-18 18:07:23 main:581] : INFO : Epoch 178 | loss: nan | val_loss: nan | Time: 2371.22 ms [2022-04-18 18:07:25 main:581] : INFO : Epoch 179 | loss: nan | val_loss: nan | Time: 2440.49 ms [2022-04-18 18:07:28 main:581] : INFO : Epoch 180 | loss: nan | val_loss: nan | Time: 2344.54 ms [2022-04-18 18:07:30 main:581] : INFO : Epoch 181 | loss: nan | val_loss: nan | Time: 2345.14 ms [2022-04-18 18:07:33 main:581] : INFO : Epoch 182 | loss: nan | val_loss: nan | Time: 2379.59 ms [2022-04-18 18:07:35 main:581] : INFO : Epoch 183 | loss: nan | val_loss: nan | Time: 2360.57 ms [2022-04-18 18:07:37 main:581] : INFO : Epoch 184 | loss: nan | val_loss: nan | Time: 2338.48 ms [2022-04-18 18:07:40 main:581] : INFO : Epoch 185 | loss: nan | val_loss: nan | Time: 2371.12 ms [2022-04-18 18:07:42 main:581] : INFO : Epoch 186 | loss: nan | val_loss: nan | Time: 2391.39 ms [2022-04-18 18:07:44 main:581] : INFO : Epoch 187 | loss: nan | val_loss: nan | Time: 2366 ms [2022-04-18 18:07:47 main:581] : INFO : Epoch 188 | loss: nan | val_loss: nan | Time: 2346.02 ms [2022-04-18 18:07:49 main:581] : INFO : Epoch 189 | loss: nan | val_loss: nan | Time: 2364.56 ms [2022-04-18 18:07:51 main:581] : INFO : Epoch 190 | loss: nan | val_loss: nan | Time: 2389.07 ms [2022-04-18 18:07:54 main:581] : INFO : Epoch 191 | loss: nan | val_loss: nan | Time: 2337.57 ms [2022-04-18 18:07:56 main:581] : INFO : Epoch 192 | loss: nan | val_loss: nan | Time: 2331.18 ms [2022-04-18 18:07:58 main:581] : INFO : Epoch 193 | loss: nan | val_loss: nan | Time: 2367.54 ms [2022-04-18 18:08:01 main:581] : INFO : Epoch 194 | loss: nan | val_loss: nan | Time: 2364.59 ms [2022-04-18 18:08:03 main:581] : INFO : Epoch 195 | loss: nan | val_loss: nan | Time: 2361.2 ms DEBUG: Args: ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200 -c --maxepoch 2048 nthreads: 1 gpudev: 0 Re-exec()-ing to set environment correctly Machine Learning Dataset Generator v9.80 (Linux/x86_64) (libTorch: release/1.7 GPU: NVIDIA GeForce GTX 1050) [2022-04-22 09:50:53 main:442] : INFO : Set logging level to 1 [2022-04-22 09:50:53 main:448] : INFO : Running in BOINC Client mode [2022-04-22 09:50:53 main:451] : INFO : Resolving all filenames [2022-04-22 09:50:53 main:459] : INFO : Resolved: dataset.hdf5 => dataset.hdf5 (exists = 1) [2022-04-22 09:50:53 main:459] : INFO : Resolved: model.cfg => model.cfg (exists = 1) [2022-04-22 09:50:53 main:459] : INFO : Resolved: model-final.pt => model-final.pt (exists = 0) [2022-04-22 09:50:53 main:459] : INFO : Resolved: model-input.pt => model-input.pt (exists = 1) [2022-04-22 09:50:53 main:459] : INFO : Resolved: snapshot.pt => snapshot.pt (exists = 1) [2022-04-22 09:50:53 main:479] : INFO : Dataset filename: dataset.hdf5 [2022-04-22 09:50:53 main:481] : INFO : Configuration: [2022-04-22 09:50:53 main:482] : INFO : Model type: GRU [2022-04-22 09:50:53 main:483] : INFO : Validation Loss Threshold: 0.0001 [2022-04-22 09:50:53 main:484] : INFO : Max Epochs: 2048 [2022-04-22 09:50:53 main:485] : INFO : Batch Size: 128 [2022-04-22 09:50:53 main:486] : INFO : Learning Rate: 0.01 [2022-04-22 09:50:53 main:487] : INFO : Patience: 10 [2022-04-22 09:50:53 main:488] : INFO : Hidden Width: 12 [2022-04-22 09:50:53 main:489] : INFO : # Recurrent Layers: 4 [2022-04-22 09:50:53 main:490] : INFO : # Backend Layers: 4 [2022-04-22 09:50:53 main:491] : INFO : # Threads: 1 [2022-04-22 09:50:53 main:493] : INFO : Preparing Dataset [2022-04-22 09:50:53 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Xt from dataset.hdf5 into memory [2022-04-22 09:50:54 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Yt from dataset.hdf5 into memory [2022-04-22 09:51:03 load:106] : INFO : Successfully loaded dataset of 2048 examples into memory. [2022-04-22 09:51:03 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Xv from dataset.hdf5 into memory [2022-04-22 09:51:03 load_hdf5_ds_into_tensor:28] : INFO : Loading Dataset /Yv from dataset.hdf5 into memory [2022-04-22 09:51:03 load:106] : INFO : Successfully loaded dataset of 512 examples into memory. [2022-04-22 09:51:03 main:501] : INFO : Creating Model [2022-04-22 09:51:03 main:514] : INFO : Preparing config file [2022-04-22 09:51:03 main:518] : INFO : Found checkpoint, attempting to load... [2022-04-22 09:51:03 main:519] : INFO : Loading config terminate called after throwing an instance of 'nlohmann::detail::type_error' what(): [json.exception.type_error.302] type must be number, but is null SIGABRT: abort called Stack trace (24 frames): ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0x37df9c)[0x56420757df9c] /lib/x86_64-linux-gnu/libpthread.so.0(+0x143c0)[0x7ff79e6b73c0] /lib/x86_64-linux-gnu/libc.so.6(gsignal+0xcb)[0x7ff71ffdb03b] /lib/x86_64-linux-gnu/libc.so.6(abort+0x12b)[0x7ff71ffba859] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(_ZN9__gnu_cxx27__verbose_terminate_handlerEv+0x135)[0x56420762f7f5] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0x398846)[0x564207598846] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0x398891)[0x564207598891] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0x3968c4)[0x5642075968c4] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xe4af0)[0x5642072e4af0] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xe1682)[0x5642072e1682] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xd9a2b)[0x5642072d9a2b] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xd5f7f)[0x5642072d5f7f] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xcf7f4)[0x5642072cf7f4] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xe16bc)[0x5642072e16bc] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xe4c96)[0x5642072e4c96] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xe17ca)[0x5642072e17ca] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xd9b28)[0x5642072d9b28] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xd5faf)[0x5642072d5faf] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xcf848)[0x5642072cf848] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xcc66f)[0x5642072cc66f] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0xc8d8c)[0x5642072c8d8c] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0x8a2d1)[0x56420728a2d1] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0x7ff71ffbc0b3] ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200(+0x8675a)[0x56420728675a] Exiting... </stderr_txt> ]]>
©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)