Task 2986850

Name ParityModified-1605150890-24023-1_0
Workunit 1348883
Created 18 Nov 2020, 8:09:51 UTC
Sent 25 Nov 2020, 2:55:16 UTC
Report deadline 2 Dec 2020, 2:55:16 UTC
Received 25 Nov 2020, 7:02:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED
Computer ID 4379
Run time 1 sec
CPU time
Validate state Invalid
Credit 0.00
Device peak FLOPS 4,493.80 GFLOPS
Application version Machine Learning Dataset Generator (GPU) v9.80 (cuda10200)
x86_64-pc-linux-gnu
Peak disk usage 2.98 GB

Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
Disk usage limit exceeded</message>
<stderr_txt>
DEBUG: Args: ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200 -c --maxepoch 1024 
nthreads: 1 gpudev: 0
Re-exec()-ing to set environment correctly
Machine Learning Dataset Generator v9.80 (Linux/x86_64) (libTorch: release/1.7 GPU: GeForce GTX 1650 SUPER)
[2020-11-25 01:45:08	                main:442]	:	INFO	:	Set logging level to 1
[2020-11-25 01:45:08	                main:448]	:	INFO	:	Running in BOINC Client mode
[2020-11-25 01:45:08	                main:451]	:	INFO	:	Resolving all filenames
[2020-11-25 01:45:08	                main:459]	:	INFO	:	Resolved: dataset.hdf5 => ../../projects/www.mlcathome.org_mlcathome/ParityModified-train-val-dataset.hdf5 (exists = 1)
[2020-11-25 01:45:08	                main:459]	:	INFO	:	Resolved: model.cfg => ../../projects/www.mlcathome.org_mlcathome/ParityModified-1605150890-24023-1_0_r814113024_1 (exists = 0)
[2020-11-25 01:45:08	                main:459]	:	INFO	:	Resolved: model-final.pt => ../../projects/www.mlcathome.org_mlcathome/ParityModified-1605150890-24023-1_0_r814113024_0 (exists = 0)
[2020-11-25 01:45:08	                main:459]	:	INFO	:	Resolved: model-input.pt => ../../projects/www.mlcathome.org_mlcathome/ParityModified-1605150890-24023-1 (exists = 1)
[2020-11-25 01:45:08	                main:459]	:	INFO	:	Resolved: snapshot.pt => snapshot.pt (exists = 0)
[2020-11-25 01:45:08	                main:479]	:	INFO	:	Dataset filename: ../../projects/www.mlcathome.org_mlcathome/ParityModified-train-val-dataset.hdf5
[2020-11-25 01:45:08	                main:481]	:	INFO	:	Configuration: 
[2020-11-25 01:45:08	                main:482]	:	INFO	:	    Model type: GRU
[2020-11-25 01:45:08	                main:483]	:	INFO	:	    Validation Loss Threshold: 0.0001
[2020-11-25 01:45:08	                main:484]	:	INFO	:	    Max Epochs: 1024
[2020-11-25 01:45:08	                main:485]	:	INFO	:	    Batch Size: 128
[2020-11-25 01:45:08	                main:486]	:	INFO	:	    Learning Rate: 0.01
[2020-11-25 01:45:08	                main:487]	:	INFO	:	    Patience: 10
[2020-11-25 01:45:08	                main:488]	:	INFO	:	    Hidden Width: 12
[2020-11-25 01:45:08	                main:489]	:	INFO	:	    # Recurrent Layers: 4
[2020-11-25 01:45:08	                main:490]	:	INFO	:	    # Backend Layers: 4
[2020-11-25 01:45:08	                main:491]	:	INFO	:	    # Threads: 1
[2020-11-25 01:45:08	                main:493]	:	INFO	:	Preparing Dataset
[2020-11-25 01:45:08	load_hdf5_ds_into_tensor:28]	:	INFO	:	Loading Dataset /Xt from ../../projects/www.mlcathome.org_mlcathome/ParityModified-train-val-dataset.hdf5 into memory
HDF5: infinite loop closing library
      L,D_top,S_top,T_top,F,P,P,Z,FD,E,SL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL,FL

</stderr_txt>
]]>


©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)