Task 3009122

Name rand_automata_0010-1605427304-29979-1_1
Workunit 1358126
Created 20 Nov 2020, 7:35:22 UTC
Sent 25 Nov 2020, 13:12:08 UTC
Report deadline 2 Dec 2020, 13:12:08 UTC
Received 25 Nov 2020, 13:22:02 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED
Computer ID 3280
Run time 1 sec
CPU time
Validate state Invalid
Credit 0.00
Device peak FLOPS 4,385.24 GFLOPS
Application version Machine Learning Dataset Generator (GPU) v9.80 (cuda10200)
x86_64-pc-linux-gnu
Peak disk usage 2.98 GB

Stderr output

<core_client_version>7.16.6</core_client_version>
<![CDATA[
<message>
Disk usage limit exceeded</message>
<stderr_txt>
DEBUG: Args: ../../projects/www.mlcathome.org_mlcathome/mlds-gpu_9.80_x86_64-pc-linux-gnu__cuda10200 -c -a LSTM --lr 0.001 -w 64 -b 2 -s 32 --maxepoch 192 
nthreads: 1 gpudev: 0
Re-exec()-ing to set environment correctly
Machine Learning Dataset Generator v9.80 (Linux/x86_64) (libTorch: release/1.7 GPU: GeForce GTX 970)
[2020-11-25 05:20:10	                main:442]	:	INFO	:	Set logging level to 1
[2020-11-25 05:20:10	                main:448]	:	INFO	:	Running in BOINC Client mode
[2020-11-25 05:20:10	                main:451]	:	INFO	:	Resolving all filenames
[2020-11-25 05:20:10	                main:459]	:	INFO	:	Resolved: dataset.hdf5 => ../../projects/www.mlcathome.org_mlcathome/rand_automata_0010-train-val-dataset.hdf5 (exists = 1)
[2020-11-25 05:20:10	                main:459]	:	INFO	:	Resolved: model.cfg => ../../projects/www.mlcathome.org_mlcathome/rand_automata_0010-1605427304-29979-1_1_r1810946161_1 (exists = 0)
[2020-11-25 05:20:10	                main:459]	:	INFO	:	Resolved: model-final.pt => ../../projects/www.mlcathome.org_mlcathome/rand_automata_0010-1605427304-29979-1_1_r1810946161_0 (exists = 0)
[2020-11-25 05:20:10	                main:459]	:	INFO	:	Resolved: model-input.pt => ../../projects/www.mlcathome.org_mlcathome/rand_automata_0010-1605427304-29979-1 (exists = 1)
[2020-11-25 05:20:10	                main:459]	:	INFO	:	Resolved: snapshot.pt => snapshot.pt (exists = 0)
[2020-11-25 05:20:10	                main:479]	:	INFO	:	Dataset filename: ../../projects/www.mlcathome.org_mlcathome/rand_automata_0010-train-val-dataset.hdf5
[2020-11-25 05:20:10	                main:481]	:	INFO	:	Configuration: 
[2020-11-25 05:20:10	                main:482]	:	INFO	:	    Model type: LSTM
[2020-11-25 05:20:10	                main:483]	:	INFO	:	    Validation Loss Threshold: 0.0001
[2020-11-25 05:20:10	                main:484]	:	INFO	:	    Max Epochs: 192
[2020-11-25 05:20:10	                main:485]	:	INFO	:	    Batch Size: 32
[2020-11-25 05:20:10	                main:486]	:	INFO	:	    Learning Rate: 0.001
[2020-11-25 05:20:10	                main:487]	:	INFO	:	    Patience: 10
[2020-11-25 05:20:10	                main:488]	:	INFO	:	    Hidden Width: 64
[2020-11-25 05:20:10	                main:489]	:	INFO	:	    # Recurrent Layers: 4
[2020-11-25 05:20:10	                main:490]	:	INFO	:	    # Backend Layers: 2
[2020-11-25 05:20:10	                main:491]	:	INFO	:	    # Threads: 1
[2020-11-25 05:20:10	                main:493]	:	INFO	:	Preparing Dataset
HDF5: infinite loop closing library
      L,T_top,P,P,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD,FD

</stderr_txt>
]]>


©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)