128.143.199.96 and Small Download

Moderators: Site Moderators, FAHC Science Team

Post Reply
HendricksSA
Posts: 339
Joined: Fri Jun 26, 2009 4:34 am

128.143.199.96 and Small Download

Post by HendricksSA »

Sorry for late reporting but I had a Project 8563 fail back on the 13th. While it called my machine unstable, I think it was a partial download of a wu assignment. Not sure if the wu was bad or not. Details follow:

Code: Select all

03:17:05:WU01:FS00:Assigned to work server 128.143.199.96
03:17:05:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:48 from 128.143.199.96
03:17:05:WU01:FS00:Connecting to 128.143.199.96:8080
03:17:05:WU01:FS00:Downloading 1.54KiB
03:17:05:WU01:FS00:Download complete
03:17:05:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:8563 run:1 clone:5 gen:445 core:0xa3 unit:0x00000371fbcb017c5203fafcc0597b92
...........................other wu transmitting............................
03:18:21:WU01:FS00:Starting
03:18:21:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/Core_a3.fah/FahCore_a3 -dir 01 -suffix 01 -version 704 -lifeline 1668 -checkpoint 15 -np 48
03:18:21:WU01:FS00:Started FahCore on PID 5613
03:18:21:WU01:FS00:Core PID:5617
03:18:21:WU01:FS00:FahCore 0xa3 started
03:18:22:WU01:FS00:0xa3:
03:18:22:WU01:FS00:0xa3:*------------------------------*
03:18:22:WU01:FS00:0xa3:Folding@Home Gromacs SMP Core
03:18:22:WU01:FS00:0xa3:Version 2.27 (Dec. 15, 2010)
03:18:22:WU01:FS00:0xa3:
03:18:22:WU01:FS00:0xa3:Preparing to commence simulation
03:18:22:WU01:FS00:0xa3:- Looking at optimizations...
03:18:22:WU01:FS00:0xa3:- Created dyn
03:18:22:WU01:FS00:0xa3:- Files status OK
03:18:22:WU01:FS00:0xa3:- Expanded 1064 -> 4096 (decompressed 384.9 percent)
03:18:22:WU01:FS00:0xa3:Called DecompressByteArray: compressed_data_size=1064 data_size=4096, decompressed_data_size=4096 diff=0
03:18:22:WU01:FS00:0xa3:- Digital signature verified
03:18:22:WU01:FS00:0xa3:
03:18:22:WU01:FS00:0xa3:Project: 8563 (Run 1, Clone 5, Gen 445)
03:18:22:WU01:FS00:0xa3:
03:18:22:WU01:FS00:0xa3:Assembly optimizations on if available.
03:18:22:WU01:FS00:0xa3:Entering M.D.
03:18:27:WU00:FS00:Upload 16.69%
03:18:27:WU01:FS00:0xa3:Mapping NT from 48 to 48 
03:18:27:WU01:FS00:0xa3:mdrun returned 255
03:18:27:WU01:FS00:0xa3:Going to send back what have done -- stepsTotalG=0
03:18:27:WU01:FS00:0xa3:Work fraction=0.0000 steps=0.
03:18:31:WU01:FS00:0xa3:logfile size=2228 infoLength=2228 edr=25 trr=1
03:18:31:WU01:FS00:0xa3:logfile size: 2228 info=2228 bed=25 hdr=1
03:18:31:WU01:FS00:0xa3:- Writing 2766 bytes of core data to disk...
03:18:31:WU01:FS00:0xa3:Done: 2254 -> 1167 (compressed to 51.7 percent)
03:18:31:WU01:FS00:0xa3:  ... Done.
03:18:31:WU01:FS00:0xa3:
03:18:31:WU01:FS00:0xa3:Folding@home Core Shutdown: UNSTABLE_MACHINE
[93m03:18:32:WARNING:WU01:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)[0m
03:18:32:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:8563 run:1 clone:5 gen:445 core:0xa3 unit:0x00000371fbcb017c5203fafcc0597b92
03:18:32:WU01:FS00:Uploading 1.64KiB to 128.143.199.96
03:18:32:WU01:FS00:Connecting to 128.143.199.96:8080
03:18:33:WU00:FS00:Upload 34.03%
03:18:33:WU02:FS00:Connecting to 171.67.108.200:8080
03:18:33:WU01:FS00:Upload complete
03:18:34:WU01:FS00:Server responded WORK_ACK (400)
03:18:34:WU01:FS00:Cleaning up
03:18:35:WU02:FS00:Assigned to work server 128.143.199.96
03:18:35:WU02:FS00:Requesting new work unit for slot 00: READY cpu:48 from 128.143.199.96
03:18:35:WU02:FS00:Connecting to 128.143.199.96:8080
03:18:38:WU02:FS00:Downloading 3.63MiB
03:18:39:WU00:FS00:Upload 51.36%
03:18:44:WU02:FS00:Download 98.18%
03:18:44:WU02:FS00:Download complete
03:18:44:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8804 run:0 clone:7 gen:409 core:0xa3 unit:0x0000027dfbcb017c5193d39a44d49ea6
Joe_H
Site Admin
Posts: 8130
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Studio M1 Max 32 GB smp6
Mac Hack i7-7700K 48 GB smp4
Location: W. MA

Re: 128.143.199.96 and Small Download

Post by Joe_H »

It appears to be a bad WU, no one has successfully processed it. No new records in the database since the 13th, so the fact that the WU was bad appears to have been detected automatically and taken off assignment.
Image
Post Reply