project:17435 run:0 clone:1286 gen:36 causes FAH service to crash after reaching ~.04% complete. This happed similarly on another work unit Saturday - forcing me to dump the WU. After that, it ran well for a couple days, now it's back to the same problem.
Feb 15 17:33:05 curecoinproject1 kernel: [ 0.194735] acpi PNP0A08:00: _OSC failed (AE_NOT_FOUND); disabling ASPM
Feb 15 17:33:05 curecoinproject1 kernel: [ 1.316545] nvidia: module verification failed: signature and/or required key missing - tainting kernel
Feb 15 17:33:07 curecoinproject1 thermald[947]: THD engine start failed
Feb 15 17:33:15 curecoinproject1 NetworkManager[1116]: nm_device_get_device_type: assertion 'NM_IS_DEVICE (self)' failed
Feb 15 17:33:15 curecoinproject1 NetworkManager[1116]: <warn> [1613435595.8848] failed to enumerate oFono devices: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.ofono was not provided by any .service files
Feb 15 17:33:19 curecoinproject1 nm-dispatcher: req:2 'up' [docker0], "/etc/NetworkManager/dispatcher.d/01ifupdown": complete: failed with Script '/etc/NetworkManager/dispatcher.d/01ifupdown' exited with error status 1.
Feb 15 17:33:19 curecoinproject1 NetworkManager[1116]: <warn> [1613435599.9162] dispatcher: (3) 01ifupdown failed (failed): Script '/etc/NetworkManager/dispatcher.d/01ifupdown' exited with error status 1.
Feb 15 17:33:24 curecoinproject1 nm-dispatcher: req:3 'up' [enp2s0], "/etc/NetworkManager/dispatcher.d/01ifupdown": complete: failed with Script '/etc/NetworkManager/dispatcher.d/01ifupdown' exited with error status 1.
Feb 15 17:33:24 curecoinproject1 NetworkManager[1116]: <warn> [1613435604.1199] dispatcher: (5) 01ifupdown failed (failed): Script '/etc/NetworkManager/dispatcher.d/01ifupdown' exited with error status 1.
Feb 15 17:33:34 curecoinproject1 fwupd[2735]: (fwupd:2735): Fu-WARNING **: FuMain: failed to load AppStream data: Failed to parse /var/cache/app-info/xmls/fwupd.xml file: Error on line 2672: Entity did not end with a semicolon; most likely you used an ampersand character without intending to start an entity - escape ampersand as &
Feb 15 17:33:34 curecoinproject1 fwupd[2735]: (fwupd:2735): Fu-WARNING **: disabling plugin because: failed to coldplug uefi: UEFI firmware updating not supported
Feb 15 17:33:34 curecoinproject1 fwupd[2735]: (fwupd:2735): Fu-WARNING **: disabling plugin because: failed to coldplug raspberrypi: Raspberry PI firmware updating not supported, no /boot/start.elf
Feb 15 17:33:52 curecoinproject1 pulseaudio[1964]: [pulseaudio] bluez5-util.c: GetManagedObjects() failed: org.freedesktop.DBus.Error.TimedOut: Failed to activate service 'org.bluez': timed out
Feb 15 17:34:49 curecoinproject1 pulseaudio[1964]: [pulseaudio] module-x11-bell.c: XkbQueryExtension() failed
Feb 15 17:34:49 curecoinproject1 pulseaudio[1964]: [pulseaudio] module.c: Failed to load module "module-x11-bell" (argument: "display=:10.0 sample=bell.ogg"): initialization failed.
It is strongly recommended to use version 7.6.21 since FahCore_22 has some new arguments that are not supported by the older clients. Thus, it would be nice to simply update the client. Since you have Ubuntu 16, I think it can handle Python 2 without issues so it would be easier to upgrade.
BTW, you can have up-to 16 previous logs in the logs folder by default so you can check the file in there if needed
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time
PantherX wrote:BTW, you can have up-to 16 previous logs in the logs folder by default so you can check the file in there if needed
Thanks - yea, I haven't looked at Linux logs for a while (found them in /var/lib/fahclient/logs) ... looks like a "BAD_FRAME_CHECKSUM" upon restart, and the work unit auto-dumped in this case. Both failures came from project 17435.
Generally speaking, a cause of that could be a faulty disk drive since it is reading the checkpoint data to resume and if it fails the checksum, that's a strong indication that something is off. See if your filesystem is healthy and repair any issues if detected/needed. Also, check to see if your drive (HDD/SSD) are within normal parameters for working.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time