DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1
-
I now have DCS up and running - this is something in config.g that it doesn't like and is bombing out. I looked at line 182 in updater.cs and it is immediately after it parses the fans. I have one fan defined as an output to enable a relay for lighting, this fan is not associated with anything - nor will it ever be - needed an on off output I could fire at will and this fitted the bill, I wonder if this is the trigger for the failure - it can't associate it with anything ???
Q : Is it possible to add an 'output' type that is neither a heater nor a fan to RRF3
I have almost my entire config.g commented out and will enable stuff line by line until it breaks again ..
My system.g is below
After uncommenting a line the reset takes an eternity, it works but it is a cause for concern - way too slow resetting
-
@Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:
Q : Is it possible to add an 'output' type that is neither a heater nor a fan to RRF3
look at M42
-
So here is a 'partial' result.
DCS does not fail gracefully if it encounters something in .g files that it can't parse for whatever reason - it crashes and doesn't recover.
Sorry boys - needs to be a whole lot more friendly than this, errors should be reported and ignored, error trapping needs work.
Loss of DCS should not prevent the web server functioning - or how can you fix whatever the problem is. The web portal should report the loss of DCS, should report problems encountered parsing .g files - including the line concerned but should remain accessible to allow .g file edits / fixes. If you need DCS to make the edits then sort out the error trapping but might want to review the architecture in that case.
My config-override.g is currently empty .... this too blows DCS up ...
Apr 27 21:22:57 duet3 DuetControlServer[5864]: [info] Settings loaded Apr 27 21:22:58 duet3 DuetControlServer[5864]: [info] Environment initialized Apr 27 21:22:58 duet3 DuetControlServer[5864]: [info] Connection to Duet established Apr 27 21:22:58 duet3 DuetControlServer[5864]: [info] IPC socket created at /var/run/dsf/dcs.sock Apr 27 21:22:58 duet3 DuetControlServer[5864]: [info] Starting macro file config.g on channel Trigger Apr 27 21:22:58 duet3 DuetControlServer[5864]: [warn] M307: Heater 0 appears to be over-powered. If left on at full power, its temperature is predicted to reach 875C Apr 27 21:22:59 duet3 DuetControlServer[5864]: [info] Starting macro file config-override.g on channel Trigger Apr 27 21:22:59 duet3 DuetControlServer[5864]: [info] Finished macro file config-override.g Apr 27 21:23:00 duet3 DuetControlServer[5864]: [fatal] Abnormal program termination
-
@Garfield Wow. There is somehting going on in your environment. I can't reproduce either the M950 or config-override.g issues.
Probably a good idea to just go back to a "stable" build and wait for things to shake out a bit more. Thanks for trying anyway.
-
@gtj0 I wish I knew how to just get back to a base I can trust at this stage, was fine at RC6 but it seems that even that is busted now on my box.
I've firmware erased and tried all manner of 'stuff' now, if I comment out certain lines in config.g things keep running ...
Going to blow this back to the stable release but it has problems that I couldn't live with (some of which centred on the PINDA), I also had homing issues - hence I was into the RC's - seems like something is buried in there that erase and re-imaging isn't clearing ...
-
@Garfield Yeah it's frustrating I know. You may want to create an issue in the DSF GitHub repo with that crash anyway. If not, would you mind if I opened an issue on your behalf?
-
Absolutely no issue at all with you adding - I suppose I should create a GIT account .... even then though I need to be sure my own incompetence isn't the cause before logging anything - so I'll still run it by here.
-
@Garfield https://github.com/chrishamm/DuetSoftwareFramework/issues/128
It looks like that crash is in filament monitor code. With your config.g, maybe I can reproduce it later today.
-
Need some place I can send you a complete copy of my backup files (they're over 5 meg)
Here are the logs from tonights blood pressure testing ... if you need more please scream before I wipe this thing again ....
Tried to send only my .g files but system won't accept zip files ...
pause.g homez.g homey.g homex.g homeall.g tpre0.g tpost0.g tfree0.g stop.g sleep.g resume.g
-
@Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:
before I wipe this thing again ....
RRF3.01-RC10 duetsoftwareframework 2.1.1 reprapfirmware 2.1.1-1 RRF3.01-RC9 duetsoftwareframework 2.1.0 reprapfirmware 2.1.0-1 RRF3.01-RC8 duetsoftwareframework 2.0.0 reprapfirmware 2.0.0-1 RRF3.01-RC7 duetsoftwareframework 1.3.2 reprapfirmware 1.3.2-1 RRF3.01-RC6 duetsoftwareframework 1.3.1 reprapfirmware 1.3.1-1 RRF3.01-RC6 duetsoftwareframework 1.3.0 reprapfirmware 1.3.0-1 RRF3.01-RC4 duetsoftwareframework 1.2.5.0 reprapfirmware 1.2.5.0-1 RRF3.0 duetsoftwareframework 1.2.4.0 reprapfirmware 1.2.4.0-1 RRF3.0 duetsoftwareframework 1.2.3.1 reprapfirmware 1.2.3.1-1 RRF3.0 duetsoftwareframework 1.2.3.0 reprapfirmware 1.2.3.0-1 RRF3.0 duetsoftwareframework 1.2.2.1 reprapfirmware 1.2.2.1-1 RRF3.0RC2+1 duetsoftwareframework 1.2.2.0 reprapfirmware 1.2.2.0-1 RRF3.0RC1 duetsoftwareframework 1.2.1.0 reprapfirmware 1.2.1.0-1 RRF3.0RC1 duetsoftwareframework 1.2.0.0 reprapfirmware 1.2.0.0-1 RRF3.0beta11 duetsoftwareframework 1.1.0.5 reprapfirmware 1.1.0.5-1 RRF3.0beta11 duetsoftwareframework 1.1.0.4 reprapfirmware 1.1.0.4-1 RRF3.0beta10+2 duetsoftwareframework 1.1.0.3 reprapfirmware 1.1.0.3-1 RRF3.0beta10+2 duetsoftwareframework 1.1.0.2 reprapfirmware 1.1.0.2-1 RRF3.0beta10+2 duetsoftwareframework 1.1.0.1 reprapfirmware 1.1.0.1-1 RRF3.0beta10+2 duetsoftwareframework 1.1.0.0 reprapfirmware 1.1.0.0-1 RRF3.0beta10+1 duetsoftwareframework 1.0.4.1 reprapfirmware 1.0.4.1-1
https://forum.duet3d.com/post/149932 <-- quasi shortcut for installing a given DSF
edit: caveat emptor - full refund no questions asked if not correct...
(code is fugly as f, but shared ) -
The gifts keep on coming ... 12 out of 10 - that's 2 cases of beer I owe you ...
-
@Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:
The gifts keep on coming ... 12 out of 10 - that's 2 cases of beer I owe you ...
haha, lets see if anything catches fire first - and i suspect you need that beer more than me atm.
-
@Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:
Could really do with that 'version compatibility' matrix ....
Your wish, and all that. See https://forum.duet3d.com/topic/15928/new-tool-duet-v3-xx-rcx-to-apt-dsf-2-x-x-version-mapping for more info.
Meanwhile a couple of examples:
./DuetVersionsAll.sh Expect this to take several seconds per line of output. duetsoftwareframework 2.1.1 contains reprapfirmware version 2.1.1-1 which is internal version 3.01-RC10 duetsoftwareframework 2.1.0 contains reprapfirmware version 2.1.0-1 which is internal version 3.01-RC9 duetsoftwareframework 2.0.0 contains reprapfirmware version 2.0.0-1 which is internal version 3.01-RC8 duetsoftwareframework 1.3.2 contains reprapfirmware version 1.3.2-1 which is internal version 3.01-RC7 duetsoftwareframework 1.3.1 contains reprapfirmware version 1.3.1-1 which is internal version 3.01-RC6 ...snip...
./DuetVersion.sh Highest Available DuetSoftwareFramework = 2.1.1 Currently Installed DuetSoftwareFramework = 2.1.1 Dependencies for DSF version 2.1.1 are: duetcontrolserver (= 2.1.1) duetsd (= 1.0.6) duettools (= 2.1.1) duetwebserver (= 2.1.0) duetwebcontrol (= 2.1.5) reprapfirmware (>= 2.1.1-1) reprapfirmware (<= 2.1.1-999) ) reprapfirmware apt version 2.1.1-1 is internal version 3.01-RC10
./DuetVersion.sh 22 Highest Available DuetSoftwareFramework = 2.1.1 Currently Installed DuetSoftwareFramework = 2.1.1 Command line arg specified to request Release -22 from highest available. Highest-22 is DuetSoftwareFramework = 1.0.4.1 Dependencies for DSF version 1.0.4.1 are: duetcontrolserver (= 1.0.4.1) duetsd (= 1.0.3) duettools (= 1.0.4.1) duetwebserver (= 1.1.1.0) duetwebcontrol (= 2.0.0-5) reprapfirmware (>= 1.0.4.1-1) reprapfirmware (<= 1.0.4.1-999) ) reprapfirmware apt version 1.0.4.1-1 is internal version 3.0beta10+1
The '22' argument is asking about the release 22 lines "back" from highest available.
-
@Garfield I can reproduce your crash. It's the M591 command for the filament sensor. I'll add more info to the GitHub issue.
-
Data Point: 2.5 hour print successful, nothing else running on the Pi, most of the time disconnected from DWC.
Edit. Several more data points:
It crashed just sitting there. I only noticed after I clicked the "jobs" tab in DWC, and the file listing was blank, this was maybe an hour after the print mentioned above finished. Eventually got a "Error retrieving file list" pop up. Existing SSH session was dead. New SSH session will not connect.
I had done a "emergency stop" about an hour before the above. DWC did reconnect after that, in fact I homed the printer. Of course, that would not affect SSH, and it did not at that time.
So... No way in... Power cycle coming up.
Power cycle #1 did not bring it back. Absolutely nothing on DWC and SSH won't connect, IP of Pi does not respond to ping. HDMI and USB console shows Chromium with DWC up, but will not show, nor move, a mouse cursor. Totally hung, even on the hardware console. (The HDMI/USB was "hot plugged" after boot; this has always worked in the past.)
But interesting that it booted to Chromium and DWC before it hung. The green popup "Connected to..." was still there. It should have timed out and gone away.
Power Cycle #2 worked. DWC and SSH fine. Going to start another print.
-
-
@Danal
Yup, that sounds in-line with the sort of things I'm seeing. I wonder if your first failed power cycle was temperature related? The first time I left the crash unnoticed it cooked the Pis CPU - I noticed the smell of something warm before realising it had all crashed. After this it took a couple of resets to get things to boot normally again.
I've had occasions where it'll boot and I can SSH in with a couple of simultaneous sessions but as soon as I click connect on the DWC session on the Pi (chrome and DWC auto-load on boot), then it'll cause the crash sometimes.@gtj0 said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:
@Garfield I can reproduce your crash. It's the M591 command for the filament sensor. I'll add more info to the GitHub issue.
Which filament sensor do you use? I have a laser one that I disconnected a few versions back as it caused another bug and because I never managed to get useful data from it. I might try re-connecting to see if it's the same bug I was seeing before as the current descriptions sound very similar.
-
I use the Duet magnetic one, has worked pretty well up till now although the data reported wasn't always 'accurate'
-
Data point: Printing for 4.5 hours as I type this.
Filament Sensor: None.
Also, not that it should make any difference, it is a toolchanger.
-
@ChrisP Thanks for reporting this, I've been able to reproduce it and it looks like this is related to
CPUSchedulingPolicy=fifo
. Once that line is removed from/usr/lib/systemd/system/duetcontrolserver.service
and the Pi is restarted, the problem does not seem to occur any more. So I'll remove that particular line in the next version again.@Garfield I could reproduce your problem, too, and it will be fixed in the next DSF version. This error won't occur again in future versions. Sorry for the inconvenience.
I'm happy to help but please keep in mind that the
unstable
package feed is primarily for testing purposes and that bugs can occur - even though I perfectly understand they can be quite annoying. So thanks again to everyone testing it and reporting issues.