EPICS Controls Argonne National Laboratory

Experimental Physics and
Industrial Control System

1994  1995  1996  1997  1998  1999  2000  2001  2002  2003  2004  2005  2006  2007  2008  2009  2010  2011  2012  2013  2014  2015  2016  2017  <20182019  2020  2021  2022  2023  2024  Index 1994  1995  1996  1997  1998  1999  2000  2001  2002  2003  2004  2005  2006  2007  2008  2009  2010  2011  2012  2013  2014  2015  2016  2017  <20182019  2020  2021  2022  2023  2024 
<== Date ==> <== Thread ==>

Subject: Re: IOC Crash with No Exception Generated
From: Michael Davidsaver <[email protected]>
To: Matt Rippa <[email protected]>
Cc: Talk EPICS Tech <[email protected]>
Date: Wed, 25 Jul 2018 21:55:17 -0700
On 07/25/2018 06:52 PM, Matt Rippa via Tech-talk wrote:
> 
> We commissioned a new RTEMS IOC here in Hawaii. The system ran
> for 2 days, 18 hours and 47 minutes before it simply halted. No exception was generated therefore no stack trace was seen. Also no console, no iocsh, no log messages. A system reset recovered our system before halting in the same manner 2 hours later. This occurred 4 times before the night ended.
> 
> We speculate bad access or stack corruption but without a stack trace
> we're at a complete loss as to how to diagnose this. This *same software release* has run for several weeks at our other site with no issues.
> ​
> Is there a way to force an exception (or stack trace), for example with watchdog?
> 
> Many thanks for your insight!

The only time I can recall truly seeing nothing printed during a crash was
when a hardware fault occurred.  This was accompanied by a red fault LED
on the underside of the cards.

In every other case I can think of, including mistakes I made during driver
development, something was printed.

NSLS2 had a batch of mvme3100 cards suffering from silver creep corrosion,
which is related to the particular lead-free solder process used.  In the
case of the mvme3100, we were told this only involved a small number of boards
assembled in 2011.  We had boards assembled both earlier, and later,
which were not effected.

https://www.google.com/search?q=silver+creep+corrosion

This condition is very obvious under small magnification.

https://www.google.com/search?q=silver+creep+corrosion&source=lnms&tbm=isch&sa=X

References:
IOC Crash with No Exception Generated Matt Rippa via Tech-talk

Navigate by Date:
Prev: IOC Crash with No Exception Generated Matt Rippa via Tech-talk
Next: Re: IOC Crash with No Exception Generated Andrew Johnson
Index: 1994  1995  1996  1997  1998  1999  2000  2001  2002  2003  2004  2005  2006  2007  2008  2009  2010  2011  2012  2013  2014  2015  2016  2017  <20182019  2020  2021  2022  2023  2024 
Navigate by Thread:
Prev: IOC Crash with No Exception Generated Matt Rippa via Tech-talk
Next: Re: IOC Crash with No Exception Generated Andrew Johnson
Index: 1994  1995  1996  1997  1998  1999  2000  2001  2002  2003  2004  2005  2006  2007  2008  2009  2010  2011  2012  2013  2014  2015  2016  2017  <20182019  2020  2021  2022  2023  2024 
ANJ, 26 Jul 2018 Valid HTML 4.01! · Home · News · About · Base · Modules · Extensions · Distributions · Download ·
· Search · EPICS V4 · IRMIS · Talk · Bugs · Documents · Links · Licensing ·