Single-event upset

{{Short description|Change of state caused by an ionizing particle}}

{{More footnotes needed|date=August 2009}}

File:Airbus_A330-303,_Qantas_AN0743607.jpg during Qantas Flight 72 on 7 October 2008 is suspected to have resulted in an aircraft upset that nearly ended in a crash after the computers experienced several malfunctions.{{Citation

| url = https://www.microsemi.com/document-portal/doc_download/130760-neutron-seu-faq

| title = Neutron-Induced Single Event Upset (SEU) FAQ

| publisher = Microsemi Corporation

| access-date = October 7, 2018

| quote = The cause has been traced to errors in an onboard computer suspected to have been induced by cosmic rays.

| ref = none}}]]

A single-event upset (SEU), also known as a single-event error (SEE), is a change of state caused by one single ionizing particle (e.g. ions, electrons, photons) striking a sensitive node in a live micro-electronic device, such as in a microprocessor, semiconductor memory, or power transistors. The state change is a result of the free charge created by ionization in or close to an important node of a logic element (e.g. memory "bit"). The error in device output or operation caused as a result of the strike is called an SEU or a soft error.

The SEU itself is not considered permanently damaging to the transistors' or circuits' functionality, unlike the case of single-event latch-up (SEL), single-event gate rupture (SEGR), or single-event burnout (SEB). These are all examples of a general class of radiation effects in electronic devices called single-event effects (SEEs).

History

Single-event upsets were first described during above-ground nuclear testing, from 1954 to 1957, when many anomalies were observed in electronic monitoring equipment. Further problems were observed in space electronics during the 1960s, although it was difficult to separate soft failures from other forms of interference. In 1972, a Hughes satellite experienced an upset where the communication with the satellite was lost for 96 seconds and then recaptured. Scientists Dr. Edward C. Smith, Al Holman, and Dr. Dan Binder explained the anomaly as a single-event upset (SEU) and published the first SEU paper in the IEEE Transactions on Nuclear Science journal in 1975.{{Cite journal|last=Binder, Smith, Holman|date=1975|title=Satellite Anomalies from Galactic Cosmic Rays|journal=IEEE Transactions on Nuclear Science|volume=NS-22, No. 6|issue=6|pages=2675–2680|doi=10.1109/TNS.1975.4328188|bibcode=1975ITNS...22.2675B|s2cid=3032512|via=IEEE Explore}} In 1978, the first evidence of soft errors from alpha particles in packaging materials was described by Timothy C. May and M.H. Woods. In 1979, James Ziegler of IBM, along with W. Lanford of Yale, first described the mechanism whereby a sea-level cosmic ray could cause a single-event upset in electronics. 1979 also saw the world's first heavy ion "single-event effects" test at a particle accelerator facility, conducted at Lawrence Berkeley National Laboratory's [https://cyclotron.lbl.gov/home 88-Inch Cyclotron] and Bevatron.Petersen, Koga, Shoga, Pickel, & Price (2013). "The Single Event Revolution". IEEE Transactions on Nuclear Science. Vol. 60, No.3.

Cause

Terrestrial SEUs arise due to cosmic particles colliding with atoms in the atmosphere, creating cascades or showers of neutrons and protons, which in turn may interact with electronic circuits. At deep sub-micron geometries, this affects semiconductor devices in the atmosphere.

In space, high-energy ionizing particles exist as part of the natural background, referred to as galactic cosmic rays (GCRs). Solar particle events and high-energy protons trapped in the Earth's magnetosphere (Van Allen radiation belts) exacerbate this problem. The high energies associated with the phenomenon in the space particle environment generally render increased spacecraft shielding useless in terms of eliminating SEUs and catastrophic single-event phenomena (e.g. destructive latch-up). Secondary atmospheric neutrons generated by cosmic rays can also have sufficiently high energy for producing SEUs in electronics on aircraft flights over the poles or at high altitudes. Trace amounts of radioactive elements in chip packages also lead to SEUs.

Testing for SEU sensitivity

The sensitivity of a device to SEU can be empirically estimated by placing a test device in a particle stream at a cyclotron or other particle accelerator facility. This particular test methodology is especially useful for predicting the SER (soft error rate) in known space environments but can be problematic for estimating terrestrial SER from neutrons. In this case, a large number of parts must be evaluated, possibly at different altitudes, to find the actual rate of upset.

Another way to empirically estimate SEU tolerance is to use a chamber shielded from radiation, with a known radiation source, such as Caesium-137.

When testing microprocessors for SEU, the software used to exercise the device must also be evaluated to determine which sections of the device were activated when SEUs occurred.

SEUs and circuit design

{{Main|Radiation hardening}}

By definition, SEUs do not destroy the circuits involved, but they can cause errors. In space-based microprocessors, one of the most vulnerable portions is often the 1st and 2nd-level cache memories, because these must be very small and have very high speed, which means that they do not hold much charge. Often these caches are disabled if terrestrial designs are being configured to survive SEUs. Another point of vulnerability is the state machine in the microprocessor control, because of the risk of entering "dead" states (with no exits), however, these circuits must drive the entire processor, so they have relatively large transistors to provide relatively large electric currents and are not as vulnerable as one might think. Another vulnerable processor component is RAM, and more specifically static RAM (SRAM) used in cache memories. SRAM memories are usually designed with transistor sizes close to the minimum allowed by technology to allocate the maximum number of bits per unit area. Small transistor sizes and high bit density make memories one of the most susceptible components to SEUs.{{Cite journal |last1=Torrens |first1=G. |last2=Alheyasat |first2=A. |last3=Alorda |first3=B. |last4=Barcelo |first4=S. |last5=Segura |first5=J. |last6=Bota |first6=S. A. |year=2020 |title=Transistor Width Effect on the Power Supply Voltage Dependence of α-SER in CMOS 6T SRAM |url=https://ieeexplore.ieee.org/document/9047908 |journal=IEEE Transactions on Nuclear Science |volume=67 |issue=5 |pages=811–817 |doi=10.1109/TNS.2020.2983586 |bibcode=2020ITNS...67..811T |s2cid=216198845 |issn=0018-9499}} To ensure resilience to SEUs, often an error correcting memory is used, together with circuitry to periodically read (leading to correction) or scrub (if reading does not lead to correction) the memory of errors, before the errors overwhelm the error-correcting circuitry.

In digital and analog circuits, a single event may cause one or more voltages pulses (i.e. glitches) to propagate through the circuit, in which case it is referred to as a single-event transient (SET). Since the propagating pulse is not technically a change of "state" as in a memory SEU, one should differentiate between SET and SEU. If a SET propagates through digital circuitry and results in an incorrect value being latched in a sequential logic unit, it is then considered an SEU.

Hardware problems can also occur for related reasons. Under certain circumstances (of both circuit design, process design, and particle properties) a "parasitic" thyristor inherent to CMOS designs can be activated, effectively causing an apparent short-circuit from power to ground. This condition is referred to as latch-up, and in absence of constructional countermeasures, often destroys the device due to thermal runaway. Most manufacturers design to prevent latch-up and test their products to ensure that latch-up does not occur from atmospheric particle strikes. In order to prevent latch-up in space, epitaxial substrates, silicon on insulator (SOI) or silicon on sapphire (SOS) are often used to further reduce or eliminate the susceptibility.

Notable SEU

  • In the 2003 elections in Brussels's municipality Schaerbeek (Belgium), an anomalous recorded number of votes triggered an investigation that concluded an SEU was responsible for giving a candidate named Maria Vindevoghel 4,096 extra votes. The possibility of a single-event upset is suggested by the difference in votes being equivalent to a power of two, {{math|212}}.{{cite news |url=https://www.independent.co.uk/news/science/subatomic-particles-cosmic-rays-computers-change-elections-planes-autopilot-a7584616.html|title=Cosmic particles can change elections and cause planes to fall through the sky, scientists warn|access-date=5 September 2018 |date=17 February 2017 |author=Ian Johnston|work=Independent}}
  • On October 7, 2008, Qantas Flight 72 at 37,000 feet, one of the plane's three air data inertial reference units had a failure, causing incorrect data to be sent to the plane's flight control systems. This caused pitch-downs and caused severe injuries to crew and passengers. All potential causes were found to be "unlikely," or "very unlikely," except for an SEU, whose likelihood couldn't be estimated.[https://web.archive.org/web/20210210130840/https://www.lanl.gov/science/NSS/issue1_2012/story4full.shtml The Invisible Neutron Threat (2012), Target 4 Flight Path 30L Publications, Los Alamos National Laboratory]

See also

References

Further reading

;General SEU

  • T.C. May and M.H. Woods, IEEE Trans Electron Devices ED-26, 2 (1979)
  • [http://www.seutest.com www.seutest.com] - Soft-error testing resources to support the JEDEC JESD89A test protocol.
  • J. F. Ziegler and W. A. Lanford, "Effect of Cosmic Rays on Computer Memories", Science, 206, 776 (1979)
  • [http://www.research.ibm.com/journal/rd40-1.html Ziegler, et al. IBM Journal of Research and Development. Vol. 40, 1 (1996)].
  • [http://radhome.gsfc.nasa.gov/radhome/see.htm NASA Introduction to SEU] from Goddard Space Flight Center Radiation Effects Facility
  • [http://adsabs.harvard.edu/cgi-bin/nph-abs_connect NASA/Smithsonian abstract search].
  • "Estimating Rates of Single-Event Upsets", J. Zoutendyk, NASA Tech Brief, Vol. 12, No. 10, item #152, Nov. 1988.
  • [https://web.archive.org/web/20050316074002/http://www.boeing.com/assocproducts/radiationlab/publications/ Boeing Radiation Effects Laboratory, focussed on Avionics]
  • [http://www.usenix.org/events/usenix07/tech/li.html A Memory Soft Error Measurement on Production Systems, 2007 USENIX Annual Technical Conference, pp. 275–280]
  • [https://ieeexplore.ieee.org/xpls/icp.jsp?arnumber=6187516 A Highly Reliable SEU Hardened Latch and High-Performance SEU Hardened Flip-Flop, International Symposium on Quality Electronic Design (ISQED), California, USA, March 19--21, 2012]

;SEU in programmable logic devices

  • "Single-Event Upsets: Should I Worry?" Xilinx Corp.
  • "Virtex-4: Soft Errors Reduced by Nearly Half!" A. Lesea, Xilinx TecXclusive, 6 May 2005.
  • [https://web.archive.org/web/20050310163239/http://www.altera.com/products/devices/stratix/features/stx-seu.html Single Event Upsets] Altera Corp.
  • [http://www.rcnp.osaka-u.ac.jp/~annurep/2001/genkou/sec3/kobayashi.pdf Evaluation of LSI Soft Errors Induced by Terrestrial Cosmic rays and Alpha Particles] - H. Kobayashi, K. Shiraishi, H. Tsuchiya, H. Usuki (all of Sony), and Y. Nagai, K. Takahisa (Osaka University), 2001.
  • [http://scholarsarchive.byu.edu/etd/521/ SEU-Induced Persistent Error Propagation in FPGAs] K. Morgan (Brigham Young University), Aug. 2006.
  • [http://www.microsemi.com/products/fpga-soc/reliability/see Microsemi neutron immune FPGA technology.]

;SEU in microprocessors

  • Elder, J.H.; Osborn, J.; Kolasinski, W. A.; "A method for characterizing a microprocessor's vulnerability to SEU", IEEE Transactions on Nuclear Science, Dec 1988 v 35 n 6.
  • [http://crc.stanford.edu/crc_papers/CRC-TR-01-4.pdf SEU Characterization of Digital Circuits Using Weighted Test Programs]
  • [https://web.archive.org/web/20060103143222/http://www-hpc.jpl.nasa.gov/PEP/pls/papers/Fault_analysis.pdf Analysis of Application Behavior During Fault Injection]
  • [https://web.archive.org/web/20051018163723/http://flightlinux.gsfc.nasa.gov/docs/Target_Arch_Report.html Flight Linux Project]

;SEU related masters theses and doctoral dissertations

  • {{cite thesis | author=R. Islam| title=High-speed Energy-efficient Soft Error Tolerant Flip-flops| publisher=Concordia University (M. A. Sc. Thesis) | year=2011 | url=http://spectrum.library.concordia.ca/15130/| type=masters}}
  • {{cite book | author=T. Z. Fullem| title=Radiation detection using single event upsets in memory chips| publisher=Binghamton University (M. S. Thesis) | year=2006 | isbn=978-0-542-78484-2| id={{ProQuest|304928976}}}}
  • {{cite thesis | author=C. L. Howe| title=Radiation-induced energy deposition and single event upset error rates in scaled microelectronic structures | publisher=Vanderbilt University (M. S. Thesis) | year=2005 | url= http://etd.library.vanderbilt.edu/ETD-db/available/etd-11022005-161757/| type=thesis }}
  • {{cite book | author=J. A. Thompson| title=Design, Construction and Programming of a Microcontroller-Based Testbench Suitable for Radiation Testing of Microelectronic Circuits| publisher=Naval Postgraduate School (M. S. Thesis) | year=1997 | url= http://stinet.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA333415| archive-url= https://web.archive.org/web/20070930200558/http://stinet.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA333415| url-status= dead| archive-date= September 30, 2007}}
  • {{cite book | author=D. R. Roth| title=The role of charge collection in the single event upset| publisher=Clemson University (M. S. Thesis) | year=1991 }}
  • {{cite book | author=A. G. Costantine| title=An Advanced Single Event Upset Tester| publisher=Rensselaer Polytechnic Institute (Ph. D Thesis) | year=1990 }}

Category:Digital electronics

*