Fujitsu A64FX#Design
{{Short description|Microprocessor designed by Fujitsu}}
{{For|the AMD microprocessor|Athlon 64}}
{{Use dmy dates|date=March 2020}}
{{Infobox CPU
| name = A64FX
| image =
| image_size =
| caption =
| produced-start = 2019
| produced-end =
| slowest =
| fastest =
| slow-unit =
| fast-unit =
| fsb-slowest =
| fsb-fastest =
| fsb-slow-unit =
| fsb-fast-unit =
| size-from = 7 nm
| size-to =
| soldby = Fujitsu
| designfirm = Fujitsu
| manuf1 = TSMC
| core1 =
| sock1 =
| pack1 =
| arch = ARMv8.2-A with SVE and SBSA level 3
| microarch = In-house
| numcores = 48 per CPU plus optional assistant cores
| predecessor = SPARC64 V
}}
The A64FX is a 64-bit ARM architecture microprocessor designed by Fujitsu. The processor is replacing the SPARC64 V as Fujitsu's processor for supercomputer applications. It powers the Fugaku supercomputer, ranked in the TOP500 as the fastest supercomputer in the world from June 2020, until falling to second place behind Frontier in June 2022.{{Cite web |title=June 2022 {{!}} TOP500 |url=https://www.top500.org/lists/top500/2022/06/ |access-date=2023-06-23 |website=www.top500.org}}{{Cite web|title=Outline of the Development of the Supercomputer Fugaku {{!}} RIKEN Center for Computational Science RIKEN Website|url=https://www.r-ccs.riken.jp/en/fugaku/project/outline|access-date=2020-11-18|website=www.r-ccs.riken.jp|archive-date=23 January 2021|archive-url=https://web.archive.org/web/20210123110534/https://www.r-ccs.riken.jp/en/fugaku/project/outline|url-status=dead}}
Design
Fujitsu collaborated with ARM to develop the processor; it is the first processor to use the ARMv8.2-A Scalable Vector Extension SIMD instruction set with 512-bit vector implementation.
It has "Four-operand FMA with Prefix Instruction", i.e. MOVPRFX instruction followed by 3-operand FMA operation (ARM, like RISC in general, is a 3-operand machine, with no space for four operands), which get packed into a single operation in the pipeline. For the processor the designer claim ">90% execution efficiency in (D|S|H)GEMM and INT16/8 dot product".
The processor uses 32 gigabytes of HBM2 memory with a bandwidth of 1 TB per second. The processor contains 16 PCI Express generation 3 lanes to connect to accelerators (hypothetical e.g. GPUs and FPGAs). The processor also integrates a TofuD fabric controller with 10 ports implemented as 20 lanes of high-speed 28 Gbit/s to connect multiple nodes in a cluster. The reported transistor count is about 8.8 billion.
Each A64FX processor has four NUMA nodes, with each NUMA node having 12 compute cores, for a total of 48 cores per processor. Each NUMA node has its own level 2 cache, HBM2 memory, and assistant cores for non-computational purposes.
Fujitsu intends to produce lower specification machines with reduced assistant cores. Reliability, availability and serviceability (RAS) capabilities are claimed, i.e. ~128,400 error checkers in total.
In June 2020 the Fugaku supercomputer using this processor reached 442 petaFLOPS and became the fastest supercomputer in the world.
Implementations
Fujitsu designed the A64FX for the Fugaku. Fugaku held the rank of the fastest supercomputer in the world by TOP500 rankings.{{Cite web|title=Supercomputer Fugaku - Supercomputer Fugaku, A64FX 48C 2.2GHz, Tofu interconnect D {{!}} TOP500|url=https://www.top500.org/system/179807/|access-date=2020-11-18|website=www.top500.org}} Fujitsu intends to sell smaller machines with A64FX processors. Anandtech reported in June 2020 that the cost of a PRIMEHPC FX700 server, with two A64FX nodes, was {{¥|4155330|link=yes}} (c. {{US$|39000|link=yes}}).
Cray is developing supercomputers using the A64FX. The {{nobr|Isambard 2}} supercomputer is being built for a consortium in the United Kingdom, led by the University of Bristol and also including the Met Office, using the Fujitsu processors. It is an upgrade to the Isambard supercomputer which was built with the Marvell ThunderX2, another ARM architecture microprocessor.
[https://www.stonybrook.edu/ookami/ Ookami] is an open testbed system supported by NSF run by Stony Brook University and the University at Buffalo providing researchers access to A64FX processors.
See also
- Comparison of ARMv8-A cores
- SPARC64 V
- ThunderX2{{snd}} another ARM architecture high performance computing microprocessor
- [https://en.wikichip.org/wiki/hisilicon/kunpeng/920-6426 Huawei Kunpeng 920]{{snd}} also an ARM high-performance microprocessor, but developed by the Huawei-owned HiSilicon. Only available in China.
References
{{reflist|refs=
}}
{{Fujitsu}}
{{Application ARM-based chips}}
Category:Computer-related introductions in 2019