gpu computing advances in 3d electromagnetic simulation
TRANSCRIPT
![Page 1: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/1.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Session S0069:
GPU Computing Advances in 3D
Electromagnetic Simulation
Andreas Buhr, Alexander Langwost, Fabrizio Zanella
CST (Computer Simulation Technology)
![Page 2: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/2.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Abstract
Computer Simulation Technology (CST) has been providing GPU acceleration for its
3D Full Wave electromagnetic field simulation tools for several years. The latest
version of CST Studio Suite supports the full range of Tesla products on both
Windows and Linux operating systems.
Using GPU, multi-GPU and MPI-GPU Computing drastically reduces the simulation
times for CST customers. We will provide a status of current and future GPU
developments at CST and share detailed simulation results.
![Page 3: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/3.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
1992
Foundation of CST
Commercialization of MAFIA (FIT)
1998
CST MICROWAVE STUDIO® — PERFECT
BOUNDARY APPROXIMATION (PBA)®
2005
Complete Technology for 3D EM
2011
System Assembly and Modeling (SAM)
CST Milestones
CST Market (excluding CST)
![Page 4: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/4.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
CST Worldwide
![Page 5: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/5.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
CST Customers
![Page 6: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/6.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
In 2007 began support of nVidia GPU acceleration in our main
product, the MWS Time Domain Solver (FIT)
Between 2008-2010 added multiple GPU and MPI+GPU support for
the FIT solver
In 2012, we added GPU acceleration support for three other solvers:
Particle in Cell Solver
Integral Equation Direct Solver
TLM Solver
CST GPU Computing Update
![Page 7: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/7.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
• The PIC solver calculates the electromagnetic field by integrating
Maxwell’s equations
• At the same time, it calculates trajectories of particles through the
calculated field.
Particle in Cell Solver: Introduction
![Page 8: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/8.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Magnetron Benchmark
Number of Meshcells 1,610,280
Av. Particle Number 1.39e4
Time CPU 36h 40m 07s
Time GPU 9h 05m 48s
Total Speed Up 4.03
![Page 9: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/9.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Particle in Cell: Core Benchmarks
0
1
2
3
4
5
6
7
8
9
10
0 10 20 30 40
Speedup
Number of Meshcells
Millions
PIC Solver Loop Speedup
0
1
2
3
4
5
6
7
0 10 20 30 40
Speedup
Number of Meshcells
Millions
PIC Total Speedup
Comparison: nVidia Tesla C2050 vs. 2x Intel XEON E5620 @ 2.4 GHz (8 cores total)
![Page 10: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/10.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
• Boundary Element Solver
-> works on surface mesh
• Frequency Domain
• Targeted at electrically large & mid-
size structures (≥ 10 λ)
• Generates a complex dense matrix to
be factorized
Integral Equation Solver: Introduction
![Page 11: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/11.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Dielectric Lens in Ku/K-Band
14 - 28 GHz, Lens diameter=60mm
Lens: eps=3, thickness=12.2mm
Surface cells: 16k
SAsw 500
Simulation time w/ Tesla C2075: 69min
Memory used : 11 GB
Diam. 60mm
![Page 12: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/12.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Integral Equation Solver: Core Benchmarks
0
2
4
6
8
10
12
14
16
0 10 20 30 40 50 60 70
Speedup
Number of Unknowns
Thousands
I-Solver Core Speedup
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
0 10 20 30 40 50 60 70
Speedup
Number of Unknowns
Thousands
I-Solver Total Speedup
1x C2050
2x C2050
4x C2050
Comparison: nVidia Tesla C2050 vs. 2x Intel XEON E5620 @ 2.4 GHz (8 cores total)
![Page 13: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/13.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Broadband Transient Solver
Special aperture and wire modeling (EMC/EMI)
Octree localized mesh
Compact models: vents, seams, slots, shielded cables
TLM Solver: Introduction
Octree mesh SCN
![Page 14: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/14.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
• Rectangular waveguide horn antenna with parabolic reflector
• 2m diameter dish @ 3.5GHz
• Approximately 23 wavelengths
TLM GPU Speedup
2012 2012 GPU
Time 20:09 4:35
Speedup 4.4
![Page 15: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/15.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Satellite system benchmark
Shielded TWP
cable
GPU Tesla 2050 solve time 45min
8-core Westmere-EX CPU 180min: 4X speedup
Overlapping joint
compact seam
with 24 segments
Cable connects
to 3D structure
![Page 16: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/16.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Time Domain (FIT) Solver
TST
PBA
• Transient, low memory, broadband solution
• Conformal meshing (PBA, TST)
• Support for multi-GPU, MPI+GPU
![Page 17: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/17.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
PCB with USB connector/cable
20M mesh cells
12 Westmere-EX core Solver Loop=7840s
2xM2070 Solver Loop=1596s
Solver Loop Speedup = 5x
![Page 18: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/18.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
GPU Computing – Typical Performance Speedup o
f th
e s
olv
er
loop
The features which need the
largest amount of memory on
the GPU are:
• dispersive materials
• lossy metal
• open boundaries
Model Sizes (Number of Mesh Cells)
CPU Performance
Model too large Swapping Sm
all M
odel
![Page 19: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/19.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
GPU Computing – Typical Performance
0
5
10
15
20
25
1 GPU (Tesla 10) 2 GPU (Tesla 10) 4 GPU (Tesla 10) 8 GPU (Tesla 10) 1 GPU (Tesla 20) 4 GPU (Tesla 20)
Speedup
Configuration
Mean Speedup of Solver Loop (compared to dual Intel Xeon X5550, fastest memory configuration)
![Page 20: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/20.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Time Domain Core Benchmarks
Comparison: nVidia Tesla C2050 vs. 2x Intel XEON E5620 @ 2.4 GHz (8 cores total)
0
2
4
6
8
10
12
14
16
18
20
0 20 40 60 80 100
Speedup
Number of Meshcells
Millions
T Solver Core Speedup
0
2
4
6
8
10
12
14
16
0 20 40 60 80 100
Speedup
Number of Meshcells
Millions
T Solver Total Speedup
1x C2050
2x C2050
4x C2050
![Page 21: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/21.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Complex Package Benchmark (SI)
50 million mesh cells
Very high complexity
CPU (2x Quad Core Intel
Xeon E5530, 2.4 GHz) 4x Tesla 10 GPUs 4x Tesla 20 GPUs
Solver Loop Time/s 39467 1662 990
Solver Loop Speedup 1 23.7 39.8
![Page 22: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/22.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
Passenger Aircraft Benchmark: MPI+GPU
146M mesh cells, (4) MPI+GPU nodes, (2) Tesla C1060 per node
Total simulation time = 8 hours
![Page 23: GPU Computing Advances in 3D Electromagnetic Simulation](https://reader030.vdocuments.net/reader030/viewer/2022011817/61d55772d248fa6c6a0c1986/html5/thumbnails/23.jpg)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12
• Significant performance improvement using GPU
Computing
• CST GPU solvers (PIC, TLM, IE, TD) available for
various applications
• Support of up to 8 GPUs/host for TD, IE Solvers;
Cluster MPI+GPU for TD Solver
Conclusion