Son Research Lab Computational modeling for Energy/Bio Soft Materials Research

Resources

feature image

Our lab mainly utilizes massively parallel MD simulation package OpenMM using graphical processing units (GPUs). Our lab creates a high-performance supercomputing ecosystem equipped with ~150 GPUs (mostly GeForce 30 series + 32 RTX 4090 !), which enables a single simulation across multiple GPUs, or alternatively run a different simulation on each one at the same time.

GPUs/CPUs cluster

Node GPUs CPUs RAM TFlops (FP32)
15 NVIDIA GeForce RTX 4090 x 10 Intel(R) Xeon(R) Gold 6242R CPU x 40cores 250GB 825.8
18 NVIDIA GeForce RTX 4090 x 6 Intel(R) Xeon(R) Gold 6348 CPU x 112cores 500GB 495.48
19 NVIDIA GeForce RTX 4090 x 6 Intel(R) Xeon(R) Gold 6348 CPU x 112cores 500GB 495.48
20 NVIDIA GeForce RTX 4090 x 10 Intel(R) Xeon(R) Platinum 8360Y CPU x 72cores 500GB 825.8
1 NVIDIA GeForce RTX 3090 x 8 Intel(R) Xeon(R) Gold 6230 CPU x 32cores 125GB 284.64
3 NVIDIA GeForce RTX 3090 x 10 Intel(R) Xeon(R) CPU E5-2680 v4 x 28cores 250GB 355.8
11 NVIDIA GeForce RTX 3090 x 12 Intel(R) Xeon(R) Gold 6242R CPU x 40cores 250GB 426.96
12 NVIDIA GeForce RTX 3090 x 12 Intel(R) Xeon(R) Gold 6242R CPU x 40cores 250GB 426.96
13 NVIDIA GeForce RTX 3090 x 8 Intel(R) Xeon(R) Gold 6230R CPU x 52cores 125GB 284.64
16 NVIDIA GeForce RTX 3090 x 10 Intel(R) Xeon(R) Gold 6242R CPU x 40cores 250GB 355.8
5 NVIDIA GeForce RTX 3080Ti x 8 Intel(R) Xeon(R) Gold 6242R CPU x 40cores 250GB 272.8
6 NVIDIA GeForce RTX 3080Ti x 12 Intel(R) Xeon(R) CPU E5-2699 v4 x 44cores 250GB 409.2
14 NVIDIA GeForce RTX 3080Ti x 12 Intel(R) Xeon(R) Gold 6242R CPU x 40cores 250GB 409.2
17 NVIDIA GeForce RTX 3080Ti x 12 Intel(R) Xeon(R) Gold 6242R CPU x 48cores 250GB 409.2

The theoretical performance of our cluster is estimated to be 6,280 TFlops (note that the entire KISTI-5 Supercomputer Nurion’s theoretical performance is 25,700 Tflops!). These faciltiies turbocharge our research by significantly boosting our simulation speed. For more, see the benchmark (below).

Simulation speed

Benchmarks of simulation speed on typical biochemical systems. Solvents are treated either explicit-RF or explicit-PME (1.2nm for cutoff distance). Langevin / Drude Langevin integrators are employed with a step size of 1 fs. The length of bonds involving hydrogen were constrained, and water molecules were fully rigid. CUDA platform on NVIDIA GeForce RTX 3090/4090 GPUs are used with mixed precision.

feature image

System: NaCl 1M Solution

5nm box, 3512 water molecules, 75 pair of Na+/Cl- ions, total 10686 atoms

Model: TIP3P, SWM4-NDP

feature image

System: Ionic liquid BMIM+/BF4-

8.0nm box, 1600 pair of BMIM+/BF4- ions, total 48000 atoms

Model: SAPT, SAPT_Drude

feature image

System: Dihydrofolate Reductase (DHFR) / JAC benchmark system

6.2nm box, 2489 atoms protein, 7023 water molecules, total 23,558 atoms

Model: amber14ffSB, CHARMM36, CHARMM_Drude2019

feature image

System: Apolipoprotein A1 (ApoA1) / JAC benchmark system

10.8nm box, 392 protein residues, 160 POPC lipids, 21,458 water molecules, total 92,224 atoms

Model: amber14ffSB, CHARMM36, CHARMM_Drude2019