quantum-espresso/test-suite/pw_workflow_exx_nscf/benchmark.out.git.inp=ncpp-...

565 lines
22 KiB
Plaintext

Program PWSCF v.7.1 starts on 24Jan2023 at 15: 5:36
This program is part of the open-source Quantum ESPRESSO suite
for quantum simulation of materials; please cite
"P. Giannozzi et al., J. Phys.:Condens. Matter 21 395502 (2009);
"P. Giannozzi et al., J. Phys.:Condens. Matter 29 465901 (2017);
"P. Giannozzi et al., J. Chem. Phys. 152 154105 (2020);
URL http://www.quantum-espresso.org",
in publications or presentations arising from this work. More details at
http://www.quantum-espresso.org/quote
Parallel version (MPI & OpenMP), running on 4 processor cores
Number of MPI processes: 4
Threads/MPI process: 1
MPI processes distributed on 1 nodes
24275 MiB available memory on the printing compute node when the environment starts
Reading input from ncpp-gamma-restart-1.in
Current dimensions of program PWSCF are:
Max number of different atomic species (ntypx) = 10
Max number of k-points (npk) = 40000
Max angular momentum in pseudopotentials (lmaxx) = 4
IMPORTANT: XC functional enforced from input :
Exchange-correlation= B3LYP
( 7 12 9 7 0 0 0)
EXX-fraction = 0.20
Any further DFT definition will be discarded
Please, verify this is what you really want
R & G space division: proc/nbgrp/npool/nimage = 4
Subspace diagonalization in iterative solution of the eigenvalue problem:
a serial algorithm will be used
EXX: q-point mesh: 1 1 1
EXX: grid of k+q points same as grid of k-points
Parallelization info
--------------------
sticks: dense smooth PW G-vecs: dense smooth PW
Min 1086 1086 271 39805 39805 4984
Max 1087 1087 272 39806 39806 4985
Sum 4345 4345 1087 159221 159221 19939
Using Slab Decomposition
bravais-lattice index = 0
lattice parameter (alat) = 10.3290 a.u.
unit-cell volume = 1645.9647 (a.u.)^3
number of atoms/cell = 4
number of atomic types = 2
number of electrons = 8.00
number of Kohn-Sham states= 4
kinetic-energy cutoff = 80.0000 Ry
charge density cutoff = 320.0000 Ry
cutoff for Fock operator = 320.0000 Ry
scf convergence threshold = 1.0E-08
mixing beta = 0.7000
number of iterations used = 8 plain mixing
Exchange-correlation= B3LYP
( 7 12 9 7 0 0 0)
EXX-fraction = 0.20
celldm(1)= 10.329039 celldm(2)= 0.000000 celldm(3)= 0.000000
celldm(4)= 0.000000 celldm(5)= 0.000000 celldm(6)= 0.000000
crystal axes: (cart. coord. in units of alat)
a(1) = ( 0.999999 0.000000 -0.001183 )
a(2) = ( 0.000000 1.599049 0.000000 )
a(3) = ( -0.026738 0.000000 0.934101 )
reciprocal axes: (cart. coord. in units 2 pi/alat)
b(1) = ( 1.000035 -0.000000 0.028626 )
b(2) = ( -0.000000 0.625372 -0.000000 )
b(3) = ( 0.001266 -0.000000 1.070584 )
PseudoPot. # 1 for N read from file:
/home/ivan/pseudo/N.blyp-mt.UPF
MD5 check sum: b1a908916a7dc14cdd8c9b4ab1603e73
Pseudo is Norm-conserving, Zval = 5.0
Generated by new atomic code, or converted to UPF format
Using radial grid of 281 points, 1 beta functions with:
l(1) = 0
PseudoPot. # 2 for H read from file:
/home/ivan/pseudo/H.blyp-hgh.UPF
MD5 check sum: c576e69559cde411e611be972c763230
Pseudo is Norm-conserving, Zval = 1.0
Generated in analytical, separable form
Using radial grid of 929 points, 0 beta functions with:
atomic species valence mass pseudopotential
N 5.00 14.00670 N ( 1.00)
H 1.00 1.00800 H ( 1.00)
No symmetry found
s frac. trans.
isym = 1 identity
cryst. s( 1) = ( 1 0 0 )
( 0 1 0 )
( 0 0 1 )
cart. s( 1) = ( 1.0000000 0.0000000 -0.0000000 )
( 0.0000000 1.0000000 0.0000000 )
( -0.0000000 0.0000000 1.0000000 )
Cartesian axes
site n. atom positions (alat units)
1 N tau( 1) = ( -1.0184779 0.2546465 0.0000000 )
2 H tau( 2) = ( -0.8318661 0.2546465 0.0000000 )
3 H tau( 3) = ( -1.0806818 0.3773218 0.1261167 )
4 H tau( 4) = ( -1.0806818 0.0840887 0.0431805 )
Crystallographic axes
site n. atom positions (cryst. coord.)
1 N tau( 1) = ( -1.0185131 0.1592486 -0.0012895 )
2 H tau( 2) = ( -0.8318948 0.1592486 -0.0010532 )
3 H tau( 3) = ( -1.0771090 0.2359663 0.1336502 )
4 H tau( 4) = ( -1.0794831 0.0525867 0.0448601 )
number of k points= 1
cart. coord. in units 2pi/alat
k( 1) = ( 0.0000000 0.0000000 0.0000000), wk = 2.0000000
cryst. coord.
k( 1) = ( 0.0000000 0.0000000 0.0000000), wk = 2.0000000
Dense grid: 159221 G-vectors FFT dimensions: ( 60, 96, 60)
Dynamical RAM for wfc: 0.30 MB
Dynamical RAM for wfc (w. buffer): 0.30 MB
Dynamical RAM for EXX: 17.30 MB
Dynamical RAM for str. fact: 1.21 MB
Dynamical RAM for local pot: 0.00 MB
Dynamical RAM for nlocal pot: 0.08 MB
Dynamical RAM for qrad: 0.03 MB
Dynamical RAM for rho,v,vnew: 3.80 MB
Dynamical RAM for rhoin: 1.27 MB
Dynamical RAM for rho*nmix: 9.72 MB
Dynamical RAM for G-vectors: 2.58 MB
Dynamical RAM for h,s,v(r/c): 0.00 MB
Dynamical RAM for <psi|beta>: 0.00 MB
Dynamical RAM for psi: 0.61 MB
Dynamical RAM for hpsi: 0.61 MB
Dynamical RAM for wfcinit/wfcrot: 1.06 MB
Estimated static dynamical RAM per process > 31.69 MB
Estimated max dynamical RAM per process > 42.62 MB
Estimated total dynamical RAM > 170.48 MB
EXX: ACE will be initialized later
Initial potential from superposition of free atoms
starting charge 8.0000, renormalised to 8.0000
negative rho (up, down): 2.691E-03 0.000E+00
Starting wfcs are 7 randomized atomic wfcs
total cpu time spent up to now is 1.1 secs
Self-consistent Calculation
iteration # 1 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
56 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24201 MiB available memory on the node where the printing process lives
------------------
ethr = 1.00E-02, avg # of iterations = 2.0
negative rho (up, down): 7.691E-04 0.000E+00
total cpu time spent up to now is 1.1 secs
total energy = -23.19359719 Ry
estimated scf accuracy < 0.72301511 Ry
iteration # 2 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
66 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24143 MiB available memory on the node where the printing process lives
------------------
ethr = 9.04E-03, avg # of iterations = 2.0
negative rho (up, down): 3.782E-04 0.000E+00
total cpu time spent up to now is 1.2 secs
total energy = -23.41266487 Ry
estimated scf accuracy < 0.44745247 Ry
iteration # 3 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
67 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24138 MiB available memory on the node where the printing process lives
------------------
ethr = 5.59E-03, avg # of iterations = 2.0
negative rho (up, down): 1.858E-04 0.000E+00
total cpu time spent up to now is 1.3 secs
total energy = -23.47803797 Ry
estimated scf accuracy < 0.00944009 Ry
iteration # 4 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
69 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24132 MiB available memory on the node where the printing process lives
------------------
ethr = 1.18E-04, avg # of iterations = 3.0
total cpu time spent up to now is 1.4 secs
total energy = -23.48165866 Ry
estimated scf accuracy < 0.01308257 Ry
iteration # 5 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
70 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24128 MiB available memory on the node where the printing process lives
------------------
ethr = 1.18E-04, avg # of iterations = 2.0
negative rho (up, down): 2.199E-07 0.000E+00
total cpu time spent up to now is 1.4 secs
total energy = -23.48199207 Ry
estimated scf accuracy < 0.00179013 Ry
iteration # 6 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
71 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24123 MiB available memory on the node where the printing process lives
------------------
ethr = 2.24E-05, avg # of iterations = 2.0
negative rho (up, down): 1.665E-07 0.000E+00
total cpu time spent up to now is 1.5 secs
total energy = -23.48215283 Ry
estimated scf accuracy < 0.00028705 Ry
iteration # 7 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
73 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24118 MiB available memory on the node where the printing process lives
------------------
ethr = 3.59E-06, avg # of iterations = 3.0
total cpu time spent up to now is 1.6 secs
total energy = -23.48227736 Ry
estimated scf accuracy < 0.00002401 Ry
iteration # 8 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
74 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24113 MiB available memory on the node where the printing process lives
------------------
ethr = 3.00E-07, avg # of iterations = 1.0
total cpu time spent up to now is 1.6 secs
total energy = -23.48227649 Ry
estimated scf accuracy < 0.00001048 Ry
iteration # 9 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
75 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24108 MiB available memory on the node where the printing process lives
------------------
ethr = 1.31E-07, avg # of iterations = 1.0
total cpu time spent up to now is 1.7 secs
total energy = -23.48227527 Ry
estimated scf accuracy < 0.00000378 Ry
iteration # 10 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
76 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24103 MiB available memory on the node where the printing process lives
------------------
ethr = 4.72E-08, avg # of iterations = 2.0
total cpu time spent up to now is 1.8 secs
total energy = -23.48227620 Ry
estimated scf accuracy < 0.00000003 Ry
iteration # 11 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
76 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24103 MiB available memory on the node where the printing process lives
------------------
ethr = 3.59E-10, avg # of iterations = 2.0
total cpu time spent up to now is 1.8 secs
End of self-consistent calculation
k = 0.0000 0.0000 0.0000 ( 19939 PWs) bands (ev):
-20.0800 -10.5378 -10.5060 -5.1416
occupation numbers
1.0000 1.0000 1.0000 1.0000
highest occupied level (ev): -5.1416
! total energy = -23.48227618 Ry
estimated scf accuracy < 1.0E-09 Ry
convergence has been achieved in 11 iterations
Using ACE for calculation of exact exchange
EXX grid: 159221 G-vectors FFT dimensions: ( 60, 96, 60)
ACE projected onto 4 (nbndproj) and applied to 4 (nbnd) bands
EXX: now go back to refine exchange calculation
total cpu time spent up to now is 2.0 secs
Self-consistent Calculation
iteration # 1 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
84 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24107 MiB available memory on the node where the printing process lives
------------------
ethr = 3.59E-10, avg # of iterations = 10.0
total cpu time spent up to now is 2.1 secs
total energy = -23.44862404 Ry
estimated scf accuracy < 0.00014417 Ry
iteration # 2 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
84 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24101 MiB available memory on the node where the printing process lives
------------------
ethr = 1.80E-06, avg # of iterations = 1.0
total cpu time spent up to now is 2.1 secs
total energy = -23.44862619 Ry
estimated scf accuracy < 0.00000845 Ry
iteration # 3 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
84 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24096 MiB available memory on the node where the printing process lives
------------------
ethr = 1.06E-07, avg # of iterations = 2.0
total cpu time spent up to now is 2.2 secs
total energy = -23.44862678 Ry
estimated scf accuracy < 0.00000069 Ry
iteration # 4 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
84 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24090 MiB available memory on the node where the printing process lives
------------------
ethr = 8.63E-09, avg # of iterations = 2.0
total cpu time spent up to now is 2.3 secs
total energy = -23.44862685 Ry
estimated scf accuracy < 0.00000002 Ry
iteration # 5 ecut= 80.00 Ry beta= 0.70
Davidson diagonalization with overlap
---- Real-time Memory Report at c_bands before calling an iterative solver
84 MiB given to the printing process from OS
0 MiB allocation reported by mallinfo(arena+hblkhd)
24085 MiB available memory on the node where the printing process lives
------------------
ethr = 2.16E-10, avg # of iterations = 2.0
total cpu time spent up to now is 2.3 secs
End of self-consistent calculation
k = 0.0000 0.0000 0.0000 ( 19939 PWs) bands (ev):
-22.2696 -11.8426 -11.8217 -6.3930
occupation numbers
1.0000 1.0000 1.0000 1.0000
highest occupied level (ev): -6.3930
! total energy = -23.44862686 Ry
estimated scf accuracy < 2.3E-09 Ry
convergence has been achieved in 5 iterations
Using ACE for calculation of exact exchange
ACE projected onto 4 (nbndproj) and applied to 4 (nbnd) bands
total energy = -23.44868007 Ry
Harris-Foulkes estimate = -23.44868007 Ry
est. exchange err (dexx) = 0.00005321 Ry
- averaged Fock potential = 2.67880671 Ry
+ Fock energy (ACE) = -1.34011676 Ry
EXX: now go back to refine exchange calculation
Calculation (EXX) stopped after iteration # 2
Writing all to output data dir ./out/pwscf.save/
init_run : 1.01s CPU 1.03s WALL ( 1 calls)
electrons : 1.05s CPU 1.13s WALL ( 2 calls)
Called by init_run:
aceinit0 : 0.00s CPU 0.00s WALL ( 1 calls)
wfcinit : 0.01s CPU 0.01s WALL ( 1 calls)
wfcinit:atom : 0.00s CPU 0.00s WALL ( 1 calls)
wfcinit:wfcr : 0.01s CPU 0.01s WALL ( 1 calls)
potinit : 0.52s CPU 0.54s WALL ( 1 calls)
hinit0 : 0.44s CPU 0.45s WALL ( 1 calls)
Called by electrons:
c_bands : 0.23s CPU 0.24s WALL ( 16 calls)
sum_band : 0.07s CPU 0.08s WALL ( 16 calls)
v_of_rho : 0.71s CPU 0.76s WALL ( 18 calls)
v_h : 0.03s CPU 0.04s WALL ( 18 calls)
v_xc : 0.68s CPU 0.72s WALL ( 18 calls)
mix_rho : 0.09s CPU 0.10s WALL ( 16 calls)
Called by c_bands:
init_us_2 : 0.01s CPU 0.01s WALL ( 33 calls)
init_us_2:cp : 0.01s CPU 0.01s WALL ( 33 calls)
cegterg : 0.23s CPU 0.24s WALL ( 16 calls)
Called by sum_band:
sum_band:wei : 0.00s CPU 0.00s WALL ( 16 calls)
sum_band:loo : 0.04s CPU 0.04s WALL ( 16 calls)
sum_band:buf : 0.00s CPU 0.00s WALL ( 16 calls)
sum_band:ini : 0.00s CPU 0.00s WALL ( 16 calls)
Called by *egterg:
cdiaghg : 0.00s CPU 0.00s WALL ( 54 calls)
cegterg:over : 0.00s CPU 0.01s WALL ( 39 calls)
cegterg:upda : 0.00s CPU 0.00s WALL ( 39 calls)
cegterg:last : 0.00s CPU 0.00s WALL ( 38 calls)
h_psi : 0.21s CPU 0.22s WALL ( 56 calls)
g_psi : 0.00s CPU 0.00s WALL ( 39 calls)
Called by h_psi:
h_psi:calbec : 0.00s CPU 0.00s WALL ( 56 calls)
vloc_psi : 0.20s CPU 0.21s WALL ( 56 calls)
add_vuspsi : 0.00s CPU 0.00s WALL ( 56 calls)
General routines
calbec : 0.00s CPU 0.00s WALL ( 86 calls)
fft : 0.19s CPU 0.22s WALL ( 176 calls)
ffts : 0.01s CPU 0.02s WALL ( 16 calls)
fftw : 0.19s CPU 0.20s WALL ( 482 calls)
fftc : 0.06s CPU 0.07s WALL ( 64 calls)
fftcw : 0.02s CPU 0.02s WALL ( 24 calls)
Parallel routines
EXX routines
exx_grid : 0.00s CPU 0.00s WALL ( 1 calls)
exxinit : 0.05s CPU 0.07s WALL ( 2 calls)
vexx : 0.11s CPU 0.13s WALL ( 2 calls)
matcalc : 0.00s CPU 0.00s WALL ( 30 calls)
aceupdate : 0.00s CPU 0.00s WALL ( 2 calls)
vexxace : 0.00s CPU 0.00s WALL ( 25 calls)
aceinit : 0.11s CPU 0.13s WALL ( 2 calls)
PWSCF : 2.29s CPU 2.45s WALL
This run was terminated on: 15: 5:38 24Jan2023
=------------------------------------------------------------------------------=
JOB DONE.
=------------------------------------------------------------------------------=