Commit Graph

37 Commits

Author SHA1 Message Date
Ye Luo 18a0fd5843 Remove obsolete ELPA include. 2022-01-18 00:06:20 -06:00
Paolo Giannozzi 5d74f827c4 Remove make.depend files from git 2021-10-24 21:29:29 +02:00
Paolo Giannozzi 771b7da9f1 Cleanup of installation scripts and Makefiles 2021-01-29 15:59:09 +00:00
Paolo Giannozzi 229be57edb Script for dependencies and make.depend updated 2021-01-23 22:32:58 +01:00
Pietro Bonfa 8e13a4b5c2 Always use devxlib now that we have it 2020-10-22 16:51:13 +02:00
Pietro Bonfa 05cc3dac4d Merge branch 'develop' into gpu-develop (first step) 2020-02-23 20:47:07 +01:00
Stefano de Gironcoli 88c1164d06 changes needed to update KS_Solver
-splitting rotate_wfc_* and adding rotate_Hpsi_* into a DENSE diagonalization dir
-removing  cg_param, david_param, ... in favour of util_param
-implementation of ParO
-update of PW, UtilXlib, FFTXlib and install  needed for compatibility
2020-02-17 12:19:53 +01:00
Pietro Bonfa ed83176255 Merge branch 'develop' into gpu-develop 2020-02-07 19:53:18 +01:00
Paolo Giannozzi dee8f970d2 desc_init moved into LAXlib 2020-02-03 11:02:41 +00:00
Pietro Bonfa 4725c3f548 Merge branch 'develop' into gpu-develop 2020-02-02 15:02:57 +01:00
carcava 38f80cfaa1 - use the new descriptor initi subroutine 2020-02-02 00:56:40 +01:00
Paolo Giannozzi 537aecdcd3 LAXlib-related reorganization
=============================
Harmonization of three copies of desc_init (two more are in KS_Solvers/PPCG,
plus two slightly different ones in Davidson diagonalization), with some
changes for clarity (in my opinion); harmonization of various copies of
compute_distmat and of calbec_[dz]distmat.

In my opinion all these routines, plus several simolar ones that are either
present  in multiple copies or that can be easily harmonized, used in parallel
diagonalization, should be moved somewhere else, preferably LAXlib/.
The problem now is that they are CONTAINed so they use and set variables from
the calling subroutine and may use arrays passed as arrays (with :); moving
them to a separate routine requires an interface, moving them into a module may
lead to undesired dependencies. Ideally one should be able to set up and
diagonalize a distributed matrix without filling the code of calls to
 low-level LAXlib routines and without too much voodoo.
2020-01-29 20:05:02 +00:00
Paolo Giannozzi ffd53eb4da Dependencies updated
Two routines in KS_Solvers/PPCG aligned to latest LAXlib changes
2020-01-28 14:58:32 +00:00
Pietro Bonfa 52bbfac655 More devicexlib 2019-12-02 15:15:49 +01:00
Pietro Bonfa 2c8b38d336 Mergin develop with gpu-develop 2019-08-21 19:14:52 +02:00
Carlo Cavazzoni 004301add1 - re-factoring of LAXlib
now QE do not "use" modules of LAXlib any longer, but it just include interface blocks.
  In principle they can now be compiled independently.
  All this beside possible errors.
  Further clean-ups are now possible, within LAXlib and in QE source codes
2019-08-13 01:16:24 +02:00
Carlo Cavazzoni 27adf6d690 - more disentanglement with LAXlib, quite some change inside LAXlib, still few outside.
Next we have to deal with the removal of the use descriptors stuff
2019-08-10 18:49:26 +02:00
Carlo Cavazzoni 5fbc6ecc9c - LAXlib made independent from other module 2019-08-07 14:27:02 +02:00
Pietro Bonfa 284c1cd23e Merge branch 'develop' into gpu-develop 2019-04-01 11:23:33 +02:00
Paolo Giannozzi 6834a502ef [Skip-CI] Obsolete version 'svn' replaced by 'git'; various .PHONY of questionable
usefulness, referring to no longer existing procedure devised for svn, removed
2019-03-01 17:42:56 +01:00
Pietro Bonfa cdcf2699a7 Merge branch 'develop' into gpu-develop 2019-02-05 15:41:15 +01:00
Paolo Giannozzi f725126d3a More minor cleanup: use module "parallel include" in KS_Solvers 2019-02-04 10:07:52 +01:00
Pietro Bonfa a590fef748 Fixed CPU build. Same function call for GPU version of ddot (should be moved elsewhere soon). 2018-11-13 14:49:27 +01:00
Pietro Bonfa 7e2b2c462a Restoring CPU compilation 2018-10-17 12:23:57 +02:00
Pietro Bonfa eebf0236df Initial (naive) implementation of CG diagonalization algorithm. 2018-10-17 11:18:08 +02:00
Pietro Bonfa fa262106f0 Initialization (partially) ported to GPUs 2018-10-09 16:54:13 +02:00
Stefano de Gironcoli a241241d27 updated dependencies 2018-08-05 16:52:11 +02:00
Stefano de Gironcoli d936f16226 export_gstart_2_* and set_mpi_comm_4_* moved to LAXLIB
their call corrected in init_run and mp_global
a recently added bug in ppcg_k when npol=1 corrected
2018-08-05 16:52:11 +02:00
Paolo Giannozzi a06d380cf4 Replicated routine "set_bgrp_index" replaced by "divide" 2017-12-23 22:00:32 +01:00
degironc ae1805bb72 redundant duplicate module constants.f90 removed from KS_Solvers/CG
Mathematical constant PI defined as a local parameter when needed



git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@14010 c92efa57-630b-4861-b058-cf58834340f0
2017-11-25 21:07:59 +00:00
degironc aba852b428 order of input arguments in KS_Solver routines changed
bringing overlap logical flag close to the s_psi function it affects



git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@13800 c92efa57-630b-4861-b058-cf58834340f0
2017-08-29 08:09:06 +00:00
degironc 0d2d3d5721 minor estetic change
git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@13730 c92efa57-630b-4861-b058-cf58834340f0
2017-08-19 13:30:16 +00:00
degironc a8340b4d40 Duplicate routines cdiaghg and rdiaghg moved from KS_Solvers/XX to LAXlib.
Duplicate module mp_bands.f90 moved from KS_Solvers/XX to UtilXlib/mp_bands_util.f90
Makefiles and makedepend.sh updated
 
that should take care of the duplicate symbols




git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@13712 c92efa57-630b-4861-b058-cf58834340f0
2017-08-08 21:44:44 +00:00
degironc 3e6b4f8e76 MAJOR restructuring of the FFTXlib library
In real space processors are organized in a 2D pattern.

Each processor owns data from a sub-set of Z-planes and a sub-set of Y-planes.
In reciprocal space each processor owns Z-columns that belong to a sub set of
X-values. This allows to split the processors in two sets for communication
in the YZ and XY planes.
In alternative, if the situation allows for it, a task group paralelization is used
(with ntg=nyfft) where complete XY planes of ntg wavefunctions are collected and Fourier
trasnformed in G space by different task-groups. This is preferable to the Z-proc + Y-proc
paralleization if task group can be used because a smaller number of larger ammounts of 
data are transferred. Hence three types of fft are implemented: 
 
  !
  !! ... isgn = +-1 : parallel 3d fft for rho and for the potential
  !
  !! ... isgn = +-2 : parallel 3d fft for wavefunctions
  !
  !! ... isgn = +-3 : parallel 3d fft for wavefunctions with task group
  !
  !! ... isgn = +   : G-space to R-space, output = \sum_G f(G)exp(+iG*R)
  !! ...              fft along z using pencils        (cft_1z)
  !! ...              transpose across nodes           (fft_scatter_yz)
  !! ...              fft along y using pencils        (cft_1y)
  !! ...              transpose across nodes           (fft_scatter_xy)
  !! ...              fft along x using pencils        (cft_1x)
  !
  !! ... isgn = -   : R-space to G-space, output = \int_R f(R)exp(-iG*R)/Omega
  !! ...              fft along x using pencils        (cft_1x)
  !! ...              transpose across nodes           (fft_scatter_xy)
  !! ...              fft along y using pencils        (cft_1y)
  !! ...              transpose across nodes           (fft_scatter_yz)
  !! ...              fft along z using pencils        (cft_1z)
  !
  ! If task_group_fft_is_active the FFT acts on a number of wfcs equal to 
  ! dfft%nproc2, the number of Y-sections in which a plane is divided. 
  ! Data are reshuffled by the fft_scatter_tg routine so that each of the 
  ! dfft%nproc2 subgroups (made by dfft%nproc3 procs) deals with whole planes 
  ! of a single wavefunciton.
  !

fft_type module heavily modified, a number of variables renamed with more intuitive names 
(at least to me), a number of more variables introduced for the Y-proc parallelization.

Task_group module made void. task_group management is now reduced to the logical component
 fft_desc%have_task_groups of fft_type_descriptor type variable fft_desc.

In term of interfaces, the 'easy' calling sequences are

SUBROUTINE invfft/fwfft( grid_type, f, dfft, howmany )

  !! where:
  !! 
  !! **grid_type = 'Dense'** : 
  !!   inverse/direct fourier transform of potentials and charge density f
  !!   on the dense grid (dfftp). On output, f is overwritten
  !! 
  !! **grid_type = 'Smooth'** :
  !!   inverse/direct fourier transform of  potentials and charge density f
  !!   on the smooth grid (dffts). On output, f is overwritten
  !! 
  !! **grid_type = 'Wave'** :
  !!   inverse/direct fourier transform of  wave functions f
  !!   on the smooth grid (dffts). On output, f is overwritten
  !!
  !! **grid_type = 'tgWave'** :
  !!   inverse/direct fourier transform of  wave functions f with task group
  !!   on the smooth grid (dffts). On output, f is overwritten
  !!
  !! **grid_type = 'Custom'** : 
  !!   inverse/direct fourier transform of potentials and charge density f
  !!   on a custom grid (dfft_exx). On output, f is overwritten
  !! 
  !! **grid_type = 'CustomWave'** :
  !!   inverse/direct fourier transform of  wave functions f
  !!   on a custom grid (dfft_exx). On output, f is overwritten
  !! 
  !! **dfft = FFT descriptor**, IMPORTANT NOTICE: grid is specified only by dfft.
  !!   No check is performed on the correspondence between dfft and grid_type.
  !!   grid_type is now used only to distinguish cases 'Wave' / 'CustomWave' 
  !!   from all other cases
                                                                                                 

Many more files modified.




git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@13676 c92efa57-630b-4861-b058-cf58834340f0
2017-08-01 20:31:02 +00:00
giannozz 15215e2262 Compiled modules shouldn't be under revision control!
git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@13663 c92efa57-630b-4861-b058-cf58834340f0
2017-07-31 16:43:50 +00:00
degironc 1b33777cbd remove some timing printing.
intra_pool_comm (the parent_comm of intra_bgrp_comm) should be the first argument
of set_mpi_comm_4_XX routines.



git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@13645 c92efa57-630b-4861-b058-cf58834340f0
2017-07-29 19:48:15 +00:00
degironc 4636bca635 KS_Solvers directory has been created with three subdirectories:
KS_Solvers/CG, KS_Solvers/Davidson, KS_Solvers/Davidson_RCI.
Two are currently used by QE, the third one implements the Davidson
diagonalization within the Reverse Communication Interface paradigm,
courtesy of Micael Oliveira.

KS_Solvers routines depend only on lower level libraries, notably UtilXlib, 
LAXlib, (SCA)LAPACK, and BLAS.

reorganization can be improved. For instance some duplicated routines like
cdiaghg and rdiaghg could/should be moved in LAXlib. This could reduce the need
to include  KS_Solvers directories in the link step of many codes.    

Minimal changes to calling sequence have been made, essentially just adding
h_psi,s_psi,g_psi and h_1psi,s_1psi routines names as arguments (with a
specific calling sequence ihardcode inside the routines that agree with PWSCF one). 
This could be avoided adopting the RCI paradigm.

Compiled in serial and parallel, 177/182 pw tests passed (3 that were failing 
even before on my laptop pw-berry, pw-langevin, pw-pawatom + 2 unknown==not tested), 
12 /17 cp tests passed (some o2-us-para-pbe-X fail but the same was for the 
original version)

I assume the modified calling procedure is working and the problem lies somewhere else.
 
Randomly tested some examples in pw, ph, pwcond and it seams to work.

Please report any problem.





git-svn-id: http://qeforge.qe-forge.org/svn/q-e/trunk/espresso@13644 c92efa57-630b-4861-b058-cf58834340f0
2017-07-29 12:19:19 +00:00