Start to add hybrid oprbital rep in manual.

This commit is contained in:
Ye Luo 2018-04-20 18:07:46 -05:00
parent d07f8ea7d0
commit 93dca0e9cc
4 changed files with 6 additions and 2 deletions

View File

@ -76,7 +76,7 @@ To achieve better performance, mixed precision version (experimental) has been i
\subsubsection{Memory consideration}
When using threads, some memory objects shared by all the threads. Usually these memory are read-only when the walkers are evolving, for instance the ionic distance table and wavefunction coefficients.
If a wavefunction is represented by B-splines, the whole table is shared by all the threads. It usually takes a large chunk of memory when a large primitive cell was used in the simulation. Its actual size is reported as ``MEMORY increase XXX MB BsplineSetReader'' in the output file.
See details about how to reduce it in section~\ref{sec:splinebasis}.
See details about how to reduce it in section~\ref{sec:spo_spline}.
The other memory objects which are distinct for each walker during random walk need to be associated with individual walkers and can not be shared. This part of memory grows linearly as the number of walkers per MPI rank. Those objects include wavefunction values (Slater determinants) at given electronic configurations and electron related distance tables (electron-electron distance table). Those matrices dominate the $N^2$ scaling of the memory usage per walker.

View File

@ -3,6 +3,7 @@
\include{spo_spline}
\include{spo_gaussian}
\include{spo_hybrid}
\include{spo_pw}
\include{spo_heg}

4
manual/spo_hybrid.tex Normal file
View File

@ -0,0 +1,4 @@
\subsection{Hybrid orbital representation}
\label{sec:spo_hybrid}

View File

@ -110,4 +110,3 @@ Additional information:
\item \texttt{twistnum}. If positive, it is the index. It is recommended not to take this way since the indexing may show some uncertainty. If negative, the super twist is referred by \texttt{twist}.
\item \texttt{Spline\_Size\_Limit\_MB}. Allows to distribute the B-spline coefficient table between the host and GPU memory. The compute kernels access host memory via zero-copy. Though the performance penaty introduced by it is significant but allows large calculations to go.
\end{itemize}
\label{sec:splinebasis}