Skip to content

Enable max-mixed multicg_offset_QUDA in specific situations.#93

Open
SaltyChiang wants to merge 1 commit intomilc-qcd:developfrom
SaltyChiang:feature/max-mixed-multicg-offset-quda
Open

Enable max-mixed multicg_offset_QUDA in specific situations.#93
SaltyChiang wants to merge 1 commit intomilc-qcd:developfrom
SaltyChiang:feature/max-mixed-multicg-offset-quda

Conversation

@SaltyChiang
Copy link
Copy Markdown
Collaborator

This PR enables the max-mixed multicg_offset_QUDA solver in specific situations. This shouldn't affect the solver precision, as it just changes the sloppy precision.

The first pseudo-fermion used for the RHMC/RHMD algorithm usually has a low precision tolerance, and residues for the MD step for all shifts are not widely distributed. In such a situation, using half-precision as the sloppy precision in QUDA has a shorter wall time to solve the solution. Because the solver doesn't need to spend too many iterations in the refinement step, the faster multishift step will cover the refinement cost, and even faster.

We perform some 512-node tests on Vista on a 192^3x384 lattice:

  • The new setup reduced about 16% of the trajectory time.

NOTICE This change has not been checked in production. Although theoretically it shouldn't affect the Monte Carlo Markov Chain, we still need a production test for this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant